摘要
为有效找出会议语音中的说话人角色个数及各角色的说话人语音,提出了一种多说话人角色聚类方法.首先定义说话人角色聚类的特征,然后采用测地距离度量特征的相似度,进而提出了一种利用类内距离来控制类间合并的多说话人角色聚类方法,最后采用4种不同类型的会议语音对该方法进行测试.结果表明:对手工分割和自动分割后的会议语音进行说话人角色聚类时,如果采用相同的聚类方法,则使用测地距离的性能优于使用传统距离的性能;如果采用相同的距离度量方法,则文中方法的性能优于传统层次聚类方法.
In order to find the number of speaker roles and the corresponding speakers' speech in meeting speeches, a clustering method for multiple speaker roles is proposed. Firstly, features for speaker role clustering are defined. Secondly, geodesic distance is used to measure the similarities among features. Then, inner-class distance is used to control inter-class mergence to form the clustering method. Finally, four different types of meeting speech corpora are used to validate the effectiveness of the proposed method. The results indicate that, for the meeting speeches obtained by both manual and automatic segmentation, the clustering performance using geodesic distance is superior to that using traditional distance when the same clustering algorithm is used in all cases, and that the proposed method performs better than the traditional hierarchical clustering method when the same measuring distance is used.
出处
《华南理工大学学报(自然科学版)》
EI
CAS
CSCD
北大核心
2015年第1期21-27,33,共8页
Journal of South China University of Technology(Natural Science Edition)
基金
国家自然科学基金资助项目(61101160)
广州市珠江科技新星专项(2013J2200070)
华南理工大学中央高校基本科研业务费专项资金重点项目(2013ZZ0053)~~
关键词
说话人角色
特征距离度量
角色聚类
测地距离
无监督聚类
speaker role
characteristic distance measure
role clustering
geodesic distance
unsupervised clustering