期刊文献+

基于LLE和模糊核聚类的语音可视化仿真

Speech Visualization Simulation Based on LLE and Fuzzy Kernel Clustering Algorithm
下载PDF
导出
摘要 根据语音信号的时变特性,提出了一种具有很好分类定位能力的语音可视化方法——局部线性嵌入(LLE)和模糊核聚类相结合的算法.通过利用LLE对提取的语音特征进行非线性降维,然后再利用模糊核聚类算法对其进行聚类分析,即利用Mercer核,将原始空间通过非线性映射到高维特征空间,在高维特征空间中对语音信号特征进行模糊核聚类分析.由于经过了核函数的映射,使原来没有显现的特征突现出来,从而能够更好地支持基于位置的语音可视化.以10名男生和10名女生在实验室环境下的720个语音资料(汉语元音)作为样本进行了试验,试验结果验证了该方法的可行性和有效性. According to the time-varying speech signal, a novel method combining LLE(locally linear embedding) with fuzzy kernel clustering algorithm was proposed for speech visualization, where LLE could reduce the nonlinear dimensionality of the speech features and then the fuzzy kernel clustering algorithm was used for clustering analysis, i.e. the Mercer kernel function was used to change the data in original space into a high-dimensional eigenspace through nonlinear mapping, and then the fuzzy clustering analysis was made in the high-dimensional eigenspace. Thus, after the kernel function mapping, the original inherent features of speech were highlighted to improve the position-based speech visualization. 720 data in Chinese vowels were obtained from 10 male and 10 female students' speech in lab, the results of simulation experiments show the feasibility and validity of the method.
出处 《东北大学学报(自然科学版)》 EI CAS CSCD 北大核心 2009年第6期790-793,共4页 Journal of Northeastern University(Natural Science)
基金 国家自然科学基金资助项目(50477015)
关键词 语音信号 可视化 局部线性嵌入 核方法 模糊核聚类 speech signal visualization LLE kernel-based method fuzzy kernel clustering
  • 相关文献

参考文献8

  • 1王枫,胡旭君,王永华.听力障碍儿童与正常儿童视觉记忆能力比较研究[J].中国特殊教育,2002(4):32-34. 被引量:14
  • 2Kuhn G M. Description of a color spectrogram [ J ]. The Journal of the Acoustical Society of America, 1984,76 (3) :682-685.
  • 3Tran D, Wagner M, Le T V. A proposed decision rule for speaker recognition based on fuzzy C-means clustering [ C]// The 5th International Conference on Spoken Language Processing. Sydney: ASSTA, 1998 : 755 - 758.
  • 4Girolami M. Mercer kernel based clustering in feature space [J]. IEEE Trans on Neural Networks, 2002, 13(3) : 780 - 784.
  • 5Roweis S T, Saul L K. Nonlinear dimensionality reduction by locally embedding [ J ]. Science, 2000, 290 ( 5500 ) : 2323 - 2326.
  • 6Muller K R, Mika S, Ratsch G, et al. An introduction to kernel based learning algorithms[J ]. IEEE Trans on Neural Networks, 2001,12(2) : 181 - 202.
  • 7Scholkopf B, Mika S, Burges C, et al. Input space versus feature space in kernel-based method[J]. IEEE TransNeural Networks, 1999,10(5) : 1000 - 1017.
  • 8林琳,王树勋,郭纲.短语音说话人识别新方法的研究[J].系统仿真学报,2007,19(10):2272-2275. 被引量:10

二级参考文献16

  • 1伍忠东,高新波,谢维信.基于核方法的模糊聚类算法[J].西安电子科技大学学报,2004,31(4):533-537. 被引量:75
  • 2张建星.聋童心理与行为问题浅析[J].听力学及言语疾病杂志,1996,4(2):109-110. 被引量:3
  • 3李公正,于之坤,刘灵.聋童视知觉发育测试分析[J].西安医科大学学报,1996,17(2):223-224. 被引量:5
  • 4曲成毅 孙喜斌 郑日昌 等.我国1758例聋儿智力发育现状调查[J].中华耳鼻咽喉科杂志,1995,30:361-364.
  • 5[4]Hiskey MS. Manual for the Hiskey - Nebraska test of learning aptitude. Lincoln: Nebraska,1986:1 - 22
  • 6[7]Anastasi A . Psychological testing. 5th ed. MacMillian: New York 1982: 280 - 281.
  • 7Matsui T,Furui S.Comparison of Text-independent Speaker Recognition Methods Using VQ-distortion and Discrete/Continuous HMMs[C]// Proc.IEEE Internat.Conf.on Acoust.Speech,Signal Processing,San Francisco:IEEE.1991.
  • 8Karayiannis N B,Pin-I Pai.Fuzzy Vector Quantization Algorithms[C]// Fuzzy Systems,IEEE World Congress on Computational Intelligence,Orlando,Florida:IEEE.1994.
  • 9Tran D,Wagner M,Van Le T.A Proposed Decision Rule for Speaker Recognition Based on Fuzzy C-Means Clustering[C]//5th International Conference on Spoken Language Processing,ICSLP'98.Sydney Australia:Australian Speech Science and Technology Association,Incorporated (ASSTA).1998.
  • 10Girolami M.Mercer kernel Based Clustering in Feature Space[J].IEEE Trans Neural Networks (S1045-9227),2002,29(1):123-127.

共引文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部