期刊文献+

结合分层条件随机场与标点符号的维吾尔语韵律边界预测 被引量:4

Uyghur Language Prosodic Boundary Prediction Combined with Hierarchical Conditional Random Field and Punctuation
下载PDF
导出
摘要 韵律结构的正确预测是高自然度语音合成系统的重要组成部分。针对维吾尔语的黏着性特点,给出其相应的韵律层次结构,采用基于条件随机场(CRF)的分层自底向上方法预测维吾尔语的韵律词和韵律短语边界,并将维吾尔语形态特征作为韵律边界预测模型的重要特征。为进一步纠正韵律边界预测错误并消除标点符号边界中不同韵律边界之间的歧义,以标点符号边界为单位建立基于CRF的标点符号韵律边界预测模型,并与双层自底向上CRF模型相结合,提出一种韵律边界预测方法。通过对不同的特征模板和模型进行反复实验,以得到更好的韵律边界预测性能。实验结果表明,该方法明显提高了韵律边界的预测召回率。 Correct prosodic boundary prediction is crucial for the quality of synthesized speech. This paper presents the prosodic hierarchy of Uyghur language which belongs to agglutinative language. A two-layer bottom-up hierarchical approach based on Conditional Random Field (CRF) is used for predicting prosodic word and prosodic phrase boundaries. Morphological features are considered useful for prosodic boundary prediction and added into the feature sets. In order to further enhance the accuracy of prosodic boundary prediction at punctuation sites, CRF based prosodic boundary determination method is used and integrated with bottom-up hierarchical approach. Consequently, the best prosodic boundary prediction performance is achieved by large and repeated experiment of different feature sets and different models. Experimental results show that the proposed method obviously improves the recall rate prediction of the prosodic boundary.
出处 《计算机工程》 CAS CSCD 北大核心 2015年第11期299-302,307,共5页 Computer Engineering
基金 国家自然科学基金资助项目(61462087) 教育部社科基金资助项目(10YJA740027) 新疆维吾尔自治区高校科研计划基金资助项目(XJEDU2013S27) 新疆师范大学博士 博士后科研启动基金资助项目(XJNUBS1308)
关键词 维吾尔语 韵律边界 分层方法 标点符号 形态特征 Uyghur language prosodic boundary hierarchical approach punctuation morphological feature
  • 相关文献

参考文献13

  • 1Taylor P, Black A W. Assigning Phrase Breaks from Part-of-Speech Sequences [J]. Computer Speech and Language, 1998,12:99-117.
  • 2Yang Chenyu, Ling Zhenhua, Dai Lirong. Unsupervised Pro-sodic Phrase Boundary Labeling of Mandarin Speech Syn-thesis Database Using Context-dependent HMM[ C]// Pro-ceedings of the 38th IEEE International Conference on Acoustics ,Speech and Signal Processing. Washington D. C., USA : IEEE Press ,2013:6875-6879.
  • 3Chu Min, Qian Yao. Locating Boundaries for Prosodic Constituents in Unrestricted Mandarin Texts [ J ]. Com- putational Linguistics and Chinese Language Processing, 2001,6(1) :61-82.
  • 4Xu Dawei, Wang Haifeng, Li Guohua, et al. Parsing Hierarchical Prosodic Structure for Mandarin Speech Synthesis [ C ]//Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing. Washington D. C. ,USA :IEEE Press ,2006 : 1745-1748.
  • 5Prahallad K, Raghavendra V, Black A. Learning Speaker- specific Phrase Breaks for Text-to-Speech Systems [ C ]// Proceedings of Speech Synthesis Workshop. Kyoto, Japan:[ s. n. ] ,2010.
  • 6李剑锋,胡国平,王仁华.基于最大熵模型的韵律短语边界预测[J].中文信息学报,2004,18(5):56-63. 被引量:20
  • 7Zhang Xiaonan, Xu Jun, Cai Lianhong. Prosodic Boundary Prediction Based on Maximum Entropy Model with Error-driven Modification [ C ]//Proceedings of International Symposium on Chinese Spoken Language Processing. Singapore : [ s. n. ] ,2006 : 149-160.
  • 8Liu Fangzhou, Jia Huibin, Tao Jianhua. A Maximum Entropy Based Hierarchical Model for Automatic Prosodic Boundary Labeling in Mandarin [ C]//Proceedings of the 6th International Symposium on Chinese Spoken Language Processing. Kunming ,China: [ s. n. ] ,2008:253-256.
  • 9古力米热·依玛木,艾斯卡尔·艾木都拉.维吾尔语句韵律层级的人工标注规则研究[C]//第三届全国少数民族青年自然语言信息处理学术研讨会论文集.乌鲁木齐:[出版者不详],2010:179-182.
  • 10Lafferty J,McCallum A, Pereira F. Conditional Random Fields :Probabilistic Models for Segmenting and Labeling Sequence Data [ C]//Proceedings of the 18th Inter- national Conference on Machine Learning. Burlington, USA : Morgan Kaufrnann Publishers, 2001 : 282-289.

二级参考文献12

  • 1M. Chu, Y. Qian, Locating Boundaries for Prosodic Constituents in Unrestricted Mandarin Texts[J]. Computational Linguistics and Chinese Language Processing, February 2001,6(1) :61 - 82.
  • 2Bachenko J, Fitzpatrick E. A computational grammar of discourse-neutral prosodic phrasing in English[J]. Computational Linguistics, 1990, 16(3): 155-170.
  • 3J. Hirschberg, P. Prieto. Training intonational phrasing rules automnatically for English and Spanish text-to-speech[J]. Speech Communication, 1996.
  • 4G. J. Busser, W. Daelemans, Van den Bosch, A. Predicting phrase breaks with memory-based learning[A]. Proceedings 4th ISCA Tutorial and Research Workshop on Speech Synthesis[ C], Perthshire Scotland, August 29th - September 1st, 2001.
  • 5Adam L. Berger, Stephen A. Della Pietra, Vincent J. Della Pietra. A maximum entropy approach to natural language processing[J]. Computational Linguistics 1996, 23(4): 597-618.
  • 6Adwait Ratnaparkhi. A Maximum Entropy Part-Of-Speech Tagger[ A]. Proceedings of the Empirical Methods in Natural Language Processing Conference[C], May 17- 18, 1996.
  • 7Hanna Wallach. Efficient training of conditional random fields[D]. Master's thesis, University of Edinburgh, 2002.
  • 8Adwait Ratnaparkhi. (1998). Maximum Entropy Models for Natural Language Ambiguity Resolution[ D ]. Ph. D.Dissertation. University of Pennsylvania, 1998.
  • 9牛正雨,柴佩琪.基于边界点词性特征统计的韵律短语切分[J].中文信息学报,2001,15(5):19-25. 被引量:13
  • 10赵晟,陶建华,蔡莲红.基于规则学习的韵律结构预测[J].中文信息学报,2002,16(5):30-37. 被引量:25

共引文献19

同被引文献29

引证文献4

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部