期刊文献+

藏语N-gram语言模型中的平滑技术研究 被引量:1

Researches on Smoothing Technology in Tibetan N-gram Language Model
下载PDF
导出
摘要 文章在Linux环境下搭建Srilm建模平台,然后对语料进行分块处理,并用N-gram count和N-gram进行计数和语言模型的建立,利用几种平滑算法对其进行了困惑度的测试,最后对这几个困惑度的数值进行比较和数据分析,总结出一个适用于当前语料和语言环境下最优的平滑方法. This paper talked about theSrilm modeling platform is built in Linux environment,and then the corpus is processed in blocks.N-gram count and N-gram were utilized to count and build the language model,and several smoothing algorithms were applied to test the degree of confusion.Finally,the values of these degrees of confusion were compared and analyzed,and concluded an optimal smoothing method for the current corpus and language environment.
作者 仁青吉 REN Qing-ji(Tibetan Intangible Cultural Heritage Key Laboratory,Gansu Normal University for Nationalities,Hezuo,747000,China)
出处 《西北民族大学学报(自然科学版)》 2019年第4期26-30,共5页 Journal of Northwest Minzu University(Natural Science)
关键词 藏语语言模型 N-GRAM 平滑算法 困惑度 Tibetan language model N-gram Smoothing algorithms Degrees of confusion
  • 相关文献

参考文献2

二级参考文献19

  • 1陈立伟,赵春晖,姜海丽,杨洪利.利用线性预测残差的语音去噪方法[J].应用科技,2005,32(4):7-9. 被引量:2
  • 2武光利,于洪志,戴玉刚.藏语语音合成系统中语音信号的频谱转换与分析[J].西北民族大学学报(自然科学版),2005,26(3):40-43. 被引量:1
  • 3赵未莲.基于小波变换的阈值语音信号去噪[J].重庆科技学院学报(自然科学版),2005,7(4):73-75. 被引量:11
  • 4Graff D. The 1998 broadcast news speech and language-model corpus. Slides from lecture at the 1997 DARPA Speech Recognition Workshop, Feb. 1997.
  • 5Rosenfeld R. A maximum entropy approach to adaptive statistical language modeling. Computer Speech and Language, 1996, 10:187-228.
  • 6Katz S M. Estimation of probabilities from sparse data for the language model component of speech recognizer. IEEE Transactions on Acoustics, Speech and Signal Processing, 1987, ASSP35:400-401.
  • 7Jelinek F,Mercer R L. Interpolated estimation of Markov source parameters from sparse data. In:Proc. of the Workshop on Pattern Recognition in Practice, Amsterdam, The Netherlands: North-Holland, May 1980,381-397.
  • 8Magerman D M. Natural Language Parrsing as Statistical Pattern Recognition:[PhD Thesis]. Stanford University, 1994.
  • 9Bahl L R,Brown P F, De Souza P V, Mercer R L. A tree-based statistical language model for natural language speech recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing. 1989, 37(7): 1001-1008.
  • 10Rosenfeld R. Adaptive Statistical Language Modeling: A Maximum Entropy Approach: [PhD thesis]. Carnegie Mellon University, 1994- CMU Technical Report CMU-CS-94-138.

共引文献40

同被引文献17

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部