基于分组的次数与规则剪枝相结合的语言模型压缩方法研究

Research on the Language Model Compression Methods Combining the Grouping Counts and Rule Pruning

下载PDF

导出

摘要由于庞大的训练语料,统计语言模型的大小往往会超出手持设备的存储能力。随着现阶段资源受限设备的迅速发展,语言模型的压缩研究也就显得更加重要。本文提出了一个语言模型压缩方法,即将次数剪切与规则剪枝方法相结合,并使用分组的方法保证在不减少单元数目的情况下压缩模型。文章对使用新的算法得到的语言模型与次数剪切和规则剪枝方法分别进行困惑度比较。实验结果表明,使用新方法得到的语言模型性能更好。 Currently the size of most statistical language models based on large-scale training corpus always goes beyond the storage ability of many handheld devices. With the rapid development of the limited resource devices, the research on language model compression can meet such requirements. This paper proposes a language model compression method which combined the count cutoff and the pruning method to reduce the size of the language model and uses grouping to compress this model without cell reduction. Our experimental results show that our method can achieve higher perplexity than those of other methods based on the same size.

作者吴晓春吴娴李培峰朱巧明

机构地区苏州大学计算机科学与技术学院

出处《计算机工程与科学》 CSCD 2008年第11期129-133,共5页 Computer Engineering & Science

关键词语言模型压缩次数剪切规则剪枝分组困惑度 language model compression count cutoff rule pruning grouping perplexity

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献11

1李晓光,王大玲,于戈.基于统计语言模型的信息检索[J].计算机科学,2005,32(8):124-127. 被引量：9
2邢永康,马少平.统计语言模型综述[J].计算机科学,2003,30(9):22-26. 被引量：37
3MANNING C D，SCHOTZE H．统计自然语言处理基础[M]．苑春法，等译．北京：电子工业出版社，2005．
4Seymore K, Rosenfeld R. Scalable Back-off Language Models[C]//Proc of the 4th Int'l Conf on Spoken Language Processing, 19 9 6.
5Stolcke A. Entropy-Based Pruning of Back-off Language Models[C]//Proc of DARPA News Transcription and Understanding Workshop, 1998.
6Wu G Q, Zheng F. A Method to Build a Super Small but Practically Accurate Language Model for Handheld Devices [J]. Computer Science & Technology, 2003, 18(6): 747- 755.
7Brown, Peter F, Stephen A. Class-Based n-gram Models of Natural Language[J]. Computational Linguistics, 1992, 18 (4) : 153-157.
8Gao J F, Joshua T, Goodman J, et al. The Use of Clustering Techniques for Language Modeling[C]//Proc of Application to Asian Language, Computational Linguistics and Chinese Language Processing, 2001.
9张仰森,曹元大,俞士汶.语言模型复杂度度量与汉语熵的估算[J].小型微型计算机系统,2006,27(10):1931-1934. 被引量：7
10Chen S f, Goodman J. An Empirical Study of Smoothing Techniques for Language Modeling[C]//Proc of the 34th Annual Meeting of the Association for Computational Linguistics, 1996.

二级参考文献41

1Graff D. The 1998 broadcast news speech and language-model corpus. Slides from lecture at the 1997 DARPA Speech Recognition Workshop, Feb. 1997.
2Rosenfeld R. A maximum entropy approach to adaptive statistical language modeling. Computer Speech and Language, 1996, 10:187-228.
3Katz S M. Estimation of probabilities from sparse data for the language model component of speech recognizer. IEEE Transactions on Acoustics, Speech and Signal Processing, 1987, ASSP35:400-401.
4Jelinek F,Mercer R L. Interpolated estimation of Markov source parameters from sparse data. In:Proc. of the Workshop on Pattern Recognition in Practice, Amsterdam, The Netherlands: North-Holland, May 1980,381-397.
5Magerman D M. Natural Language Parrsing as Statistical Pattern Recognition:[PhD Thesis]. Stanford University, 1994.
6Bahl L R,Brown P F, De Souza P V, Mercer R L. A tree-based statistical language model for natural language speech recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing. 1989, 37(7): 1001-1008.
7Rosenfeld R. Adaptive Statistical Language Modeling: A Maximum Entropy Approach: [PhD thesis]. Carnegie Mellon University, 1994- CMU Technical Report CMU-CS-94-138.
8Darroch J, RatclifI D. Generalized iterative scaling for log-linear models. The annals of Mathematical statistics 1972, 43: 1470-1480.
9Berger A L. Della Pietra S A, Della Pietra V J. A maximum entropy approach to natural language processing. Computational Linguistics 1996,22(1) : 39-71.
10RosenIeld R. Two decades oI Statistical Language Modeling: Where Do We Go From Here? Proceedings of the IEEE, 2000, 88(8).

共引文献65

1王思丽,祝忠明.机构知识库相关性检索机制研究与试验[J].情报科学,2020,0(2):94-101. 被引量：1
2董云耀,钱如栏.一种改进的基于隐马尔可夫的信息检索模型[J].杭州电子科技大学学报（自然科学版）,2009,29(4):46-49. 被引量：1
3荣传振,岳振军,贾永兴,王渊,杨宇.唇语识别关键技术研究进展[J].数据采集与处理,2012,27(S2):277-283. 被引量：4
4倪瑞煜,戴新宇,尹存燕,陈家骏.一种基于语料库的日语动词格框架自动构造技术[J].计算机应用研究,2007,24(6):66-68.
5张柯,沈夏炯,董鑫,于俊洋.基于概念格的语义相关度计算[J].郑州轻工业学院学报（自然科学版）,2007,22(2):178-181. 被引量：1
6陈天莹,陈蓉,潘璐璐,李红军,于中华.基于前后文n-gram模型的古汉语句子切分[J].计算机工程,2007,33(3):192-193. 被引量：25
7刘美茹.基于LSI和SVM的文本分类研究[J].计算机工程,2007,33(15):217-219. 被引量：8
8王金铨,梁茂成,俞洪亮.基于N-gram和向量空间模型的语句相似度研究[J].现代外语,2007,30(4):405-413. 被引量：14
9徐扬.基于最大熵模型的汉语隐喻现象识别[J].计算机工程与科学,2007,29(4):95-97. 被引量：3
10吴春颖,王士同.基于二元语法的N-最大概率中文粗分模型[J].计算机应用,2007,27(12):2902-2905. 被引量：12

1吴晓春,吴娴,朱巧明.一个语言模型压缩方法的研究与实践[J].苏州大学学报（工科版）,2008,28(3):16-20. 被引量：1
2束建华,倪志伟,杨善林.基于蚁群优化的分类规则挖掘方法[J].广西师范大学学报（自然科学版）,2007,25(4):18-23. 被引量：4
3王刚,王本年.基于FNN与GA相融合的数据挖掘方法研究[J].计算机技术与发展,2008,18(2):119-121. 被引量：4
4郭燕萍.基于兴趣度的正负关联规则挖掘算法研究[J].农业网络信息,2015(8):51-55.
5高恩婷,段湘煜,巢佳媛,张民.翻译规则剪枝与基于半强制解码和变分贝叶斯推理的模型训练[J].中文信息学报,2014,28(5):141-147.
6黄永文,何中市.基于互信息的统计语言模型平滑技术[J].中文信息学报,2005,19(4):46-51. 被引量：8
7商炳章,白清源.基于互信息规则剪枝的关联文本分类[J].南京师范大学学报（工程技术版）,2008,8(4):173-177. 被引量：1
8邵谨荣,张万军.关联规则剪枝的运用[J].电脑知识与技术,2009,5(10X):8362-8365.
9郭蓝天,李扬,慕德俊,杨涛,李哲.一种基于LDA主题模型的话题发现方法[J].西北工业大学学报,2016,34(4):698-702. 被引量：22
10赵知纬,钱龙华,周国栋.一个面向信息抽取的中文跨文本指代语料库[J].中文信息学报,2015,29(1):57-66. 被引量：3

计算机工程与科学

2008年第11期

浏览历史

内容加载中请稍等...

基于分组的次数与规则剪枝相结合的语言模型压缩方法研究

参考文献11

二级参考文献41

共引文献65

相关作者

相关机构

相关主题

浏览历史