期刊文献+

基于Maximum Likelihood与HMM的文本挖掘 被引量:1

Text Information Mining Based on Maximum Likelihood and Hidden Markov Model
下载PDF
导出
摘要 随着信息技术、数据库技术、网络技术的发展,各行各业均存储了大量的文本数据,怎样从这些文本数据中发掘有价值的信息和知识成为人们急需解决的问题。提出基于Maximum Likelihood与HMM的文本挖掘方法,利用Maximum Likelihood构建隐马尔可夫模型,对论文条目进行特定信息的发掘,并克服了实验过程中"零概率"的缺陷。实验结果表明准确率平均达到0.9,召回率平均达到0.85,从理论和实践上证明该方法是有效的。 With the development of information technology, database technology and network technology, a large number of texts are produced in all kinds of fields, the question should be solved quickly that how to mine useful information and knowledge from texts. Introduces how to mine information using maximum likelihood and hidden Markov model. It constructs HMM with maximum likelihood and mines customizing messages from thesis entries with HMM. During the process of extracting, it deals with the questing of"zero probability". The experiment results indicate that the average precise rate arrives to 0.9 and the average recall rate arrives to 0.85. Both in theory and in practice the method are effective.
出处 《计算机技术与发展》 2007年第12期110-112,共3页 Computer Technology and Development
基金 湖南省自然科学基金资助项目(04JJ40051) 湖南省教育厅资助项目(06C724)
关键词 隐马尔可夫模型 最大似然 文本挖掘 信息抽取 hidden Markov model maximum likelihood text mining information extraction
  • 相关文献

参考文献8

二级参考文献34

  • 1KharHengChoo,JooChuanTong:,LouxinZhang.Recent Applications of Hidden Markov Models in Computational Biology[J].Genomics, Proteomics & Bioinformatics,2004,2(2):84-96. 被引量:6
  • 2[1]Abney S.Parsing by chunk.In Berwick,A.and Tenny,editors,Principle-Based Parsing.Kluwer,1991
  • 3[2]Erik F.Tjong Kim Sang and Sabine Buchholz Introduction to the CoNLL-2000 Shared Task: Chunking.CoNLL-2000 and LLL-2000.Lisbon,Portugal,pp.127~132
  • 4[3]Erik F,Sang T K.Text chunking by system combination.In:Proc.of CoNLL-2000 and LLL-2000.Lisbon,Portugal,2000
  • 5[4]Brants T.TnT -a statistical part-of-speech tagger.In:Proc.of the Sixth Applied Natural Language Processing (ANLP-2000),Seattle,WA,2000
  • 6[5]Ramshaw L,Marcus M.Text Chunking Using Transformation-Based Learning.In:Proc.of third Workshop on Very Large Corpora,June 1995.82~94
  • 7[6]Ratnaparkhi A.Maximum Entropy Models for Natural Language Ambiguity Resolution:[Phd.Thesis].University of Pennsylvania,1998
  • 8[7]Merialdo B.Tagging English Text with a Probabilistic Mod-el.Computational Linguistics,1994,20(2):155~171
  • 9[8]Church K W.A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text.In:Proc.of the 1st Conf.on Applied Natural Language Processing,ANLP,ACL,1988.136~143
  • 10[9]Daelemans W,Buchholz S,Veenstra J.Memory-Based Shallow Parsing.In:Proc.of EMNLP/VLC-99,University of Maryland,USA,June 1999.239~246

共引文献44

同被引文献11

引证文献1

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部