期刊文献+

贝叶斯决策树在英文现在分词词性识别中的应用 被引量:6

Application of Bayesian decision tree to recognition of English present participle
下载PDF
导出
摘要 针对英文现在分词词性标注这一特定问题存在的难点分析了隐马尔可夫模型(HMM)的不足,提出了贝叶斯决策树模型。对一个已经标注好的语料库进行统计,运用决策树C4.5算法从单边条件和双边条件两个方面对英语现在分词的三种词性进行合理的分类消歧。对于双边条件下仍然存在歧义的情况,用贝叶斯最小风险对决策树改进,用标注好的语料库对模型进行训练。最后,采用一个未经过标注的语料库进行测试,取得了非常好的效果,证明了模型的优越性。 Concerning the difficulties in part-of-speech tagging in English present participle, the authors analyzed the drawbacks of Hidden Markov Models (HMM) and proposed Bayesian decision tree model. Firstly, the tagged corpus was calculated and CA. 5 in decision tree was used for proper classification and disambiguation of the three classes of present participle. Then, the decision tree was improved by Bayesian least risk. At last, an untagged corpus was used to test the model and the result is very good, which proves the superiority of the model.
作者 徐哲 刘循
出处 《计算机应用》 CSCD 北大核心 2009年第9期2571-2574,共4页 journal of Computer Applications
基金 国家自然科学基金资助项目(60773169)
关键词 分类 消歧 贝叶斯决策树 隐马尔可夫模型 classification disambiguation Bayesian decision tree Hidden Markov Model (HMM)
  • 相关文献

参考文献8

  • 1POLAT K. A novel hybrid intelligent method based on C4.5 decision tree classifier and one against all approach for muhielass classification problems [ J]. Expert Systems with Applieations, 2007, 36 (2): 1587-1592.
  • 2SHUKLA S K, TIWARI M K. Soft decision trees: A genetically optimized cluster oriented approach [ J]. Systems with Applications, 2009, 36(1): 551-563.
  • 3QUINLANN J R. Induction of decision trees [ J]. Machine Learning, 1986, 1(1): 81-106.
  • 4PULKKINEN P. Fuzzy classifier identification Using decision tree and multiobjective evolutionary algorithms [ J]. International Journal of Approximate Reasoning, 2008, 36(2): 526-543.
  • 5KUPIEC J. Robust part of speech tagging using a hidden Markov model [J]. Computer Speech and Language, 1992, 6(3): 225 -242.
  • 6CHEN JING-NIAN, HUANG HOU-KUAN, TIAN SHENG-FENG, et al. Feature selection for text classification with Naive Bayes [ J]. Expert Systems with Applications: An International Journal, 2009, 36(3): 5432 -5435.
  • 7LI REN-PU. Mining classification rules using rough sets and neural networks [ J]. European Journal of Operational Research, 2004, 157(2): 439 -448.
  • 8GARSIDE R, LEECH G, SAMPSON G. The computational analysis of English [M]. London: Longman, 1987.

同被引文献122

引证文献6

二级引证文献41

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部