WORD SENSE DISAMBIGUATION BASED ON IMPROVED BAYESIAN CLASSIFIERS 被引量：1

WORD SENSE DISAMBIGUATION BASED ON IMPROVED BAYESIAN CLASSIFIERS

下载PDF

导出

摘要 Word Sense Disambiguation (WSD) is to decide the sense of an ambiguous word on particular context. Most of current studies on WSD only use several ambiguous words as test samples, thus leads to some limitation in practical application. In this paper, we perform WSD study based on large scale real-world corpus using two unsupervised learning algorithms based on ±n-improved Bayesian model and Dependency Grammar (DG)-improved Bayesian model. ±n-improved classifiers reduce the window size of context of ambiguous words with close-distance feature extraction method, and decrease the jamming of useless features, thus obviously improve the accuracy, reaching 83.18% (in open test). DG-improved classifier can more effectively conquer the noise effect existing in Naive-Bayesian classifier. Experimental results show that this approach does better on Chinese WSD, and the open test achieved an accuracy of 86.27%. Word Sense Disambiguation （WSD） is to decide the sense of an ambiguous word on particular context. Most of current studies on WSD only use several ambiguous words as test samples, thus leads to some limitation in practical application. In this paper, we perform WSD study based on large scale real-world corpus using two unsupervised learning algorithms based on ±n-improved Bayesian model and Dependency Grammar （DG）-improved Bayesian model. ±n-improved classifiers reduce the window size of context of ambiguous words with close-distance feature extraction method, and decrease the jamming of useless features, thus obviously improve the accuracy, reaching 83.18% （in open test）. DG-improved classifier can more effectively conquer the noise effect existing in Naive-Bayesian classifier. Experimental results show that this approach does better on Chinese WSD, and the open test achieved an accuracy of 86.27%.

作者 Liu Ting Lu Zhimao Li Sheng

机构地区 Computer Science ＆ Technology School Computer Science ＆ Technology School

出处《Journal of Electronics(China)》 2006年第3期394-398,共5页 电子科学学刊（英文版）

基金 Supported by the National Natural Science Foundation of China (No.60435020).

关键词 Word Sense Disambiguation （WSD） Natural Language Processing （NLP） Unsupervised learning algorithm Dependency Grammar （DG） Bayesian classifier 叶贝斯分级器自然语言处理 NLP 学习算法依赖性

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

同被引文献6

1刘群,张华平,俞鸿魁,程学旗.基于层叠隐马模型的汉语词法分析[J].计算机研究与发展,2004,41(8):1421-1429. 被引量：198
2张玉芳,彭时名,吕佳.基于文本分类TFIDF方法的改进与应用[J].计算机工程,2006,32(19):76-78. 被引量：121
3LEUSCH G,UEFFING N, NEY H, et al. A novel string-to-string distance measure with applications to machine translation evaluation [A]. Machine Translation Summit IX [C]. New Orleans: [s. n. ],2003. 240-247.
4张立岩,吕玲,王井阳.基于最大熵算法的全文检索研究[J].河北科技大学学报,2009,30(2):112-115. 被引量：6
5崔春生.基于可拓的Vague相似度计算[J].河北科技大学学报,2010,31(2):108-111. 被引量：4
6秦兵,刘挺,王洋,郑实福,李生.基于常问问题集的中文问答系统研究[J].哈尔滨工业大学学报,2003,35(10):1179-1182. 被引量：96

引证文献1

1王保民,刘明生,邢飞.基于语义的语句相似度计算研究[J].河北科技大学学报,2011,32(4):364-367.

1Fang Min.Novel ensemble learning based on multiple section distribution in distributed environment[J].Journal of Systems Engineering and Electronics,2008,19(2):377-380.
2薄丽丽,付主木,梁坤峰.基于PSO的BP网络在苹果颜色分级中的应用[J].信息化纵横,2009(16):63-66.
3ZHU Kai-hua,QI Fei-hu,JIANG Ren-jie,XU Li.Automatic character detection and segmentation in natural scene images[J].Journal of Zhejiang University-Science A(Applied Physics & Engineering),2007,8(1):63-71. 被引量：12
4薄丽丽,付主木,马建伟.基于GA的BP网络在苹果缺陷识别中的应用[J].河南科技大学学报（自然科学版）,2009,30(6):42-44.
5YANG Che-Yu.Word sense disambiguation using semantic relatedness measurement[J].Journal of Zhejiang University-Science A(Applied Physics & Engineering),2006,7(10):1609-1625. 被引量：7
6张祖昶,王诚,奚建春.电信反欺诈系统(AFS)的设计与实现[J].信息技术,2004,28(2):5-8.
7张全新,郑建军,牛振东,原达.贝叶斯分类器集成的增量学习方法[J].北京理工大学学报,2008,28(5):397-400. 被引量：3
8鹿文鹏,黄河燕,吴昊.基于领域知识的图模型词义消歧方法[J].自动化学报,2014,40(12):2836-2850. 被引量：10
9曹乐平,温芝元.补偿模糊神经网络水果形状分级器分级误差[J].农业工程学报,2008,24(12):102-106. 被引量：8
10李生,张晶,赵铁军,姚建民.词义消歧研究的现状与发展方向[J].计算机科学,2001,28(9):95-98. 被引量：8

Journal of Electronics(China)

2006年第3期

浏览历史

内容加载中请稍等...

WORD SENSE DISAMBIGUATION BASED ON IMPROVED BAYESIAN CLASSIFIERS 被引量：1

同被引文献6

引证文献1

相关作者

相关机构

相关主题

浏览历史