期刊文献+

基于支持向量机的汉语问句分类 被引量:20

Chinese Question Classification Based on Support Vector Machine
下载PDF
导出
摘要 目前汉语问句分类一般都依据疑问词及其相关词的组合规则,但由于规则的提取很深地依赖于语言知识,而且很难穷举出所有的特征规则,因此会影响分类的效果.支持向量机(SVM)是建立在统计理论基础上的机器学习方法,对于小样本分类问题有很好的识别效果.文中分析和定义了汉语问句的类型,建立了以SVM为基础的问句分类模型,详细描述了问句分类特征的选取过程,并在句法特征的基础上引入语义特征进行汉语问句分类实验,分类准确率达88.7%,表明结合句法和语义特征以SVM进行汉语问句分类具有很好的效果. At present, Chinese question classification is commonly based on the combinatorial rules between the interrogatives and their interrelated words. Because the extraction of the combinatorial rules greatly depends on language knowledge and not all combinatorial rules can be listed, the classification performance is not desirable. As the SVM (Support Vector Machine), a machine learning method based on the statistical theory, possesses excellent discriminating effect on small sample classification, this paper establishes a question classification model based on SVM after the analysis and definition of Chinese question types. The process of the feature selection for question classification is then described in detail. Finally, a question classification experiment is carried out by introducing corresponding semantic features based on syntactic features, with a classification accuracy of 88.7% being achieved, which indicates that Chinese questions can be excellently classified by means of SVM with the combination of syntactic features and semantic features.
出处 《华南理工大学学报(自然科学版)》 EI CAS CSCD 北大核心 2005年第9期25-29,34,共6页 Journal of South China University of Technology(Natural Science Edition)
基金 云南省信息技术基金资助项目(2002IT03)
关键词 问答系统 问句分类 支持向量机 句法特征 语义特征 question-answering system question classification support vector machine syntactic feature semantic feature
  • 相关文献

参考文献13

  • 1张学工.关于统计学习理论与支持向量机[J].自动化学报,2000,26(1):32-42. 被引量:2257
  • 2郑实福,刘挺,秦兵,李生.自动问答综述[J].中文信息学报,2002,16(6):46-52. 被引量:165
  • 3郑实福.[D].哈尔滨:哈尔滨工业大学计算机科学与工程系,2002.
  • 4Li Xin, Roth Dan. Learning question classifier [A]. Proceedings of the 19th International Conference on Computational Linguistics [C]. Taipei: Morgan Kaufmann Publishers ,2002.556 - 562.
  • 5Li Xin, Roth Dan, Small Kevin. The role of semantic information in learning question classifiers [A]. Proceedings of the 1st International Joint Conference on Natural Language Processing [C]. Berlin: Spring-Verlag,2004.451 -458.
  • 6Zhang Dell, Lee Wee Sun. Question classification using support vector machines [A]. Proceedings of the 26th annual international ACM SIGIR Conference on Research and Development in Informaion Retrieval [C]. New York: ACM Press ,2003.26 - 32.
  • 7Hacioglu Kadri, Ward Wayne. Question classification using support vector machines and error correcting code[A]. Proceedings of HLT-NACCL 2003 [C]. Edmonton,2003.28 - 30.
  • 8Roth Dan, Cumby Chad, Li Xin, et al. Question-answering via enhanced understanding of questions [A]. Proceedings of the 1 1th Text Retrieval Conference [C]. Gait hersburg: NIST Special Publication, 2002. 667 - 676.
  • 9Hermjakob U. Parsing and question classification for question answering [A]. ACL-2001 Workshop on Open-Domain Question Answering [C]. Toulouse, 2001. 255 -262.
  • 10Taira Jun Suzuki, Sasaki Yutaka, Maeda Eisaku. Question classification using HDAG kernel [A]. ACL Workshop on Mulitilingual Summarization and Question Answering [C]. Sapporo,2003.61 - 68.

二级参考文献23

  • 1[8]Ulf Hermjakob. Parsing and Question Classification for Question Answering. Proceeding of the workshop on Open-Domain Question Answering at ACL-2001
  • 2[9]Eugene Agichtein, Steve Lawrence, Luis Gravano. Learning Search Engine Specific Query Transformations for Question Answering. ACM 2001,169- 178
  • 3[10]Soo-Min Kim, ae-Ho Baek, Sang-Beom Kim, Hae-Chang Rim Question Answering Considering Semantic Categories and Co-occurrence Density. Proceedings of the night Text Retrieval Conference (TREC-9)
  • 4[11]Marius Pasca, Sanda Harabagiu. High-Performance Question/Answering. 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( Sigir-01 ). New Orleans, LA. September 9 - 13,2001
  • 5[1]Ittycheriah,M. Franz,W-J Zhu,A. Ratnaparkhi. IBM's Statistical Question Answering System. Proceedings of the night Text Retrieval Conference (TREC-9)
  • 6[2]D. Elworthy. Question Answering Using a Large NLP System. Proceedings of the night Text Retrieval Conference (TREC-9)
  • 7[3]L. Wu,X-j Huang,Y. Guo,B. Liu,Y. Zhang. FDU at TREC-9:CLIR,Filtering and QA Tasks. Proceedings of the night Text Retrieval Conference(TREC-9)
  • 8[4]R.J. Cooper, S. M. Rüger. A Simple Question Answering System. Proceedings of the night Text Retrieval Conference(TREC-9)
  • 9[5]C.L.A. Clarke, G. V. Cormack, D. I. E. Kisman, T. R. Lynam. Question Answering by Passage Selection. Proceedings of the night Text Retrieval Conference (TREC-9)
  • 10[6]S-M Kim,D-H Baek,S-B Kim,H-C Rim. Question Answering Considering Semantic Categories and CoOccurrence Density. Proceedings of the night Text Retrieval Conference(TREC-9)

共引文献2474

同被引文献225

引证文献20

二级引证文献108

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部