期刊文献+

汉语定义语句的抽取方法 被引量:1

Extraction of Chinese Sentences for Definition
下载PDF
导出
摘要 该文介绍了在汉语文本中抽取定义语句的方法。方法的主要特点是:给定被定义的词汇(字符串),应用Bo-yer-Moore算法查找该串在文本中的位置,继而在该句子中查找符合定义特征的谓词。在这个工作基础上,根据谓词字符串的特征排除谓词歧义,并按照句法分析的结果对定义语句修饰谓词的不同语法单元进行识别,从而实现了基于字符串和语法特征的识别的定义语句抽取。 Using Boyer-Moore algorithm and grammar parsing technology,the paper studies extracting the Chinese sentences,which make some definitions,in texts.In the work,defined words,verbs and some functional phrases for definition are recognized so a more precise extractions are completed,and some words which might make incorrect extractions are avoided.
作者 张寅生
出处 《计算机与数字工程》 2011年第10期45-47,112,共4页 Computer & Digital Engineering
关键词 知识抽取 信息抽取 汉语句子识别 定义语句 BOYER-MOORE算法 自然语言理解 knowledge extraction information extraction Chinese sentence recognition definition recognition Boyer-Moore algorithm natural language processing
  • 相关文献

参考文献7

  • 1汉斯·波塞尔 李文潮译.科学:什么是科学[M].上海:上海三联书店,2002..
  • 2Siddharth Patwardhan, Ellen Riloff. Effective Information Extraction with Semantic Affinity Patterns and Relevant Regions[C]//Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Prague, Czech Republic, http://wenku, baidu. com/view/26eafb2c453610661ed9f4ce, html, accessedon June 21,2011.
  • 3刘德喜,何炎祥,姬东鸿,杨华.一种基于演化算法进行句子抽取的多文档自动摘要系统SBGA[J].中文信息学报,2006,20(6):46-53. 被引量:10
  • 4吕雅娟,李生,赵铁军.基于双语模型的汉语句法分析知识自动获取[J].计算机学报,2003,26(1):32-38. 被引量:6
  • 5Von-Wun Soo, Hsing-Yuen Yeh, Shih-Neng Lin, et al. Ontology-based Knowledge Extraction from Semantic Annotated Biological Literature[C]//The Ninth Conference on Artificial Intelligence and Application, 2004.
  • 6秦殿英,焦庆争.Boyer-Moore串查找JAVA算法[J].安徽工业大学学报(自然科学版),2001,18(1):89-92. 被引量:1
  • 7俞士汶.北京大学语言知识库概况.汉语语言与计算学报,2003,13(2):119-120.

二级参考文献24

  • 1钟伟才,刘静,刘芳焦,李成.组合优化多智能体进化算法[J].计算机学报,2004,27(10):1341-1353. 被引量:34
  • 2王萌,何婷婷,姬东鸿,王晓荣.基于HowNet概念获取的中文自动文摘系统[J].中文信息学报,2005,19(3):87-93. 被引量:22
  • 3秦兵,刘挺,李生.多文档自动文摘综述[J].中文信息学报,2005,19(6):13-20. 被引量:51
  • 4辛运帏.JAVA算法[M].北京:电子工业出版社,1998..
  • 5[1]Marcus M P, Marcinkiewicz M A, Santorini B. Building a large annotated corpus of English: the Penn Treebank. Computational Linguistics, 1993,19(2) :313~330
  • 6[3]Zhou Ming. A block-based robust dependency parser for unrestricted Chinese text. In: Proceedings of the 2nd Chinese Language Processing Workshop Attached to ACL-2000, Hong Kong, 2000. 78~84
  • 7[5]Wu De-Kai. Stochastic inversion transduction grammars and bilingual parsing of parallel corpora. Computational Linguistics, 1997, 23(3): 377~403
  • 8[6]Wu De-Kai. Stochastic inversion transduction grammars, with application to segmentation, bracketing, and alignment of parallel corpora. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI 1995), Montreal,1995. 1328~1335
  • 9[7]Wu De Kai, Wong Hong Sing. Machine translation with a stochastic grammatical channel. In: Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and the 17th International Conference on Computational Linguistics, Montreal, 1998. 1408~1415
  • 10[8]Wu De Kai. Trainable coarse bilingual grammars for parallel text bracketing. In: Proceedings of the 3rd Annual Workshop on Very Large Corpora, Cambridge, 1994. 69~82

共引文献30

同被引文献9

  • 1]iang Huixing,Wang Xiaojie, Tian Jilei. Second-order HMM for event extraction from short message [ J ]. Lecture Notes in Computer Science,2010,6 177 : 149-156.
  • 2Zhou Deyu, Yulan Heb. Biomedical events extraction using the hidden vector state model [ J ]. Artificial Intelligence in Medicine,2011,53(3) :205-213.
  • 3Li Qing,Yuanzhu Peter Chen. Personalized text snippet extraction using statistical language models [ J ]. Pattern Recognition, 2010,43( 1 ) :378-386.
  • 4Scheffer T, Decomain C, Wrobel S. Active hidden Markov models for information extraction [ C ]//Proceedings of the Interna- tional Symposium on Intelligent Data Analysis. Berlin:Springer,2001:301-109.
  • 5Bolanle Ojokoh ,Zhang Ming,Tang Jian. A trigram hidden Markov model for metadata extraction from heterogeneous references[J]. Information Sciences,2011,181 (9) : 1 538-1 551.
  • 6Liu Jiangyang. Resolution to combinational ambiguity of Chinese word segmentation [ C ]//Proceedings of the International Conference on E-Learning, E-Business, Enterprise Information Systems, and E-Government, 2009. Hongkong: IEEE, 2009 : 141-145.
  • 7Souyma Ray, Mark Craven. Representing sentence structure in hidden markov models for information extraction [ C ]// Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence. Washington: Morgan Kanfmann Publishers,2001 : 1 273-1279.
  • 8袁里驰.基于依存关系的句法分析统计模型[J].中南大学学报(自然科学版),2009,40(6):1630-1635. 被引量:13
  • 9梁吉光,田俊华,姜杰.基于改进HMM的文本信息抽取模型[J].计算机工程,2011,37(20):178-179. 被引量:9

引证文献1

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部