期刊文献+

基于DOP的汉语句法分析技术 被引量:4

Implementing Chinese Parsing Based on DOP Technique
下载PDF
导出
摘要 本文提出一种以DOP技术作为基本框架,同时利用基于相似的概率评估技术,实现汉语句法分析的方法。其中,对于输入语句,首先需要经过词汇层与词性层两层初选。然后,基于已构建知识源,获取输入语句的片段组合形式。最后,对输入语句与初选结果进行相似性评估,完成输入语句的组合分析过程。为论证方法有效性,基于包含1 000 个语句的真实汉语语料构建知识源,并采用包含100 个语句的真实汉语语料作为测试集。实验表明,句法分析的各项指标都比较令人满意,可有效地实现汉语句法分析。 This paper presents a kind of Chinese parsing method which takes the DOP technique as the basic frame and utilizes the similarity based probabilityestimate technique. In the implementation, every input sentence must by preprocessed through the initial selection in word level and part of speech level. Then the fragment combination forms of the input sentence are acquired based on the constructed knowledge source which includes treebank, fragment bank and fragment combination bank. Finally, the similarity estimate between the input sentence and the initial selection result is proceeded by using the similarity based probability estimate technique. So the combination parsing process of the input sentence can be completed successfully. To prove the efficiency of the proposed method, the knowledge source is constructed based on the real world Chinese corpus which involves 1 000 Chinese sentences, and the other real world Chinese corpus which includes 100 Chinese sentences is used as the test set. The experiment result shows that every test parameter is satisfactory and the parsing process can be implemented efficiently.
出处 《中文信息学报》 CSCD 北大核心 2000年第1期13-21,共9页 Journal of Chinese Information Processing
基金 国家自然科学基金!(编号:69675019) 国家教委博士点专项基金
关键词 DOP 汉语 句法分析 相似性评估 树库 片段库 Data oriented parsing Chinese parsing Similarity estimate Treebank Fragment bank Fragment combination form bank
  • 相关文献

参考文献7

  • 1[1]Rens Bod. Data-oriented parsing (DOP). Proceedings COLING'92,Nantes,France,1992
  • 2[2]Rens Bod. Using an annotated corpus as a stochastic granmmr. Proceedings EACL' 93, Utrecht, TheNetherlands, 1992
  • 3朱靖波,姚天顺.面向数据的句法分析技术[J].中文信息学报,1998,12(1):1-8. 被引量:9
  • 4[4]Rens Bod. Monte Carlo parsing, Recent Advances in Parsing Technology, Kluwer Acadenmic Publishers
  • 5[5]Khalil Sima' an. An optimized algorithm for data oriented parsing. Proceedings Intemational Conference on Recent Advances in Natural Language Processing, Tzigov Chark,Bulgaria
  • 6[6]Joshua Goodman. Parsing algorithms and metrics. Proceedings of the 34th Annual Meeting of the ACL,June 1996
  • 7[7]Lillian Jane Lee. Similarity-Based Approaches to Natural Languages Processing. Doctor Thesis, in the subject of Comnputer Science, Harvard University, Cambridge, Massachusetts, 1997

二级参考文献1

  • 1姚天顺,自然语言理解:一种让机器懂得人类语言的研究,1995年,12页

共引文献8

同被引文献32

  • 1周明,黄昌宁,张敏,白栓虎,吴升.统计与规则并举的汉语句法分析模型[J].计算机研究与发展,1994,31(2):40-49. 被引量:8
  • 2李幸,宗成庆.引入标点处理的层次化汉语长句句法分析方法[J].中文信息学报,2006,20(4):8-15. 被引量:22
  • 3石纯一.人工智能原理[M].北京:清华大学出版社,2000..
  • 4Chelba, Ciprian, Frederick Jelinek. Exploiting syntactic structure for language modeling[C].ACL, 1998. 225-231.
  • 5Caraballo, Sharon A, Eugene Chamiak. New figures of merit for best-fist probabilistic chart parsing[J] .Computational Linguistics, 1998, 24: 275-298.
  • 6Collins, Michael John. A new statistical parser based on bigram lexical dependencies[C]. ACL, 1996. 184-191.
  • 7Bod Rens, Ron Kaplan. A data-oriented approach lexical -functional grammar[C].Eindhoven, Netherlands: Computational Linguistics in the Netherlands, 1996.
  • 8Collions, Michael John. Three generative lexiealized models for statistical parsing[C]. ACL, 1997. 16-23.
  • 9Magerman, David M. Statistical decision-tree models for parsing[C]. ACL, 1995.276-283.
  • 10Goodman, Joshua. Parsing algorithms and metrics [C]. ACL,1996. 177-183.

引证文献4

二级引证文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部