面向数据的句法分析消歧

Disambiguation for Data-Oriented Parsing

下载PDF

导出

摘要面向数据的分析技术(Data-Oriented Parsing,DOP)是一种概率分析策略,其概率模型的主要目的在于为一个给定的句子找到最可能的分析,即分析消歧。实际上,有关算法计算复杂度的大量研究证明,该类消歧问题属于NP-完全问题。因此,为有效实现最可能的分析,国外学者提出许多近似分析算法。本文主要论述在 DOP 框架中,基于 Monte Carlo 方法找到最可能分析的近似分析算法,并说明该方法可在合理的算法时间代价范围内实现,而且在统计上受控,以确保所获得的近似解确实对应着分析消歧后的精确解。 Data-Oriented Parsing（DOP）technique is a kind of probabilistic parsing strategy. The main goal of DOP model is to find the most probable parse for a given input sentence, that is, parse disambiguation. In fact, it is proved through a lot of research work about algorithm computation complexity that this kind of disambiguation problem belongs to the class of NP-Complete problem. So in order to implement the most probable parse efficiently, some researchers have proposed many approximation parsing algorithms. This paper mainly presents a kind of approximation parsing algorithm based on Monte Carlo method in DOP framework, which can be implemented at reasonable（i, e. polynomial）algorithmic cost. And at the same time, under statistical control, it is guaranteed that an obtained approximate solution indeed corresponds to an exact solution of the problem after disambiguation.

作者张玥杰张涛朱靖波姚天顺

机构地区复旦大学计算机科学与工程系上海财经大学信息管理与工程学院东北大学信息科学与工程学院

出处《计算机科学》 CSCD 北大核心 2006年第3期174-178,共5页 Computer Science

基金本文得到国家自然科学基金(编号:60203010 70501018与605333100) 上海财经大学"211工程"(2004年)资助

关键词面向数据的句法分析随机树替换文法消歧 MONTE CARLO方法 Data-oriented parsing（DOP）, Stochastic tree substitution grammar （STSG）, Disambiguation, Monte carlo method

分类号 TP391 [自动化与计算机技术—计算机应用技术] TP391.2 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献16

1Scha R,Bod R.Computationele Esthetica.Informatie en Informatiebeleid,1993,11(1).
2Bod R.Data-oriented Parsing(DOP).In:Proc.of COLING'92,Nantes,France,1992.
3Bod R.A Computational Model of Language Performance.Data Oriented Parsing.Computational Linguistics in the Netherlands 1991,Amsterdam,1992.
4Bod R.Data Oriented Parsing as a General Framework for Stochastic Language Processing.Sikkel K,Nijholt A,eds.Parsing Natural Language,TWLT6,Twente University,1993.
5Bod R.Using an Annotated Corpus as a Stochastic Grammar.Proceedings of EACL'93,Utrecht,The Netherlands,1993.
6Bod R.Mathematical Properties of the Data Oriented Parsing Model.Preprints Third Meeting on Mathematics of Language,Austin,Texas,1992.
7Sima'an K.Computational Complexity of Probabilistic Disambiguation by Means of Tree-Grammars.In:Proc.COLING-96,Copenhagen,1996.
8Bod R.Monte Carlo Parsing.Recent Advances in Parsing Technology.Kluwer Academic Publishers.
9Sima'an K.An Optimized algorithm for Data Oriented Parsing.In:Proc.International Conference on Recent Advances in Natural Language Processing,Tzigov Chark,Bulgaria,1994.
10Goodman J.Parsing Algorithms and Metrics.In:Proc.of th 34th Annual Meeting of the ACL,June,1996.

1朱靖波,姚天顺.面向数据的句法分析技术[J].中文信息学报,1998,12(1):1-8. 被引量：9
2吴伟成,周俊生,曲维光.基于统计学习模型的句法分析方法综述[J].中文信息学报,2013,27(3):9-19. 被引量：19
3刘威,赵文杰,李成,徐忠林,李婷.粒子滤波理论框架及在目标跟踪中的应用[J].自动化与仪器仪表,2016(3):190-191. 被引量：2
4卢振泰,陈武凡.Monte Carlo随机数在图像加密中的应用[J].计算机工程与应用,2009,45(7):7-9. 被引量：1
5李长庚,李新兵.基于Monte Carlo的非测距传感器网络定位算法[J].计算机工程,2008,34(24):1-3. 被引量：1
6丁洁,林建素,刘忠.基于网络排队模型的Monte Carlo多线程电梯交通流优化设计[J].计算机应用,2008,28(B06):396-398. 被引量：1
7张艳红,吴勇.基于Monte Carlo方法的任意概率密度随机数字信号发生器设计　[J].电子科技,2004,17(8):45-48. 被引量：3
8汪存友,余嘉元.一种新的基于神经网络的IRT项目参数估计模型[J].计算机应用,2006,26(4):992-994. 被引量：9
9王铂强,陈军.Monte Carlo方法计算圆周率[J].南通职业大学学报,2005,19(4):61-62. 被引量：2
10程云鹏,肖兵,金宏斌,段克清.一种目标身份识别算法的评估新方法[J].计算机测量与控制,2006,14(10):1377-1379. 被引量：5

计算机科学

2006年第3期

浏览历史

内容加载中请稍等...

面向数据的句法分析消歧

参考文献16

相关作者

相关机构

相关主题

浏览历史