统计句法分析建模中基于信息论的特征类型分析被引量：4

The Information-Theory-Based Feature Type Analysis in the Modeling for Probabilistic Parsing

下载PDF

导出

摘要统计句法分析利用概率评价模型评价每棵候选句法树存在的可能性 ,选择概率值最高的候选句法树作为最终的句法分析结果 .因此 ,统计句法分析的核心是一个概率评价模型 ,而各种概率评价模型的本质区别主要在于它们分别是根据上下文中的哪些特征来赋予句法树概率的 .在统计句法分析研究领域 ,虽然已经提出了大量的概率评价模型 ,然而 ,不同的模型用到了不同类型的特征 .如何评价这些特征类型对于句法分析的作用呢 ?针对以上的问题 ,本研究为统计句法分析提出了一种特征类型的分析模型 ,该模型可以从信息论的角度量化地分析不同类型的上下文特征对于句法结构的预测作用 .其基本思想是利用信息论中熵与条件熵的度量来显示一个特征类型是否抓住了预测句法结构的主要信息 .如果加入某个特征类型之后当前句法结构的不确定性 (熵 )明显下降 ,则认为该特征类型抓住了上下文中影响句法结构的某些主要信息 .特征类型分析的信息论模型利用预测信息量、预测信息增益、预测信息关联度以及预测信息总量四种度量从不同的侧面量化地分析各种特征类型及特征类型组合对于当前目标的预测作用 .实验以 Penn Tree Bank为训练集 ,将上下文中不同的特征类型对于句法分析规则的预测作用进行了系统的量化分析。 The paper proposes an information-theory-based feature type analysis model. Using the method, we can quantitatively analyze the power of different feature types for syntactic structure prediction from the viewpoint of information theory. The basic idea is that we use entropy and conditional entropy to measure whether a feature type grasps some of the information for syntactic structure prediction. If the average uncertainty of the syntactic structures declines apparently, the feature type is deemed to have grasped some intrinsic linguistic information in the context that has close relation to the syntactic structure. Using Penn-Treebank training and testing set, our experiment quantitatively analyze the different feature types' predictive power for syntactic structure predictive power for syntactic structure prediction in a systematic way and draws a series of conclusions which reflect the predictive power of different feature types and feature type combination for syntactic parsing.

作者穗志方赵军俞士汶

机构地区北京大学计算机科学与技术系计算语言学研究所香港科技大学计算机科学系人类语言技术中心

出处《计算机学报》 EI CSCD 北大核心 2001年第2期144-151,共8页 Chinese Journal of Computers

基金国家"九七三"项目! (G19980 30 5 0 7-4 ) 国家自然科学基金! (6 94830 0 3)资助

关键词统计句法分析信息论概率建模特征类型分析语音识别 Entropy Information theory

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献10

1[1]Kenneth Ward Church. A stochastic parts program and noun phrase parser for unrestricted text. In: Proc 2nd Conference on Applied Natural Language Processing, ACL, Austin, Texas, 1988. 136-143
2[2]Magerman D M, Marcus M P. Pearl:A probabilistic chart parser. In: Proc European ACL Conference, Berlin, Germany,1991, http://www-cs-students.stanford.edu/～magerman/pubs.html
3[3]Briscoe T, Carroll J. Generalized LR parsing of natural language (corpora) with unification-based grammars. Computational Linguistics, 1993, 19(1):25-60
4[4]Magerman D M, Weir C. Probabilistic prediction and Picky chart parsing. In: Proc DARPA Speech and Natural Language Workshop, Arden House, NY, 1992, http://www-cs-students.stanford.edu/～magerman/pubs.html
5[5]Magerman D M. Statistical decision-tree models for parsing. In: Proc 33th Annual Meeting of the ACL, Cambridge, MA, 1995. 276-283
6[6]Collins M J. A new statistical parser based on bigram lexical dependencies. In: Proc 34th Annual Meeting of the ACL, Santa Cruz, CA, 1996.184-191
7[7]Charniak E. Statistical parsing with a context-free grammar and word statistics. In: Proc 14th National Conference on Artificial Intelligence, Menlo Park, CA, 1997. 598-603
8[8]Black E, Jelinek F, Lafferty J et al. Towards history-based grammars: Using richer models of context in probabilistic parsing. In: Proc 31st Annual Meeting of the ACL, Columbus, Ohio, 1993. 31-37
9[9]Marcus M P, Santorini B, Marcinkiewicz M A. Building a large annotated corpus of English:The Penn treebank. Computational Linguistics, 1993, 19(2):313-330
10[10]Bell T C, Cleary J G, Witten I H. Text compression. Englewood Cliffs, New Jersey 07632: Prentice Hall, 1992

同被引文献64

1董振东.语义关系的表达和知识系统的建造[J].语言文字应用,1998(3):79-85. 被引量：59
2由丽萍,范开泰,刘开瑛.汉语语义分析模型研究述评[J].中文信息学报,2005,19(6):57-63. 被引量：22
3秦春秀,赵捧未,刘怀亮.词语相似度计算研究[J].情报理论与实践,2007,30(1):105-108. 被引量：30
4[2]Darroch J N,Ratcliff D.Generalized iterative scaling for log-linear models[J].The Annals of Mathematical Statistics, 1972;43(5): 1470-1480
5[3]Au R Rosenfeld. Adaptive language modeling using the maximum entropy principle[C].ln:Proceedings of the Human Language Technology Workshop ,ARPA: 1993: 108-113
6[4]Rosenfeld R.A maximum entropy approach to adaptive statistical language modeling[J].Computer, Speech, and Language, 1996; 10
7[5]Jaynes E T.Notes on present status and future prospects[C].ln:Grandy W T,Schick L Heds. Maximum Entropy and Bayesian Methods,Kluwer: 1990:1-13
8Quillian M R. Semantic memory[ M]//Minsky M Y. Semantic In- formation Processing. Cambridge: MIT Press, 1968.
9Sowa J F. Conceptual structures:Information processing in mind and machine[ M]. Boston: Addison - Wesley Longman Publishing Co. , Inc. , 1984.
10Gruber T R. A translation approach to portable ontology specifica- tions[J]. Knowledge Acquisition, 1993, 5(2) : 199 -220.

引证文献4

1袁毅,张丹,张晓东,谢建明,孙啸.基因相关生物医学文献挖掘研究[J].电脑知识与技术,2008,3(5):620-623. 被引量：3
2徐延勇,郭忠伟,周献中.基于最大熵方法的统计语言模型[J].计算机工程与应用,2002,38(5):53-55. 被引量：4
3秦春秀,祝婷,赵捧未,张毅.自然语言语义分析研究进展[J].图书情报工作,2014,58(22):130-137. 被引量：31
4孙淑婷,刘铖枨,周广茵,韩锐,陈立超,羊月褀,许玥.图像分割算法在医学图像中的应用综述[J].现代仪器与医疗,2024,30(2):59-68.

二级引证文献38

1郑海山.大数据时代建构人工智能辅助量刑系统的路径探讨[J].湘江青年法学,2018,4(1):68-87. 被引量：4
2李明杰,贾巨涛,宋德超,吴伟,韩林峄.一种基于少量训练数据的口语语义理解技术[J].家电科技,2020(S01):222-224. 被引量：4
3陈欣,和金生,董丽平.知识创新随机过程最大熵模型[J].中国工程科学,2004,6(12):43-46.
4曹波,苏一丹,邓琦.基于最大熵模型的中国人名自动识别[J].计算机工程与应用,2009,45(4):227-228. 被引量：7
5陈文君,於文雪.汉英跨语言检索系统中关键词提取方法的研究[J].电脑知识与技术,2009,5(10):7848-7849.
6张克菊,韩毅.关系抽取技术的发展与应用——以生物信息学为例[J].情报科学,2010,28(1):102-106. 被引量：1
7吕婷,姜友好.文本挖掘在生物医学领域中的应用及其系统工具[J].中华医学图书情报杂志,2010,19(4):56-64. 被引量：19
8张富利,郑海山.大数据时代人工智能辅助量刑问题研究[J].昆明理工大学学报（社会科学版）,2018,18(6):1-10. 被引量：9
9高志鹏,牛琨,刘杰.面向大数据的分析技术[J].北京邮电大学学报,2015,38(3):1-12. 被引量：49
10祝婷,秦春秀,马晓悦,李祖海.基于本体与LDA主题模型的文本资源推荐方法研究[J].情报杂志,2015,34(11):150-156. 被引量：4

1朱胜.分布式电源的概率建模及其对电力系统的影响[J].电子测试,2016,27(11):127-128.
2何亚平.条件熵在图像压缩中的应用[J].贵州科技工程职业学院学报,2007,2(4):31-32.
3俞一彪,袁保宗.连续语音识别中句法结构知识的利用[J].电子学报,1990,18(6):68-74. 被引量：5
4王振华,田金文,柳健.一种适合于遥感图像的逐行扫描压缩算法[J].宇航学报,2005,26(1):60-65. 被引量：1
5季玉玉,俞晓磊,赵志敏,汪东华.射频识别系统碰撞过程的概率建模及防碰撞检测[J].理化检验（物理分册）,2013,49(1):6-10. 被引量：2
6邵银波,贺玲,秦江敏.BMP神经网络在句法分析中的运用[J].空军雷达学院学报,2000,14(4):11-14.
7陈月,赵岩,王世刚.图像局部特征自适应的快速SIFT图像拼接方法[J].中国光学,2016,9(4):415-421. 被引量：25
8肖小玲,李腊元.基于概率支持向量机方法的人脸识别[J].武汉理工大学学报（交通科学与工程版）,2009,33(2):345-348. 被引量：4
9电视技术:未来10年有10变[J].电视研究,1994(3):54-54.
10马洁.目标跟踪技术在智能视频监控中的应用研究[J].煤矿机电,2016,37(2):41-43. 被引量：1

计算机学报

2001年第2期

浏览历史

内容加载中请稍等...

统计句法分析建模中基于信息论的特征类型分析被引量：4

参考文献10

同被引文献64

引证文献4

二级引证文献38

相关作者

相关机构

相关主题

浏览历史

统计句法分析建模中基于信息论的特征类型分析 被引量：4

参考文献10

同被引文献64

引证文献4

二级引证文献38

相关作者

相关机构

相关主题

浏览历史

统计句法分析建模中基于信息论的特征类型分析被引量：4