基于句法结构约束的模糊限制信息范围检测被引量：1

Hedge Scope Detection Based on Syntactic Structural Constraints

下载PDF

导出

摘要模糊限制信息检测用于区分模糊限制信息与事实信息,提高抽取信息的真实性和可靠性。模糊限制信息范围的界定具有依赖于语义和句法结构的特点,是模糊限制信息检测的一个难点。该文提出一种基于句法结构约束的模糊限制信息范围检测方法,基于依存结构树和短语结构树构建决策树,获取句法结构约束集,用于产生句法结构约束特征,并加入到条件随机域模型中进行模糊限制信息范围检测。实验采用CoNLL-2010共享任务数据集,在标准的模糊限制语标注语料上,获得了70.28%的F值,比采用普通的句法结构特征提高了4.22%。 Hedge scope detection is used to distinguish factual information and uncertain information,which could improve the authenticity and reliability in information extraction.Hedge scope detection is a difficult task because of its dependency of the semantic and syntactic structures.In this paper,we propose a hedge scope detection method based on syntactic structural constraints.First,two decision trees are constructed on dependency structure and phrase structure respectively to build the syntactic constraint set.And then the hedge scope detection results based on the syntactic constraint set are used as the syntactic constraint features for Conditional Random Fields（CRF）models.Experiments on the CoNLL-2010corpus achieve the 70.28% F-score on the golden standard hedge cues,which is 4.22% higher than the system with the common syntactic construction features.

作者周惠巍杨欢黄德根李瑶李丽双

机构地区大连理工大学计算机科学与技术学院

出处《中文信息学报》 CSCD 北大核心 2013年第5期137-143,共7页 Journal of Chinese Information Processing

基金国家自然科学基金资助项目(61272375 61173100 61173101)

关键词模糊限制信息范围检测句法结构约束决策树条件随机域 hedge scope detection syntactic structural constraints decision tree conditional random fields

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献15

1George L.Hedges:a study in meaning criteria and the logic of fuzzy concepts[J].Journal of Philosophical Logic,1973,2(4):458-508.
2Marc L,Qiu X Y,Pandmini S.The language of bioscience:facts,speculations,and statements in between[C]//Proceedings of the BioLINK,Boston,2004,17-24.
3Szarvas G,Vincze V,Farkas R,et al.The BioScope corpus:biomedical texts annotated for uncertainty,negation and their scopes[J].BMC Bioinformatics,2008,9(11):S9.
4Medlock B,Briscoe T.Weakly supervised learning for hedge classification in scientific iterature[C]//Proceedings of ACL,the 45th Annual Meeting of the Association of Computational Linguistics,2007,992-999.
5Farkas R,Vincze V,Móra G,et al.The CoNLL 2010 Shared Task:Learning to detect hedges and their scope in natural language text[C]//Proceedings of the CoNLL,Uppsala,Sweden.2010,1-12.
6(O)zgür A,Radev D R.Detecting speculations and their scopes in scientific text[C]//Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing,Singapore,August,Association for Computational Linguistics.2009:1398-1407.
7Velldal E,Ovrelid L,Oepen S.Resolving speculation:MaxEnt cue classification and dependency-based scope rules[C]//Proceedings of the CoNLL,Uppsala,Sweden,2010,48-55.
8Morante R,Asch V V,Daelemans W.Memory-based resolution of In-Sentence scopes of hedge cues[C]//Proceedings of the CoNLL,Uppsala,Sweden,2010:40-47.
9Qiaoming Zhu,Junhui Li,Hongling Wang,et al.A unified framework for scope learning via simplified shallow semantic parsing[C]//Proceedings of the 2010Conference on Empirical Methods in Natural Language Processing,2010:714-724.
10ZHOU Huiwei HUANG Degen LI Xiaoyan YANG Yuansheng.Combining Structured and Flat Features by a Composite Kernel to Detect Hedges Scope in Biological Texts[J].Chinese Journal of Electronics,2011,20(3):476-482. 被引量：2

二级参考文献25

1郑家恒,卢娇丽.关键词抽取方法的研究[J].计算机工程,2005,31(18):194-196. 被引量：41
2Moore AW, Zuev D. Internet traffic classification using Bayesian analysis techniques. In: Proc. of the 2005 ACM SIGMETRICS Int'l Conf. on Measurement and Modeling of Computer Systems, Banff, 2005. 50-60. http://www.cl.cam.ac.uk/-awm22 /publications/moore2005internet.pdf.
3Madhukar A, Williamson C. A longitudinal study of P2P traffic classification. In: Proc. of the 14th IEEE Int'l Syrup. on Modeling, Analysis, and Simulation. Monterey, 2006. http://ieeexplore.ieee.org/xpl/ffeeabs_all.jsp?arnumber=1698549.
4Moore AW, Papagiannaki K. Toward the accurate identification of network applications. In: Dovrolis C, ed. Proc. of the PAM 2005. LNCS 3431, Heidelberg: Springer-Verlag, 2005.41-54.
5Karagiannis T, Papagiannaki K, Faloutsos M. BLINC: Multilevel traffic classification in the dark. In: Proc. of the ACM SIGCOMM. Philadelphia, 2005. 229-240. http://conferences.sigcomm.org/sigcomm/2005/paper-KarPap.pdf.
6Roughan M, Sen S, Spatscheck O, Dutfield N. Class-of-Service mapping for QoS: A statistical signature-based approach to IP traffic classification. In: Proc. of the ACM SIGCOMM Internet Measurement Conf. Taormina, 2004. 135-148. http://www.imconf.net/imc-2004/papers/p 135-roughan.pdf.
7Zuev D, Moore AW. Traffic classification using a statistical approach. In: Dovrolis C, ed. Proc. of the PAM 2005. LNCS 3431, Heidelberg: Springer-Verlag, 2005. 321-324.
8Nguyen T, Armitage G. Training on multiple sub-flows to optimise the use of Machine Learning classifiers in real-world IP networks. In: Proc. of the 31 st IEEE LCN 2006. Tampa, 2006. http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=4116573.
9Eerman J, Mahanti A, Arlitt M. Internct traffic identification using machine learning techniques. In: Proc. of the 49th IEEE GLOBECOM. San Francisco, 2006. http://pages.cpsc.ucalgary.ca/-mahanti/papers/globecom06.pdf.
10Erman J, Arlitt M, Mahanti A. Traffic classification using clustering algorithms. In: Proc. of the ACM SIGCOMM Workshop on Mining Network Data (MineNet). Pisa, 2006. http://conferences.sigcomm.org/sigcomm/2006/papers/minenet-01.pdf.

共引文献172

1高文才,曹帅.基于MRF-FCM算法的矿井运动目标图像优化[J].工矿自动化,2024,50(S01):69-73.
2邓建国,张素兰,张继福,荀亚玲,刘爱琴.监督学习中的损失函数及应用研究[J].大数据,2020,6(1):60-80. 被引量：38
3代志康,吴秋新,程希明.一种基于ResNet的网络流量识别方法[J].北京信息科技大学学报（自然科学版）,2020,35(1):82-88. 被引量：5
4陈陆颖,丛蓉,杨洁,于华.P2P Streaming Traffic Classification in High-Speed Networks[J].China Communications,2011,8(5):70-78. 被引量：1
5赵树鹏,陈贞翔,彭立志.基于流中前5个包的在线流量分类特征[J].济南大学学报（自然科学版）,2012,26(2):156-160. 被引量：3
6孟姣,王丽宏,熊刚,姚垚.基于机器学习的SSH应用分类研究[J].计算机研究与发展,2012,49(S2):153-159. 被引量：2
7胡婷,王勇,陶晓玲.网络流量分类方法的比较研究[J].桂林电子科技大学学报,2010,30(3):216-219. 被引量：4
8胡婷,王勇,陶晓玲.混合模式的网络流量分类方法[J].计算机应用,2010,30(10):2653-2655. 被引量：8
9易兴辉,王国胤,胡峰.一种新的基于粗糙集的动态样本识别算法[J].南京大学学报（自然科学版）,2010,46(5):501-506. 被引量：8
10刘浩力.多层次压缩决策树在计算机取证中的应用[J].中国信息界,2011(1):60-62.

同被引文献5

1何自然.模糊限制语与言语交际[J].外国语,1985,8(5):29-33. 被引量：275
2曹媛,朱巧明,李培峰.中文事件事实性信息语料库的构建方法[J].中文信息学报,2013,27(6):38-44. 被引量：6
3邹博伟,周国栋,朱巧明.否定与不确定信息抽取研究综述[J].中文信息学报,2015,29(4):16-24. 被引量：1
4贾晓凡,蒋跃.基于小型语料库的模糊限制语分类方法的对比研究[J].外语艺术教育研究,2011,0(3):10-14. 被引量：1
5陈萍,蒋跃.中英医学论文摘要中模糊限制语的对比研究[J].外语艺术教育研究,2009,0(1):15-20. 被引量：6

引证文献1

1周惠巍,杨欢,徐俊利,张静,亢世勇.中文模糊限制信息范围语料库的研究与构建[J].中文信息学报,2017,31(3):77-85. 被引量：4

二级引证文献4

1徐俊利,赵江江,赵宁,薛超.营销活动问题标签分类语料库的构建与分类研究[J].计算机应用与软件,2019,36(3):42-48. 被引量：3
2冯鸾鸾,李军辉,李培峰,朱巧明.面向国防科技领域的技术和术语语料库构建方法[J].中文信息学报,2020,34(8):41-50. 被引量：19
3魏明飞,潘冀,陈志敏,梅小华,石会鹏.预训练模型下航天情报实体识别方法[J].华侨大学学报（自然科学版）,2021,42(6):831-837.
4刘凯,廖湘琳,张宏军.面向特定领域文本的重叠关系语料库构建方法[J].计算机技术与发展,2022,32(10):126-131.

1周惠巍,杨欢,张静,亢世勇,黄德根.中文模糊限制语语料库的研究与构建[J].中文信息学报,2015,29(6):83-89. 被引量：4
2李永芹.计算机图像处理技术的运用分析[J].电子技术与软件工程,2015(24):106-106. 被引量：5
3武帅.基于条件随机域模型的事实信息抽取方法应用[J].现代图书情报技术,2010(10):59-64.
4汪一百.生物信息数据处理系统的研究分析[J].电子制作,2013,21(17):144-144.
5周玉新.信息抽取研究与发展综述[J].才智,2016,0(27):262-262. 被引量：6
6苏庆林.大庆油田协同办公系统信息授权模型的建立[J].信息系统工程,2013,26(8):64-65.
7易准.计算机图像处理技术应用研究[J].电子技术与软件工程,2014(12):120-120. 被引量：7
8魏少峰,张威.对计算机图像处理技术应用研究[J].科技风,2012(5):81-81. 被引量：16
9李璐,张国印,李正文.基于SVM的主题爬虫技术研究[J].计算机科学,2015,42(2):118-122. 被引量：12
10李开雪.“例式引导型概念”教学初探[J].生物学教学,2011,36(2):23-25.

中文信息学报

2013年第5期

浏览历史

内容加载中请稍等...

基于句法结构约束的模糊限制信息范围检测被引量：1

参考文献15

二级参考文献25

共引文献172

同被引文献5

引证文献1

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

基于句法结构约束的模糊限制信息范围检测 被引量：1

参考文献15

二级参考文献25

共引文献172

同被引文献5

引证文献1

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

基于句法结构约束的模糊限制信息范围检测被引量：1