期刊文献+

基于句法结构约束的模糊限制信息范围检测 被引量:1

Hedge Scope Detection Based on Syntactic Structural Constraints
下载PDF
导出
摘要 模糊限制信息检测用于区分模糊限制信息与事实信息,提高抽取信息的真实性和可靠性。模糊限制信息范围的界定具有依赖于语义和句法结构的特点,是模糊限制信息检测的一个难点。该文提出一种基于句法结构约束的模糊限制信息范围检测方法,基于依存结构树和短语结构树构建决策树,获取句法结构约束集,用于产生句法结构约束特征,并加入到条件随机域模型中进行模糊限制信息范围检测。实验采用CoNLL-2010共享任务数据集,在标准的模糊限制语标注语料上,获得了70.28%的F值,比采用普通的句法结构特征提高了4.22%。 Hedge scope detection is used to distinguish factual information and uncertain information,which could improve the authenticity and reliability in information extraction.Hedge scope detection is a difficult task because of its dependency of the semantic and syntactic structures.In this paper,we propose a hedge scope detection method based on syntactic structural constraints.First,two decision trees are constructed on dependency structure and phrase structure respectively to build the syntactic constraint set.And then the hedge scope detection results based on the syntactic constraint set are used as the syntactic constraint features for Conditional Random Fields(CRF)models.Experiments on the CoNLL-2010corpus achieve the 70.28% F-score on the golden standard hedge cues,which is 4.22% higher than the system with the common syntactic construction features.
出处 《中文信息学报》 CSCD 北大核心 2013年第5期137-143,共7页 Journal of Chinese Information Processing
基金 国家自然科学基金资助项目(61272375 61173100 61173101)
关键词 模糊限制信息范围检测 句法结构约束 决策树 条件随机域 hedge scope detection syntactic structural constraints decision tree conditional random fields
  • 相关文献

参考文献15

  • 1George L.Hedges:a study in meaning criteria and the logic of fuzzy concepts[J].Journal of Philosophical Logic,1973,2(4):458-508.
  • 2Marc L,Qiu X Y,Pandmini S.The language of bioscience:facts,speculations,and statements in between[C]//Proceedings of the BioLINK,Boston,2004,17-24.
  • 3Szarvas G,Vincze V,Farkas R,et al.The BioScope corpus:biomedical texts annotated for uncertainty,negation and their scopes[J].BMC Bioinformatics,2008,9(11):S9.
  • 4Medlock B,Briscoe T.Weakly supervised learning for hedge classification in scientific iterature[C]//Proceedings of ACL,the 45th Annual Meeting of the Association of Computational Linguistics,2007,992-999.
  • 5Farkas R,Vincze V,Móra G,et al.The CoNLL 2010 Shared Task:Learning to detect hedges and their scope in natural language text[C]//Proceedings of the CoNLL,Uppsala,Sweden.2010,1-12.
  • 6(O)zgür A,Radev D R.Detecting speculations and their scopes in scientific text[C]//Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing,Singapore,August,Association for Computational Linguistics.2009:1398-1407.
  • 7Velldal E,Ovrelid L,Oepen S.Resolving speculation:MaxEnt cue classification and dependency-based scope rules[C]//Proceedings of the CoNLL,Uppsala,Sweden,2010,48-55.
  • 8Morante R,Asch V V,Daelemans W.Memory-based resolution of In-Sentence scopes of hedge cues[C]//Proceedings of the CoNLL,Uppsala,Sweden,2010:40-47.
  • 9Qiaoming Zhu,Junhui Li,Hongling Wang,et al.A unified framework for scope learning via simplified shallow semantic parsing[C]//Proceedings of the 2010Conference on Empirical Methods in Natural Language Processing,2010:714-724.
  • 10ZHOU Huiwei HUANG Degen LI Xiaoyan YANG Yuansheng.Combining Structured and Flat Features by a Composite Kernel to Detect Hedges Scope in Biological Texts[J].Chinese Journal of Electronics,2011,20(3):476-482. 被引量:2

二级参考文献25

  • 1郑家恒,卢娇丽.关键词抽取方法的研究[J].计算机工程,2005,31(18):194-196. 被引量:41
  • 2Moore AW, Zuev D. Internet traffic classification using Bayesian analysis techniques. In: Proc. of the 2005 ACM SIGMETRICS Int'l Conf. on Measurement and Modeling of Computer Systems, Banff, 2005. 50-60. http://www.cl.cam.ac.uk/-awm22 /publications/moore2005internet.pdf.
  • 3Madhukar A, Williamson C. A longitudinal study of P2P traffic classification. In: Proc. of the 14th IEEE Int'l Syrup. on Modeling, Analysis, and Simulation. Monterey, 2006. http://ieeexplore.ieee.org/xpl/ffeeabs_all.jsp?arnumber=1698549.
  • 4Moore AW, Papagiannaki K. Toward the accurate identification of network applications. In: Dovrolis C, ed. Proc. of the PAM 2005. LNCS 3431, Heidelberg: Springer-Verlag, 2005.41-54.
  • 5Karagiannis T, Papagiannaki K, Faloutsos M. BLINC: Multilevel traffic classification in the dark. In: Proc. of the ACM SIGCOMM. Philadelphia, 2005. 229-240. http://conferences.sigcomm.org/sigcomm/2005/paper-KarPap.pdf.
  • 6Roughan M, Sen S, Spatscheck O, Dutfield N. Class-of-Service mapping for QoS: A statistical signature-based approach to IP traffic classification. In: Proc. of the ACM SIGCOMM Internet Measurement Conf. Taormina, 2004. 135-148. http://www.imconf.net/imc-2004/papers/p 135-roughan.pdf.
  • 7Zuev D, Moore AW. Traffic classification using a statistical approach. In: Dovrolis C, ed. Proc. of the PAM 2005. LNCS 3431, Heidelberg: Springer-Verlag, 2005. 321-324.
  • 8Nguyen T, Armitage G. Training on multiple sub-flows to optimise the use of Machine Learning classifiers in real-world IP networks. In: Proc. of the 31 st IEEE LCN 2006. Tampa, 2006. http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=4116573.
  • 9Eerman J, Mahanti A, Arlitt M. Internct traffic identification using machine learning techniques. In: Proc. of the 49th IEEE GLOBECOM. San Francisco, 2006. http://pages.cpsc.ucalgary.ca/-mahanti/papers/globecom06.pdf.
  • 10Erman J, Arlitt M, Mahanti A. Traffic classification using clustering algorithms. In: Proc. of the ACM SIGCOMM Workshop on Mining Network Data (MineNet). Pisa, 2006. http://conferences.sigcomm.org/sigcomm/2006/papers/minenet-01.pdf.

共引文献172

同被引文献5

引证文献1

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部