期刊文献+

基于词汇-语义模式的金融事件信息抽取方法 被引量:16

Information extraction method of financial events based on lexical-semantic pattern
下载PDF
导出
摘要 信息抽取是自然语言处理工作中的重要任务之一。针对由于自然语言的多样性、歧义性和结构性而导致的信息抽取困难的问题,提出了一种面向金融事件信息抽取的层次化词汇-语义模式方法。首先,定义了一个金融事件表示模型;然后应用基于深度学习的词向量方法来实现自动生成同义概念词典;最后采用基于有限状态机驱动的层次化词汇-语义规则模式实现了对各类金融事件信息自动抽取的目标。实验结果表明,所提方法可以从金融新闻文本中准确地抽取出各类金融事件信息,并且对26类金融事件的微平均识别准确率达到93.9%,微平均召回率达到86.9%,微平均F1值达到90.3%。 Information extraction is one of the most important tasks in natural language processing. A hierarchical Lexical-Semantic Pattern (LSP) method for the extraction of financial events was proposed for the problem of information extraction in natural language processing due to linguistic diversity, ambiguity and structure. Firstly, a financial event representation model was defined. Secondly, a word vector method based on deep learning was used to realize the automatic generation of synonymous concept ~lictionary. Finally, some hierarchical LSPs based on finite state machine were used to extract various kinds of financial events. The experimental results show that by using the proposed method various kinds of financial events can be accurately extracted from the financial news text, and for 26 types of financial events recognition the micro average precision is 93.9%, the micro average recall is 86.9%, the micro average F1 value reaches 90.3%.
作者 罗明 黄海量
出处 《计算机应用》 CSCD 北大核心 2018年第1期84-90,共7页 journal of Computer Applications
基金 上海市科技人才计划项目(14XD1421000) 上海市科技创新行动计划项目(16511102900)~~
关键词 词汇-语义模式 信息抽取 金融事件 词向量 词列表 概念词典 Lexical-Semantic Pattern (LSP) information extraction financial event word vector word list conceptgazetteer
  • 相关文献

参考文献5

二级参考文献81

共引文献153

同被引文献140

引证文献16

二级引证文献40

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部