期刊文献+

基于文本挖掘机制的区域经济关系分析 被引量:5

Text Mining for the Relations between Entities
下载PDF
导出
摘要 已有的经济关系研究大都采用实证的或单纯的计量学的方法来实现的。本文则针对非结构化的文本特点,采用信息抽取和文本挖掘方法挖掘用户感兴趣的区域经济关系是具有十分重大应用价值的研究课题。本文在探讨了基于实体关系的文本挖掘机制的基础上,对31个省、市、自治区的区域经济关系进行了分析。运用文本挖掘技术对经济关系的挖掘包括两种方式:一是基于属性的经济关系挖掘,利用信息抽取获取各个实体属性,采用聚类方法分析经济实体关系;二是基于相互引用的经济关系挖掘,首先构造经济实体关系分类词典,提出了实体关系标注算法,利用信息抽取获得实体之间的引用情况,然后构造关系有向图,从中挖掘区域经济之间的关系。研究表明,运用文本挖掘技术,既可以对各个区域经济发展状况进行分析和评价,也可以发现特定区域经济之间的内在关系。 Text mining plays an important role in knowledge acquisition, and it is valuable issue to apply information extraction and text mining to mine relations among entities from non-structure texts in the internet. In this paper, the approach of text mining for relations between named entities is presented, and it includes two mining schemes. One is based on the attributes of entities. It applies the approach of information extraction to collect their attributes, and then adopt the clustering algorithm to analyze the relations between named entities. The other is based on the reference between entities. It constructs the relation dictionary and presents the algorithm of annotating relations. It set up the vector-graph based on the references between entities, and it derives several interesting information patterns from the vector-graph. As a result, it shows a better effect on mining the relationship between named entities from a specific domain.
出处 《情报学报》 CSSCI 北大核心 2008年第3期418-424,共7页 Journal of the China Society for Scientific and Technical Information
基金 本文得到国家自然科学基金项目(编号:60373095,60673039)的资助.
关键词 文本挖掘 同义扩充 实体关系 关系标注 聚类分析 区域经济 text mining, synonymous expansion, relations between named entities, annotating relations, clustering analysis, regional economics
  • 相关文献

参考文献15

二级参考文献44

  • 1朱德熙.现代书面汉语里的虚化动词和名动词 为第一届国际汉语教学讨论会而作[J].北京大学学报(哲学社会科学版),1985,22(5):3-8. 被引量:131
  • 2周明,黄昌宁.面向语料库标注的汉语依存体系的探讨[J].中文信息学报,1994,8(3):35-52. 被引量:39
  • 3王建波,杜春玲,王开铸.基于篇章理解的自动文摘研究[J].中文信息学报,1995,9(3):33-42. 被引量:12
  • 4[1]ELLEN R. Information extraction as a stepping stone toward story understanding [M]. Montreal:MIT press, 1999.
  • 5[2]SODERLAND G. Learning text analysis rules for domain-specific natural language processing [D]. Amherst: University of Massachusetts, 1997.
  • 6[3]COLLINS M, MILLER S. Semantic tagging using a probabilistic context free grammar [EB/OL].http://www.ai.mit.edu/people/mcollins/papers/wvlcb.fin-as.ps,2003-12-22.
  • 7[4]HOBBS, JERRY R, DOUGLAS E, et al. FASTUS: a cascaded finite-state transducer for extracting information from natural-language text[A]. In: Finite State Devices for Natural Language Processing[C]. Cambridge: MIT Press, 1997.
  • 8Cowie J,Lchnert W. Information Extraction. Special NLP Issuc of the Communications of the ACM, 1996, 39( 1 ): 80-91
  • 9Dong Zhendong,Dong Qiang. HowNet. http://www.keenage.com.2000
  • 10Grishman R. Information Extraction: Techniques and Challenges.In Information Extraction, Springer Notes in Artificial Intelligence,Springer-Verlag, 1997

共引文献196

同被引文献75

引证文献5

二级引证文献33

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部