期刊文献+

基于无监督技术的中文新闻事件数据构建与分析

Construction and analysis of Chinese news event data based on unsupervised techniques
下载PDF
导出
摘要 本文针对面向媒介和传播学研究的中文新闻事件数据构建任务进行探索,利用自然语言处理、深度学习和无监督聚类等技术,构建了一套开放性的新闻事件提取框架。构建中文新闻事件数据库的过程可以概括为将原始的新闻文本进行处理,然后进行句法分析和语义角色识别,从中提取三元组,再提取动词并转换为向量表示,之后通过降维和聚类结合人工标注形成结构化数据,最后提出了事件重要性得分以评估新闻中事件的分布情况。利用《人民日报》的新闻数据进行了实验,验证了本文研究的理论与实践价值。 In this paper the task of constructing Chinese news event data for media and communication research was explored,technologies such as natural language processing,deep learning,and unsupervised clustering were utilized to construct an open-ended news event extraction framework.The process of constructing the Chinese news event database could be summarized as processing the original news text,performing syntactic analysis and semantic role recognition,extracting triplets from it,then extracting verbs and converting them into vector representations,followed by dimension reduction and clustering combined with manual annotation to form structured data.Finally,an event importance score was proposed to assess the distribution of events in the news.The framework was tested using news data from the People's Daily,validating the practical value of the research.
作者 元方 卢伟 沈浩 YUAN Fang;LU Wei;SHEN Hao(State Key Laboratory of Media Convergence and Communication,Communication University of China,Beijing 100024,China)
出处 《中国传媒大学学报(自然科学版)》 2023年第5期1-9,共9页 Journal of Communication University of China:Science and Technology
基金 中国传媒大学中央高校基本科研业务费专项资金资助(CUC23GY004)。
关键词 新闻事件 事件数据 无监督学习 news event event data unsupervised learning
  • 相关文献

参考文献3

二级参考文献22

  • 1邓守信.汉语动词的时间结构[J].语言教学与研究,1985(4):7-17. 被引量:43
  • 2杨小璐.现代汉语“才”与“就”的母语习得[J].现代外语,2000,23(4):331-348. 被引量:14
  • 3香港中文大学[J]高校招生,2003(10).
  • 4Veerle van Geenhoven. For-adverbials, Frequentative Aspect, and Pluractionality[J] 2004,Natural Language Semantics(2):135~190
  • 5Jo-wang Lin. Temporal Reference in Mandarin Chinese[J] 2003,Journal of East Asian Linguistics(3):259~311
  • 6Andrew Simpson,Zoe Wu. From D to T – Determiner Incorporationand the Creation of Tense[J] 2002,Journal of East Asian Linguistics(2):169~209
  • 7Wolfgang Klein,Ping Li,Hemriette Hendriks. Aspect and Assertion in Mandarin Chinese[J] 2000,Natural Language and Linguistic Theory(4):723~770
  • 8Mats Rooth. A theory of focus interpretation[J] 1992,Natural Language Semantics(1):75~116
  • 9Emmon Bach. The algebra of events[J] 1986,Linguistics and Philosophy(1):5~16
  • 10David R. Dowty. Toward a semantic analysis of verb aspect and the English ‘imperfective’ progressive[J] 1977,Linguistics and Philosophy(1):45~77

共引文献68

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部