摘要
全球迎来数据革命背景下,数据要素市场迎来爆发式增长,商业和学术界对数据的需求不断增长,如何有效挖掘数据背后深层次信息成为研究热点及重难点问题。GDELT项目数据作为免费的全球新闻媒体数据库,为人文社会科学领域提供了全球视野问题研究的数据支撑。通过介绍GDELT项目概况、数据获取方式,梳理其数据应用领域及问题,以Event数据库为例,选取其字段指标,结合以往研究,构建基本变量、选取研究方法并提炼数学思想,最终对研究进行讨论及展望。目前研究问题呈现“多点开花”现象,可分为源数据处理及解析、社会热点问题挖掘与国际关系研究三方面,多采用经典方法及数学思想解决热点问题。未来研究应打破学科壁垒,多学科融合交叉协同研究,再度深入挖掘,探索GDELT项目数据新应用领域,完善数据共享平台,增加更多搜索查询模块,融合多方数据,搭建本土实时监控预警平台,充分了解数据局限,科学性使用数据。
The GDELT project is a free global news media database that provides data support for re⁃search on global issues in the humanities and social sciences.After the introduction of the GDELT project,the data acquisition methods,and the application areas and problems of the data;taking the Event database as an example,selecting its field indicators,to construct its basic variables;selecting the research methods and the refinement of mathematical ideas,comes the discussion and outlook of the research.The current re⁃search problem is“multi-faceted”and can be divided into three areas:metadata processing and parsing,social hotspot mining,and international relations research,mostly with the help of classical methods and mathematical ideas to solve hotspot problems.Future research should(1)break down disciplinary barriers,and integrate and collaborate with multiple disciplines,(2)dig deeper again,and explore new applications of GDELT project data,(3)improve the data sharing platform,add more search and query modules,and in⁃tegrate data from multiple parties,to build a local real-time monitoring and early warning platform,so as to fully understand data limitations and use data scientifically.
作者
李振福
张琦琦
LI Zhenfu;ZHANG Qiqi
出处
《信息技术与管理应用》
2024年第3期67-78,共12页
Information Technology and Management Application
基金
国家社会科学基金重大项目“‘一带一路’沿线国家和地区非传统安全问题研究”(23&ZD147)。