摘要
[目的/意义]人工智能技术的更迭应用驱动着数据集在科学计量研究领域发挥着日渐重要的作用。从传统面向信息供应的数据资料集合到如今辅助知识发现和关系网络构建的知识资源,数据集各功能取得了快速发展,进而为拓展科学计量研究的深度和广度提供支持。[研究设计/方法]以Scientometrics期刊2016-2020年收录的论文为数据源,分析数据集的整体使用情况,探究数据集的使用热度与文献数量之间的关系,针对典型数据集进行特征分析,并探讨人工智能技术对于数据集工作的影响,展望数据集的未来建设方向。[结论/发现]数据集的被使用频次与其收录论文数量之间存在一定正相关关系,同一科学计量研究倾向于同时使用多种数据集,且基于科研文献数据集的科学计量研究与人工智能技术之间的关系日益紧密。[创新/价值]旨在通过分析科学计量相关论文所使用数据集的特征,总结归纳近年来数据集的建设发展规律,并为开展科学计量研究选用数据集提供参考。
[Purpose/Significance]Driven by the application development of artificial intelligence technologies,academic literature datasets are playing an increasingly important role in the field of scientometrics.Developing from traditional data collections oriented to information supply to knowledge resources that assist in knowledge discovery and relationship network construction nowadays,every aspect of the function of data sets have been greatly improved,which in turn provides support for expanding the depth and breadth of research in scientometrics.[Design/Methodology]The article uses journal articles published by Scientometrics in 2016-2020 as the data source.Through the analysis of the data sets usage records of scientometrics research,the overall use of the data sets is summarized,and the relationship between the usage popularity of the data sets and the number of documents is explored.In terms of typical data sets that have been frequently used,the article specifically analyzes their characteristics,explores the impact of artificial intelligence technology on datasets and looks forward to its future construction direction.[Findings/Conclusion]There is a certain positive correlation between the usage frequency of datasets and their collection volume,the same study tends to cross use multiple datasets,and the relationship between scientometrics research and artificial intelligence technology is getting increasingly close.[Originality/Value]The purpose of this study is to summarize the construction and development rules of data sets in recent years by analyzing the striking features of data sets used in scientometrics related papers,so as to provide references for the selection of data sets for scientometrics research.
作者
张彤阳
王楚涵
俞超
徐健
ZHANG Tongyang;WANG Chuhan;YU Chao;XU Jian(School of Information Management,Sun Yat-sen University,Guangzhou,510006;Department of Information Management,Beijing University,Beijing,100091)
出处
《图书情报知识》
北大核心
2023年第5期39-49,共11页
Documentation,Information & Knowledge
基金
广州市哲学社会科学发展“十四五”规划2023年度一般课题项目“人才是第一资源的广州实践和对策研究—高校科技人才与企业技术需求对接问题与对策研究”(2023GZYB04)的研究成果之一。
关键词
数据集
人工智能
科学计量
特征分析
Data sets
Artificial intelligence
Scientometrics
Feature analysis