[目的]开源代码是计算机领域内研究成果可验证和可复现的重要依据,本文旨在探究计算机领域论文是否开源代码以及不同开源代码类型对论文被引量的影响。[方法]以Papers with Code上2043篇计算机领域期刊论文为样本,采用基于稳健标准误差...[目的]开源代码是计算机领域内研究成果可验证和可复现的重要依据,本文旨在探究计算机领域论文是否开源代码以及不同开源代码类型对论文被引量的影响。[方法]以Papers with Code上2043篇计算机领域期刊论文为样本,采用基于稳健标准误差的多元回归模型进行分析。[结果]研究表明,论文开源代码与被引量呈显著正相关,不同开源代码类型的论文被引优势有别。[结论]计算机领域论文开源代码不仅提供了研究成果复现的手段,还有助于增加论文被引次数,且在Github代码仓库的README文件中提及原文信息利于促进论文被引。展开更多
[研究目的]科学数据共享是推动科技创新的关键,对科学数据共享研究领域的文献产出情况、研究热点、知识基础以及演化历程进行探究和总结,为促进国内相关理论研究与实践发展提供参考。[研究方法]运用科学知识图谱法,对2001-2020年期间Web...[研究目的]科学数据共享是推动科技创新的关键,对科学数据共享研究领域的文献产出情况、研究热点、知识基础以及演化历程进行探究和总结,为促进国内相关理论研究与实践发展提供参考。[研究方法]运用科学知识图谱法,对2001-2020年期间Web of Science核心数据库所收录的科学数据共享研究相关文献进行合作网络、关键词共现和文献共被引分析。[研究结论]研究发现,研究热点由早期科学数据共享技术、共享模式和科学数据共享平台研究扩展到当前数据共享政策、数据共享基础设施、长尾科学数据共享等问题研究。此外,科学数据共享研究领域早期主要受制度理论和计划行为理论的影响,现阶段主要受虚拟组织理论、数据生命周期理论和长尾理论的影响。展开更多
Purpose:The purpose of this study is to propose an improved credit allocation method that makes the leading author of the paper more distinguishable and makes the deification more robust under malicious manipulations....Purpose:The purpose of this study is to propose an improved credit allocation method that makes the leading author of the paper more distinguishable and makes the deification more robust under malicious manipulations.Design/methodology/approach:We utilize a modified Sigmoid function to handle the fat-tail distributed citation counts.We also remove the target paper in calculating the contribution of co-citations.Following previous studies,we use 30 Nobel Prize-winning papers and their citation networks based on the American Physical Society(APS)and the Microsoft Academic Graph(MAG)dataset to test the accuracy of our proposed method(NCCAS).In addition,we use 654,148 articles published in the field of computer science from 2000 to 2009 in the MAG dataset to validate the distinguishability and robustness of NCCAS.Finding:Compared with the state-of-the-art methods,NCCAS gives the most accurate prediction of Nobel laureates.Furthermore,the leading author of the paper identified by NCCAS is more distinguishable compared with other co-authors.The results by NCCAS are also more robust to malicious manipulation.Finally,we perform ablation studies to show the contribution of different components in our methods.Research limitations:Due to limited ground truth on the true leading author of a work,the accuracy of NCCAS and other related methods can only be tested in Nobel Physics Prize-winning papers.Practical implications:NCCAS is successfully applied to a large number of publications,demonstrating its potential in analyzing the relationship between the contribution and the recognition of authors with different by-line orders.Originality/value:Compared with existing methods,NCCAS not only identifies the leading author of a paper more accurately,but also makes the deification more distinguishable and more robust,providing a new tool for related studies.展开更多
基于世界大洋环流实验(World Ocean Circulation Experiment,WOCE)、全球联合海洋通量研究(Joint Global Ocean Flux Study,JGOFS)、全球海洋船基水文调查计划(Global Ocean Ship-based Hydro-graphic Investigations Program,GO-SHIP)...基于世界大洋环流实验(World Ocean Circulation Experiment,WOCE)、全球联合海洋通量研究(Joint Global Ocean Flux Study,JGOFS)、全球海洋船基水文调查计划(Global Ocean Ship-based Hydro-graphic Investigations Program,GO-SHIP)和痕量元素及同位素海洋生物地球化学循环国际研究(An In-ternational Study of the Marine Biogeochemical Cycles of Trace Elements and Isotopes,GEOTRACES)四个国际海洋观测计划的科研数据规范化存储与共享管理经验,分析多学科综合计划如何让获得的海量观测数据进行长期、有效的存储,具有可传承性,探讨了建立数据中心和数据仓储对于数据的规范化管理和推进数据共享的重要性。总结出一套数据管理方案:在项目启动时,建立项目/数据管理办公室,推行科研数据标准化管理。针对项目研究主题形成完整的实体参数目录,统一数据名称、单位、分类及内容,制定元数据模板等,数据管理办公室要能存储和共享数据,并确保数据安全,在项目成员内部就数据质量的保证措施、数据共享与引用等管理规定达成共识。展开更多
文摘[目的]开源代码是计算机领域内研究成果可验证和可复现的重要依据,本文旨在探究计算机领域论文是否开源代码以及不同开源代码类型对论文被引量的影响。[方法]以Papers with Code上2043篇计算机领域期刊论文为样本,采用基于稳健标准误差的多元回归模型进行分析。[结果]研究表明,论文开源代码与被引量呈显著正相关,不同开源代码类型的论文被引优势有别。[结论]计算机领域论文开源代码不仅提供了研究成果复现的手段,还有助于增加论文被引次数,且在Github代码仓库的README文件中提及原文信息利于促进论文被引。
文摘[研究目的]科学数据共享是推动科技创新的关键,对科学数据共享研究领域的文献产出情况、研究热点、知识基础以及演化历程进行探究和总结,为促进国内相关理论研究与实践发展提供参考。[研究方法]运用科学知识图谱法,对2001-2020年期间Web of Science核心数据库所收录的科学数据共享研究相关文献进行合作网络、关键词共现和文献共被引分析。[研究结论]研究发现,研究热点由早期科学数据共享技术、共享模式和科学数据共享平台研究扩展到当前数据共享政策、数据共享基础设施、长尾科学数据共享等问题研究。此外,科学数据共享研究领域早期主要受制度理论和计划行为理论的影响,现阶段主要受虚拟组织理论、数据生命周期理论和长尾理论的影响。
基金This work was supported by University Innovation Research Group of Chongqing(No.CXQT21005).
文摘Purpose:The purpose of this study is to propose an improved credit allocation method that makes the leading author of the paper more distinguishable and makes the deification more robust under malicious manipulations.Design/methodology/approach:We utilize a modified Sigmoid function to handle the fat-tail distributed citation counts.We also remove the target paper in calculating the contribution of co-citations.Following previous studies,we use 30 Nobel Prize-winning papers and their citation networks based on the American Physical Society(APS)and the Microsoft Academic Graph(MAG)dataset to test the accuracy of our proposed method(NCCAS).In addition,we use 654,148 articles published in the field of computer science from 2000 to 2009 in the MAG dataset to validate the distinguishability and robustness of NCCAS.Finding:Compared with the state-of-the-art methods,NCCAS gives the most accurate prediction of Nobel laureates.Furthermore,the leading author of the paper identified by NCCAS is more distinguishable compared with other co-authors.The results by NCCAS are also more robust to malicious manipulation.Finally,we perform ablation studies to show the contribution of different components in our methods.Research limitations:Due to limited ground truth on the true leading author of a work,the accuracy of NCCAS and other related methods can only be tested in Nobel Physics Prize-winning papers.Practical implications:NCCAS is successfully applied to a large number of publications,demonstrating its potential in analyzing the relationship between the contribution and the recognition of authors with different by-line orders.Originality/value:Compared with existing methods,NCCAS not only identifies the leading author of a paper more accurately,but also makes the deification more distinguishable and more robust,providing a new tool for related studies.
文摘基于世界大洋环流实验(World Ocean Circulation Experiment,WOCE)、全球联合海洋通量研究(Joint Global Ocean Flux Study,JGOFS)、全球海洋船基水文调查计划(Global Ocean Ship-based Hydro-graphic Investigations Program,GO-SHIP)和痕量元素及同位素海洋生物地球化学循环国际研究(An In-ternational Study of the Marine Biogeochemical Cycles of Trace Elements and Isotopes,GEOTRACES)四个国际海洋观测计划的科研数据规范化存储与共享管理经验,分析多学科综合计划如何让获得的海量观测数据进行长期、有效的存储,具有可传承性,探讨了建立数据中心和数据仓储对于数据的规范化管理和推进数据共享的重要性。总结出一套数据管理方案:在项目启动时,建立项目/数据管理办公室,推行科研数据标准化管理。针对项目研究主题形成完整的实体参数目录,统一数据名称、单位、分类及内容,制定元数据模板等,数据管理办公室要能存储和共享数据,并确保数据安全,在项目成员内部就数据质量的保证措施、数据共享与引用等管理规定达成共识。