期刊文献+

基于类中心向量的论文作者归属机构自动识别方法研究 被引量:5

Auto-Identification of Authors Affiliation Based on Class-Center Vectors
下载PDF
导出
摘要 对大规模科技文献进行整理分析时,常常需要自动识别论文作者所归属的组织机构,此时需要将论文中的作者地址信息与对应的机构名称进行自动匹配。同一个机构的作者地址信息在不同的英文论文中可能出现多种不同的写法,这给匹配造成了困难。针对这一问题,设计出一种机器学习方法,此方法充分利用英文论文中作者地址的书写特点,在基于类中心向量的基础上将作者地址信息与机构名称进行自动匹配。与传统方法比较,该方法不需要手工编写烦琐的匹配规则,被应用于中国科学院作者地址信息数据集,实验结果证明了此方法的可行性。 When analyzing a large amount of scientific and technical literature, identification of the author's affiliation is always necessary. A key step in this task is matching the author 's address to the corresponding institution. Authors from one institution often state their affiliations in various forms in English. This causes string-matching methods to yield unsatisfactory results. In this paper, a machine learning method known as“class-center vectors”has been proposed to solve this problem according to the characteristics of the author's address. Compared with traditional methods, our method does not require matching rules to be written manually. The experimental results of Chinese Academy of Sciences (CAS) author's address data sets illustrate the feasibility of our method.
作者 何涛 王桂芳 马廷灿 He Tao;Wang Guifang;Ma Tingcan(Wuhan Documentation and Information Center, Chinese Academy of Sciences, Wuhan 430071)
出处 《情报学报》 CSSCI CSCD 北大核心 2019年第7期716-721,共6页 Journal of the China Society for Scientific and Technical Information
基金 中国科学院青年创新促进会项目(2016160)
关键词 作者地址 机构名称 类中心向量 机器学习 author’s address institution name class-center vectors machine learning
  • 相关文献

参考文献2

二级参考文献30

  • 1About SWORD (Simple Web-service Offering Repository Deposit) [EB/OL]. [2014-08-12]. http://swordapp.org/about/.
  • 2ANSI/NISO Z39.96-2012 JATS: Journal Article Tag Suite [EB/OL]. [2014-08-12]. http://www.niso.org/apps/group~r~ublic/ project/details.php?proj ect_id=93.
  • 3The RIOXX Metadata Profile and Guidelines: Application Profile Version 2.0 beta I[EB/OL]. [2014-08-12]. http://docs. rioxx.net/v2-0-beta- 1/.
  • 4Journal Article Versions (JAV): Recommendations of theNISO/ALPSP JAV [EB/OL]. [2014-08-12]. http://docs.rioxx. net/v2-0-beta-I/.
  • 5The DOl(Digital Object Identifier) System[EB/OL]. [2014- 08-12]. http://www.doi.org/.
  • 6ORCID[EB/OL]. [2014-08-12]. http://orcid.org/.
  • 7ResearcherlD [EB/OL]. [2014-08-12]. http://www.researcherid. com/.
  • 8FundRef [EB/OL]. [2014-08-12]. http://www.crossref.org/ fundrefL.
  • 9V4OA[EB/OL]. [2014-08-12]. http://v4oa.net/about/.
  • 10A Proposed NISO Work Item: Specification for Open Access Metadata and Indicators [EB/OL]. [2014-08-12]. http:// www.niso.org/apps/group_public/download.php/9845/Open% 20Access%20Metadata%20-%20Work%2011em%20for%20ba llot.pdf.

共引文献6

同被引文献58

引证文献5

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部