摘要
[目的/意义]旨在对特定科研单位的SCIE论文数据本地化处理,为进一步开展科研情报分析服务提供基础。[方法/过程]利用HTTP协议客户端编程工具包实现SCIE论文数据的本地化,再结合人机构建的二级机构词表,采用数据库技术和高效的字符串匹配算法,实现作者二级机构识别。[结果/结论]通过实证分析发现,采用该方法,能大幅提高作者二级机构识别准确率和效率。
[Purpose/significance] The paper is to localize SCIE paper date of special scientific research institutions, so as to provide basis for further scientific research information analysis service.[Method/process] The paper uses HTTP protocol client programming toolkit to localize SCIE paper data. Then, combined with the vocabulary of the second institution constructed by man-machine, it uses database technology and efficient string matching algorithm to realize the second institution of author recognition.[Result/conclusion] Empirical analysis shows that this method can greatly improve the recognition accuracy and rate of the second institution of author.
作者
潘春华
郑勇
Pan Chunhua;Zheng Yong(Beijing Forest University Campus Information Center, Beijing 100083;Beijing Forest University Library, Beijing 100083)
出处
《情报探索》
2019年第9期50-53,共4页
Information Research
基金
北京林业大学热点追踪项目“北京林业大学科研情报服务研究”(项目编号:2018BLRD)成果之一
关键词
科学引文索引
数据本地化
作者二级机构
识别
science citation index
data localization
the second level institutions of author
identify