摘要
随着语义网与关联数据技术的兴起与发展,采用SKOS对网络叙词表进行语义化描述逐渐成为主流,这为叙词表的发布、共享以及应用提供了新的契机。本文首先抓取EBSCO公司LISTA数据库的图情学科网络叙词表构建数据集,共得到概念词11243个,其中正式叙词4255个,入口词6988个。接着,利用SKOS对叙词表中的语义关系进行规范化描述;利用q SKOS对语义化词表的完整性进行验证,为SKOS叙词表的正确性与合法性提供保证;利用Apache jena Fuseki将SKOS格式的叙词表发布关联数据,并构建Jena文本索引以支持Lucene全文本检索。最后,利用Graphviz进行叙词网络的绘制和可视化呈现,并利用Skosmos构建图情学科叙词表检索系统,实现了图情学科知识概念的中英文浏览、查询和检索。本文通过实验证明SKOS能较好地描述和揭示网络叙词之间的语义关系,图情学科SKOS的构建对领域概念查询、学术知识检索、领域本体构建等都具有重要意义。
With the rapid development of the semantic Web and linked data, an increasing number of online thesau- ruses are being annotated by SKOS, which provides a new opportunity for sharing, publishing, and applying of online thesauruses. In this study, we first crawled the library and information science (LIS) online thesaurus of the EBSCO LISTA database and obtained 4255 formal descriptors and 6988 non-descriptors in total. These vocabularies were used to compose the concept data set of the LIS field. Then, SKOS was used to normalize the semantic relations in the LISTA thesaurus; qSKOS was used to verify the completeness of the semantic vocabulary, thus providing a guarantee for the correctness of the SKOS thesaurus; Apache Jena Fuseki was used to publish the LISTA/SKOS thesaurus to the linked data; and the Jena text index was built to support the Lucene free-text search. Finally, we used Graphviz for the drawing and visualization of the LISTA thesaurus network. We also constructed the LIS Thesaurus Retrieval System using Skosmos. In the system, we can browse, query, and retrieve the LIS thesaurus either in English or Chinese. Hence, we reached the conclusion that SKOS can describe the semantic relations between different descriptors. The construction of LISTA/SKOS is of great significance to concept queries, academic knowledge retrieval, and domain ontology construction.
作者
石泽顺
肖明
Shi Zeshun;Xiao Ming(School of Government, Beijing Normal University, Beijing 100875)
出处
《情报学报》
CSSCI
CSCD
北大核心
2018年第3期274-284,共11页
Journal of the China Society for Scientific and Technical Information
基金
国家社会科学基金项目"基于语义识别的引文分析理论
方法与应用研究"(16BTQ073)
关键词
图情
SKOS
关联数据
可视化
library and information science
SKOS
linked data
visualization