摘要
根据不同的图档博数据类型,对多种语义增强方式及其应用进行介绍,以解读图档博支持数字人文的新动向.文章将万维网建立以来图档博对大量珍贵的文化遗产资源进行的数据和信息源处理行动划分为数字化、数据化、语境化三个阶段,并通过一系列案例来阐述语义技术直接应用于图档博数据的方法和启发,以此进一步探讨实现图档博数据(包括结构化、半结构化、非结构化数据)语义增强的可行方案和模式.语义增强是一种通过应用语义技术来增强数据价值的策略,近年来在图档博数字化处理原始材料的过程中化为大规模的实际行动,通过采用语义增强的策略和方法,图档博能够提高其数据的质量及可发现性和再使用性,从而促进图档博数据在数字人文研究中更为广泛和深入的使用.
To explore a variety of semantic enrichment methods that have been applied to the data provided by libraries,archives,and museums(LAMs)while interpreting the new trends which enable the more thorough consumption of LAM data to support digital humanities.Since the establishment of the World Wide Web,LAMs have made significant strides in the data and information processing of their tremendous amount of precious cultural heritage resources.These actions occur in three stages:digitization,datafication,and contextualization.Through a series of cases,the article explores how semantic technologies are directly applied to the LAM data.This includes structured,semi-structured,and unstructured data,regardless of what types of original artifacts carry the data.This article also discusses the feasible strategies and modes of achieving semantic enrichment of LAM data.Semantic enrichment is the process of enhancing the value of data by applying semantic technologies.In recent years,with the focus on digital processing of original materials in LAMs,this has been transformed into a large-scale procedure.The quality,discoverability,use-and reuse-ability of LAM data can be maximized through semantic enrichment,thereby promoting the deeper and wider use of LAM data in digital humanities research.
作者
曾蕾
谭旭
Zeng Marcia Lei;Tan Xu
出处
《数字人文研究》
2021年第1期65-86,共22页
Digital Humanities Research
关键词
语义增强
数字人文
图档博
智慧数据
Semantic Enrichment
Digital Humanities
Libraries,Archives,and Museums(LAMs)
Smart Data