期刊文献+

历史典籍的结构化探索——《史记·列传》数字人文知识库的构建与可视化研究 被引量:4

Explore the structuration of historical books:the construction and quantitative analysis of digital humanities database of the Biographies of the Shiji
下载PDF
导出
摘要 中国古代典籍文献浩如烟海,蕴藏了大量的历史人文知识。以电子化和全文检索为主要方法的古籍数字化开发应用模式已经成为语言文学、历史、哲学等学科的重要基础资源和工具。随着人工智能与大数据技术的发展,数字人文的研究范式不断演进,将传统典籍的文本转换为高度结构化的新型数字人文数据库是一项新的探索,将文本中词汇、人物、地理实体等要素有机组织起来,对于历史现象可视化、历史规律量化具有重大意义。以《史记·列传》为对象,进行古汉语自动分词及词性标注、人工校对以及实体信息人工标注,形成多层次、高质量的数字人文知识库,实现包含古籍词汇、人物、地点等要素的定量分析与可视化检索,挖掘出《史记·列传》人物和地点分布情况、人物关系、人地关系等信息。得出:《史记·列传》共出现人物1787位、地点1173个;相比《史记·本纪》和《史记·世家》,《史记·列传》特有人物共1092位,特有地点共556个。本文研究内容为古籍数字人文知识库的构建提供了新的思路与框架。 Ancient Chinese classical books are vast and contain a lot of historical and humanistic knowledge.The development and application mode of the digitization of ancient books based on digitization and full-text retrieval has become an important basic resource and tool for language and literature,history,philosophy and other disciplines.With the development of artificial intelligence and big data technology,the research paradigm of digital humanities is constantly evolving.It is a new exploration to convert the text of traditional books into a highly structured new digital humanities database.Organizing elements such as words,characters,and geographical entities in the text organically is of great significance for the visualization of historical knowledge and the quantification of historical information.The Biographies of the Shiji was selected as the object.The automatic word segmentation and part-of-speech tagging,manual proofreading and manual annotation of entity information were performed to construct a multi-level and high-quality structured digital humanities knowledge base,realize quantitative analysis and visual retrieval of elements,such as words,characters and locations of ancient books,and excavate information such as distribution of characters and locations,relationship between characters and relationship between people and locations.It was concluded that there are 1787 persons and 1173 locations in the Biographies of the Shiji,and compared with Benji and Shijia of the Shiji,there are 1092 unique persons and 556 unique locations of the Biographies of the Shiji.New ideas and frameworks for the construction of digital humanities knowledge base of ancient books were provided.
作者 郑童哲恒 李斌 冯敏萱 常博林 王东波 ZHENG Tongzheheng;LI Bin;FENG Minxuan;CHANG Bolin;WANG Dongbo(School of Chinese Language and Literature,Nanjing Normal University,Nanjing 210097,China;College of Information Management,Nanjing Agricultural University,Nanjing 210095,China)
出处 《大数据》 2022年第6期40-55,共16页 Big Data Research
基金 江苏省社会科学基金项目(No.20JYB004) 国家社会科学基金资助项目(No.18BYY127,No.21&ZD331)。
关键词 数字人文 《史记·列传》 知识服务 大数据 古汉语信息处理 digital humanities the Biographies of the Shiji knowledge service big data ancient Chinese information processing
  • 相关文献

参考文献18

二级参考文献250

共引文献334

同被引文献110

引证文献4

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部