期刊文献+

基于hLDA的图书内部主题层次组织研究 被引量:5

A Research on Internal Hierarchical Topic Organization Model of the Book Based on hLDA
原文传递
导出
摘要 [目的 /意义]对以图书为代表的多主题长文档进行文档内层次主题分析及组织,为用户提供细粒度的挖掘结果,以帮助用户了解图书主题,并快速理解图书内部主题的结构与联系。[方法 /过程]在利用层次主题模型hLDA及上下文信息构建图书内部主题层次组织模型并进行模型实现的基础上,设计实验对模型进行评估。[结果 /结论]实验结果表明,基于hLDA的图书内部主题层次组织具有更高的查全率和查准率。 [ Purpose/significance ] This paper analyzes and organizes hierarchical topic texts in multi -topic long documents which represented by hooks, and offers fine - granularity mining results for users to help them understand the topic of a book and quickly understand the structure and relationship of topics within the book. [ Method/process ] Firstly, hierarchical topic model (hLDA) and context information are applied to build hierarchical topic organization model within the book and its prototype system is implemented. Secondly, an experiment is designed to evaluate this model. [ Result/conclusion] The experiment results prove that the internal hierarchical topic organization model of a book will promote the recall and the precision.
出处 《图书情报工作》 CSSCI 北大核心 2016年第18期140-148,共9页 Library and Information Service
基金 国家自然科学基金项目"图书层次主题自动标引研究"(项目编号:71303089) 华中师范大学2016年校级教学研究项目"信息管理类‘知识主题-课程’体系网络构建研究"(项目编号:201623)研究成果之一
关键词 电子图书 主题模型 hLDA 上下文信息 多主题文档 e-book topic model hLDA context information multi-topic documents
  • 相关文献

参考文献12

二级参考文献78

  • 1孙萍,苏东出.基于OCR的电子图书目录自动生成算法的实现[J].现代情报,2004,24(9):151-152. 被引量:2
  • 2梁莹,施善旦.海蓝目录自动识别系统的设计[J].广西科学院学报,2004,20(4):284-286. 被引量:1
  • 3唐光前.图书目次增强的模式及系统实践[J].图书馆理论与实践,2005(3):126-128. 被引量:3
  • 4江汇泉.DC元数据图书馆应用中的编码实现(二)——限定性DC的编码实现[J].图书馆杂志,2006,25(4):50-53. 被引量:12
  • 5[1]J Higashino,H Fujisawa,Y Nakano et al.A Knowledge based segmentation method for document understanding[C].ln:Proc 8th Int Conf Pattern Recongniton (ICPR), 1986: 745~748
  • 6[2]F Cesarini,E Francesconi,M Gori et al. Rectangle Labeling for an Invoice Understanding System[C].In :Proc Of 4th Int Conf Document Analysis and Recognition(ICDAR),1997:324~330
  • 7[3]Karl-Hans Blasius,Beate Graweneyer,Isabel Hohn et al.Knowledgebased Document Analysis[C].In :Proc Of 4th Int Conf Document Analysis and Recognition(ICDAR),1997:728~731
  • 8[4]Ji-Yeon Lee,Jeong-Seon Park,Hyeran Byun et al.Automatic generation of structured hyperdocuments from document images[J].Pattern Recognition, 2002; 35: 485~503
  • 9[5]Donato Malerba,Floriana Esposito,Francesca A Lisi et al. Automated Discovery of Dependencies Between Logical Components in Document Image Understanding[C].In:Proc of the 6th Int Conf Document Analysis and Recognition(ICDAR),2001:174~178
  • 10[6]F Le Bourgeois,H Emptoz,S Souafi Bensafi.Document Understanding Using Probabilistic Relaxation:Application on Tables of Contents of Periodicals[C].In:Proc Of 6th Int Conf Document Analysis and Recognition (ICDAR),2001:508~512

共引文献124

同被引文献75

引证文献5

二级引证文献29

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部