摘要
报纸文献是一种未被充分开发的重要信息源。我国报纸文献数据库建设已经实现从题录库向全文库的发展,为报纸文献内容加工和挖掘提供了保障。但目前报纸文献缺乏统一完善的加工规范和标准,内容加工的方式也以简单的分类索引和人工剪报为主,加工自动化水平和加工深度不够,应向深层次、自动化、产品化方向发展。
Newspaper is a kind of important information sources which is not fully exploited. The construction of Chinese newspaper literature database, which has shifted from bibliographic da- tabase to full-text database, is the base for its deep content processing and mining, There are many problems in the process of newspaper literature treatment, such as lacking of uniform processing criteria and standards, simply processing methods, low-level automation and shallow processing depth. Thus, the content processing of newspaper literature should be directed to deeper, automation and product.
出处
《中国索引》
2011年第4期48-53,共6页
Journal of the China Society of Indexers
基金
教育部人文社会科学研究青年基金项目“电子报纸内容深加工研究”(09YJC870014)
江苏省社会科学基金青年项目“数字报纸的自动标引研究”(09TQC011)的研究成果之一
关键词
报纸文献
内容加工
文献数据库
报纸著录
报纸标引
Newspaper Literature, Content Processing, Literature Database, Newspaper Article Cataloguing, Newspaper Article Indexing