摘要
对馆藏文献书目数据库中存在的数据质量问题进行研究,采用基于规则的分类方法将书目数据记录进行分类,检测识别相似重复书目数据记录,提出了基于时间的不完整书目数据记录清理方法和基于优先级的相似重复书目数据记录合并方法。
Of library bibliographic data quality problems that exist in the database, the classification method based on rules will classify bibliography data records, test and identify similar repeated bibliographic data records, this ar- ticle puts forward the incomplete bibliographic data cleaning method based on time and similar repeated bibliograph- ic data record combined method based on priority.
作者
喻亚琴
YU Ya-qin(Library, Nantong Vocational & Technical Shipping College, Nantong Jiangsu 226010, China)
出处
《四川图书馆学报》
2017年第4期62-65,共4页
Journal of The Library Science Society of Sichuan
关键词
不完整书目数据
相似重复书目数据
数据清理
incomplete bibliographic data
similar repeated bibliographic data
data cleaning