摘要
高校信息化发展所面临的巨大挑战之一是数据质量问题,需要采用有效的数据治理手段全面提升数据质量。人员信息作为高校的核心主数据,是数据治理的重要内容。提出了一种基于关键属性匹配的人员信息整合方法,通过数据预处理、人员唯一性识别、可疑数据人工处理等步骤,对人员赋予唯一编号标识有助于消除多源系统中的重复人员信息形成人员的"黄金视图",同时支撑学校其它各类业务应用。
Data quality is one of the great challenges for the informatization development of colleges and universities.It is necessary to improve the quality of data with a kind of effective data governance methods.Personnel information,as the core data,is an important part in data governance.This paper presents a method of integrating personnel information based on key feature matching,including data preprocessing,person identification and manual processing.This method can assign a unique identification number to each person,form a consistent golden view without redundant information,and support other business applications in colleges and universities.
作者
邹恒华
张志飞
刘波
许维胜
ZOU Henghua;ZHANG Zhifei;LIU Bo;XU Weisheng(Educational Technology and Computing Center,Tongji University;Research Center for Marine Science and Technology,Tongji University;Informatization Office of Tongji University,Shanghai 200092;College of Electronics and Information Engineering,Tongji University,Shanghai 201804)
出处
《微型电脑应用》
2019年第2期13-17,共5页
Microcomputer Applications
基金
国家自然科学基金项目(71540022)
中央高校基本科研业务费项目(20153503)
关键词
关键属性匹配
数据治理
主数据
人员信息整合
Key feature matching
Data governance
Master data
Personnel information integration