1[1]Erhard R., Do H.H. Data Cleaning:Problem and Current Approaches[J]. IEEE Techn. Bulletin Data Engineering,2000,23(4).
2[2]Hern′andez M.A.,Stolfo S.J. The merge/purge problem for large databases[A]. Proceedings of the ACM SIGMOD,International Conference on Management of Data[C]. ACM Press,May 1995. 127-138.
3[3]Monge A.E. An adaptive and efficient algorithm for detecting approximately duplicate database records[J]. Submitted for journal publication, June 2000.
4[4]Monge A. E.,Elkan C.P. The field matching problem: Algorithms and applications[A]. Proc. 2nd Intl. Conf. Knowledge Discovery and Data Mining[C]. Portland, Oregon,1996.
5[5]Lee M.L.,Lu H., Ling T.W. et al. Cleansing Data for Mining and Warehousing[A]. 10th International Conference and Workshop on Database and Expert Systems Applications (DEXA99)[C]. Florence, Italy, August 30 - September 3,1999.
3Madnick S E,Wang R r.A framework for corporate householding[C]∥Fisher C,Davidson B N,eds.Proceedings of the 7th International Conference on Information Quality,MIT,2002:36-46.
4Apers P,Atzeni P,Ceri S,et al.Proceedings of the 27th International Conference on Very Large Data Bases[C]∥Proceedings of Very Large Databases,Rome,2001:381~390.
6Monge A E.Matching algorithms within a duplicate detection system[J].IEEE Data Engineer Bulletin,2000,23(4):14-20.
7Bunke H,Jiang X,Abegglen K,et al.On the weighted mean of a pair of strings[J].Pattern Analysis & Applications,2002,5(5):23-30.
8Batista G,Monard M C.An analysis of four missing data treatment methods for supervised learning[J].Applied Artificial Intelligence,2003,17(5-6):519-533.
9Diego M,Monica S,Tiziana C.Using ontologies for XML data cleaning[C]∥OTM Confederated Internationl Workshops and Posters,Rome,2005:562-571.
10Naumann F,Freytag J,Leser U.Completeness of integrated information sources[J].Information Systems,2004,29(7):583-615.