期刊文献+

基于分布特征的异常成绩检测方法 被引量:1

An Approach of Grade-Outlier Detection Based on its Distribution Characteristics
下载PDF
导出
摘要 数据质量对于学生成绩具有十分重要的意义.本文将教育学原理与数据清洗技术相结合,提出了一种基于分布特征的异常成绩检测方法.在理论上论证了方法的合理性,并通过实验验证了方法的有效性.本文的工作不仅对于提高成绩管理系统的运行质量有直接的作用,而且为将数据质量研究应用于教育信息化领域提供了很好的开端. Information Quality to Student grade has fundamental significance. Having combined the theory of pedagogy with data cleansing technique, we propose a detection approach of grade outliers based on distribution characteristics in this paper. The rationality of the approach has been argued and its availability has been verified through our experiment. The results of this paper not only have direct effect on improving the quality of grade management system, but also have provided a fine start to apply the study of Information quality to educational informationization field.
作者 阳小华 李萌
出处 《南华大学学报(自然科学版)》 2008年第4期7-9,21,共4页 Journal of University of South China:Science and Technology
基金 湖南省教育十五规划2003资助项目(XJK03CG021)
关键词 数据质量 数据清洗 异常成绩检测 Information Quality Data Cleansing Detection of Grade Outliers
  • 相关文献

参考文献4

  • 1韩京宇,徐立臻,董逸生.数据质量研究综述[J].计算机科学,2008,35(2):1-5. 被引量:102
  • 2Kahn B, Strong D, Wang R Y. Information Quality Benchmarks: Product and Service Performance [ J ]. Communications of the ACM, 2002,45 (4) : 184 - 192.
  • 3Richard Y Wang, Diane M Strong. Beyond accuracy: What data quality means to data consumers [ J ]. Journal of Management Information System, 1996,12 ( 4 ) : 5 - 34.
  • 4Hipp J, Guntzer U, Grimmer U. Data quality mining: making a virtue of necessity [ C ]//Workshop on Re- search Issues in Data Mining and Knowledge Discovery, Santa Barbara :2001,52 - 57.

二级参考文献78

  • 1韩京宇,徐立臻,董逸生.一种大数据量的相似记录检测方法[J].计算机研究与发展,2005,42(12):2206-2212. 被引量:32
  • 2Monge A, Elkan C. An efficient domain-independent algorithm for detecting approximately duplicate database records [C]. In: Proceedings of the ACM-SIGMOD Workshop on Research Issues on Knowledge Discovery and Data Mining,Tucson, AZ, 1997.
  • 3Motro A, Rakov I. Estimating the quality of data in relational databases [C]. In.. Proeeedings of the 1996 Conferenee on Informtion Quality, Cambridge, Massaehusetts, Oetober 1996.
  • 4Motro A, Anokhin P, Acar A C. Utility-based resolution of data inconsistencies [C]. IQIS 2004. 35-43.
  • 5Parssian A, Sarkar S, Jacob V S. Assessing data quality for information products [C]. 1999.
  • 6Parssian A, Sarkar S, Jacob V S. Assessing information quality for the composite relational operation ioins [C]. In:Proc. of Seventh International Conference on Information Quality, 2002.
  • 7Kahn B K, Strong D M. Product and Service Performance Model for Information Quality: An Update. IQ 1998. 102-115.
  • 8Barnett V , Lewis T. Outliers in statistical data. New York: John Wiley and Sons Inc , 1994.
  • 9Liu B, Hsu W, Ma Y. Integrating classification and association rule mining [C]. In.. Proc. of 4^th International Conference on Knowledge Discovery and Data Mining (KDD98), ACM press, 1998. 80-86.
  • 10Pluempitiwiriyawej C. A new hierarchical clustering model for speeding up the reconciliation of XML based, semistructured data in mediation systems [D]:[Doctoral Thesis]. 2001.

共引文献101

同被引文献12

  • 1Pearson R K.The problem of disguised missing data[J].ACM SIGKDD Explorations,2006,8(1):83-92.
  • 2Hua M,Pei J.Cleaning disguised missing data:A heuristic approach[C]//KDD.2007:950-958.
  • 3Des Jardins D.Outliers,inliers,and just plain liars-new graphical EDA+(EDA Plus)techniques for understanding data[C]//Proc.SAS User's Group International Conference(SUGI26).Long Beach,CA,2001.
  • 4刘新平,刘存侠.教务统计与测评导论[M].北京:科学出版社,2003.
  • 5Rahm E,Do Hong Hai.Data cleaning:Problems and current approaches[J].IEEE Data Engineering Bulletin,2000,23(4):3-13.
  • 6Dasu T,Johnson T.Exploratory Data Mining and Data Cleaning[M].John Wiley,2003.
  • 7Monge A,Elkan C.An efficient domain independent algorithm for detecting approximately duplicate database records[C]//Proceedings of the ACM2SIGMOD Workshop on Research Issues on Knowledge Discovery and Data Mining.Tucson,AZ,1997.
  • 8Parssian A,Sarkar S,Jacob V S.Assessing data quality for information products[C]//International Conference on Information systems.1999.
  • 9Parssian A,Sarkar S,Jacob V S.Assessing information quality for the composite relational operation joins[C]//Proc.of Seventh International Conference on Information Quality.2002.
  • 10Kahn B K,Strong D M.Product and service performance model for information quality:An update[C]//Proc.of the Conference on Information Quality.1998:102-115.

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部