数据流挖掘及其在持续审计中的可用性研究被引量：3

Research on Data Stream Mining and Its Availability to Continuous Audit

下载PDF

导出

摘要随着企业信息化程度的提高和互联网的普及,每天都会产生海量的实时数据,而数据流挖掘则为分析海量数据提供了一种新途径。数据流挖掘中的聚类、分类、离群点检测等算法的研究取得了进展,为在持续审计中应用数据流挖掘提供了可行性。本文提出的一种基于数据流挖掘的持续审计模型,克服了传统持续审计模型对审计端的存储能力要求高、占用大量硬件资源、联机分析时间长、对异常数据的发现滞后等缺点。 With the development of enterprise informatization and the popularity of the Internet, massive real-time data are being produced every day. Data stream mining provides one novel approach to analyzing massive real-time data. In this paper the sate-of-art in this field is presented, and its availability to continuous audit is discussed. Finally, based on data stream mining, one continuous audit model is proposed, which overcomes the disadvantages of huge storage capacity requirements, long-time online analysis and the delayed finding of abnormal data.

作者谷瑞军陈圣磊

机构地区南京审计学院信息科学学院

出处《南京审计学院学报》 2011年第1期36-40,共5页 journal of nanjing audit university

基金国家自然科学基金(70971067/G0112) 国家社会科学基金(10BGL016) 江苏省高校自然科学研究项目(09KJD520006)

关键词数据流挖掘持续审计审计模型聚类分类离群点检测 data stream mining continuous audit auditing model clustering classification outlier detection

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献21

1Golab L, Ozsu M T. Issues in data stream management [ J]. ACM SIGMOD Record, 2003, 32:5- 14.
2Marascu A, Masseglia F. Mining sequential patterns from temporal streaming data[ EB/OL]. [2010 - 09 - 09]. http ://www. di. uniba. it/- malerba/activities/mstd/.
3Guha S, shra N, Moweani R, et al. Clustering data streams: theory and practice [ J ]. IEEE Transactions on Knowledge and Data Engineering, 2003, 15: 515- 528.
4Aggarwal C, Han Jiawei, Wang Jianyong, et al. A frame- work for clustering evolving data streams [ C ]. Proc of Int Conf on Very Large Data Bases( VLDB03 ). San Francisco: Morgan Kaufmann Publishers,2003:81 - 92.
5Zao Feng, Ester M, Qian Weining, et al. Density-based clustering over an evolving data stream with noise [ C ]. Proc of the SIAM Conference on Data Ming. Philadelphia: Society for Industrial and Applied Mathematics, 2006 : 328 - 339.
6Cao Feng, Ester M, Qian Weining, et al. Density-based clustering over an evolving data stream with noise [ C ]. Proc of the SIAM Conference on Data Ming. Philadelphia: Society for Industrial and Applied Mathematics, 2006 : 328 - 339.
7Wang Haixun, Fan Wei, Yu Philip S, et al. Mining concept-drifting data streams using ensemble classifiers [ C ]. Proc. of SIGKDD. New York : ACM, 2003:226 - 235.
8Pedro D, Geoff H, Mining high-speed data streams [ C ]. Proc of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York: ACM, 2000:71 - 80.
9Huhen G, Spencer L, Domingos P. Mining time changing data streams[ C]. Proc. of the ACM Intl Conf on Knowledge Discovery and Data Mining. New York: ACM. 2001:97 - 106.
10Hawkins D M. Identification of outliers[ M]. London : Chapman and Hall, 1980.

二级参考文献22

1E Knorr, R Ng. Algorithms for mining distance-based outliers in large datasets [C]. The 24th Conf on VLDB, New York,NY, 1998
2M M Breunig, H P Kreigel, R T Ng, et al. LOF: Identifying density-based local outliers [C]. The ACM SIGMOD, Dallas,TX, 2000
3M Joshi, R Agarwal, V Kumar. Mining needles in a haystack:Classifying rare classes via two-phase rule induction [C]. The ACM SIGMOD Int'l Conf on Management of Data, Santa Barbara, CA, 2001
4D Hawkins. Identification of Outliers [M]. London: Chapman and Hall, 1980. 1-45
5S Guha, N Mishra, R Motwani, et al. Clustering data streams[C]. In: Proc of the Annual Syrup on Foundations of Computer Science, 2000. 359- 366. http://citeseer. ist. psu. edu/guha00clustering.html
6Mishra, Adam Meyerson, Sudipto Guha, et al. Streaming-data algorithms for high-quality clustering [C]. In: Proc of IEEE Int'l Conf on Data Engineering, 2002. http://citeseer. ist. psu.edu/497671. html
7J Han, M Kamber. Data Mining [M]. New York: Morgan Kaufmann, 2001. 1-321
8S Robertson, E Siegel, M Miller, et al. Surveillance detection in high bandwidth environments [OL]. http://wwwl. cs.columbia.edu/ids/publications/SD-DiscexⅢ. pdf, 2003
9M Mahoney. Network traffic anomaly detection based on packet bytes [C]. The 2003 ACM Symp on Applied Computing,Melbourne, Florida, 2003
10T Johnson, I Kwok, R Ng. Fast computation of 2-dimensional depth contours [C]. The 4th Int'l Conf on Knowledge Discovery and Data Mining, New York, 1998

共引文献47

1史慧英,孙海萍.连续审计——高校经济责任审计的新思路[J].青岛职业技术学院学报,2008,21(2):67-69. 被引量：1
2刘登玉.浅析中石油内部审计信息化发展方向[J].经济视野,2014(1).
3王元彤,李元霞.连续审计——开启内部审计新时代[J].齐鲁珠坛,2008,0(3):14-16. 被引量：2
4郝东洋.连续审计在我国的应用[J].企业家天地,2008,0(10):60-61. 被引量：2
5查成东,王长松,巩宪锋,周家新.基于改进K-均值聚类算法的背景提取方法[J].计算机工程与设计,2007,28(21):5141-5143. 被引量：7
6查成东,王长松,巩宪锋,周家新.基于自适应背景模型的运动目标检测[J].光电工程,2008,35(1):26-30. 被引量：6
7陈小燕,陈良华.连续审计在中国应用的前景和建议[J].价值工程,2008,27(4):47-50. 被引量：9
8熊桂喜,刘铭志.基于改进Sage滤波器的车辆行程时间预测模型[J].计算机技术与发展,2008,18(9):162-164. 被引量：1
9辛金国,郑丽娜.基于IT技术的社会保障基金连续审计探析[J].新会计,2009(1):51-54. 被引量：1
10张忠平,梁永欣.基于反k近邻的流数据离群点挖掘算法[J].计算机工程,2009,35(12):11-13. 被引量：11

同被引文献25

1王刚.关于联网审计的几个基本问题[J].审计月刊,2005(5):6-8. 被引量：5
2李世新,邬晓岚.基于XBRL和Web服务的网络化审计取证模式研究[J].生产力研究,2006(11):253-254. 被引量：6
3张天西,高锦萍.XBRL对审计的影响研究[J].当代财经,2007(6):101-104. 被引量：47
4Kogan A, Sudit F E, Vasarhelyi A M . Continuous online auditing : a program of research [ J ]. Journal of Informa- tion Systems, 1999,13:87 - 103.
5Alles M G, Kogan A, Vasarhelyi A M. Feasibility and e- conomics of continuous assurance, auditing[ J ]. Practice and Theory,2002,21:125 - 138.
6Cheung S, Lindqvist U, Fong W M. Modeling muhistep cyber attacks for scenario recognition [ C ]. The Third DARPA Information Survivability Conference and Expo- sition (DISCEX III). Washington: IEEE computer Soci- ety Press,2003:284 - 292.
7Woodroof J, Searcy D. Continuous audit model develop- ment and implementation within a debt covenant compli- ance domain [ J ]. International Journal of Accounting Information Systems,2001,2 : 169 - 191.
8Dain O, Cuningham R. Building scenarios from a heter- ogeneous alert stream[ EB/OL]. [ 2011 - 02 - 03 ]. ht- tp://citeseerx, ist. psu. edu/viewdoc/download? doi = 10.1.1.58. 5277&rep = repl &type = pdf.
9Kim J Sun, Kim M, Noth B N. A fuzzy expert system for network forensics [ C ]. The 2004 International Con- ference on Computational Science and Its Applications ( ICCSA 2004), Perngia, 2004 : 117 - 129.
10Ning Peng, Xu Dingbang, Healey G C, etc. Building attack scenarios through integration of complementary a- lert correlation methods [ C ]. Proc. Of the 11 th Annual Network and Distributed System Security Symposium. NDSS, 2004:233 - 255.

引证文献3

1陈留平,赵顺娣,魏雯,刘艳梅.对接型XBRL网络财务报告审计模型的构建研究[J].南京审计学院学报,2013,10(3):89-95. 被引量：4
2景波,刘莹,陈耿.基于电子取证技术的持续审计模型研究[J].南京审计学院学报,2011,8(4):58-62. 被引量：6
3肖薇,金治中.我国推行连续审计的若干思考[J].商业会计,2014(1):64-66. 被引量：1

二级引证文献11

1丛秋实,黄作明,柳巧玲.面向服务架构的计算机审计系统研究[J].审计与经济研究,2013,28(2):35-41. 被引量：7
2陈留平,赵顺娣,魏雯,刘艳梅.对接型XBRL网络财务报告审计模型的构建研究[J].南京审计学院学报,2013,10(3):89-95. 被引量：4
3范真荣.XBRL视角下智慧审计平台建设探讨[J].财会通讯（上）,2014(2):101-103. 被引量：4
4吴羽翔,李宁滨,金鑫,楼叶.面向IaaS云服务基础设施的电子证据保全与取证分析系统设计[J].信息网络安全,2014(9):184-188. 被引量：6
5王楠.面向服务架构的计算机审计系统分析[J].中国科技博览,2015,0(4):70-70.
6隗义轩.面向服务架构的计算机审计系统研究[J].赤峰学院学报（自然科学版）,2015,31(6):16-17.
7郭蓉.面向XBRL的审计实验:理论、问题与对策[J].现代商贸工业,2016,37(17):110-112. 被引量：1
8刘国城.基于大数据的互联网安全审计过程建模研究[J].兰州学刊,2018,0(3):92-103. 被引量：6
9舒文泉.XBRL环境下持续审计模式的审计风险评价[J].财会通讯（上）,2018(7):18-24. 被引量：1
10李洁颖,李建军.XBRL技术下持续审计平台应用研究--以中国移动重庆公司为例[J].市场周刊,2023,36(11):130-134.

1景波,刘莹,陈耿.基于电子取证技术的持续审计模型研究[J].南京审计学院学报,2011,8(4):58-62. 被引量：6
2陈伟,Robin Qiu,刘思峰.持续审计(CA)研究综述[J].小型微型计算机系统,2008,29(9):1755-1760. 被引量：12
3汪晨.数据库技术在计算机辅助审计中的应用研究[J].数字技术与应用,2016,34(10):83-83. 被引量：2
4陈伟,QIU Robin,刘思峰.数据库技术在计算机辅助审计中的应用研究[J].计算机应用研究,2008,25(6):1908-1910. 被引量：6
5王振莉.联网审计中的审计预警技术探析[J].南京审计学院学报,2011,8(4):63-66. 被引量：1
6陈伟,SMIELIAUSKAS Wally,刘思峰.联网审计绩效评价影响因素的灰色关联分析[J].计算机应用研究,2012,29(2):435-437. 被引量：5
7陈伟,SMIELIAUSKAS Wally,刘思峰.联网审计绩效的动态评价方法:基于AHP和GM(1,1)的组合应用[J].计算机科学,2012,39(7):185-189. 被引量：3
8陈伟.一种基于等级法的联网审计绩效评价方法[J].计算机科学,2010,37(11):111-116. 被引量：6
9陈伟,Smieliauskas Wally.联网审计的绩效评价方法：基于RC和AHP的组合应用[J].系统工程理论与实践,2012,32(8):1768-1776. 被引量：8
10陈伟.一种基于AHP的联网审计绩效评价方法[J].审计与经济研究,2011,26(5):47-52. 被引量：24

南京审计学院学报

2011年第1期

浏览历史

内容加载中请稍等...

数据流挖掘及其在持续审计中的可用性研究被引量：3

参考文献21

二级参考文献22

共引文献47

同被引文献25

引证文献3

二级引证文献11

相关作者

相关机构

相关主题

浏览历史

数据流挖掘及其在持续审计中的可用性研究 被引量：3

参考文献21

二级参考文献22

共引文献47

同被引文献25

引证文献3

二级引证文献11

相关作者

相关机构

相关主题

浏览历史

数据流挖掘及其在持续审计中的可用性研究被引量：3