A Novel Operational Partition between Neural Network Classifiers on Vulnerability to Data Mining Bias

A Novel Operational Partition between Neural Network Classifiers on Vulnerability to Data Mining Bias

下载PDF

导出

摘要 It is difficult if not impossible to appropriately and effectively select from among the vast pool of existing neural network machine learning predictive models for industrial incorporation or academic research exploration and enhancement. When all models outperform all the others under disparate circumstances, none of the models do. Selecting the ideal model becomes a matter of ill-supported opinion ungrounded on the extant real world environment. This paper proposes a novel grouping of the model pool grounded along a non-stationary real world data line into two groups: Permanent Data Learning and Reversible Data Learning. This paper further proposes a novel approach towards qualitatively and quantitatively demonstrating their significant differences based on how they alternatively approach dynamic and raw real world data vs static and prescient data mining biased laboratory data. The results across 2040 separate simulation runs using 15,600 data points in realistically operationally controlled data environments show that the two-group division is effective and significant with clear qualitative, quantitative and theoretical support. Results across the empirical and theoretical spectrum are internally and externally consistent yet demonstrative of why and how this result is non-obvious. It is difficult if not impossible to appropriately and effectively select from among the vast pool of existing neural network machine learning predictive models for industrial incorporation or academic research exploration and enhancement. When all models outperform all the others under disparate circumstances, none of the models do. Selecting the ideal model becomes a matter of ill-supported opinion ungrounded on the extant real world environment. This paper proposes a novel grouping of the model pool grounded along a non-stationary real world data line into two groups: Permanent Data Learning and Reversible Data Learning. This paper further proposes a novel approach towards qualitatively and quantitatively demonstrating their significant differences based on how they alternatively approach dynamic and raw real world data vs static and prescient data mining biased laboratory data. The results across 2040 separate simulation runs using 15,600 data points in realistically operationally controlled data environments show that the two-group division is effective and significant with clear qualitative, quantitative and theoretical support. Results across the empirical and theoretical spectrum are internally and externally consistent yet demonstrative of why and how this result is non-obvious.

作者 Charles Wong

机构地区 Theta Rhythms

出处《Journal of Software Engineering and Applications》 2014年第4期264-272,共9页 软件工程与应用（英文）

关键词 Machine LEARNING Neural Networks DATA Mining DATA DREDGING NON-STATIONARY Time Series Analysis Permanent DATA LEARNING Reversible DATA LEARNING Machine Learning Neural Networks Data Mining Data Dredging Non-Stationary Time Series Analysis Permanent Data Learning Reversible Data Learning

分类号 R73 [医药卫生—肿瘤]

引文网络
相关文献

1Qian LI,Gang LI,Wenjia NIU,Yanan CAO,Liang CHANG,Jianlong TAN,Li GUO.Boosting imbalanced data learning with Wiener process oversampling[J].Frontiers of Computer Science,2017,11(5):836-851. 被引量：1
2Junru Lu,Chunkai Zhang,Fengxing Shi.A Classification Method of Imbalanced Data Base on PSO Algorithm[J].国际计算机前沿大会会议论文集,2016(2):37-39.
3Ossama E. GOUDA,Ghada M. AMER,Waleed A. SALEM.Computational Aspects of Electromagnetic Fields near H.V. Transmission Lines[J].Energy and Power Engineering,2009,1(2):65-71.
4Xingmin Guan,Xiang Zong.Severe Convective Weather Nowcasting System in Heilongjiang Meteorological Bureau and Its Preliminary Performance Evaluation[J].Atmospheric and Climate Sciences,2019,9(3):323-330.
5Omar Asif,Md. Belayat Hossain,Mamun Hasan,Mir Toufikur Rahman,Muhammad E. H. Chowdhury.Fire-Detectors Review and Design of an Automated, Quick Responsive Fire-Alarm System Based on SMS[J].International Journal of Communications, Network and System Sciences,2014,7(9):386-395. 被引量：2
6Salvadora Ortega-Requena,Serge Rebouillat,Fernand Pla.Paving the High-Way to Sustainable, Value Adding Open-Innovation Integrating Bigger-Data Challenges: Three Examples from Bio-Ingredients to Robust Durable Applications of Electrochemical Impacts[J].Journal of Biomaterials and Nanobiotechnology,2018,9(2):117-188. 被引量：1
7Takayoshi Matsui,Toshiro Fujimoto.Treatment for Depression with Chronic Neck Pain Completely Cured in 94.2% of Patients Following Neck Muscle Treatment[J].Neuroscience & Medicine,2011,2(2):71-77. 被引量：1

Journal of Software Engineering and Applications

2014年第4期

浏览历史

内容加载中请稍等...

A Novel Operational Partition between Neural Network Classifiers on Vulnerability to Data Mining Bias

相关作者

相关机构

相关主题

浏览历史