A predictive DEA model for outlier detection 被引量：3

导出

摘要 Outlier detection is one of the key issues in any data-driven analytics.In this paper,we propose Bi-super DEA,a super DEA-based method that constructs both efficient and inefficient frontiers for outlier detection.In evaluating its predictive performance,we develop a novel predictive DEA procedure,PDEA,which extends the conventional DEA approaches that have been primarily used for in-sample efficiency estimation,to predict outputs for the out-of-sample.This enables us to compare the predictive performance of our approach against several popular outlier detection methods including the parametric robust regression in statistics and non-parametric k-means in data mining.We conduct comprehensive simulation experiments to examine the relative performance of these outlier detection methods under the influence of five factors:sample size,linearity of production function,normality of noise distribution,homogeneity of data,and levels of random noise contaminating the data generating process(DGP).We find that,somewhat surprisingly,Bi-super CCR consistently outperforms Bi-super BCC in detecting outliers.Under the linearity,normality and homogeneity conditions,the parametric robust regression method works best.However,when the DGP violates these conditions,Bi-super DEA emerges as the better choice due to its distribution-free property.Our results shed light on the conditions that each method excels or fails and provide users with practical guidelines on how to choose appropriate methods to detect outliers.

作者 Mingwen Yang Guohua Wan Eric Zheng

机构地区 Antai College of Economics and Management Naveen Jindal School of Management

出处《Journal of Management Analytics》 EI 2014年第1期20-41,共22页 管理分析学报（英文）

基金 This research is supported in part by NSF of China[71125003]and NCET-10-0578.

关键词 predictive DEA Bi-super DEA outlier detection SIMULATION

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

同被引文献3

1Yuan Xu,Yong Shin Park,Ju Dong Park,Wonjoo Cho.Evaluating the environmental efficiency of the U.S.airline industry using a directional distance function DEA approach[J].Journal of Management Analytics,2021,8(1):1-18. 被引量：5
2Shenghai Zhou,Yang Zhan.A new method for performance evaluation of decision-making units with application to service industry[J].Journal of Management Analytics,2021,8(1):84-100. 被引量：2
3Wei Hu,Ye Hou,Longwei Tian,Yuan Li.Selection of logistics distribution center location for SDN enterprises[J].Journal of Management Analytics,2015,2(3):202-215. 被引量：4

引证文献3

1Shashi K.Shahi,Mohamed Dia.Comparison of Ontario’s roundwood and recycled fibre pulp and paper mills’performance using data Envelopment analysis[J].Journal of Management Analytics,2021,8(2):222-251.
2Yuan Xu,Yong Shin Park,Ju Dong Park,Wonjoo Cho.Evaluating the environmental efficiency of the U.S.airline industry using a directional distance function DEA approach[J].Journal of Management Analytics,2021,8(1):1-18. 被引量：5
3Pravin Kumar,Rajesh Kumar Singh,Prerna Sinha.Optimal site selection for a hospital using a fuzzy extended ELECTRE approach[J].Journal of Management Analytics,2016,3(2):115-135. 被引量：2

二级引证文献7

1胡鹏.董事会特征对企业可持续性绩效的影响[J].当代经理人,2023(3):61-71.
2Shashi K.Shahi,Mohamed Dia.Comparison of Ontario’s roundwood and recycled fibre pulp and paper mills’performance using data Envelopment analysis[J].Journal of Management Analytics,2021,8(2):222-251.
3Qingchuan Cui,Wei Jiang.Panel data study on the appropriate proportion of personal expenses in total health expenditure in China[J].Journal of Management Analytics,2018,5(1):18-31.
4Neha Bansal,Arun Sharma,R.K.Singh.Fuzzy AHP approach for legal judgement summarization[J].Journal of Management Analytics,2019,6(3):323-340. 被引量：1
5刘伟晗,李伟,卢灿.基于三阶段SBM-DEA的碳排放效率分解研究[J].电力科学与工程,2023,39(7):24-33. 被引量：1
6郑琰,巴文婷,肖玉杰.考虑碳排放的长三角港口群动态效率测度[J].交通运输系统工程与信息,2023,23(4):34-46. 被引量：1
7杨扬,郭挂梅.基于超效率SBM模型的航空企业碳排放效率研究[J].环境工程技术学报,2023,13(5):1779-1786. 被引量：2

1Shelby R.Buckman,Reuven Glick,Kevin J.Lansing,Nicolas Petrosky-Nadeau,Lily M.Seitelman.Replicating and projecting the path of COVID-19 with a model-implied reproduction number[J].Infectious Disease Modelling,2020,5(1):635-651. 被引量：1
2Xin Fang,Fengjiao Yuan.The coordination and preference of supply chain contracts based on time-sensitivity promotional mechanism[J].Journal of Management Science and Engineering,2018,3(3):158-178.
3Huiming Zhang,Song Xi Chen.Concentration Inequalities for Statistical Inference[J].Communications in Mathematical Research,2021,37(1):1-85.
4张攀峰,甘子莹,吴正中,陈元维,罗祥林.载Ce6的双敏感胶束嵌合到载DOX·HCl的温敏凝胶中用于肿瘤的光动与化疗联合治疗[J].高分子材料科学与工程,2021,37(3):168-175. 被引量：2
5Moustafa Esa,Mostafa S.Amin,Ahmed Hassan.Relative performance of novel blast wave mitigation system to conventional system based on mitigation percent criteria[J].Defence Technology（防务技术）,2021,17(3):912-922. 被引量：1
6秦婉亭,老松杨,汤俊,卢聪.基于变分自编码器的飓风轨迹异常检测方法[J].系统仿真学报,2021,33(9):2191-2201. 被引量：9
7Puneet Pasricha,Dharmaraja Selvamuthu,Guglielmo D’Amico,Raimondo Manca.Portfolio optimization of credit risky bonds: a semi-Markov process approach[J].Financial Innovation,2020,6(1):456-469. 被引量：1
8Yun Zhou,Zheng Yan,Naihu Li,Lingfeng Yu,Lingfeng Zhou,Lixia Chen.Cloud-Data Envelopment Analysis Method Used for Assessment of Restoration Building Block Schemes[J].CSEE Journal of Power and Energy Systems,2015,1(2):43-52. 被引量：1
9Haibin Xie,Shouyang Wang.Timing the market: the economic value of price extremes[J].Financial Innovation,2018,4(1):443-466. 被引量：2
10Wujie Shi,Heng Lv.A Noteof CP_(2) Groups[J].Communications in Mathematics and Statistics,2017,5(4):447-451.

Journal of Management Analytics

2014年第1期

浏览历史

内容加载中请稍等...

A predictive DEA model for outlier detection 被引量：3

同被引文献3

引证文献3

二级引证文献7

相关作者

相关机构

相关主题

浏览历史