Outlier mining is an important aspect in data mining and the outlier miningbased on Cook distance is most commonly used. But we know that when the data have multicollinearity,the traditional Cook method is no longer e...Outlier mining is an important aspect in data mining and the outlier miningbased on Cook distance is most commonly used. But we know that when the data have multicollinearity,the traditional Cook method is no longer effective. Considering the excellence of the principalcomponent estimation, we use it to substitute the least squares estimation, and then give the Cookdistance measurement based on principal component estimation, which can be used in outlier mining.At the same time, we have done some research on related theories and application problems.展开更多
文摘Outlier mining is an important aspect in data mining and the outlier miningbased on Cook distance is most commonly used. But we know that when the data have multicollinearity,the traditional Cook method is no longer effective. Considering the excellence of the principalcomponent estimation, we use it to substitute the least squares estimation, and then give the Cookdistance measurement based on principal component estimation, which can be used in outlier mining.At the same time, we have done some research on related theories and application problems.