期刊文献+

Probabilistic outlier detection for sparse multivariate geotechnical site investigation data using Bayesian learning 被引量:2

下载PDF
导出
摘要 Various uncertainties arising during acquisition process of geoscience data may result in anomalous data instances(i.e.,outliers)that do not conform with the expected pattern of regular data instances.With sparse multivariate data obtained from geotechnical site investigation,it is impossible to identify outliers with certainty due to the distortion of statistics of geotechnical parameters caused by outliers and their associated statistical uncertainty resulted from data sparsity.This paper develops a probabilistic outlier detection method for sparse multivariate data obtained from geotechnical site investigation.The proposed approach quantifies the outlying probability of each data instance based on Mahalanobis distance and determines outliers as those data instances with outlying probabilities greater than 0.5.It tackles the distortion issue of statistics estimated from the dataset with outliers by a re-sampling technique and accounts,rationally,for the statistical uncertainty by Bayesian machine learning.Moreover,the proposed approach also suggests an exclusive method to determine outlying components of each outlier.The proposed approach is illustrated and verified using simulated and real-life dataset.It showed that the proposed approach properly identifies outliers among sparse multivariate data and their corresponding outlying components in a probabilistic manner.It can significantly reduce the masking effect(i.e.,missing some actual outliers due to the distortion of statistics by the outliers and statistical uncertainty).It also found that outliers among sparse multivariate data instances affect significantly the construction of multivariate distribution of geotechnical parameters for uncertainty quantification.This emphasizes the necessity of data cleaning process(e.g.,outlier detection)for uncertainty quantification based on geoscience data.
出处 《Geoscience Frontiers》 SCIE CAS CSCD 2021年第1期425-439,共15页 地学前缘(英文版)
基金 supported by the National Key R&D Program of China(Project No.2016YFC0800200) the NRF-NSFC 3rd Joint Research Grant(Earth Science)(Project No.41861144022) the National Natural Science Foundation of China(Project Nos.51679174,and 51779189) the Shenzhen Key Technology R&D Program(Project No.20170324) The financial support is grateful acknowledged。
  • 相关文献

参考文献1

二级参考文献21

  • 1Attoh-Okine, N.O., Cooger, K., Mensah, S., 2009. Multivariate adaptive regression (MARS) and hinged hyperplanes (HHP) for doweled pavement performance modeling. Journal of Construction and Building Materials 23, 3020-3023.
  • 2Das, S.K., Basudhar, EK., 2006. Undrained lateral load capacity of piles in clay using artificial neural network. Computer and Geotechnics 33 (8), 454-459.
  • 3Demuth, H., Beale, M., 2003. Neural Network Toolbox for MATLAB-user Guide Version 4.1. The Math Works Inc.
  • 4Friedman, J.H., 1991. Multivariate adaptive regression splines. The Annals of Sta- tistics 19, 1-141.
  • 5Gandomi, A.H., Roke, D.A., 2013. Intelligent formulation of structural engineering systems. In: Seventh M1T Conference on Computational Fluid and Solid Me- chanics- Focus: Multiphysics and Multiscale, 12-14 Jun., Cambridge, USA.
  • 6Garson, G.D., 1991. Interpreting neural-network connection weights. Al Expert 6 (7), 47-51.
  • 7Goh, A.T.C., Zhang, W.G., 2014. An improvement to MLR model for predicting liquefaction-induced lateral spread using Multivariate Adaptive Regression Splines. Engineering Geology 170, 1 10.
  • 8Hastie, T., Tibshirani, R., Friedman, J., 2009. The Elements of Statistical Learning: Data Mining, Inference and Prediction, second ed. Springer.
  • 9Jekabsons, G., 2010. VariReg: a Software Tool for Regression Modelling Using Various Modeling Methods. Riga Technical University. http://www.cs.rtu.lv/ jekabsons/.
  • 10Jeon, J.K., Rahman, M.S., 2008. Fuzzy Neural Network Models for Geotechnical Problems. Research Project FHWA/NC/2006-52. North Carolina State Univer- sity, Raleigh, N.C.

共引文献33

同被引文献13

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部