Predicting the motion of other road agents enables autonomous vehicles to perform safe and efficient path planning.This task is very complex,as the behaviour of road agents depends on many factors and the number of po...Predicting the motion of other road agents enables autonomous vehicles to perform safe and efficient path planning.This task is very complex,as the behaviour of road agents depends on many factors and the number of possible future trajectories can be consid-erable(multi-modal).Most prior approaches proposed to address multi-modal motion prediction are based on complex machine learning systems that have limited interpret-ability.Moreover,the metrics used in current benchmarks do not evaluate all aspects of the problem,such as the diversity and admissibility of the output.The authors aim to advance towards the design of trustworthy motion prediction systems,based on some of the re-quirements for the design of Trustworthy Artificial Intelligence.The focus is on evaluation criteria,robustness,and interpretability of outputs.First,the evaluation metrics are comprehensively analysed,the main gaps of current benchmarks are identified,and a new holistic evaluation framework is proposed.Then,a method for the assessment of spatial and temporal robustness is introduced by simulating noise in the perception system.To enhance the interpretability of the outputs and generate more balanced results in the proposed evaluation framework,an intent prediction layer that can be attached to multi-modal motion prediction models is proposed.The effectiveness of this approach is assessed through a survey that explores different elements in the visualisation of the multi-modal trajectories and intentions.The proposed approach and findings make a significant contribution to the development of trustworthy motion prediction systems for autono-mous vehicles,advancing the field towards greater safety and reliability.展开更多
Association rule learning(ARL)is a widely used technique for discovering relationships within datasets.However,it often generates excessive irrelevant or ambiguous rules.Therefore,post-processing is crucial not only f...Association rule learning(ARL)is a widely used technique for discovering relationships within datasets.However,it often generates excessive irrelevant or ambiguous rules.Therefore,post-processing is crucial not only for removing irrelevant or redundant rules but also for uncovering hidden associations that impact other factors.Recently,several post-processing methods have been proposed,each with its own strengths and weaknesses.In this paper,we propose THAPE(Tunable Hybrid Associative Predictive Engine),which combines descriptive and predictive techniques.By leveraging both techniques,our aim is to enhance the quality of analyzing generated rules.This includes removing irrelevant or redundant rules,uncovering interesting and useful rules,exploring hidden association rules that may affect other factors,and providing backtracking ability for a given product.The proposed approach offers a tailored method that suits specific goals for retailers,enabling them to gain a better understanding of customer behavior based on factual transactions in the target market.We applied THAPE to a real dataset as a case study in this paper to demonstrate its effectiveness.Through this application,we successfully mined a concise set of highly interesting and useful association rules.Out of the 11,265 rules generated,we identified 125 rules that are particularly relevant to the business context.These identified rules significantly improve the interpretability and usefulness of association rules for decision-making purposes.展开更多
Objective Clinical medical record data associated with hepatitis B-related acute-on-chronic liver failure(HBV-ACLF)generally have small sample sizes and a class imbalance.However,most machine learning models are desig...Objective Clinical medical record data associated with hepatitis B-related acute-on-chronic liver failure(HBV-ACLF)generally have small sample sizes and a class imbalance.However,most machine learning models are designed based on balanced data and lack interpretability.This study aimed to propose a traditional Chinese medicine(TCM)diagnostic model for HBV-ACLF based on the TCM syndrome differentiation and treatment theory,which is clinically interpretable and highly accurate.Methods We collected medical records from 261 patients diagnosed with HBV-ACLF,including three syndromes:Yang jaundice(214 cases),Yang-Yin jaundice(41 cases),and Yin jaundice(6 cases).To avoid overfitting of the machine learning model,we excluded the cases of Yin jaundice.After data standardization and cleaning,we obtained 255 relevant medical records of Yang jaundice and Yang-Yin jaundice.To address the class imbalance issue,we employed the oversampling method and five machine learning methods,including logistic regression(LR),support vector machine(SVM),decision tree(DT),random forest(RF),and extreme gradient boosting(XGBoost)to construct the syndrome diagnosis models.This study used precision,F1 score,the area under the receiver operating characteristic(ROC)curve(AUC),and accuracy as model evaluation metrics.The model with the best classification performance was selected to extract the diagnostic rule,and its clinical significance was thoroughly analyzed.Furthermore,we proposed a novel multiple-round stable rule extraction(MRSRE)method to obtain a stable rule set of features that can exhibit the model’s clinical interpretability.Results The precision of the five machine learning models built using oversampled balanced data exceeded 0.90.Among these models,the accuracy of RF classification of syndrome types was 0.92,and the mean F1 scores of the two categories of Yang jaundice and Yang-Yin jaundice were 0.93 and 0.94,respectively.Additionally,the AUC was 0.98.The extraction rules of the RF syndrome differentiation model based on the MRSRE method revealed that the common features of Yang jaundice and Yang-Yin jaundice were wiry pulse,yellowing of the urine,skin,and eyes,normal tongue body,healthy sublingual vessel,nausea,oil loathing,and poor appetite.The main features of Yang jaundice were a red tongue body and thickened sublingual vessels,whereas those of Yang-Yin jaundice were a dark tongue body,pale white tongue body,white tongue coating,lack of strength,slippery pulse,light red tongue body,slimy tongue coating,and abdominal distension.This is aligned with the classifications made by TCM experts based on TCM syndrome differentiation and treatment theory.Conclusion Our model can be utilized for differentiating HBV-ACLF syndromes,which has the potential to be applied to generate other clinically interpretable models with high accuracy on clinical data characterized by small sample sizes and a class imbalance.展开更多
Fault diagnosis plays an irreplaceable role in the normal operation of equipment.A fault diagnosis model is often required to be interpretable for increasing the trust between humans and the model.Due to the understan...Fault diagnosis plays an irreplaceable role in the normal operation of equipment.A fault diagnosis model is often required to be interpretable for increasing the trust between humans and the model.Due to the understandable knowledge expression and transparent reasoning process,the belief rule base(BRB)has extensive applications as an interpretable expert system in fault diagnosis.Optimization is an effective means to weaken the subjectivity of experts in BRB,where the interpretability of BRB may be weakened.Hence,to obtain a credible result,the weakening factors of interpretability in the BRB-based fault diagnosis model are firstly analyzed,which are manifested in deviation from the initial judgement of experts and over-optimization of parameters.For these two factors,three indexes are proposed,namely the consistency index of rules,consistency index of the rule base and over-optimization index,tomeasure the interpretability of the optimizedmodel.Considering both the accuracy and interpretability of amodel,an improved coordinate ascent(I-CA)algorithmis proposed to fine-tune the parameters of the fault diagnosis model based on BRB.In I-CA,the algorithm combined with the advance and retreat method and the golden section method is employed to be one-dimensional search algorithm.Furthermore,the random optimization sequence and adaptive step size are proposed to improve the accuracy of the model.Finally,a case study of fault diagnosis in aerospace relays based on BRB is carried out to verify the effectiveness of the proposed method.展开更多
Early fault diagnosis of bearings is crucial for ensuring safe and reliable operations.Convolutional neural networks(CNNs)have achieved significant breakthroughs in machinery fault diagnosis.However,complex and varyin...Early fault diagnosis of bearings is crucial for ensuring safe and reliable operations.Convolutional neural networks(CNNs)have achieved significant breakthroughs in machinery fault diagnosis.However,complex and varying working conditions can lead to inter-class similarity and intra-class variability in datasets,making it more challenging for CNNs to learn discriminative features.Furthermore,CNNs are often considered“black boxes”and lack sufficient interpretability in the fault diagnosis field.To address these issues,this paper introduces a residual mixed domain attention CNN method,referred to as RMA-CNN.This method comprises multiple residual mixed domain attention modules(RMAMs),each employing one attention mechanism to emphasize meaningful features in both time and channel domains.This significantly enhances the network’s ability to learn fault-related features.Moreover,we conduct an in-depth analysis of the inherent feature learning mechanism of the attention module RMAM to improve the interpretability of CNNs in fault diagnosis applications.Experiments conducted on two datasets—a high-speed aeronautical bearing dataset and a motor bearing dataset—demonstrate that the RMA-CNN achieves remarkable results in diagnostic tasks.展开更多
Prediction systems are an important aspect of intelligent decisions.In engineering practice,the complex system structure and the external environment cause many uncertain factors in the model,which influence the model...Prediction systems are an important aspect of intelligent decisions.In engineering practice,the complex system structure and the external environment cause many uncertain factors in the model,which influence the modeling accuracy of the model.The belief rule base(BRB)can implement nonlinear modeling and express a variety of uncertain information,including fuzziness,ignorance,randomness,etc.However,the BRB system also has two main problems:Firstly,modeling methods based on expert knowledge make it difficult to guarantee the model’s accuracy.Secondly,interpretability is not considered in the optimization process of current research,resulting in the destruction of the interpretability of BRB.To balance the accuracy and interpretability of the model,a self-growth belief rule basewith interpretability constraints(SBRB-I)is proposed.The reasoning process of the SBRB-I model is based on the evidence reasoning(ER)approach.Moreover,the self-growth learning strategy ensures effective cooperation between the datadriven model and the expert system.A case study showed that the accuracy and interpretability of the model could be guaranteed.The SBRB-I model has good application prospects in prediction systems.展开更多
With the development of Fintech, applying artificial intelligence (AI) technologies to the financial field is a general trend. However, there are some inappropriate conditions, for instance, the AI model is always tre...With the development of Fintech, applying artificial intelligence (AI) technologies to the financial field is a general trend. However, there are some inappropriate conditions, for instance, the AI model is always treated as a black box and cannot be interpreted. This paper studies the AI model interpretability when the models are applied in the financial field. We analyze the reasons of black box problem and explore the effective solutions. We propose a new kind of automatic Regtech tool—LIMER, and put forward policy suggestions, thereby continuously promoting the development of Fintech to a higher level.展开更多
The accurate and reliable interpretation of regional land cover data is very important for natural resource monitoring and environmental assessment.At present,refined land cover data are mainly obtained by manual visu...The accurate and reliable interpretation of regional land cover data is very important for natural resource monitoring and environmental assessment.At present,refined land cover data are mainly obtained by manual visual interpretation,which has the problems of heavy workload and inconsistent interpretation scales.Deep learning has greatly improved the automatic processing and analysis of remote sensing data.However,the accurate interpretation of feature information from massive datasets remains a difficult problem in wide regional land cover classification.To improve the efficiency of deep learning-based remote sensing image interpretation,we selected multisource remote sensing data,assessed the interpretability of the U-Net model based on surface spatial scenes with different levels of complexity,and proposed a new method of stereoscopic accuracy verification(SAV)to evaluate the reliability of the classification result.The results show that classification accuracy is more highly correlated with terrain and landscape than with other factors related to image data,such as platform and spatial resolution.As the complexity of surface spatial scenes increases,the accuracy of the classification results mainly shows a fluctuating declining trend.We also find the distribution characteristics from the SAV evaluation results of different land cover types in each surface spatial scene.Based on the results observed in this study,we consider the distinction of interpretability and reliability in diverse ground object types and design targeted classification strategies for different surface scenes,which can greatly improve the classification efficiency.The key achievement of this study is to provide the theoretical basis for remote sensing information analysis and an accuracy evaluation method for regional land cover classification,and the proposed method can help improve the likelihood that intelligent interpretation can replace manual acquisition.展开更多
This paper introduces the system of game-theoretic interactions,which connects both the explanation of knowledge encoded in a deep neural networks(DNN)and the explanation of the representation power of a DNN.In this s...This paper introduces the system of game-theoretic interactions,which connects both the explanation of knowledge encoded in a deep neural networks(DNN)and the explanation of the representation power of a DNN.In this system,we define two gametheoretic interaction indexes,namely the multi-order interaction and the multivariate interaction.More crucially,we use these interaction indexes to explain feature representations encoded in a DNN from the following four aspects:(1)Quantifying knowledge concepts encoded by a DNN;(2)Exploring how a DNN encodes visual concepts,and extracting prototypical concepts encoded in the DNN;(3)Learning optimal baseline values for the Shapley value,and providing a unified perspective to compare fourteen different attribution methods;(4)Theoretically explaining the representation bottleneck of DNNs.Furthermore,we prove the relationship between the interaction encoded in a DNN and the representation power of a DNN(e.g.,generalization power,adversarial transferability,and adversarial robustness).In this way,game-theoretic interactions successfully bridge the gap between“the explanation of knowledge concepts encoded in a DNN”and"the explanation of the representation capacity of a DNN"as a unified explanation.展开更多
Tropical cloud clusters(TCCs)can potentially develop into tropical cyclones(TCs),leading to significant casualties and economic losses.Accurate prediction of tropical cyclogenesis(TCG)is crucial for early warnings.Mos...Tropical cloud clusters(TCCs)can potentially develop into tropical cyclones(TCs),leading to significant casualties and economic losses.Accurate prediction of tropical cyclogenesis(TCG)is crucial for early warnings.Most traditional deep learning methods applied to TCG prediction rely on predictors from a single time point,neglect the ocean-atmosphere interactions,and exhibit low model interpretability.This study proposes the Tropical Cyclogenesis Prediction-Net(TCGP-Net)based on the Swin Transformer,which leverages convolutional operations and attention mechanisms to encode spatiotemporal features and capture the temporal evolution of predictors.This model incorporates the coupled ocean-atmosphere interactions,including multiple variables such as sea surface temperature.Additionally,causal inference and integrated gradients are employed to validate the effectiveness of the predictors and provide an interpretability analysis of the model's decision-making process.The model is trained using GridSat satellite data and ERA5 reanalysis datasets.Experimental results demonstrate that TCGP-Net achieves high accuracy and stability,with a detection rate of 97.9%and a false alarm rate of 2.2%for predicting TCG 24 hours in advance,significantly outperforming existing models.This indicates that TCGP-Net is a reliable tool for tropical cyclogenesis prediction.展开更多
Modern medicine is reliant on various medical imaging technologies for non-invasively observing patients’anatomy.However,the interpretation of medical images can be highly subjective and dependent on the expertise of...Modern medicine is reliant on various medical imaging technologies for non-invasively observing patients’anatomy.However,the interpretation of medical images can be highly subjective and dependent on the expertise of clinicians.Moreover,some potentially useful quantitative information in medical images,especially that which is not visible to the naked eye,is often ignored during clinical practice.In contrast,radiomics performs high-throughput feature extraction from medical images,which enables quantitative analysis of medical images and prediction of various clinical endpoints.Studies have reported that radiomics exhibits promising performance in diagnosis and predicting treatment responses and prognosis,demonstrating its potential to be a non-invasive auxiliary tool for personalized medicine.However,radiomics remains in a developmental phase as numerous technical challenges have yet to be solved,especially in feature engineering and statistical modeling.In this review,we introduce the current utility of radiomics by summarizing research on its application in the diagnosis,prognosis,and prediction of treatment responses in patients with cancer.We focus on machine learning approaches,for feature extraction and selection during feature engineering and for imbalanced datasets and multi-modality fusion during statistical modeling.Furthermore,we introduce the stability,reproducibility,and interpretability of features,and the generalizability and interpretability of models.Finally,we offer possible solutions to current challenges in radiomics research.展开更多
In the existing landslide susceptibility prediction(LSP)models,the influences of random errors in landslide conditioning factors on LSP are not considered,instead the original conditioning factors are directly taken a...In the existing landslide susceptibility prediction(LSP)models,the influences of random errors in landslide conditioning factors on LSP are not considered,instead the original conditioning factors are directly taken as the model inputs,which brings uncertainties to LSP results.This study aims to reveal the influence rules of the different proportional random errors in conditioning factors on the LSP un-certainties,and further explore a method which can effectively reduce the random errors in conditioning factors.The original conditioning factors are firstly used to construct original factors-based LSP models,and then different random errors of 5%,10%,15% and 20%are added to these original factors for con-structing relevant errors-based LSP models.Secondly,low-pass filter-based LSP models are constructed by eliminating the random errors using low-pass filter method.Thirdly,the Ruijin County of China with 370 landslides and 16 conditioning factors are used as study case.Three typical machine learning models,i.e.multilayer perceptron(MLP),support vector machine(SVM)and random forest(RF),are selected as LSP models.Finally,the LSP uncertainties are discussed and results show that:(1)The low-pass filter can effectively reduce the random errors in conditioning factors to decrease the LSP uncertainties.(2)With the proportions of random errors increasing from 5%to 20%,the LSP uncertainty increases continuously.(3)The original factors-based models are feasible for LSP in the absence of more accurate conditioning factors.(4)The influence degrees of two uncertainty issues,machine learning models and different proportions of random errors,on the LSP modeling are large and basically the same.(5)The Shapley values effectively explain the internal mechanism of machine learning model predicting landslide sus-ceptibility.In conclusion,greater proportion of random errors in conditioning factors results in higher LSP uncertainty,and low-pass filter can effectively reduce these random errors.展开更多
Predicting the displacement of landslide is of utmost practical importance as the landslide can pose serious threats to both human life and property.However,traditional methods have the limitation of random selection ...Predicting the displacement of landslide is of utmost practical importance as the landslide can pose serious threats to both human life and property.However,traditional methods have the limitation of random selection in sliding window selection and seldom incorporate weather forecast data for displacement prediction,while a single structural model cannot handle input sequences of different lengths at the same time.In order to solve these limitations,in this study,a new approach is proposed that utilizes weather forecast data and incorporates the maximum information coefficient(MIC),long short-term memory network(LSTM),and attention mechanism to establish a teacher-student coupling model with parallel structure for short-term landslide displacement prediction.Through MIC,a suitable input sequence length is selected for the LSTM model.To investigate the influence of rainfall on landslides during different seasons,a parallel teacher-student coupling model is developed that is able to learn sequential information from various time series of different lengths.The teacher model learns sequence information from rainfall intensity time series while incorporating reliable short-term weather forecast data from platforms such as China Meteorological Administration(CMA)and Reliable Prognosis(https://rp5.ru)to improve the model’s expression capability,and the student model learns sequence information from other time series.An attention module is then designed to integrate different sequence information to derive a context vector,representing seasonal temporal attention mode.Finally,the predicted displacement is obtained through a linear layer.The proposed method demonstrates superior prediction accuracies,surpassing those of the support vector machine(SVM),LSTM,recurrent neural network(RNN),temporal convolutional network(TCN),and LSTM-Attention models.It achieves a mean absolute error(MAE)of 0.072 mm,root mean square error(RMSE)of 0.096 mm,and pearson correlation coefficients(PCCS)of 0.85.Additionally,it exhibits enhanced prediction stability and interpretability,rendering it an indispensable tool for landslide disaster prevention and mitigation.展开更多
User representation learning is crucial for capturing different user preferences,but it is also critical challenging because user intentions are latent and dispersed in complex and different patterns of user-generated...User representation learning is crucial for capturing different user preferences,but it is also critical challenging because user intentions are latent and dispersed in complex and different patterns of user-generated data,and thus cannot be measured directly.Text-based data models can learn user representations by mining latent semantics,which is beneficial to enhancing the semantic function of user representations.However,these technologies only extract common features in historical records and cannot represent changes in user intentions.However,sequential feature can express the user’s interests and intentions that change time by time.But the sequential recommendation results based on the user representation of the item lack the interpretability of preference factors.To address these issues,we propose in this paper a novel model with Dual-Layer User Representation,named DLUR,where the user’s intention is learned based on two different layer representations.Specifically,the latent semantic layer adds an interactive layer based on Transformer to extract keywords and key sentences in the text and serve as a basis for interpretation.The sequence layer uses the Transformer model to encode the user’s preference intention to clarify changes in the user’s intention.Therefore,this dual-layer user mode is more comprehensive than a single text mode or sequence mode and can effectually improve the performance of recommendations.Our extensive experiments on five benchmark datasets demonstrate DLUR’s performance over state-of-the-art recommendation models.In addition,DLUR’s ability to explain recommendation results is also demonstrated through some specific cases.展开更多
In enterprise operations,maintaining manual rules for enterprise processes can be expensive,time-consuming,and dependent on specialized domain knowledge in that enterprise domain.Recently,rule-generation has been auto...In enterprise operations,maintaining manual rules for enterprise processes can be expensive,time-consuming,and dependent on specialized domain knowledge in that enterprise domain.Recently,rule-generation has been automated in enterprises,particularly through Machine Learning,to streamline routine tasks.Typically,these machine models are black boxes where the reasons for the decisions are not always transparent,and the end users need to verify the model proposals as a part of the user acceptance testing to trust it.In such scenarios,rules excel over Machine Learning models as the end-users can verify the rules and have more trust.In many scenarios,the truth label changes frequently thus,it becomes difficult for the Machine Learning model to learn till a considerable amount of data has been accumulated,but with rules,the truth can be adapted.This paper presents a novel framework for generating human-understandable rules using the Classification and Regression Tree(CART)decision tree method,which ensures both optimization and user trust in automated decision-making processes.The framework generates comprehensible rules in the form of if condition and then predicts class even in domains where noise is present.The proposed system transforms enterprise operations by automating the production of human-readable rules from structured data,resulting in increased efficiency and transparency.Removing the need for human rule construction saves time and money while guaranteeing that users can readily check and trust the automatic judgments of the system.The remarkable performance metrics of the framework,which achieve 99.85%accuracy and 96.30%precision,further support its efficiency in translating complex data into comprehensible rules,eventually empowering users and enhancing organizational decision-making processes.展开更多
Traffic flow forecasting constitutes a crucial component of intelligent transportation systems(ITSs).Numerous studies have been conducted for traffic flow forecasting during the past decades.However,most existing stud...Traffic flow forecasting constitutes a crucial component of intelligent transportation systems(ITSs).Numerous studies have been conducted for traffic flow forecasting during the past decades.However,most existing studies have concentrated on developing advanced algorithms or models to attain state-of-the-art forecasting accuracy.For real-world ITS applications,the interpretability of the developed models is extremely important but has largely been ignored.This study presents an interpretable traffic flow forecasting framework based on popular tree-ensemble algorithms.The framework comprises multiple key components integrated into a highly flexible and customizable multi-stage pipeline,enabling the seamless incorporation of various algorithms and tools.To evaluate the effectiveness of the framework,the developed tree-ensemble models and another three typical categories of baseline models,including statistical time series,shallow learning,and deep learning,were compared on three datasets collected from different types of roads(i.e.,arterial,expressway,and freeway).Further,the study delves into an in-depth interpretability analysis of the most competitive tree-ensemble models using six categories of interpretable machine learning methods.Experimental results highlight the potential of the proposed framework.The tree-ensemble models developed within this framework achieve competitive accuracy while maintaining high inference efficiency similar to statistical time series and shallow learning models.Meanwhile,these tree-ensemble models offer interpretability from multiple perspectives via interpretable machine-learning techniques.The proposed framework is anticipated to provide reliable and trustworthy decision support across various ITS applications.展开更多
The aperture of natural rock fractures significantly affects the deformation and strength properties of rock masses,as well as the hydrodynamic properties of fractured rock masses.The conventional measurement methods ...The aperture of natural rock fractures significantly affects the deformation and strength properties of rock masses,as well as the hydrodynamic properties of fractured rock masses.The conventional measurement methods are inadequate for collecting data on high-steep rock slopes in complex mountainous regions.This study establishes a high-resolution three-dimensional model of a rock slope using unmanned aerial vehicle(UAV)multi-angle nap-of-the-object photogrammetry to obtain edge feature points of fractures.Fracture opening morphology is characterized using coordinate projection and transformation.Fracture central axis is determined using vertical measuring lines,allowing for the interpretation of aperture of adaptive fracture shape.The feasibility and reliability of the new method are verified at a construction site of a railway in southeast Tibet,China.The study shows that the fracture aperture has a significant interval effect and size effect.The optimal sampling length for fractures is approximately 0.5e1 m,and the optimal aperture interpretation results can be achieved when the measuring line spacing is 1%of the sampling length.Tensile fractures in the study area generally have larger apertures than shear fractures,and their tendency to increase with slope height is also greater than that of shear fractures.The aperture of tensile fractures is generally positively correlated with their trace length,while the correlation between the aperture of shear fractures and their trace length appears to be weak.Fractures of different orientations exhibit certain differences in their distribution of aperture,but generally follow the forms of normal,log-normal,and gamma distributions.This study provides essential data support for rock and slope stability evaluation,which is of significant practical importance.展开更多
On September 5, 2022, a magnitude Ms 6.8 earthquake occurred along the Moxi fault in the southern part of the Xianshuihe fault zone located in the southeastern margin of the Tibetan Plateau,resulting in severe damage ...On September 5, 2022, a magnitude Ms 6.8 earthquake occurred along the Moxi fault in the southern part of the Xianshuihe fault zone located in the southeastern margin of the Tibetan Plateau,resulting in severe damage and substantial economic loss. In this study, we established a coseismic landslide database triggered by Luding Ms 6.8 earthquake, which includes 4794 landslides with a total area of 46.79 km^(2). The coseismic landslides primarily consisted of medium and small-sized landslides, characterized by shallow surface sliding. Some exhibited characteristics of high-position initiation resulted in the obstruction or partial obstruction of rivers, leading to the formation of dammed lakes. Our research found that the coseismic landslides were predominantly observed on slopes ranging from 30° to 50°, occurring at between 1000 m and 2500 m, with slope aspects varying from 90° to 180°. Landslides were also highly developed in granitic bodies that had experienced structural fracturing and strong-tomoderate weathering. Coseismic landslides concentrated within a 6 km range on both sides of the Xianshuihe and Daduhe fault zones. The area and number of coseismic landslides exhibited a negative correlation with the distance to fault lines, road networks, and river systems, as they were influenced by fault activity, road excavation, and river erosion. The coseismic landslides were mainly distributed in the southeastern region of the epicenter, exhibiting relatively concentrated patterns within the IX-degree zones such as Moxi Town, Wandong River basin, Detuo Town to Wanggangping Township. Our research findings provide important data on the coseismic landslides triggered by the Luding Ms 6.8 earthquake and reveal the spatial distribution patterns of these landslides. These findings can serve as important references for risk mitigation, reconstruction planning, and regional earthquake disaster research in the earthquake-affected area.展开更多
This paper proposes a new approach for online power system transient security assessment(TSA)and preventive control based on XGBoost and DC optimal power flow(DCOPF).The novelty of this proposal is that it applies the...This paper proposes a new approach for online power system transient security assessment(TSA)and preventive control based on XGBoost and DC optimal power flow(DCOPF).The novelty of this proposal is that it applies the XGBoost and data selection method based on the 1-norm distance in local feature importance evaluation which can provide a certain model interpretability.The method of SMOTE+ENN is adopted for data rebalancing.The contingency-oriented XGBoost model is trained with databases generated by time domain simulations to represent the transient security constraint in the DCOPF model,which has a relatively fast speed of calculation.The transient security constrained generation rescheduling is implemented with the differential evolution algorithm,which is utilized to optimize the rescheduled generation in the preventive control.Feasibility and effectiveness of the proposed approach are demonstrated on an IEEE 39-bus test system and a 500-bus operational model for South Carolina,USA.展开更多
The prediction of structural performance plays a significant role in damage assessment of glass fiber reinforcement polymer(GFRP)elastic gridshell structures.Machine learning(ML)approaches are implemented in this stud...The prediction of structural performance plays a significant role in damage assessment of glass fiber reinforcement polymer(GFRP)elastic gridshell structures.Machine learning(ML)approaches are implemented in this study,to predict maximum stress and displacement of GFRP elastic gridshell structures.Several ML algorithms,including linear regression(LR),ridge regression(RR),support vector regression(SVR),K-nearest neighbors(KNN),decision tree(DT),random forest(RF),adaptive boosting(AdaBoost),extreme gradient boosting(XGBoost),category boosting(CatBoost),and light gradient boosting machine(LightGBM),are implemented in this study.Output features of structural performance considered in this study are the maximum stress as f1(x)and the maximum displacement to self-weight ratio as f2(x).A comparative study is conducted and the Catboost model presents the highest prediction accuracy.Finally,interpretable ML approaches,including shapely additive explanations(SHAP),partial dependence plot(PDP),and accumulated local effects(ALE),are applied to explain the predictions.SHAP is employed to describe the importance of each variable to structural performance both locally and globally.The results of sensitivity analysis(SA),feature importance of the CatBoost model and SHAP approach indicate the same parameters as the most significant variables for f1(x)and f2(x).展开更多
基金European Commission,Joint Research Center,Grant/Award Number:HUMAINTMinisterio de Ciencia e Innovación,Grant/Award Number:PID2020‐114924RB‐I00Comunidad de Madrid,Grant/Award Number:S2018/EMT‐4362 SEGVAUTO 4.0‐CM。
文摘Predicting the motion of other road agents enables autonomous vehicles to perform safe and efficient path planning.This task is very complex,as the behaviour of road agents depends on many factors and the number of possible future trajectories can be consid-erable(multi-modal).Most prior approaches proposed to address multi-modal motion prediction are based on complex machine learning systems that have limited interpret-ability.Moreover,the metrics used in current benchmarks do not evaluate all aspects of the problem,such as the diversity and admissibility of the output.The authors aim to advance towards the design of trustworthy motion prediction systems,based on some of the re-quirements for the design of Trustworthy Artificial Intelligence.The focus is on evaluation criteria,robustness,and interpretability of outputs.First,the evaluation metrics are comprehensively analysed,the main gaps of current benchmarks are identified,and a new holistic evaluation framework is proposed.Then,a method for the assessment of spatial and temporal robustness is introduced by simulating noise in the perception system.To enhance the interpretability of the outputs and generate more balanced results in the proposed evaluation framework,an intent prediction layer that can be attached to multi-modal motion prediction models is proposed.The effectiveness of this approach is assessed through a survey that explores different elements in the visualisation of the multi-modal trajectories and intentions.The proposed approach and findings make a significant contribution to the development of trustworthy motion prediction systems for autono-mous vehicles,advancing the field towards greater safety and reliability.
文摘Association rule learning(ARL)is a widely used technique for discovering relationships within datasets.However,it often generates excessive irrelevant or ambiguous rules.Therefore,post-processing is crucial not only for removing irrelevant or redundant rules but also for uncovering hidden associations that impact other factors.Recently,several post-processing methods have been proposed,each with its own strengths and weaknesses.In this paper,we propose THAPE(Tunable Hybrid Associative Predictive Engine),which combines descriptive and predictive techniques.By leveraging both techniques,our aim is to enhance the quality of analyzing generated rules.This includes removing irrelevant or redundant rules,uncovering interesting and useful rules,exploring hidden association rules that may affect other factors,and providing backtracking ability for a given product.The proposed approach offers a tailored method that suits specific goals for retailers,enabling them to gain a better understanding of customer behavior based on factual transactions in the target market.We applied THAPE to a real dataset as a case study in this paper to demonstrate its effectiveness.Through this application,we successfully mined a concise set of highly interesting and useful association rules.Out of the 11,265 rules generated,we identified 125 rules that are particularly relevant to the business context.These identified rules significantly improve the interpretability and usefulness of association rules for decision-making purposes.
基金Key research project of Hunan Provincial Administration of Traditional Chinese Medicine(A2023048)Key Research Foundation of Education Bureau of Hunan Province,China(23A0273).
文摘Objective Clinical medical record data associated with hepatitis B-related acute-on-chronic liver failure(HBV-ACLF)generally have small sample sizes and a class imbalance.However,most machine learning models are designed based on balanced data and lack interpretability.This study aimed to propose a traditional Chinese medicine(TCM)diagnostic model for HBV-ACLF based on the TCM syndrome differentiation and treatment theory,which is clinically interpretable and highly accurate.Methods We collected medical records from 261 patients diagnosed with HBV-ACLF,including three syndromes:Yang jaundice(214 cases),Yang-Yin jaundice(41 cases),and Yin jaundice(6 cases).To avoid overfitting of the machine learning model,we excluded the cases of Yin jaundice.After data standardization and cleaning,we obtained 255 relevant medical records of Yang jaundice and Yang-Yin jaundice.To address the class imbalance issue,we employed the oversampling method and five machine learning methods,including logistic regression(LR),support vector machine(SVM),decision tree(DT),random forest(RF),and extreme gradient boosting(XGBoost)to construct the syndrome diagnosis models.This study used precision,F1 score,the area under the receiver operating characteristic(ROC)curve(AUC),and accuracy as model evaluation metrics.The model with the best classification performance was selected to extract the diagnostic rule,and its clinical significance was thoroughly analyzed.Furthermore,we proposed a novel multiple-round stable rule extraction(MRSRE)method to obtain a stable rule set of features that can exhibit the model’s clinical interpretability.Results The precision of the five machine learning models built using oversampled balanced data exceeded 0.90.Among these models,the accuracy of RF classification of syndrome types was 0.92,and the mean F1 scores of the two categories of Yang jaundice and Yang-Yin jaundice were 0.93 and 0.94,respectively.Additionally,the AUC was 0.98.The extraction rules of the RF syndrome differentiation model based on the MRSRE method revealed that the common features of Yang jaundice and Yang-Yin jaundice were wiry pulse,yellowing of the urine,skin,and eyes,normal tongue body,healthy sublingual vessel,nausea,oil loathing,and poor appetite.The main features of Yang jaundice were a red tongue body and thickened sublingual vessels,whereas those of Yang-Yin jaundice were a dark tongue body,pale white tongue body,white tongue coating,lack of strength,slippery pulse,light red tongue body,slimy tongue coating,and abdominal distension.This is aligned with the classifications made by TCM experts based on TCM syndrome differentiation and treatment theory.Conclusion Our model can be utilized for differentiating HBV-ACLF syndromes,which has the potential to be applied to generate other clinically interpretable models with high accuracy on clinical data characterized by small sample sizes and a class imbalance.
基金supported by the Natural Science Foundation of China (No.61833016)the Shaanxi Outstanding Youth Science Foundation (No.2020JC-34)the Shaanxi Science and Technology Innovation Team (No.2022TD-24).
文摘Fault diagnosis plays an irreplaceable role in the normal operation of equipment.A fault diagnosis model is often required to be interpretable for increasing the trust between humans and the model.Due to the understandable knowledge expression and transparent reasoning process,the belief rule base(BRB)has extensive applications as an interpretable expert system in fault diagnosis.Optimization is an effective means to weaken the subjectivity of experts in BRB,where the interpretability of BRB may be weakened.Hence,to obtain a credible result,the weakening factors of interpretability in the BRB-based fault diagnosis model are firstly analyzed,which are manifested in deviation from the initial judgement of experts and over-optimization of parameters.For these two factors,three indexes are proposed,namely the consistency index of rules,consistency index of the rule base and over-optimization index,tomeasure the interpretability of the optimizedmodel.Considering both the accuracy and interpretability of amodel,an improved coordinate ascent(I-CA)algorithmis proposed to fine-tune the parameters of the fault diagnosis model based on BRB.In I-CA,the algorithm combined with the advance and retreat method and the golden section method is employed to be one-dimensional search algorithm.Furthermore,the random optimization sequence and adaptive step size are proposed to improve the accuracy of the model.Finally,a case study of fault diagnosis in aerospace relays based on BRB is carried out to verify the effectiveness of the proposed method.
基金The authors would like to acknowledge the support of the China Scholarship Council,the Flemish Government under the“Onderzoeksprogramma Artificiële Intelligentie(AI)Vlaanderen”Program and the Research Foundation–Flanders(FWO)under the ROBUSTIFY research grant no.S006119N.
文摘Early fault diagnosis of bearings is crucial for ensuring safe and reliable operations.Convolutional neural networks(CNNs)have achieved significant breakthroughs in machinery fault diagnosis.However,complex and varying working conditions can lead to inter-class similarity and intra-class variability in datasets,making it more challenging for CNNs to learn discriminative features.Furthermore,CNNs are often considered“black boxes”and lack sufficient interpretability in the fault diagnosis field.To address these issues,this paper introduces a residual mixed domain attention CNN method,referred to as RMA-CNN.This method comprises multiple residual mixed domain attention modules(RMAMs),each employing one attention mechanism to emphasize meaningful features in both time and channel domains.This significantly enhances the network’s ability to learn fault-related features.Moreover,we conduct an in-depth analysis of the inherent feature learning mechanism of the attention module RMAM to improve the interpretability of CNNs in fault diagnosis applications.Experiments conducted on two datasets—a high-speed aeronautical bearing dataset and a motor bearing dataset—demonstrate that the RMA-CNN achieves remarkable results in diagnostic tasks.
基金This work was supported in part by the Postdoctoral Science Foundation of China under Grant No.2020M683736in part by the Natural Science Foundation of Heilongjiang Province of China under Grant No.LH2021F038+2 种基金in part by the innovation practice project of college students in Heilongjiang Province under Grant Nos.202010231009,202110231024,and 202110231155in part by the basic scientific research business expenses scientific research projects of provincial universities in Heilongjiang Province Grant Nos.XJGZ2021001in part by the Education and teaching reform program of 2021 in Heilongjiang Province under Grant No.SJGY20210457.
文摘Prediction systems are an important aspect of intelligent decisions.In engineering practice,the complex system structure and the external environment cause many uncertain factors in the model,which influence the modeling accuracy of the model.The belief rule base(BRB)can implement nonlinear modeling and express a variety of uncertain information,including fuzziness,ignorance,randomness,etc.However,the BRB system also has two main problems:Firstly,modeling methods based on expert knowledge make it difficult to guarantee the model’s accuracy.Secondly,interpretability is not considered in the optimization process of current research,resulting in the destruction of the interpretability of BRB.To balance the accuracy and interpretability of the model,a self-growth belief rule basewith interpretability constraints(SBRB-I)is proposed.The reasoning process of the SBRB-I model is based on the evidence reasoning(ER)approach.Moreover,the self-growth learning strategy ensures effective cooperation between the datadriven model and the expert system.A case study showed that the accuracy and interpretability of the model could be guaranteed.The SBRB-I model has good application prospects in prediction systems.
文摘With the development of Fintech, applying artificial intelligence (AI) technologies to the financial field is a general trend. However, there are some inappropriate conditions, for instance, the AI model is always treated as a black box and cannot be interpreted. This paper studies the AI model interpretability when the models are applied in the financial field. We analyze the reasons of black box problem and explore the effective solutions. We propose a new kind of automatic Regtech tool—LIMER, and put forward policy suggestions, thereby continuously promoting the development of Fintech to a higher level.
基金Under the auspices of National Natural Science Foundation of China(No.41971352)Key Research and Development Project of Shaanxi Province(No.2022ZDLSF06-01)。
文摘The accurate and reliable interpretation of regional land cover data is very important for natural resource monitoring and environmental assessment.At present,refined land cover data are mainly obtained by manual visual interpretation,which has the problems of heavy workload and inconsistent interpretation scales.Deep learning has greatly improved the automatic processing and analysis of remote sensing data.However,the accurate interpretation of feature information from massive datasets remains a difficult problem in wide regional land cover classification.To improve the efficiency of deep learning-based remote sensing image interpretation,we selected multisource remote sensing data,assessed the interpretability of the U-Net model based on surface spatial scenes with different levels of complexity,and proposed a new method of stereoscopic accuracy verification(SAV)to evaluate the reliability of the classification result.The results show that classification accuracy is more highly correlated with terrain and landscape than with other factors related to image data,such as platform and spatial resolution.As the complexity of surface spatial scenes increases,the accuracy of the classification results mainly shows a fluctuating declining trend.We also find the distribution characteristics from the SAV evaluation results of different land cover types in each surface spatial scene.Based on the results observed in this study,we consider the distinction of interpretability and reliability in diverse ground object types and design targeted classification strategies for different surface scenes,which can greatly improve the classification efficiency.The key achievement of this study is to provide the theoretical basis for remote sensing information analysis and an accuracy evaluation method for regional land cover classification,and the proposed method can help improve the likelihood that intelligent interpretation can replace manual acquisition.
基金supported by National Science and Technology Major Project(No.2021ZD0111602)the National Nature Science Foundation of China(Nos.62276165 and U19B2043)Shanghai Natural Science Foundation,China(Nos.21JC1403800 and 21ZR1434600).
文摘This paper introduces the system of game-theoretic interactions,which connects both the explanation of knowledge encoded in a deep neural networks(DNN)and the explanation of the representation power of a DNN.In this system,we define two gametheoretic interaction indexes,namely the multi-order interaction and the multivariate interaction.More crucially,we use these interaction indexes to explain feature representations encoded in a DNN from the following four aspects:(1)Quantifying knowledge concepts encoded by a DNN;(2)Exploring how a DNN encodes visual concepts,and extracting prototypical concepts encoded in the DNN;(3)Learning optimal baseline values for the Shapley value,and providing a unified perspective to compare fourteen different attribution methods;(4)Theoretically explaining the representation bottleneck of DNNs.Furthermore,we prove the relationship between the interaction encoded in a DNN and the representation power of a DNN(e.g.,generalization power,adversarial transferability,and adversarial robustness).In this way,game-theoretic interactions successfully bridge the gap between“the explanation of knowledge concepts encoded in a DNN”and"the explanation of the representation capacity of a DNN"as a unified explanation.
基金supported by the National Natural Science Foundation of China(Grant Nos.U2142211,42075141&42341202)the National Key Research and Development Program of China(Grant No.2020YFA0608000)+1 种基金the Shanghai Municipal Science and Technology Major Project(Grant No.2021SHZDZX0100)the Fundamental Research Funds for the Central Universities。
文摘Tropical cloud clusters(TCCs)can potentially develop into tropical cyclones(TCs),leading to significant casualties and economic losses.Accurate prediction of tropical cyclogenesis(TCG)is crucial for early warnings.Most traditional deep learning methods applied to TCG prediction rely on predictors from a single time point,neglect the ocean-atmosphere interactions,and exhibit low model interpretability.This study proposes the Tropical Cyclogenesis Prediction-Net(TCGP-Net)based on the Swin Transformer,which leverages convolutional operations and attention mechanisms to encode spatiotemporal features and capture the temporal evolution of predictors.This model incorporates the coupled ocean-atmosphere interactions,including multiple variables such as sea surface temperature.Additionally,causal inference and integrated gradients are employed to validate the effectiveness of the predictors and provide an interpretability analysis of the model's decision-making process.The model is trained using GridSat satellite data and ERA5 reanalysis datasets.Experimental results demonstrate that TCGP-Net achieves high accuracy and stability,with a detection rate of 97.9%and a false alarm rate of 2.2%for predicting TCG 24 hours in advance,significantly outperforming existing models.This indicates that TCGP-Net is a reliable tool for tropical cyclogenesis prediction.
基金supported in part by the National Natural Science Foundation of China(82072019)the Shenzhen Basic Research Program(JCYJ20210324130209023)+5 种基金the Shenzhen-Hong Kong-Macao S&T Program(Category C)(SGDX20201103095002019)the Mainland-Hong Kong Joint Funding Scheme(MHKJFS)(MHP/005/20),the Project of Strategic Importance Fund(P0035421)the Projects of RISA(P0043001)from the Hong Kong Polytechnic University,the Natural Science Foundation of Jiangsu Province(BK20201441)the Provincial and Ministry Co-constructed Project of Henan Province Medical Science and Technology Research(SBGJ202103038,SBGJ202102056)the Henan Province Key R&D and Promotion Project(Science and Technology Research)(222102310015)the Natural Science Foundation of Henan Province(222300420575),and the Henan Province Science and Technology Research(222102310322).
文摘Modern medicine is reliant on various medical imaging technologies for non-invasively observing patients’anatomy.However,the interpretation of medical images can be highly subjective and dependent on the expertise of clinicians.Moreover,some potentially useful quantitative information in medical images,especially that which is not visible to the naked eye,is often ignored during clinical practice.In contrast,radiomics performs high-throughput feature extraction from medical images,which enables quantitative analysis of medical images and prediction of various clinical endpoints.Studies have reported that radiomics exhibits promising performance in diagnosis and predicting treatment responses and prognosis,demonstrating its potential to be a non-invasive auxiliary tool for personalized medicine.However,radiomics remains in a developmental phase as numerous technical challenges have yet to be solved,especially in feature engineering and statistical modeling.In this review,we introduce the current utility of radiomics by summarizing research on its application in the diagnosis,prognosis,and prediction of treatment responses in patients with cancer.We focus on machine learning approaches,for feature extraction and selection during feature engineering and for imbalanced datasets and multi-modality fusion during statistical modeling.Furthermore,we introduce the stability,reproducibility,and interpretability of features,and the generalizability and interpretability of models.Finally,we offer possible solutions to current challenges in radiomics research.
基金This work is funded by the National Natural Science Foundation of China(Grant Nos.42377164 and 52079062)the National Science Fund for Distinguished Young Scholars of China(Grant No.52222905).
文摘In the existing landslide susceptibility prediction(LSP)models,the influences of random errors in landslide conditioning factors on LSP are not considered,instead the original conditioning factors are directly taken as the model inputs,which brings uncertainties to LSP results.This study aims to reveal the influence rules of the different proportional random errors in conditioning factors on the LSP un-certainties,and further explore a method which can effectively reduce the random errors in conditioning factors.The original conditioning factors are firstly used to construct original factors-based LSP models,and then different random errors of 5%,10%,15% and 20%are added to these original factors for con-structing relevant errors-based LSP models.Secondly,low-pass filter-based LSP models are constructed by eliminating the random errors using low-pass filter method.Thirdly,the Ruijin County of China with 370 landslides and 16 conditioning factors are used as study case.Three typical machine learning models,i.e.multilayer perceptron(MLP),support vector machine(SVM)and random forest(RF),are selected as LSP models.Finally,the LSP uncertainties are discussed and results show that:(1)The low-pass filter can effectively reduce the random errors in conditioning factors to decrease the LSP uncertainties.(2)With the proportions of random errors increasing from 5%to 20%,the LSP uncertainty increases continuously.(3)The original factors-based models are feasible for LSP in the absence of more accurate conditioning factors.(4)The influence degrees of two uncertainty issues,machine learning models and different proportions of random errors,on the LSP modeling are large and basically the same.(5)The Shapley values effectively explain the internal mechanism of machine learning model predicting landslide sus-ceptibility.In conclusion,greater proportion of random errors in conditioning factors results in higher LSP uncertainty,and low-pass filter can effectively reduce these random errors.
基金This research work is supported by Sichuan Science and Technology Program(Grant No.2022YFS0586)the National Key R&D Program of China(Grant No.2019YFC1509301)the National Natural Science Foundation of China(Grant No.61976046).
文摘Predicting the displacement of landslide is of utmost practical importance as the landslide can pose serious threats to both human life and property.However,traditional methods have the limitation of random selection in sliding window selection and seldom incorporate weather forecast data for displacement prediction,while a single structural model cannot handle input sequences of different lengths at the same time.In order to solve these limitations,in this study,a new approach is proposed that utilizes weather forecast data and incorporates the maximum information coefficient(MIC),long short-term memory network(LSTM),and attention mechanism to establish a teacher-student coupling model with parallel structure for short-term landslide displacement prediction.Through MIC,a suitable input sequence length is selected for the LSTM model.To investigate the influence of rainfall on landslides during different seasons,a parallel teacher-student coupling model is developed that is able to learn sequential information from various time series of different lengths.The teacher model learns sequence information from rainfall intensity time series while incorporating reliable short-term weather forecast data from platforms such as China Meteorological Administration(CMA)and Reliable Prognosis(https://rp5.ru)to improve the model’s expression capability,and the student model learns sequence information from other time series.An attention module is then designed to integrate different sequence information to derive a context vector,representing seasonal temporal attention mode.Finally,the predicted displacement is obtained through a linear layer.The proposed method demonstrates superior prediction accuracies,surpassing those of the support vector machine(SVM),LSTM,recurrent neural network(RNN),temporal convolutional network(TCN),and LSTM-Attention models.It achieves a mean absolute error(MAE)of 0.072 mm,root mean square error(RMSE)of 0.096 mm,and pearson correlation coefficients(PCCS)of 0.85.Additionally,it exhibits enhanced prediction stability and interpretability,rendering it an indispensable tool for landslide disaster prevention and mitigation.
基金supported by the Applied Research Center of Artificial Intelligence,Wuhan College(Grant Number X2020113)the Wuhan College Research Project(Grant Number KYZ202009).
文摘User representation learning is crucial for capturing different user preferences,but it is also critical challenging because user intentions are latent and dispersed in complex and different patterns of user-generated data,and thus cannot be measured directly.Text-based data models can learn user representations by mining latent semantics,which is beneficial to enhancing the semantic function of user representations.However,these technologies only extract common features in historical records and cannot represent changes in user intentions.However,sequential feature can express the user’s interests and intentions that change time by time.But the sequential recommendation results based on the user representation of the item lack the interpretability of preference factors.To address these issues,we propose in this paper a novel model with Dual-Layer User Representation,named DLUR,where the user’s intention is learned based on two different layer representations.Specifically,the latent semantic layer adds an interactive layer based on Transformer to extract keywords and key sentences in the text and serve as a basis for interpretation.The sequence layer uses the Transformer model to encode the user’s preference intention to clarify changes in the user’s intention.Therefore,this dual-layer user mode is more comprehensive than a single text mode or sequence mode and can effectually improve the performance of recommendations.Our extensive experiments on five benchmark datasets demonstrate DLUR’s performance over state-of-the-art recommendation models.In addition,DLUR’s ability to explain recommendation results is also demonstrated through some specific cases.
文摘In enterprise operations,maintaining manual rules for enterprise processes can be expensive,time-consuming,and dependent on specialized domain knowledge in that enterprise domain.Recently,rule-generation has been automated in enterprises,particularly through Machine Learning,to streamline routine tasks.Typically,these machine models are black boxes where the reasons for the decisions are not always transparent,and the end users need to verify the model proposals as a part of the user acceptance testing to trust it.In such scenarios,rules excel over Machine Learning models as the end-users can verify the rules and have more trust.In many scenarios,the truth label changes frequently thus,it becomes difficult for the Machine Learning model to learn till a considerable amount of data has been accumulated,but with rules,the truth can be adapted.This paper presents a novel framework for generating human-understandable rules using the Classification and Regression Tree(CART)decision tree method,which ensures both optimization and user trust in automated decision-making processes.The framework generates comprehensible rules in the form of if condition and then predicts class even in domains where noise is present.The proposed system transforms enterprise operations by automating the production of human-readable rules from structured data,resulting in increased efficiency and transparency.Removing the need for human rule construction saves time and money while guaranteeing that users can readily check and trust the automatic judgments of the system.The remarkable performance metrics of the framework,which achieve 99.85%accuracy and 96.30%precision,further support its efficiency in translating complex data into comprehensible rules,eventually empowering users and enhancing organizational decision-making processes.
基金funded by the National Key R&D Program of China(Grant No.2023YFE0106800)the Humanity and Social Science Youth Foundation of Ministry of Education of China(Grant No.22YJC630109).
文摘Traffic flow forecasting constitutes a crucial component of intelligent transportation systems(ITSs).Numerous studies have been conducted for traffic flow forecasting during the past decades.However,most existing studies have concentrated on developing advanced algorithms or models to attain state-of-the-art forecasting accuracy.For real-world ITS applications,the interpretability of the developed models is extremely important but has largely been ignored.This study presents an interpretable traffic flow forecasting framework based on popular tree-ensemble algorithms.The framework comprises multiple key components integrated into a highly flexible and customizable multi-stage pipeline,enabling the seamless incorporation of various algorithms and tools.To evaluate the effectiveness of the framework,the developed tree-ensemble models and another three typical categories of baseline models,including statistical time series,shallow learning,and deep learning,were compared on three datasets collected from different types of roads(i.e.,arterial,expressway,and freeway).Further,the study delves into an in-depth interpretability analysis of the most competitive tree-ensemble models using six categories of interpretable machine learning methods.Experimental results highlight the potential of the proposed framework.The tree-ensemble models developed within this framework achieve competitive accuracy while maintaining high inference efficiency similar to statistical time series and shallow learning models.Meanwhile,these tree-ensemble models offer interpretability from multiple perspectives via interpretable machine-learning techniques.The proposed framework is anticipated to provide reliable and trustworthy decision support across various ITS applications.
基金This work was supported by the National Nature Science Foundation of China(Grant Nos.42177139 and 41941017)the Natural Science Foundation Project of Jilin Province,China(Grant No.20230101088JC).The authors would like to thank the anonymous reviewers for their comments and suggestions.
文摘The aperture of natural rock fractures significantly affects the deformation and strength properties of rock masses,as well as the hydrodynamic properties of fractured rock masses.The conventional measurement methods are inadequate for collecting data on high-steep rock slopes in complex mountainous regions.This study establishes a high-resolution three-dimensional model of a rock slope using unmanned aerial vehicle(UAV)multi-angle nap-of-the-object photogrammetry to obtain edge feature points of fractures.Fracture opening morphology is characterized using coordinate projection and transformation.Fracture central axis is determined using vertical measuring lines,allowing for the interpretation of aperture of adaptive fracture shape.The feasibility and reliability of the new method are verified at a construction site of a railway in southeast Tibet,China.The study shows that the fracture aperture has a significant interval effect and size effect.The optimal sampling length for fractures is approximately 0.5e1 m,and the optimal aperture interpretation results can be achieved when the measuring line spacing is 1%of the sampling length.Tensile fractures in the study area generally have larger apertures than shear fractures,and their tendency to increase with slope height is also greater than that of shear fractures.The aperture of tensile fractures is generally positively correlated with their trace length,while the correlation between the aperture of shear fractures and their trace length appears to be weak.Fractures of different orientations exhibit certain differences in their distribution of aperture,but generally follow the forms of normal,log-normal,and gamma distributions.This study provides essential data support for rock and slope stability evaluation,which is of significant practical importance.
基金supported by the National Natural Science Foundation of China project (No. 42372339)the China Geological Survey Project (Nos. DD20221816, DD20190319)。
文摘On September 5, 2022, a magnitude Ms 6.8 earthquake occurred along the Moxi fault in the southern part of the Xianshuihe fault zone located in the southeastern margin of the Tibetan Plateau,resulting in severe damage and substantial economic loss. In this study, we established a coseismic landslide database triggered by Luding Ms 6.8 earthquake, which includes 4794 landslides with a total area of 46.79 km^(2). The coseismic landslides primarily consisted of medium and small-sized landslides, characterized by shallow surface sliding. Some exhibited characteristics of high-position initiation resulted in the obstruction or partial obstruction of rivers, leading to the formation of dammed lakes. Our research found that the coseismic landslides were predominantly observed on slopes ranging from 30° to 50°, occurring at between 1000 m and 2500 m, with slope aspects varying from 90° to 180°. Landslides were also highly developed in granitic bodies that had experienced structural fracturing and strong-tomoderate weathering. Coseismic landslides concentrated within a 6 km range on both sides of the Xianshuihe and Daduhe fault zones. The area and number of coseismic landslides exhibited a negative correlation with the distance to fault lines, road networks, and river systems, as they were influenced by fault activity, road excavation, and river erosion. The coseismic landslides were mainly distributed in the southeastern region of the epicenter, exhibiting relatively concentrated patterns within the IX-degree zones such as Moxi Town, Wandong River basin, Detuo Town to Wanggangping Township. Our research findings provide important data on the coseismic landslides triggered by the Luding Ms 6.8 earthquake and reveal the spatial distribution patterns of these landslides. These findings can serve as important references for risk mitigation, reconstruction planning, and regional earthquake disaster research in the earthquake-affected area.
基金supported in part by the National Key Research and Development Program of China under Grant 2020YFB0905900.
文摘This paper proposes a new approach for online power system transient security assessment(TSA)and preventive control based on XGBoost and DC optimal power flow(DCOPF).The novelty of this proposal is that it applies the XGBoost and data selection method based on the 1-norm distance in local feature importance evaluation which can provide a certain model interpretability.The method of SMOTE+ENN is adopted for data rebalancing.The contingency-oriented XGBoost model is trained with databases generated by time domain simulations to represent the transient security constraint in the DCOPF model,which has a relatively fast speed of calculation.The transient security constrained generation rescheduling is implemented with the differential evolution algorithm,which is utilized to optimize the rescheduled generation in the preventive control.Feasibility and effectiveness of the proposed approach are demonstrated on an IEEE 39-bus test system and a 500-bus operational model for South Carolina,USA.
基金The research work was supported by the National Natural Science Foundation of China(Grant No.51978400)the National Key Research and Development Program of China(No.2021YFE0107800).The support is gratefully acknowledged.
文摘The prediction of structural performance plays a significant role in damage assessment of glass fiber reinforcement polymer(GFRP)elastic gridshell structures.Machine learning(ML)approaches are implemented in this study,to predict maximum stress and displacement of GFRP elastic gridshell structures.Several ML algorithms,including linear regression(LR),ridge regression(RR),support vector regression(SVR),K-nearest neighbors(KNN),decision tree(DT),random forest(RF),adaptive boosting(AdaBoost),extreme gradient boosting(XGBoost),category boosting(CatBoost),and light gradient boosting machine(LightGBM),are implemented in this study.Output features of structural performance considered in this study are the maximum stress as f1(x)and the maximum displacement to self-weight ratio as f2(x).A comparative study is conducted and the Catboost model presents the highest prediction accuracy.Finally,interpretable ML approaches,including shapely additive explanations(SHAP),partial dependence plot(PDP),and accumulated local effects(ALE),are applied to explain the predictions.SHAP is employed to describe the importance of each variable to structural performance both locally and globally.The results of sensitivity analysis(SA),feature importance of the CatBoost model and SHAP approach indicate the same parameters as the most significant variables for f1(x)and f2(x).