Machine learning has been widely used for solving partial differential equations(PDEs)in recent years,among which the random feature method(RFM)exhibits spectral accuracy and can compete with traditional solvers in te...Machine learning has been widely used for solving partial differential equations(PDEs)in recent years,among which the random feature method(RFM)exhibits spectral accuracy and can compete with traditional solvers in terms of both accuracy and efficiency.Potentially,the optimization problem in the RFM is more difficult to solve than those that arise in traditional methods.Unlike the broader machine-learning research,which frequently targets tasks within the low-precision regime,our study focuses on the high-precision regime crucial for solving PDEs.In this work,we study this problem from the following aspects:(i)we analyze the coeffcient matrix that arises in the RFM by studying the distribution of singular values;(ii)we investigate whether the continuous training causes the overfitting issue;(ii)we test direct and iterative methods as well as randomized methods for solving the optimization problem.Based on these results,we find that direct methods are superior to other methods if memory is not an issue,while iterative methods typically have low accuracy and can be improved by preconditioning to some extent.展开更多
The development of defect prediction plays a significant role in improving software quality. Such predictions are used to identify defective modules before the testing and to minimize the time and cost. The software w...The development of defect prediction plays a significant role in improving software quality. Such predictions are used to identify defective modules before the testing and to minimize the time and cost. The software with defects negatively impacts operational costs and finally affects customer satisfaction. Numerous approaches exist to predict software defects. However, the timely and accurate software bugs are the major challenging issues. To improve the timely and accurate software defect prediction, a novel technique called Nonparametric Statistical feature scaled QuAdratic regressive convolution Deep nEural Network (SQADEN) is introduced. The proposed SQADEN technique mainly includes two major processes namely metric or feature selection and classification. First, the SQADEN uses the nonparametric statistical Torgerson–Gower scaling technique for identifying the relevant software metrics by measuring the similarity using the dice coefficient. The feature selection process is used to minimize the time complexity of software fault prediction. With the selected metrics, software fault perdition with the help of the Quadratic Censored regressive convolution deep neural network-based classification. The deep learning classifier analyzes the training and testing samples using the contingency correlation coefficient. The softstep activation function is used to provide the final fault prediction results. To minimize the error, the Nelder–Mead method is applied to solve non-linear least-squares problems. Finally, accurate classification results with a minimum error are obtained at the output layer. Experimental evaluation is carried out with different quantitative metrics such as accuracy, precision, recall, F-measure, and time complexity. The analyzed results demonstrate the superior performance of our proposed SQADEN technique with maximum accuracy, sensitivity and specificity by 3%, 3%, 2% and 3% and minimum time and space by 13% and 15% when compared with the two state-of-the-art methods.展开更多
The main purpose of reverse engineering is to convert discrete data pointsinto piecewise smooth, continuous surface models. Before carrying out model reconstruction it issignificant to extract geometric features becau...The main purpose of reverse engineering is to convert discrete data pointsinto piecewise smooth, continuous surface models. Before carrying out model reconstruction it issignificant to extract geometric features because the quality of modeling greatly depends on therepresentation of features. Some fitting techniques of natural quadric surfaces with least-squaresmethod are described. And these techniques can be directly used to extract quadric surfaces featuresduring the process of segmentation for point cloud.展开更多
Accurate prediction of shield tunneling-induced settlement is a complex problem that requires consideration of many influential parameters.Recent studies reveal that machine learning(ML)algorithms can predict the sett...Accurate prediction of shield tunneling-induced settlement is a complex problem that requires consideration of many influential parameters.Recent studies reveal that machine learning(ML)algorithms can predict the settlement caused by tunneling.However,well-performing ML models are usually less interpretable.Irrelevant input features decrease the performance and interpretability of an ML model.Nonetheless,feature selection,a critical step in the ML pipeline,is usually ignored in most studies that focused on predicting tunneling-induced settlement.This study applies four techniques,i.e.Pearson correlation method,sequential forward selection(SFS),sequential backward selection(SBS)and Boruta algorithm,to investigate the effect of feature selection on the model’s performance when predicting the tunneling-induced maximum surface settlement(S_(max)).The data set used in this study was compiled from two metro tunnel projects excavated in Hangzhou,China using earth pressure balance(EPB)shields and consists of 14 input features and a single output(i.e.S_(max)).The ML model that is trained on features selected from the Boruta algorithm demonstrates the best performance in both the training and testing phases.The relevant features chosen from the Boruta algorithm further indicate that tunneling-induced settlement is affected by parameters related to tunnel geometry,geological conditions and shield operation.The recently proposed Shapley additive explanations(SHAP)method explores how the input features contribute to the output of a complex ML model.It is observed that the larger settlements are induced during shield tunneling in silty clay.Moreover,the SHAP analysis reveals that the low magnitudes of face pressure at the top of the shield increase the model’s output。展开更多
This paper proposes a hybrid feature selection sequence comple-mented with filter and wrapper concepts to improve the accuracy of Machine Learning(ML)based supervised classifiers for classifying the survivability of b...This paper proposes a hybrid feature selection sequence comple-mented with filter and wrapper concepts to improve the accuracy of Machine Learning(ML)based supervised classifiers for classifying the survivability of breast cancer patients into classes,living and deceased using METABRIC and Surveillance,Epidemiology and End Results(SEER)datasets.The ML-based classifiers used in the analysis are:Multiple Logistic Regression,K-Nearest Neighbors,Decision Tree,Random Forest,Support Vector Machine and Multilayer Perceptron.The workflow of the proposed ML algorithm sequence comprises the following stages:data cleaning,data balancing,feature selection via a filter and wrapper sequence,cross validation-based training,testing and performance evaluation.The results obtained are compared in terms of the following classification metrics:Accuracy,Precision,F1 score,True Positive Rate,True Negative Rate,False Positive Rate,False Negative Rate,Area under the Receiver Operating Characteristics curve,Area under the Precision-Recall curve and Mathews Correlation Coefficient.The comparison shows that the proposed feature selection sequence produces better results from all supervised classifiers than all other feature selection sequences considered in the analysis.展开更多
Aiming to deficiency of the filter and wrapper feature selection methods, anew method based on composite method of filter and wrapper method is proposed. First the methodfilters original features to form a feature sub...Aiming to deficiency of the filter and wrapper feature selection methods, anew method based on composite method of filter and wrapper method is proposed. First the methodfilters original features to form a feature subset which can meet classification correctness rate,then applies wrapper feature selection method select optimal feature subset. A successful techniquefor solving optimization problems is given by genetic algorithm (GA). GA is applied to the problemof optimal feature selection. The composite method saves computing time several times of the wrappermethod with holding the classification accuracy in data simulation and experiment on bearing faultfeature selection. So this method possesses excellent optimization property, can save more selectiontime, and has the characteristics of high accuracy and high efficiency.展开更多
An efficient multi-harmonic method is proposed for studying the effects of mistuning on resonant features of bladed disks with blade-to-blade dry friction damping. This method is able to predict accurately the forced ...An efficient multi-harmonic method is proposed for studying the effects of mistuning on resonant features of bladed disks with blade-to-blade dry friction damping. This method is able to predict accurately the forced response of bladed disks in frequency domain, which is validated by numerical integration method in time domain. The resonant features of both tuned and mistuned systems are investigated by using this method under various system coupling strengths, viscous dampings, and dry friction darnpings, etc. The results demonstrate that the proposed multi-harmonic method is very efficient for studying the mistuning effects on the resonant response of bladed disks with blade-to-blade dry friction damping, especially considering the combined effects of various system parameters.展开更多
In order to provide a guidance to specify the element size dynamically during adaptive finite element mesh generation, adaptive criteria are firstly defined according to the relationships between the geometrical featu...In order to provide a guidance to specify the element size dynamically during adaptive finite element mesh generation, adaptive criteria are firstly defined according to the relationships between the geometrical features and the elements of 3D solid. Various modes based on different datum geometrical elements, such as vertex, curve, surface, and so on, are then designed for generating local refined mesh. With the guidance of the defmed criteria, different modes are automatically selected to apply on the appropriate datum objects to program the element size in the local special areas. As a result, the control information of element size is successfully programmed covering the entire domain based on the geometrical features of 3D solid. A new algorithm based on Delatmay triangulation is then developed for generating 3D adaptive finite element mesh, in which the element size is dynamically specified to catch the geometrical features and suitable tetrahedron facets are selected to locate interior nodes continuously. As a result, adaptive mesh with good-quality elements is generated. Examples show that the proposed method can be successfully applied to adaptive finite element mesh automatic generation based on the geometrical features of 3D solid.展开更多
Several millions of people suffer from Parkinson’s disease globally.Parkinson’s affects about 1%of people over 60 and its symptoms increase with age.The voice may be affected and patients experience abnormalities in...Several millions of people suffer from Parkinson’s disease globally.Parkinson’s affects about 1%of people over 60 and its symptoms increase with age.The voice may be affected and patients experience abnormalities in speech that might not be noticed by listeners,but which could be analyzed using recorded speech signals.With the huge advancements of technology,the medical data has increased dramatically,and therefore,there is a need to apply data mining and machine learning methods to extract new knowledge from this data.Several classification methods were used to analyze medical data sets and diagnostic problems,such as Parkinson’s Disease(PD).In addition,to improve the performance of classification,feature selection methods have been extensively used in many fields.This paper aims to propose a comprehensive approach to enhance the prediction of PD using several machine learning methods with different feature selection methods such as filter-based and wrapper-based.The dataset includes 240 recodes with 46 acoustic features extracted from3 voice recording replications for 80 patients.The experimental results showed improvements when wrapper-based features selection method was used with K-NN classifier with accuracy of 88.33%.The best obtained results were compared with other studies and it was found that this study provides comparable and superior results.展开更多
In thefield of diagnosis of medical images the challenge lies in tracking and identifying the defective cells and the extent of the defective region within the complex structure of a brain cavity.Locating the defective...In thefield of diagnosis of medical images the challenge lies in tracking and identifying the defective cells and the extent of the defective region within the complex structure of a brain cavity.Locating the defective cells precisely during the diagnosis phase helps tofight the greatest exterminator of mankind.Early detec-tion of these defective cells requires an accurate computer-aided diagnostic system(CAD)that supports early treatment and promotes survival rates of patients.An ear-lier version of CAD systems relies greatly on the expertise of radiologist and it con-sumed more time to identify the defective region.The manuscript takes the efficacy of coalescing features like intensity,shape,and texture of the magnetic resonance image(MRI).In the Enhanced Feature Fusion Segmentation based classification method(EEFS)the image is enhanced and segmented to extract the prominent fea-tures.To bring out the desired effect the EEFS method uses Enhanced Local Binary Pattern(EnLBP),Partisan Gray Level Co-occurrence Matrix Histogram of Oriented Gradients(PGLCMHOG),and iGrab cut method to segment image.These prominent features along with deep features are coalesced to provide a single-dimensional fea-ture vector that is effectively used for prediction.The coalesced vector is used with the existing classifiers to compare the results of these classifiers with that of the gen-erated vector.The generated vector provides promising results with commendably less computatio nal time for pre-processing and classification of MR medical images.展开更多
This article introduces the concept of load aggregation,which involves a comprehensive analysis of loads to acquire their external characteristics for the purpose of modeling and analyzing power systems.The online ide...This article introduces the concept of load aggregation,which involves a comprehensive analysis of loads to acquire their external characteristics for the purpose of modeling and analyzing power systems.The online identification method is a computer-involved approach for data collection,processing,and system identification,commonly used for adaptive control and prediction.This paper proposes a method for dynamically aggregating large-scale adjustable loads to support high proportions of new energy integration,aiming to study the aggregation characteristics of regional large-scale adjustable loads using online identification techniques and feature extraction methods.The experiment selected 300 central air conditioners as the research subject and analyzed their regulation characteristics,economic efficiency,and comfort.The experimental results show that as the adjustment time of the air conditioner increases from 5 minutes to 35 minutes,the stable adjustment quantity during the adjustment period decreases from 28.46 to 3.57,indicating that air conditioning loads can be controlled over a long period and have better adjustment effects in the short term.Overall,the experimental results of this paper demonstrate that analyzing the aggregation characteristics of regional large-scale adjustable loads using online identification techniques and feature extraction algorithms is effective.展开更多
The feature-selection problem in training AdaBoost classifiers is addressed in this paper. A working feature subset is generated by adopting a novel feature subset selection method based on the partial least square (...The feature-selection problem in training AdaBoost classifiers is addressed in this paper. A working feature subset is generated by adopting a novel feature subset selection method based on the partial least square (PLS) regression, and then trained and selected from this feature subset in Boosting. The experiments show that the proposed PLS-based feature-selection method outperforms the current feature ranking method and the random sampling method.展开更多
We describe how the Unit-Feature Spatial Classification Method(UFSCM) can be used operationally to classify cloud types in satellite imagery efficiently and conveniently.By using a combination of Interactive Data Lang...We describe how the Unit-Feature Spatial Classification Method(UFSCM) can be used operationally to classify cloud types in satellite imagery efficiently and conveniently.By using a combination of Interactive Data Language(IDL) and Visual C++(VC) code in combination to extend the technique in three dimensions(3-D),this paper provides an efficient method to implement interactive computer visualization of the 3-D discrimination matrix modification,so as to deal with the bi-spectral limitations of traditional two dimensional(2-D) UFSCM.The case study of cloud-type classification based on FY-2C satellite data (0600 UTC 18 and 0000 UTC 10 September 2007) is conducted by comparison with ground station data, and indicates that 3-D UFSCM makes more use of the pattern recognition information in multi-spectral imagery,resulting in more reasonable results and an improvement over the 2-D method.展开更多
In order to improve the accuracy of building structure identification using remote sensing images,a building structure classification method based on multi-feature fusion of UAV remote sensing image is proposed in thi...In order to improve the accuracy of building structure identification using remote sensing images,a building structure classification method based on multi-feature fusion of UAV remote sensing image is proposed in this paper.Three identification approaches of remote sensing images are integrated in this method:object-oriented,texture feature,and digital elevation based on DSM and DEM.So RGB threshold classification method is used to classify the identification results.The accuracy of building structure classification based on each feature and the multi-feature fusion are compared and analyzed.The results show that the building structure classification method is feasible and can accurately identify the structures in large-area remote sensing images.展开更多
The task of indoor visual localization, utilizing camera visual information for user pose calculation, was a core component of Augmented Reality (AR) and Simultaneous Localization and Mapping (SLAM). Existing indoor l...The task of indoor visual localization, utilizing camera visual information for user pose calculation, was a core component of Augmented Reality (AR) and Simultaneous Localization and Mapping (SLAM). Existing indoor localization technologies generally used scene-specific 3D representations or were trained on specific datasets, making it challenging to balance accuracy and cost when applied to new scenes. Addressing this issue, this paper proposed a universal indoor visual localization method based on efficient image retrieval. Initially, a Multi-Layer Perceptron (MLP) was employed to aggregate features from intermediate layers of a convolutional neural network, obtaining a global representation of the image. This approach ensured accurate and rapid retrieval of reference images. Subsequently, a new mechanism using Random Sample Consensus (RANSAC) was designed to resolve relative pose ambiguity caused by the essential matrix decomposition based on the five-point method. Finally, the absolute pose of the queried user image was computed, thereby achieving indoor user pose estimation. The proposed indoor localization method was characterized by its simplicity, flexibility, and excellent cross-scene generalization. Experimental results demonstrated a positioning error of 0.09 m and 2.14° on the 7Scenes dataset, and 0.15 m and 6.37° on the 12Scenes dataset. These results convincingly illustrated the outstanding performance of the proposed indoor localization method.展开更多
ICSED (Improved Cluster Shade Edge Detection) algorithm and other various methods to accurately and efficiently detect edges on satellite data are presented. Error rate criterion is used to statistically evaluate the ...ICSED (Improved Cluster Shade Edge Detection) algorithm and other various methods to accurately and efficiently detect edges on satellite data are presented. Error rate criterion is used to statistically evaluate the performances of these methods in detecting oceanic features for both noise free and noise contaminated AVHRR (Advanced Very High Resolution Radiometer) IR image with Kuroshio. Also, practical experiments in detecting the eddy of Kuroshio with these methods are carried out for comparison. Results show that the ICSED algorithm has more advantages than other methods in detecting mesoscale features of ocean. Finally, the effectiveness of window size of ICSED method to oceanic features detection is quantitatively discussed.展开更多
Electrical resistivity tomography (ERT) has been used to experimentally detect shallow buried faults in urban areas in the past a few years, with some progress and experience obtained. According to the results from Ol...Electrical resistivity tomography (ERT) has been used to experimentally detect shallow buried faults in urban areas in the past a few years, with some progress and experience obtained. According to the results from Olympic Park, Beijing, Shandong Province, Gansu Province and Shanxi Province, we have generalized the method and procedure for inferring the discontinuity of electrical structures (DES) indicating a buried fault in urban areas from resistivity tomograms and its typical electrical features. In general, the layered feature of the electrical structure is first analyzed to preliminarily define whether or not a DES exists in the target area. Resistivity contours in resistivity tomograms are then analyzed from the deep to the shallow. If they extend upward from the deep to the shallow and shape into an integral dislocation, sharp flexure (convergence) or gradient zone, it is inferred that the DES exists, indicating a buried fault. Finally, horizontal tracing is be carried out to define the trend of the DES. The DES can be divided into three types-type AB, ABA and AC. In the present paper, the Zhangdian-Renhe fault system in Zibo city is used as an example to illustrate how to use the method to infer the location and spatial extension of a target fault. Geologic drilling holes are placed based on our research results, and the drilling logs testify that our results are correct. However, the method of this paper is not exclusive and inflexible. It is expected to provide reference and assistance for inferring the shallow buried faults in urban areas from resistivity tomograms in the future.展开更多
An empirical test on long memory between price and trading volume of China metals futures market was given with MF-DCCA method. The empirical results show that long memory feature with a certain period exists in price...An empirical test on long memory between price and trading volume of China metals futures market was given with MF-DCCA method. The empirical results show that long memory feature with a certain period exists in price-volume correlation and a fittther proof was given by analyzing the source of multifractal feature. The empirical results suggest that it is of important practical significance to bring the fractal market theory and other nonlinear theory into the analysis and explanation of the behavior in metal futures market.展开更多
Tillage methods play a crucial role in controlling rainwater partitioning and soil erosion.This study utilized rainfall simulation experiments to investigate the impact of four tillage methods(manual digging(MD),manua...Tillage methods play a crucial role in controlling rainwater partitioning and soil erosion.This study utilized rainfall simulation experiments to investigate the impact of four tillage methods(manual digging(MD),manual hoeing(MH),traditional ploughing(TP),and ridged ploughing(RP))on runoff and soil erosion at the plot scale.The smooth slope(SS)was used as a benchmark.Rainfall intensities of 30,60,90,and 120 mm h−1 were considered.The study revealed that tillage altered rainwater distribution into depression storage,infiltration,and runoff.Tillage reduces runoff and increases infiltration.The four tillage methods(30–73%)increased the proportion of rainwater converted to infiltration to varying degrees compared to the SS(22–53%).Microrelief features influenced the role of tillage methods in soil erosion.Surface roughness and depression storage accounted for 79%of the variation in sediment yield.The four tillage methods reduced runoff by 2.1–64.7%and sediment yield by 2.5–77.2%.Moreover,increased rainfall intensity weakens the ability of tillage to control soil erosion.When rainfall intensity increased to 120 mm h−1,there was no significant difference in runoff yield among RP,TP,MH,and SS.Therefore,assessing the effectiveness of tillage in reducing soil erosion should consider changes in rainfall intensity.Additionally,the cover management(C)factor of the RUSLE was used to assess the effects of different tillage methods on soil loss.Overall,the C factor values for tilled slopes are in the order MH>TP>RP>MD with a range of 0.23–0.97.As the surface roughness increases,the C factor tends to decrease,and the two are exponential functions(R2=0.86).These studies contribute to our understanding of how different tillage methods impact runoff and soil erosion in sloped farmland and provide guidance for selecting appropriate local manual tillage methods.展开更多
Formulating criteria for the assessment system of historic settlements is challenging due to complex geographical conditions and evaluator knowledge limitations, leading to subjective bias in the assessment process. T...Formulating criteria for the assessment system of historic settlements is challenging due to complex geographical conditions and evaluator knowledge limitations, leading to subjective bias in the assessment process. To address this issue, this study proposes a data-driven method for assessing the features of historical settlements to carry out scientific and refined assessment and result analysis. Focusing on Northeast Hubei as the study area, this paper selects 3 historical settlements for validation and analysis. The results of the study show that (1) the data-driven method expands the methodological chain of assessing historical settlement features, and improves the assessment efficiency and scientificity of the assessment results by applying it to the new assessment process;(2) Through comparing the assessment results of the validation cases and data samples, the study establishes a comprehensive quantitative ranking of the assessment of historical settlement features and identifies the main influencing factors, thus enhancing the precision of result analysis;(3) By comparing the resulting assessment framework with the current assessment system, this study confirms the advantages of the proposed framework in identifying nuanced features and aligning with geographical conditions, thereby verifying the effectiveness of the data-driven method.展开更多
基金supported by the NSFC Major Research Plan--Interpretable and Generalpurpose Next-generation Artificial Intelligence(No.92370205).
文摘Machine learning has been widely used for solving partial differential equations(PDEs)in recent years,among which the random feature method(RFM)exhibits spectral accuracy and can compete with traditional solvers in terms of both accuracy and efficiency.Potentially,the optimization problem in the RFM is more difficult to solve than those that arise in traditional methods.Unlike the broader machine-learning research,which frequently targets tasks within the low-precision regime,our study focuses on the high-precision regime crucial for solving PDEs.In this work,we study this problem from the following aspects:(i)we analyze the coeffcient matrix that arises in the RFM by studying the distribution of singular values;(ii)we investigate whether the continuous training causes the overfitting issue;(ii)we test direct and iterative methods as well as randomized methods for solving the optimization problem.Based on these results,we find that direct methods are superior to other methods if memory is not an issue,while iterative methods typically have low accuracy and can be improved by preconditioning to some extent.
文摘The development of defect prediction plays a significant role in improving software quality. Such predictions are used to identify defective modules before the testing and to minimize the time and cost. The software with defects negatively impacts operational costs and finally affects customer satisfaction. Numerous approaches exist to predict software defects. However, the timely and accurate software bugs are the major challenging issues. To improve the timely and accurate software defect prediction, a novel technique called Nonparametric Statistical feature scaled QuAdratic regressive convolution Deep nEural Network (SQADEN) is introduced. The proposed SQADEN technique mainly includes two major processes namely metric or feature selection and classification. First, the SQADEN uses the nonparametric statistical Torgerson–Gower scaling technique for identifying the relevant software metrics by measuring the similarity using the dice coefficient. The feature selection process is used to minimize the time complexity of software fault prediction. With the selected metrics, software fault perdition with the help of the Quadratic Censored regressive convolution deep neural network-based classification. The deep learning classifier analyzes the training and testing samples using the contingency correlation coefficient. The softstep activation function is used to provide the final fault prediction results. To minimize the error, the Nelder–Mead method is applied to solve non-linear least-squares problems. Finally, accurate classification results with a minimum error are obtained at the output layer. Experimental evaluation is carried out with different quantitative metrics such as accuracy, precision, recall, F-measure, and time complexity. The analyzed results demonstrate the superior performance of our proposed SQADEN technique with maximum accuracy, sensitivity and specificity by 3%, 3%, 2% and 3% and minimum time and space by 13% and 15% when compared with the two state-of-the-art methods.
基金This project is supported by Research Foundation for Doctoral Program of Higher Education, China (No.98033532)
文摘The main purpose of reverse engineering is to convert discrete data pointsinto piecewise smooth, continuous surface models. Before carrying out model reconstruction it issignificant to extract geometric features because the quality of modeling greatly depends on therepresentation of features. Some fitting techniques of natural quadric surfaces with least-squaresmethod are described. And these techniques can be directly used to extract quadric surfaces featuresduring the process of segmentation for point cloud.
基金support provided by The Science and Technology Development Fund,Macao SAR,China(File Nos.0057/2020/AGJ and SKL-IOTSC-2021-2023)Science and Technology Program of Guangdong Province,China(Grant No.2021A0505080009).
文摘Accurate prediction of shield tunneling-induced settlement is a complex problem that requires consideration of many influential parameters.Recent studies reveal that machine learning(ML)algorithms can predict the settlement caused by tunneling.However,well-performing ML models are usually less interpretable.Irrelevant input features decrease the performance and interpretability of an ML model.Nonetheless,feature selection,a critical step in the ML pipeline,is usually ignored in most studies that focused on predicting tunneling-induced settlement.This study applies four techniques,i.e.Pearson correlation method,sequential forward selection(SFS),sequential backward selection(SBS)and Boruta algorithm,to investigate the effect of feature selection on the model’s performance when predicting the tunneling-induced maximum surface settlement(S_(max)).The data set used in this study was compiled from two metro tunnel projects excavated in Hangzhou,China using earth pressure balance(EPB)shields and consists of 14 input features and a single output(i.e.S_(max)).The ML model that is trained on features selected from the Boruta algorithm demonstrates the best performance in both the training and testing phases.The relevant features chosen from the Boruta algorithm further indicate that tunneling-induced settlement is affected by parameters related to tunnel geometry,geological conditions and shield operation.The recently proposed Shapley additive explanations(SHAP)method explores how the input features contribute to the output of a complex ML model.It is observed that the larger settlements are induced during shield tunneling in silty clay.Moreover,the SHAP analysis reveals that the low magnitudes of face pressure at the top of the shield increase the model’s output。
文摘This paper proposes a hybrid feature selection sequence comple-mented with filter and wrapper concepts to improve the accuracy of Machine Learning(ML)based supervised classifiers for classifying the survivability of breast cancer patients into classes,living and deceased using METABRIC and Surveillance,Epidemiology and End Results(SEER)datasets.The ML-based classifiers used in the analysis are:Multiple Logistic Regression,K-Nearest Neighbors,Decision Tree,Random Forest,Support Vector Machine and Multilayer Perceptron.The workflow of the proposed ML algorithm sequence comprises the following stages:data cleaning,data balancing,feature selection via a filter and wrapper sequence,cross validation-based training,testing and performance evaluation.The results obtained are compared in terms of the following classification metrics:Accuracy,Precision,F1 score,True Positive Rate,True Negative Rate,False Positive Rate,False Negative Rate,Area under the Receiver Operating Characteristics curve,Area under the Precision-Recall curve and Mathews Correlation Coefficient.The comparison shows that the proposed feature selection sequence produces better results from all supervised classifiers than all other feature selection sequences considered in the analysis.
基金This project is supported by Scientific Research Foundation of National Defence of China (No.41319040202).
文摘Aiming to deficiency of the filter and wrapper feature selection methods, anew method based on composite method of filter and wrapper method is proposed. First the methodfilters original features to form a feature subset which can meet classification correctness rate,then applies wrapper feature selection method select optimal feature subset. A successful techniquefor solving optimization problems is given by genetic algorithm (GA). GA is applied to the problemof optimal feature selection. The composite method saves computing time several times of the wrappermethod with holding the classification accuracy in data simulation and experiment on bearing faultfeature selection. So this method possesses excellent optimization property, can save more selectiontime, and has the characteristics of high accuracy and high efficiency.
基金National Nature Science Foundation of China (NO.50275121)
文摘An efficient multi-harmonic method is proposed for studying the effects of mistuning on resonant features of bladed disks with blade-to-blade dry friction damping. This method is able to predict accurately the forced response of bladed disks in frequency domain, which is validated by numerical integration method in time domain. The resonant features of both tuned and mistuned systems are investigated by using this method under various system coupling strengths, viscous dampings, and dry friction darnpings, etc. The results demonstrate that the proposed multi-harmonic method is very efficient for studying the mistuning effects on the resonant response of bladed disks with blade-to-blade dry friction damping, especially considering the combined effects of various system parameters.
基金This project is supported by Provincial Project Foundation of Science and Technology of Guangdong, China(No.2002104040101).
文摘In order to provide a guidance to specify the element size dynamically during adaptive finite element mesh generation, adaptive criteria are firstly defined according to the relationships between the geometrical features and the elements of 3D solid. Various modes based on different datum geometrical elements, such as vertex, curve, surface, and so on, are then designed for generating local refined mesh. With the guidance of the defmed criteria, different modes are automatically selected to apply on the appropriate datum objects to program the element size in the local special areas. As a result, the control information of element size is successfully programmed covering the entire domain based on the geometrical features of 3D solid. A new algorithm based on Delatmay triangulation is then developed for generating 3D adaptive finite element mesh, in which the element size is dynamically specified to catch the geometrical features and suitable tetrahedron facets are selected to locate interior nodes continuously. As a result, adaptive mesh with good-quality elements is generated. Examples show that the proposed method can be successfully applied to adaptive finite element mesh automatic generation based on the geometrical features of 3D solid.
基金This research was funded by the Deputyship for Research&Innovation,Ministry of Education in Saudi Arabia under the Project Number(77/442).
文摘Several millions of people suffer from Parkinson’s disease globally.Parkinson’s affects about 1%of people over 60 and its symptoms increase with age.The voice may be affected and patients experience abnormalities in speech that might not be noticed by listeners,but which could be analyzed using recorded speech signals.With the huge advancements of technology,the medical data has increased dramatically,and therefore,there is a need to apply data mining and machine learning methods to extract new knowledge from this data.Several classification methods were used to analyze medical data sets and diagnostic problems,such as Parkinson’s Disease(PD).In addition,to improve the performance of classification,feature selection methods have been extensively used in many fields.This paper aims to propose a comprehensive approach to enhance the prediction of PD using several machine learning methods with different feature selection methods such as filter-based and wrapper-based.The dataset includes 240 recodes with 46 acoustic features extracted from3 voice recording replications for 80 patients.The experimental results showed improvements when wrapper-based features selection method was used with K-NN classifier with accuracy of 88.33%.The best obtained results were compared with other studies and it was found that this study provides comparable and superior results.
文摘In thefield of diagnosis of medical images the challenge lies in tracking and identifying the defective cells and the extent of the defective region within the complex structure of a brain cavity.Locating the defective cells precisely during the diagnosis phase helps tofight the greatest exterminator of mankind.Early detec-tion of these defective cells requires an accurate computer-aided diagnostic system(CAD)that supports early treatment and promotes survival rates of patients.An ear-lier version of CAD systems relies greatly on the expertise of radiologist and it con-sumed more time to identify the defective region.The manuscript takes the efficacy of coalescing features like intensity,shape,and texture of the magnetic resonance image(MRI).In the Enhanced Feature Fusion Segmentation based classification method(EEFS)the image is enhanced and segmented to extract the prominent fea-tures.To bring out the desired effect the EEFS method uses Enhanced Local Binary Pattern(EnLBP),Partisan Gray Level Co-occurrence Matrix Histogram of Oriented Gradients(PGLCMHOG),and iGrab cut method to segment image.These prominent features along with deep features are coalesced to provide a single-dimensional fea-ture vector that is effectively used for prediction.The coalesced vector is used with the existing classifiers to compare the results of these classifiers with that of the gen-erated vector.The generated vector provides promising results with commendably less computatio nal time for pre-processing and classification of MR medical images.
基金supported by the State Grid Science&Technology Project(5100-202114296A-0-0-00).
文摘This article introduces the concept of load aggregation,which involves a comprehensive analysis of loads to acquire their external characteristics for the purpose of modeling and analyzing power systems.The online identification method is a computer-involved approach for data collection,processing,and system identification,commonly used for adaptive control and prediction.This paper proposes a method for dynamically aggregating large-scale adjustable loads to support high proportions of new energy integration,aiming to study the aggregation characteristics of regional large-scale adjustable loads using online identification techniques and feature extraction methods.The experiment selected 300 central air conditioners as the research subject and analyzed their regulation characteristics,economic efficiency,and comfort.The experimental results show that as the adjustment time of the air conditioner increases from 5 minutes to 35 minutes,the stable adjustment quantity during the adjustment period decreases from 28.46 to 3.57,indicating that air conditioning loads can be controlled over a long period and have better adjustment effects in the short term.Overall,the experimental results of this paper demonstrate that analyzing the aggregation characteristics of regional large-scale adjustable loads using online identification techniques and feature extraction algorithms is effective.
基金Supported by the National Natural Science Foundation of China(60772066)
文摘The feature-selection problem in training AdaBoost classifiers is addressed in this paper. A working feature subset is generated by adopting a novel feature subset selection method based on the partial least square (PLS) regression, and then trained and selected from this feature subset in Boosting. The experiments show that the proposed PLS-based feature-selection method outperforms the current feature ranking method and the random sampling method.
基金supported by the National Natural Science Foundation of China(Grant No.40875012)the National Basic Research Program of China(Grant No.2009CB421502)the Meteorology Open Fund of Huaihe River Basin(HRM200704).
文摘We describe how the Unit-Feature Spatial Classification Method(UFSCM) can be used operationally to classify cloud types in satellite imagery efficiently and conveniently.By using a combination of Interactive Data Language(IDL) and Visual C++(VC) code in combination to extend the technique in three dimensions(3-D),this paper provides an efficient method to implement interactive computer visualization of the 3-D discrimination matrix modification,so as to deal with the bi-spectral limitations of traditional two dimensional(2-D) UFSCM.The case study of cloud-type classification based on FY-2C satellite data (0600 UTC 18 and 0000 UTC 10 September 2007) is conducted by comparison with ground station data, and indicates that 3-D UFSCM makes more use of the pattern recognition information in multi-spectral imagery,resulting in more reasonable results and an improvement over the 2-D method.
基金sponsored by National Key R&D Program of China(2018YFC1504504)Youth Foundation of Yunnan Earthquake Agency(2021K01)Project of Yunnan Earthquake Agency“Chuan bang dai”(CQ3-2021001).
文摘In order to improve the accuracy of building structure identification using remote sensing images,a building structure classification method based on multi-feature fusion of UAV remote sensing image is proposed in this paper.Three identification approaches of remote sensing images are integrated in this method:object-oriented,texture feature,and digital elevation based on DSM and DEM.So RGB threshold classification method is used to classify the identification results.The accuracy of building structure classification based on each feature and the multi-feature fusion are compared and analyzed.The results show that the building structure classification method is feasible and can accurately identify the structures in large-area remote sensing images.
文摘The task of indoor visual localization, utilizing camera visual information for user pose calculation, was a core component of Augmented Reality (AR) and Simultaneous Localization and Mapping (SLAM). Existing indoor localization technologies generally used scene-specific 3D representations or were trained on specific datasets, making it challenging to balance accuracy and cost when applied to new scenes. Addressing this issue, this paper proposed a universal indoor visual localization method based on efficient image retrieval. Initially, a Multi-Layer Perceptron (MLP) was employed to aggregate features from intermediate layers of a convolutional neural network, obtaining a global representation of the image. This approach ensured accurate and rapid retrieval of reference images. Subsequently, a new mechanism using Random Sample Consensus (RANSAC) was designed to resolve relative pose ambiguity caused by the essential matrix decomposition based on the five-point method. Finally, the absolute pose of the queried user image was computed, thereby achieving indoor user pose estimation. The proposed indoor localization method was characterized by its simplicity, flexibility, and excellent cross-scene generalization. Experimental results demonstrated a positioning error of 0.09 m and 2.14° on the 7Scenes dataset, and 0.15 m and 6.37° on the 12Scenes dataset. These results convincingly illustrated the outstanding performance of the proposed indoor localization method.
文摘ICSED (Improved Cluster Shade Edge Detection) algorithm and other various methods to accurately and efficiently detect edges on satellite data are presented. Error rate criterion is used to statistically evaluate the performances of these methods in detecting oceanic features for both noise free and noise contaminated AVHRR (Advanced Very High Resolution Radiometer) IR image with Kuroshio. Also, practical experiments in detecting the eddy of Kuroshio with these methods are carried out for comparison. Results show that the ICSED algorithm has more advantages than other methods in detecting mesoscale features of ocean. Finally, the effectiveness of window size of ICSED method to oceanic features detection is quantitatively discussed.
基金The project entitled "Urban Active Fault Surveying Project"(143623) funded by the National Development and Roform Commission of China"Active Faults Exploration and Seismic Hazard Assessment in Zibo City"(SD1501) funded by the Department of Science & Technology of Shangdong Province,China
文摘Electrical resistivity tomography (ERT) has been used to experimentally detect shallow buried faults in urban areas in the past a few years, with some progress and experience obtained. According to the results from Olympic Park, Beijing, Shandong Province, Gansu Province and Shanxi Province, we have generalized the method and procedure for inferring the discontinuity of electrical structures (DES) indicating a buried fault in urban areas from resistivity tomograms and its typical electrical features. In general, the layered feature of the electrical structure is first analyzed to preliminarily define whether or not a DES exists in the target area. Resistivity contours in resistivity tomograms are then analyzed from the deep to the shallow. If they extend upward from the deep to the shallow and shape into an integral dislocation, sharp flexure (convergence) or gradient zone, it is inferred that the DES exists, indicating a buried fault. Finally, horizontal tracing is be carried out to define the trend of the DES. The DES can be divided into three types-type AB, ABA and AC. In the present paper, the Zhangdian-Renhe fault system in Zibo city is used as an example to illustrate how to use the method to infer the location and spatial extension of a target fault. Geologic drilling holes are placed based on our research results, and the drilling logs testify that our results are correct. However, the method of this paper is not exclusive and inflexible. It is expected to provide reference and assistance for inferring the shallow buried faults in urban areas from resistivity tomograms in the future.
基金Project(13&ZD024)supported by the Major Program of the National Social Science Fund of ChinaProject(71073177)supported by the National Natural Science Foundation of China+3 种基金Project(CX2012B107)supported by the Graduate Student Innovation Project of Hunan Province,ChinaProject(13YJAZH149)supported by the Social Science Fund of Ministry of Education of ChinaProject(2011ZK2043)supported by the Key Program of the Soft Science Research Project of Hunan Province,ChinaProject(12JJ4077)supported by Natural Science Foundation of Hunan Province of China
文摘An empirical test on long memory between price and trading volume of China metals futures market was given with MF-DCCA method. The empirical results show that long memory feature with a certain period exists in price-volume correlation and a fittther proof was given by analyzing the source of multifractal feature. The empirical results suggest that it is of important practical significance to bring the fractal market theory and other nonlinear theory into the analysis and explanation of the behavior in metal futures market.
基金jointly funded by the National Natural Science Foundation of China(41771259&41171034)Natural Science Basic Research Plan in Shaanxi Province of China(2023-JC-ZD-30&2019JZ-45)+1 种基金Shaanxi Provincial Water Conservancy Science and Technology Project(2016slkj-11)Shaanxi Provincial Key Laboratory Project of Department of Education(14JS059).
文摘Tillage methods play a crucial role in controlling rainwater partitioning and soil erosion.This study utilized rainfall simulation experiments to investigate the impact of four tillage methods(manual digging(MD),manual hoeing(MH),traditional ploughing(TP),and ridged ploughing(RP))on runoff and soil erosion at the plot scale.The smooth slope(SS)was used as a benchmark.Rainfall intensities of 30,60,90,and 120 mm h−1 were considered.The study revealed that tillage altered rainwater distribution into depression storage,infiltration,and runoff.Tillage reduces runoff and increases infiltration.The four tillage methods(30–73%)increased the proportion of rainwater converted to infiltration to varying degrees compared to the SS(22–53%).Microrelief features influenced the role of tillage methods in soil erosion.Surface roughness and depression storage accounted for 79%of the variation in sediment yield.The four tillage methods reduced runoff by 2.1–64.7%and sediment yield by 2.5–77.2%.Moreover,increased rainfall intensity weakens the ability of tillage to control soil erosion.When rainfall intensity increased to 120 mm h−1,there was no significant difference in runoff yield among RP,TP,MH,and SS.Therefore,assessing the effectiveness of tillage in reducing soil erosion should consider changes in rainfall intensity.Additionally,the cover management(C)factor of the RUSLE was used to assess the effects of different tillage methods on soil loss.Overall,the C factor values for tilled slopes are in the order MH>TP>RP>MD with a range of 0.23–0.97.As the surface roughness increases,the C factor tends to decrease,and the two are exponential functions(R2=0.86).These studies contribute to our understanding of how different tillage methods impact runoff and soil erosion in sloped farmland and provide guidance for selecting appropriate local manual tillage methods.
基金the National Natural Science Foundation of China(Grant No.52278018).
文摘Formulating criteria for the assessment system of historic settlements is challenging due to complex geographical conditions and evaluator knowledge limitations, leading to subjective bias in the assessment process. To address this issue, this study proposes a data-driven method for assessing the features of historical settlements to carry out scientific and refined assessment and result analysis. Focusing on Northeast Hubei as the study area, this paper selects 3 historical settlements for validation and analysis. The results of the study show that (1) the data-driven method expands the methodological chain of assessing historical settlement features, and improves the assessment efficiency and scientificity of the assessment results by applying it to the new assessment process;(2) Through comparing the assessment results of the validation cases and data samples, the study establishes a comprehensive quantitative ranking of the assessment of historical settlement features and identifies the main influencing factors, thus enhancing the precision of result analysis;(3) By comparing the resulting assessment framework with the current assessment system, this study confirms the advantages of the proposed framework in identifying nuanced features and aligning with geographical conditions, thereby verifying the effectiveness of the data-driven method.