Aiming at the rapid identification of rural buildings in complex environments from high-spatialresolution images, an improved Mahalanobis distance colour segmentation method(IMDCSM) is proposed and realised in Red, Gr...Aiming at the rapid identification of rural buildings in complex environments from high-spatialresolution images, an improved Mahalanobis distance colour segmentation method(IMDCSM) is proposed and realised in Red, Green and Blue(RGB) space. Vector sets of a lower discrete degree are obtained by filtering the colour vector sets of the building samples, and a standard ellipsoid equation can be constructed based on these vector sets. The threshold of interested colour range can be flexibly and intuitively selected by changing the shape and size of this ellipsoid. Then, according to the relationship between the location of the image pixel colour vector and the ellipsoid, all building information can be extracted quickly. To verify the effectiveness of the proposed method, unmanned aerial vehicle(UAV) images of two areas in the suburbs of Chengdu city and Deyang city were utilised as experimental data for image segmentation, and the existing colour segmentation method based on the Mahalanobis distance was selected as an indicator to assess the effectiveness of this method. The experimental results demonstrate that the completeness and correctness of this method reached 95% and 83.0%, respectively, values that are higher than those of the Mahalanobis distance colour segmentation method(MDCSM). In general, this method is suitable for the rapid extraction of rural building information, and provides a new threshold selection method for classification.展开更多
In order to select effective feature subsets for pattern classification, a novel statistics rough set method is presented based on generalized attribute reduction. Unlike classical reduction approaches, the objects in...In order to select effective feature subsets for pattern classification, a novel statistics rough set method is presented based on generalized attribute reduction. Unlike classical reduction approaches, the objects in universe of discourse are signs of training sample sets and values of attributes are taken as statistical parameters. The binary relation and discernibility matrix for the reduction are induced by distance function. Furthermore, based on the monotony of the distance function defined by Mahalanobis distance, the effective feature subsets are obtained as generalized attribute reducts. Experiment result shows that the classification performance can be improved by using the selected feature subsets.展开更多
The Mahalanobis distance features proposed by P.C.Mahalanobis, an Indian statistician, can be used in an automatic on-line cutting tool condition monitoring process based on digital image processing. In this paper, a ...The Mahalanobis distance features proposed by P.C.Mahalanobis, an Indian statistician, can be used in an automatic on-line cutting tool condition monitoring process based on digital image processing. In this paper, a new method of obtaining Mahalanobis distance features from a tool image is proposed. The key of calculating Mahalanobis distance is appropriately dividing the object into several component sets. Firstly, a technique is proposed that can automatically divide the component groups for calculating Mahalanobis distance based on the gray level of wearing or breakage regions in a tool image. The wearing region can be divided into high gray level component group and the tool-blade into low one. Then, the relation between Mahalanobis distance features of component groups and tool conditions is investigated. The results indicate that the high brightness region on the flank surface of the turning tool will change with its abrasion change and if the tool is heavily abraded, the area of high brightness will increase apparently. The Mahalanobis distance features of high gray level component group are related with wearing state of tool and low gray level component group correlated with breakage of tool. The experimental results show that the abrasion of the tool’s flank surface affected the Mahalanobis distances of high brightness component of the tool and the pixels of high brightness component set. Compared with the changes of them, we found that the Mahalanobis distance of high brightness component of the tool was more sensitive to the abrasion of cutting tool than the area of high brightness component set of the tool. Here we found that the relative changing rate of the area of high brightness component set was not quite obvious and it was ranging from 2% to 15%, while the relative changing rate of the Mahalanobis distance in table 1 ranges from 13.9% to 47%. It is 3 times higher than the changing rate of the area.展开更多
Inertial Navigation System/Celestial Navigation System(INS/CNS)integration,especially for the tightly-coupled mode,provides a promising autonomous tactics for Hypersonic Vehicle(HV)in military demands.However,INS/CNS ...Inertial Navigation System/Celestial Navigation System(INS/CNS)integration,especially for the tightly-coupled mode,provides a promising autonomous tactics for Hypersonic Vehicle(HV)in military demands.However,INS/CNS integration is a challenging research task due to its special characteristics such as strong nonlinearity,non-additive noise and dynamic complexity.This paper presents a novel nonlinear filtering method for INS/CNS integration by adopting the emerging Cubature Kalman Filter(CKF)to handle the strong INS error model nonlinearity caused by HV's high dynamics.It combines the state-augmentation technique into the nonlinear CKF to decrease the negative effect of non-additive noise in inertial measurements.Subsequently,a technique for the detection of dynamic model uncertainty is developed,and the augmented CKF is modified with fading memory to tackle dynamic model uncertainty by rigorously deriving the fading factor via the theory of Mahalanobis distance without artificial empiricism.Simulation results and comparison analysis prove that the proposed method can effectively curb the adverse impacts of non-additive noise and dynamic model uncertainty for inertial measurements,leading to improved performance for HV navigation with tightly-coupled INS/CNS integration.展开更多
A new fault diagnosis technique for rolling element bearing using multi-scale Lempel-Ziv complexity(LZC) and Mahalanobis distance(MD) criterion is proposed in this study. A multi-scale coarse-graining process is used ...A new fault diagnosis technique for rolling element bearing using multi-scale Lempel-Ziv complexity(LZC) and Mahalanobis distance(MD) criterion is proposed in this study. A multi-scale coarse-graining process is used to extract fault features for various bearing fault conditions to overcome the limitation of the single stage coarse-graining process in the LZC algorithm. This is followed by the application of MD criterion to calculate the accuracy rate of LZC at different scales, and the best scale corresponding to the maximum accuracy rate is identified for fault pattern recognition. A comparison analysis with Euclidean distance(ED) criterion is also presented to verify the superiority of the proposed method. The result confirms that the fault diagnosis technique using a multi-scale LZC and MD criterion is more effective in distinguishing various fault conditions of rolling element bearings.展开更多
Accurate diagnosis of different bronchopul- monary diseases is important in clinical practice. This study involved 20 healthy volunteers and 77 patients with bronchopulmonary diseases, including chronic obstructive pu...Accurate diagnosis of different bronchopul- monary diseases is important in clinical practice. This study involved 20 healthy volunteers and 77 patients with bronchopulmonary diseases, including chronic obstructive pulmonary disease (COPD), bronchial asthma, pulmonary tuberculosis, and community-acquired pneumonia. The absorption spectrum of exhaled air samples was recorded on an intra-cavity photo-acoustic gas analyzer (ILPA-1, Special Technologies, Ltd., Russia) with photo-acoustic detectors and CO2 laser with a tuning range from 9.2 to 10.8μm. In conclusion, analysis of the Mahalanobis distance-based absorption spectral profiles of breath air from bronchopulmonary patients and healthy volunteers allows the formulation of a preliminary diagnosis.展开更多
When building a classification model,the scenario where the samples of one class are significantly more than those of the other class is called data imbalance.Data imbalance causes the trained classification model to ...When building a classification model,the scenario where the samples of one class are significantly more than those of the other class is called data imbalance.Data imbalance causes the trained classification model to be in favor of the majority class(usually defined as the negative class),which may do harm to the accuracy of the minority class(usually defined as the positive class),and then lead to poor overall performance of the model.A method called MSHR-FCSSVM for solving imbalanced data classification is proposed in this article,which is based on a new hybrid resampling approach(MSHR)and a new fine cost-sensitive support vector machine(CS-SVM)classifier(FCSSVM).The MSHR measures the separability of each negative sample through its Silhouette value calculated by Mahalanobis distance between samples,based on which,the so-called pseudo-negative samples are screened out to generate new positive samples(over-sampling step)through linear interpolation and are deleted finally(under-sampling step).This approach replaces pseudo-negative samples with generated new positive samples one by one to clear up the inter-class overlap on the borderline,without changing the overall scale of the dataset.The FCSSVM is an improved version of the traditional CS-SVM.It considers influences of both the imbalance of sample number and the class distribution on classification simultaneously,and through finely tuning the class cost weights by using the efficient optimization algorithm based on the physical phenomenon of rime-ice(RIME)algorithm with cross-validation accuracy as the fitness function to accurately adjust the classification borderline.To verify the effectiveness of the proposed method,a series of experiments are carried out based on 20 imbalanced datasets including both mildly and extremely imbalanced datasets.The experimental results show that the MSHR-FCSSVM method performs better than the methods for comparison in most cases,and both the MSHR and the FCSSVM played significant roles.展开更多
This study presents a proposed method for assessing the condition and predicting the future status of condensers operating in seawater over an extended period.The aim is to address the problems of scaling and corrosio...This study presents a proposed method for assessing the condition and predicting the future status of condensers operating in seawater over an extended period.The aim is to address the problems of scaling and corrosion,which lead to increased loss of cold resources.The method involves utilising a set of multivariate feature parameters associated with the condenser as input for evaluation and trend prediction.This methodology offers a precise means of determining the optimal timing for condenser cleaning,with the ultimate goal of improving its overall performance.The proposed approach involves the integration of the analytic network process with subjective expert experience and the entropy weightmethod with objective big data analysis to develop a fusion health degreemodel.The mathematical model is constructed quantitatively using the improved Mahalanobis distance.Furthermore,a comprehensive prediction model is developed by integrating the improved Informer model and Markov error correction.This model takes into account the health status of the equipment and several influencing factors,includingmultivariate feature characteristics.This model facilitates the objective examination and prediction of the progression of equipment deterioration trends.The present study involves the computation and verification of the field time series data,which serves to demonstrate the accuracy of the condenser health-related models proposed in this research.These models effectively depict the real condition and temporal variations of the equipment,thus offering a valuable method for determining the precise cleaning time required for the condenser.展开更多
Peanuts pods grow underground and mature unevenly, resulting that choosing the correct time to harvest is more complicated than other crops. Pod maturity can be determined by blasting with a pressure washer to remove ...Peanuts pods grow underground and mature unevenly, resulting that choosing the correct time to harvest is more complicated than other crops. Pod maturity can be determined by blasting with a pressure washer to remove outer skin of the pod (exocarp) to expose the color of the middle layer (mesocarp). The mesocarp color changes with maturity from white to yellow, orange, brown and finally black. The sum of percentage from orange, brown, and black mesocarp (OBB) color and black color (BL) represents the kernels that are mature enough to harvest. The goal of this research is to identify methodologies to estimate OBB and BL of the pods using RGB images taken in the field and validate the proposed model using other pod images. The Mahalanobis distance classification method was used to process sets of images and calculate pod area (number of pixels) corresponding to two classes (mesocarp and background) with nine different color groups. The results showed a performance of 94% effectiveness for mesocarp using Mahalanobis distance classification. Statistical regression for OBB and BL was developed based on 315 images of peanut pods taken from the field. The R2 and root mean square error of predicted and actual OBB were 0.93 and 4.1%, respectively. The R2 and root mean square error of predicted and actual BL were 0.88 and 1.8%, respectively. The validation of OBB using other images provided reasonable estimation (R2 = 0.98 and RMSE = 2.73%). This study introduces a novel, cost-effective, and non-destructive method for estimating peanut maturity using RGB imagery and Mahalanobis distance classification in the field. This innovative approach addresses the limitations of traditional methods and offers a robust alternative for real-time maturity assessment.展开更多
One hundred and sixty-eight genotypes of cotton from the same growing region were used as a germplasm group to study the validity of different genetic distances in constructing cotton core subset. Mixed linear model a...One hundred and sixty-eight genotypes of cotton from the same growing region were used as a germplasm group to study the validity of different genetic distances in constructing cotton core subset. Mixed linear model approach was employed to unbiasedly predict genotypic values of 20 traits for eliminating the environmental effect. Six commonly used genetic distances(Euclidean,standardized Euclidean,Mahalanobis,city block,cosine and correlation distances) combining four commonly used hierarchical cluster methods(single distance,complete distance,unweighted pair-group average and Ward's methods) were used in the least distance stepwise sampling(LDSS) method for constructing different core subsets. The analyses of variance(ANOVA) of different evaluating parameters showed that the validities of cosine and correlation distances were inferior to those of Euclidean,standardized Euclidean,Mahalanobis and city block distances. Standardized Euclidean distance was slightly more effective than Euclidean,Mahalanobis and city block distances. The principal analysis validated standardized Euclidean distance in the course of constructing practical core subsets. The covariance matrix of accessions might be ill-conditioned when Mahalanobis distance was used to calculate genetic distance at low sampling percentages,which led to bias in small-sized core subset construction. The standardized Euclidean distance is recommended in core subset construction with LDSS method.展开更多
The local skin temperatures of 22 subjects at air temperatures of 21,24,26,29 ℃ are measured,and the mean skin temperatures are calculated by ten skin temperature measuring points.The thermal comfort levels and the t...The local skin temperatures of 22 subjects at air temperatures of 21,24,26,29 ℃ are measured,and the mean skin temperatures are calculated by ten skin temperature measuring points.The thermal comfort levels and the thermal sensations of these subjects are also investigated by a questionnaire.The Mahalanobis distance discrimination method is applied to establish the evaluation model for the thermal comfort based on the mean skin temperature.The experimental results indicate that the difference of the mean skin temperatures between the comfort level and the discomfort level is significant.Using the evaluation model,the mean skin temperature at the thermal comfort level is 32.6 to 33.7 ℃,and the thermal comfort levels of 72% of the subjects are correctly evaluated.The accuracy of the evaluation model can be improved when the effects of sex of the subject on the mean skin temperature and the thermal comfort are considered.It can be concluded that the mean skin temperature can be used as an effective physiological indicator to evaluate human thermal comfort in a steady thermal environment.展开更多
Land suitability assessment is a prerequisite phase in land use planning; it guides toward optimal land use by providing information on the opportunities and constraints involved in the use of a given land area. A geo...Land suitability assessment is a prerequisite phase in land use planning; it guides toward optimal land use by providing information on the opportunities and constraints involved in the use of a given land area. A geographic information system-based procedure, known as rural settlement suitability evaluation(RSSE) using an improved technique for order preference by similarity to ideal solution(TOPSIS), was adopted to determine the most suitable area for constructing rural settlements in different geographical locations. Given the distribution and independence of rural settlements, a distinctive evaluation criteria system that differed from that of urban suitability was established by considering the level of rural infrastructure services as well as living and working conditions. The unpredictable mutual interference among evaluation factors has been found in practical works. An improved TOPSIS using Mahalanobis distance was applied to solve the unpredictable correlation among the criteria in a suitability evaluation. Uncertainty and sensitivity analyses obtained via Monte Carlo simulation were performed to examine the robustness of the model. Daye, a resource-based city with rapid economic development, unsatisfied rural development, and geological environmental problems caused by mining, was used as a case study. Results indicate the following findings: 1) The RSSE model using the improved TOPSIS can assess the suitability of rural settlements, and the suitability maps generated using the improved TOPSIS have higher information density than those generated using traditional TOPSIS. The robustness of the model is improved, and the uncertainty is reduced in the suitability results. 2) Highly suitable land is mainly distributed in the northeast of the study area, and the majority of which is cultivated land, thereby leading to tremendous pressure on the loss of cultivated land. 3) Lastly, 12.54% of the constructive expansion permitted zone and 8.36% of the constructive expansion conditionally permitted zone are situated in an unsuitable area, which indicates that the general planning of Daye lacks the necessary verification of suitability evaluation. Guidance is provided on the development strategy of rural settlement patches to support decision making in general land use planning.展开更多
Nondestructive techniques for appraising gas metal arc welding(GMAW) faults plays a very important role in on-line quality controllability and prediction of the GMAW process. On-line welding quality controllability ...Nondestructive techniques for appraising gas metal arc welding(GMAW) faults plays a very important role in on-line quality controllability and prediction of the GMAW process. On-line welding quality controllability and prediction have several disadvantages such as high cost, low efficiency, complication and greatly being affected by the environment. An enhanced, efficient evaluation technique for evaluating welding faults based on Mahalanobis distance(MD) and normal distribution is presented. In addition, a new piece of equipment, designated the weld quality tester(WQT), is developed based on the proposed evaluation technique. MD is superior to other multidimensional distances such as Euclidean distance because the covariance matrix used for calculating MD takes into account correlations in the data and scaling. The values of MD obtained from welding current and arc voltage are assumed to follow a normal distribution. The normal distribution has two parameters: the meanm and standard deviations of the data. In the proposed evaluation technique used by the WQT, values of MD located in the range from zero tom+3s are regarded as “good”. Two experiments which involve changing the flow of shielding gas and smearing paint on the surface of the substrate are conducted in order to verify the sensitivity of the proposed evaluation technique and the feasibility of using WQT. The experimental results demonstrate the usefulness of the WQT for evaluating welding quality. The proposed technique can be applied to implement the on-line welding quality controllability and prediction, which is of great importance to design some novel equipment for weld quality detection.展开更多
Various uncertainties arising during acquisition process of geoscience data may result in anomalous data instances(i.e.,outliers)that do not conform with the expected pattern of regular data instances.With sparse mult...Various uncertainties arising during acquisition process of geoscience data may result in anomalous data instances(i.e.,outliers)that do not conform with the expected pattern of regular data instances.With sparse multivariate data obtained from geotechnical site investigation,it is impossible to identify outliers with certainty due to the distortion of statistics of geotechnical parameters caused by outliers and their associated statistical uncertainty resulted from data sparsity.This paper develops a probabilistic outlier detection method for sparse multivariate data obtained from geotechnical site investigation.The proposed approach quantifies the outlying probability of each data instance based on Mahalanobis distance and determines outliers as those data instances with outlying probabilities greater than 0.5.It tackles the distortion issue of statistics estimated from the dataset with outliers by a re-sampling technique and accounts,rationally,for the statistical uncertainty by Bayesian machine learning.Moreover,the proposed approach also suggests an exclusive method to determine outlying components of each outlier.The proposed approach is illustrated and verified using simulated and real-life dataset.It showed that the proposed approach properly identifies outliers among sparse multivariate data and their corresponding outlying components in a probabilistic manner.It can significantly reduce the masking effect(i.e.,missing some actual outliers due to the distortion of statistics by the outliers and statistical uncertainty).It also found that outliers among sparse multivariate data instances affect significantly the construction of multivariate distribution of geotechnical parameters for uncertainty quantification.This emphasizes the necessity of data cleaning process(e.g.,outlier detection)for uncertainty quantification based on geoscience data.展开更多
A wireless sensor network(WSN)consists of several tiny sensor nodes to monitor,collect,and transmit the physical information from an environment through the wireless channel.The node failure is considered as one of th...A wireless sensor network(WSN)consists of several tiny sensor nodes to monitor,collect,and transmit the physical information from an environment through the wireless channel.The node failure is considered as one of the main issues in the WSN which creates higher packet drop,delay,and energy consumption during the communication.Although the node failure occurred mostly due to persistent energy exhaustion during transmission of data packets.In this paper,Artificial Neural Network(ANN)based Node Failure Detection(NFD)is developed with cognitive radio for detecting the location of the node failure.The ad hoc on-demand distance vector(AODV)routing protocol is used for transmitting the data from the source node to the base station.Moreover,the Mahalanobis distance is used for detecting an adjacent node to the node failure which is used to create the routing path without any node failure.The performance of the proposed ANN-NFD method is analysed in terms of throughput,delivery rate,number of nodes alive,drop rate,end to end delay,energy consumption,and overhead ratio.Furthermore,the performance of the ANN-NFD method is evaluated with the header to base station and base station to header(H2B2H)protocol.The packet delivery rate of the ANN-NFD method is 0.92 for 150 nodes that are high when compared to the H2B2H protocol.Hence,the ANN-NFD method provides data consistency during data transmission under node and battery failure.展开更多
The applications of laser-induced breakdown spectroscopy(LIBS) on classifying complex natural organics are relatively limited and their accuracy still requires improvement.In this work,to study the methods on classifi...The applications of laser-induced breakdown spectroscopy(LIBS) on classifying complex natural organics are relatively limited and their accuracy still requires improvement.In this work,to study the methods on classification of complex organics,three kinds of fresh leaves were measured by LIBS.100 spectra from 100 samples of each kind of leaves were measured and then they were divided into a training set and a test set in a ratio of 7:3.Two algorithms of chemometric methods including the partial least squares discriminant analysis(PLS-DA) and principal component analysis Mahalanobis distance(PCA-MD) were used to identify these leaves.By using 23 lines from 16 elements or molecules as input data,these two methods can both classify these three kinds of leaves successfully.The classification accuracies of training sets are both up to 100% by PCA-MD and PLS-DA.The classification accuracies of the test set are 93.3% by PCA-MD and 97.8% by PLS-DA.It means that PLS-DA is better than PCA-MD in classifying plant leaves.Because the components in PLS-DA process are more suitable for classification than those in PCA-MD process.We think that this work can provide a reference for plant traceability using LIBS.展开更多
This paper describes an improved algorithm for fuzzy c-means clustering of remotely sensed data, by which the degree of fuzziness of the resultant classification is de- creased as comparing with that by a conventional...This paper describes an improved algorithm for fuzzy c-means clustering of remotely sensed data, by which the degree of fuzziness of the resultant classification is de- creased as comparing with that by a conventional algorithm: that is, the classification accura- cy is increased. This is achieved by incorporating covariance matrices at the level of individual classes rather than assuming a global one. Empirical results from a fuzzy classification of an Edinburgh suburban land cover confirmed the improved performance of the new algorithm for fuzzy c-means clustering, in particular when fuzziness is also accommodated in the assumed reference data.展开更多
As the key component in aeroengine rotor systems,the health status of rolling bearings directly influences the reliability and safety of aeroengine rotor systems.In order to monitor rolling bearing conditions,a fusion...As the key component in aeroengine rotor systems,the health status of rolling bearings directly influences the reliability and safety of aeroengine rotor systems.In order to monitor rolling bearing conditions,a fusion fault diagnosis method,namely empirical mode decomposition(EMD)-Mahalanobis distance(E2MD)and improved wavelet threshold(IWT)(E2MD-IWT)for vibrational signals and acoustic emission(AE)signals is developed to improve the diagnostic accuracy of rolling bearings.The IWT method is proposed with a hard wavelet threshold and a soft wavelet threshold.Moreover,it is shown to be effective through numerical simulation.EMD is utilized to process the original AE signals for rolling bearings so as to generate a set of components called intrinsic modes functions(IMFs).The Mahalanobis distance(MD)approach is introduced in order to determine the smallest MD between the original AE signal and IMF components.Then,the IWT approach is employed to select the IMF components with the largest MD.It is demonstrated that the proposed E2MD-IWT method for vibrational and AE signals can improve rolling bearing fault diagnosis,beyond its ability to effectively eliminate noise signals.This study offers a promising approach to fault diagnosis for rolling bearings in aeroengines with regard to vibration signals and AE signals.展开更多
Hardware Trojan(HT) refers to a special module intentionally implanted into a chip or an electronic system. The module can be exploited by the attacker to achieve destructive functions. Unfortunately the HT is difficu...Hardware Trojan(HT) refers to a special module intentionally implanted into a chip or an electronic system. The module can be exploited by the attacker to achieve destructive functions. Unfortunately the HT is difficult to detecte due to its minimal resource occupation. In order to achieve an accurate detection with high efficiency, a HT detection method based on the electromagnetic leakage of the chip is proposed in this paper. At first, the dimensionality reduction and the feature extraction of the electromagnetic leakage signals in each group(template chip, Trojan-free chip and target chip) were realized by principal component analysis(PCA). Then, the Mahalanobis distances between the template group and the other groups were calculated. Finally, the differences between the Mahalanobis distances and the threshold were compared to determine whether the HT had been implanted into the target chip. In addition, the concept of the HT Detection Quality(HTDQ) was proposed to analyze and compare the performance of different detection methods. Our experiment results indicate that the accuracy of this detection method is 91.93%, and the time consumption is 0.042s in average, which shows a high HTDQ compared with three other methods.展开更多
In this short review paper, the significant and profound impacts of the Chou’s “invariance theorem” have been briefly presented with crystal clear convincingness.
基金supported by National Science and Technology Support Project of the 12th Five-Year Plan of China (Grant No.2014BAL01B04)Sichuan Provincial Department of Land and Resources Research Project (Grant No.KJ-2018-13)
文摘Aiming at the rapid identification of rural buildings in complex environments from high-spatialresolution images, an improved Mahalanobis distance colour segmentation method(IMDCSM) is proposed and realised in Red, Green and Blue(RGB) space. Vector sets of a lower discrete degree are obtained by filtering the colour vector sets of the building samples, and a standard ellipsoid equation can be constructed based on these vector sets. The threshold of interested colour range can be flexibly and intuitively selected by changing the shape and size of this ellipsoid. Then, according to the relationship between the location of the image pixel colour vector and the ellipsoid, all building information can be extracted quickly. To verify the effectiveness of the proposed method, unmanned aerial vehicle(UAV) images of two areas in the suburbs of Chengdu city and Deyang city were utilised as experimental data for image segmentation, and the existing colour segmentation method based on the Mahalanobis distance was selected as an indicator to assess the effectiveness of this method. The experimental results demonstrate that the completeness and correctness of this method reached 95% and 83.0%, respectively, values that are higher than those of the Mahalanobis distance colour segmentation method(MDCSM). In general, this method is suitable for the rapid extraction of rural building information, and provides a new threshold selection method for classification.
基金This work was supported by the National Basic Research Program of China(No.2001CB309403)
文摘In order to select effective feature subsets for pattern classification, a novel statistics rough set method is presented based on generalized attribute reduction. Unlike classical reduction approaches, the objects in universe of discourse are signs of training sample sets and values of attributes are taken as statistical parameters. The binary relation and discernibility matrix for the reduction are induced by distance function. Furthermore, based on the monotony of the distance function defined by Mahalanobis distance, the effective feature subsets are obtained as generalized attribute reducts. Experiment result shows that the classification performance can be improved by using the selected feature subsets.
文摘The Mahalanobis distance features proposed by P.C.Mahalanobis, an Indian statistician, can be used in an automatic on-line cutting tool condition monitoring process based on digital image processing. In this paper, a new method of obtaining Mahalanobis distance features from a tool image is proposed. The key of calculating Mahalanobis distance is appropriately dividing the object into several component sets. Firstly, a technique is proposed that can automatically divide the component groups for calculating Mahalanobis distance based on the gray level of wearing or breakage regions in a tool image. The wearing region can be divided into high gray level component group and the tool-blade into low one. Then, the relation between Mahalanobis distance features of component groups and tool conditions is investigated. The results indicate that the high brightness region on the flank surface of the turning tool will change with its abrasion change and if the tool is heavily abraded, the area of high brightness will increase apparently. The Mahalanobis distance features of high gray level component group are related with wearing state of tool and low gray level component group correlated with breakage of tool. The experimental results show that the abrasion of the tool’s flank surface affected the Mahalanobis distances of high brightness component of the tool and the pixels of high brightness component set. Compared with the changes of them, we found that the Mahalanobis distance of high brightness component of the tool was more sensitive to the abrasion of cutting tool than the area of high brightness component set of the tool. Here we found that the relative changing rate of the area of high brightness component set was not quite obvious and it was ranging from 2% to 15%, while the relative changing rate of the Mahalanobis distance in table 1 ranges from 13.9% to 47%. It is 3 times higher than the changing rate of the area.
基金co-supported by the National Natural Science Foundation of China(Nos.41904028,42004021)the Natural Science Basic Research Plan in Shaanxi Province of China(Nos.2020JQ-150,2020JQ-234)the Soft Science Project of Xi’an Science and Technology Plan(No.XA2020RKXYJ-0150)。
文摘Inertial Navigation System/Celestial Navigation System(INS/CNS)integration,especially for the tightly-coupled mode,provides a promising autonomous tactics for Hypersonic Vehicle(HV)in military demands.However,INS/CNS integration is a challenging research task due to its special characteristics such as strong nonlinearity,non-additive noise and dynamic complexity.This paper presents a novel nonlinear filtering method for INS/CNS integration by adopting the emerging Cubature Kalman Filter(CKF)to handle the strong INS error model nonlinearity caused by HV's high dynamics.It combines the state-augmentation technique into the nonlinear CKF to decrease the negative effect of non-additive noise in inertial measurements.Subsequently,a technique for the detection of dynamic model uncertainty is developed,and the augmented CKF is modified with fading memory to tackle dynamic model uncertainty by rigorously deriving the fading factor via the theory of Mahalanobis distance without artificial empiricism.Simulation results and comparison analysis prove that the proposed method can effectively curb the adverse impacts of non-additive noise and dynamic model uncertainty for inertial measurements,leading to improved performance for HV navigation with tightly-coupled INS/CNS integration.
基金the National Natural Science Foundation of China(No.51075220)the Special Research Fund for the Higher Education Doctoral Program(No.20123721110001)+1 种基金the Basic Research Project of Qingdao Science and Technology Plan(No.12-1-4-4-(3)-JCH)the Privileged Shandong Provincial Government’s "Taishan Scholar" Program
文摘A new fault diagnosis technique for rolling element bearing using multi-scale Lempel-Ziv complexity(LZC) and Mahalanobis distance(MD) criterion is proposed in this study. A multi-scale coarse-graining process is used to extract fault features for various bearing fault conditions to overcome the limitation of the single stage coarse-graining process in the LZC algorithm. This is followed by the application of MD criterion to calculate the accuracy rate of LZC at different scales, and the best scale corresponding to the maximum accuracy rate is identified for fault pattern recognition. A comparison analysis with Euclidean distance(ED) criterion is also presented to verify the superiority of the proposed method. The result confirms that the fault diagnosis technique using a multi-scale LZC and MD criterion is more effective in distinguishing various fault conditions of rolling element bearings.
文摘Accurate diagnosis of different bronchopul- monary diseases is important in clinical practice. This study involved 20 healthy volunteers and 77 patients with bronchopulmonary diseases, including chronic obstructive pulmonary disease (COPD), bronchial asthma, pulmonary tuberculosis, and community-acquired pneumonia. The absorption spectrum of exhaled air samples was recorded on an intra-cavity photo-acoustic gas analyzer (ILPA-1, Special Technologies, Ltd., Russia) with photo-acoustic detectors and CO2 laser with a tuning range from 9.2 to 10.8μm. In conclusion, analysis of the Mahalanobis distance-based absorption spectral profiles of breath air from bronchopulmonary patients and healthy volunteers allows the formulation of a preliminary diagnosis.
基金supported by the Yunnan Major Scientific and Technological Projects(Grant No.202302AD080001)the National Natural Science Foundation,China(No.52065033).
文摘When building a classification model,the scenario where the samples of one class are significantly more than those of the other class is called data imbalance.Data imbalance causes the trained classification model to be in favor of the majority class(usually defined as the negative class),which may do harm to the accuracy of the minority class(usually defined as the positive class),and then lead to poor overall performance of the model.A method called MSHR-FCSSVM for solving imbalanced data classification is proposed in this article,which is based on a new hybrid resampling approach(MSHR)and a new fine cost-sensitive support vector machine(CS-SVM)classifier(FCSSVM).The MSHR measures the separability of each negative sample through its Silhouette value calculated by Mahalanobis distance between samples,based on which,the so-called pseudo-negative samples are screened out to generate new positive samples(over-sampling step)through linear interpolation and are deleted finally(under-sampling step).This approach replaces pseudo-negative samples with generated new positive samples one by one to clear up the inter-class overlap on the borderline,without changing the overall scale of the dataset.The FCSSVM is an improved version of the traditional CS-SVM.It considers influences of both the imbalance of sample number and the class distribution on classification simultaneously,and through finely tuning the class cost weights by using the efficient optimization algorithm based on the physical phenomenon of rime-ice(RIME)algorithm with cross-validation accuracy as the fitness function to accurately adjust the classification borderline.To verify the effectiveness of the proposed method,a series of experiments are carried out based on 20 imbalanced datasets including both mildly and extremely imbalanced datasets.The experimental results show that the MSHR-FCSSVM method performs better than the methods for comparison in most cases,and both the MSHR and the FCSSVM played significant roles.
基金supported by the National Natural Science Foundation of China (51906133).
文摘This study presents a proposed method for assessing the condition and predicting the future status of condensers operating in seawater over an extended period.The aim is to address the problems of scaling and corrosion,which lead to increased loss of cold resources.The method involves utilising a set of multivariate feature parameters associated with the condenser as input for evaluation and trend prediction.This methodology offers a precise means of determining the optimal timing for condenser cleaning,with the ultimate goal of improving its overall performance.The proposed approach involves the integration of the analytic network process with subjective expert experience and the entropy weightmethod with objective big data analysis to develop a fusion health degreemodel.The mathematical model is constructed quantitatively using the improved Mahalanobis distance.Furthermore,a comprehensive prediction model is developed by integrating the improved Informer model and Markov error correction.This model takes into account the health status of the equipment and several influencing factors,includingmultivariate feature characteristics.This model facilitates the objective examination and prediction of the progression of equipment deterioration trends.The present study involves the computation and verification of the field time series data,which serves to demonstrate the accuracy of the condenser health-related models proposed in this research.These models effectively depict the real condition and temporal variations of the equipment,thus offering a valuable method for determining the precise cleaning time required for the condenser.
文摘Peanuts pods grow underground and mature unevenly, resulting that choosing the correct time to harvest is more complicated than other crops. Pod maturity can be determined by blasting with a pressure washer to remove outer skin of the pod (exocarp) to expose the color of the middle layer (mesocarp). The mesocarp color changes with maturity from white to yellow, orange, brown and finally black. The sum of percentage from orange, brown, and black mesocarp (OBB) color and black color (BL) represents the kernels that are mature enough to harvest. The goal of this research is to identify methodologies to estimate OBB and BL of the pods using RGB images taken in the field and validate the proposed model using other pod images. The Mahalanobis distance classification method was used to process sets of images and calculate pod area (number of pixels) corresponding to two classes (mesocarp and background) with nine different color groups. The results showed a performance of 94% effectiveness for mesocarp using Mahalanobis distance classification. Statistical regression for OBB and BL was developed based on 315 images of peanut pods taken from the field. The R2 and root mean square error of predicted and actual OBB were 0.93 and 4.1%, respectively. The R2 and root mean square error of predicted and actual BL were 0.88 and 1.8%, respectively. The validation of OBB using other images provided reasonable estimation (R2 = 0.98 and RMSE = 2.73%). This study introduces a novel, cost-effective, and non-destructive method for estimating peanut maturity using RGB imagery and Mahalanobis distance classification in the field. This innovative approach addresses the limitations of traditional methods and offers a robust alternative for real-time maturity assessment.
基金Project supported by the National Natural Science Foundation of China (No. 30270759)the Cooperation Project in Science and Technology between China and Poland Governments (No. 32-38)the Scientific Research Foundation for Doctors in Shandong Academy of Agricultural Sciences (No. [2007]20), China
文摘One hundred and sixty-eight genotypes of cotton from the same growing region were used as a germplasm group to study the validity of different genetic distances in constructing cotton core subset. Mixed linear model approach was employed to unbiasedly predict genotypic values of 20 traits for eliminating the environmental effect. Six commonly used genetic distances(Euclidean,standardized Euclidean,Mahalanobis,city block,cosine and correlation distances) combining four commonly used hierarchical cluster methods(single distance,complete distance,unweighted pair-group average and Ward's methods) were used in the least distance stepwise sampling(LDSS) method for constructing different core subsets. The analyses of variance(ANOVA) of different evaluating parameters showed that the validities of cosine and correlation distances were inferior to those of Euclidean,standardized Euclidean,Mahalanobis and city block distances. Standardized Euclidean distance was slightly more effective than Euclidean,Mahalanobis and city block distances. The principal analysis validated standardized Euclidean distance in the course of constructing practical core subsets. The covariance matrix of accessions might be ill-conditioned when Mahalanobis distance was used to calculate genetic distance at low sampling percentages,which led to bias in small-sized core subset construction. The standardized Euclidean distance is recommended in core subset construction with LDSS method.
基金The National Natural Science Foundation of China(No.5087125)
文摘The local skin temperatures of 22 subjects at air temperatures of 21,24,26,29 ℃ are measured,and the mean skin temperatures are calculated by ten skin temperature measuring points.The thermal comfort levels and the thermal sensations of these subjects are also investigated by a questionnaire.The Mahalanobis distance discrimination method is applied to establish the evaluation model for the thermal comfort based on the mean skin temperature.The experimental results indicate that the difference of the mean skin temperatures between the comfort level and the discomfort level is significant.Using the evaluation model,the mean skin temperature at the thermal comfort level is 32.6 to 33.7 ℃,and the thermal comfort levels of 72% of the subjects are correctly evaluated.The accuracy of the evaluation model can be improved when the effects of sex of the subject on the mean skin temperature and the thermal comfort are considered.It can be concluded that the mean skin temperature can be used as an effective physiological indicator to evaluate human thermal comfort in a steady thermal environment.
基金Under the auspices of National Natural Science Foundation of China(No.41371429,41401196)
文摘Land suitability assessment is a prerequisite phase in land use planning; it guides toward optimal land use by providing information on the opportunities and constraints involved in the use of a given land area. A geographic information system-based procedure, known as rural settlement suitability evaluation(RSSE) using an improved technique for order preference by similarity to ideal solution(TOPSIS), was adopted to determine the most suitable area for constructing rural settlements in different geographical locations. Given the distribution and independence of rural settlements, a distinctive evaluation criteria system that differed from that of urban suitability was established by considering the level of rural infrastructure services as well as living and working conditions. The unpredictable mutual interference among evaluation factors has been found in practical works. An improved TOPSIS using Mahalanobis distance was applied to solve the unpredictable correlation among the criteria in a suitability evaluation. Uncertainty and sensitivity analyses obtained via Monte Carlo simulation were performed to examine the robustness of the model. Daye, a resource-based city with rapid economic development, unsatisfied rural development, and geological environmental problems caused by mining, was used as a case study. Results indicate the following findings: 1) The RSSE model using the improved TOPSIS can assess the suitability of rural settlements, and the suitability maps generated using the improved TOPSIS have higher information density than those generated using traditional TOPSIS. The robustness of the model is improved, and the uncertainty is reduced in the suitability results. 2) Highly suitable land is mainly distributed in the northeast of the study area, and the majority of which is cultivated land, thereby leading to tremendous pressure on the loss of cultivated land. 3) Lastly, 12.54% of the constructive expansion permitted zone and 8.36% of the constructive expansion conditionally permitted zone are situated in an unsuitable area, which indicates that the general planning of Daye lacks the necessary verification of suitability evaluation. Guidance is provided on the development strategy of rural settlement patches to support decision making in general land use planning.
基金Supported by Ningbo Municipal Natural Science Foundation of China (Grant No.2014A610063)
文摘Nondestructive techniques for appraising gas metal arc welding(GMAW) faults plays a very important role in on-line quality controllability and prediction of the GMAW process. On-line welding quality controllability and prediction have several disadvantages such as high cost, low efficiency, complication and greatly being affected by the environment. An enhanced, efficient evaluation technique for evaluating welding faults based on Mahalanobis distance(MD) and normal distribution is presented. In addition, a new piece of equipment, designated the weld quality tester(WQT), is developed based on the proposed evaluation technique. MD is superior to other multidimensional distances such as Euclidean distance because the covariance matrix used for calculating MD takes into account correlations in the data and scaling. The values of MD obtained from welding current and arc voltage are assumed to follow a normal distribution. The normal distribution has two parameters: the meanm and standard deviations of the data. In the proposed evaluation technique used by the WQT, values of MD located in the range from zero tom+3s are regarded as “good”. Two experiments which involve changing the flow of shielding gas and smearing paint on the surface of the substrate are conducted in order to verify the sensitivity of the proposed evaluation technique and the feasibility of using WQT. The experimental results demonstrate the usefulness of the WQT for evaluating welding quality. The proposed technique can be applied to implement the on-line welding quality controllability and prediction, which is of great importance to design some novel equipment for weld quality detection.
基金supported by the National Key R&D Program of China(Project No.2016YFC0800200)the NRF-NSFC 3rd Joint Research Grant(Earth Science)(Project No.41861144022)+2 种基金the National Natural Science Foundation of China(Project Nos.51679174,and 51779189)the Shenzhen Key Technology R&D Program(Project No.20170324)The financial support is grateful acknowledged。
文摘Various uncertainties arising during acquisition process of geoscience data may result in anomalous data instances(i.e.,outliers)that do not conform with the expected pattern of regular data instances.With sparse multivariate data obtained from geotechnical site investigation,it is impossible to identify outliers with certainty due to the distortion of statistics of geotechnical parameters caused by outliers and their associated statistical uncertainty resulted from data sparsity.This paper develops a probabilistic outlier detection method for sparse multivariate data obtained from geotechnical site investigation.The proposed approach quantifies the outlying probability of each data instance based on Mahalanobis distance and determines outliers as those data instances with outlying probabilities greater than 0.5.It tackles the distortion issue of statistics estimated from the dataset with outliers by a re-sampling technique and accounts,rationally,for the statistical uncertainty by Bayesian machine learning.Moreover,the proposed approach also suggests an exclusive method to determine outlying components of each outlier.The proposed approach is illustrated and verified using simulated and real-life dataset.It showed that the proposed approach properly identifies outliers among sparse multivariate data and their corresponding outlying components in a probabilistic manner.It can significantly reduce the masking effect(i.e.,missing some actual outliers due to the distortion of statistics by the outliers and statistical uncertainty).It also found that outliers among sparse multivariate data instances affect significantly the construction of multivariate distribution of geotechnical parameters for uncertainty quantification.This emphasizes the necessity of data cleaning process(e.g.,outlier detection)for uncertainty quantification based on geoscience data.
文摘A wireless sensor network(WSN)consists of several tiny sensor nodes to monitor,collect,and transmit the physical information from an environment through the wireless channel.The node failure is considered as one of the main issues in the WSN which creates higher packet drop,delay,and energy consumption during the communication.Although the node failure occurred mostly due to persistent energy exhaustion during transmission of data packets.In this paper,Artificial Neural Network(ANN)based Node Failure Detection(NFD)is developed with cognitive radio for detecting the location of the node failure.The ad hoc on-demand distance vector(AODV)routing protocol is used for transmitting the data from the source node to the base station.Moreover,the Mahalanobis distance is used for detecting an adjacent node to the node failure which is used to create the routing path without any node failure.The performance of the proposed ANN-NFD method is analysed in terms of throughput,delivery rate,number of nodes alive,drop rate,end to end delay,energy consumption,and overhead ratio.Furthermore,the performance of the ANN-NFD method is evaluated with the header to base station and base station to header(H2B2H)protocol.The packet delivery rate of the ANN-NFD method is 0.92 for 150 nodes that are high when compared to the H2B2H protocol.Hence,the ANN-NFD method provides data consistency during data transmission under node and battery failure.
基金supported by the Fundamental Research Funds for the Central Universities of Ministry of Education of China(No.JB190501)Science and Technology Innovation Team of Shaanxi Province(No.2019TD-002)National Natural Science Foundation of China(No.11774277)。
文摘The applications of laser-induced breakdown spectroscopy(LIBS) on classifying complex natural organics are relatively limited and their accuracy still requires improvement.In this work,to study the methods on classification of complex organics,three kinds of fresh leaves were measured by LIBS.100 spectra from 100 samples of each kind of leaves were measured and then they were divided into a training set and a test set in a ratio of 7:3.Two algorithms of chemometric methods including the partial least squares discriminant analysis(PLS-DA) and principal component analysis Mahalanobis distance(PCA-MD) were used to identify these leaves.By using 23 lines from 16 elements or molecules as input data,these two methods can both classify these three kinds of leaves successfully.The classification accuracies of training sets are both up to 100% by PCA-MD and PLS-DA.The classification accuracies of the test set are 93.3% by PCA-MD and 97.8% by PLS-DA.It means that PLS-DA is better than PCA-MD in classifying plant leaves.Because the components in PLS-DA process are more suitable for classification than those in PCA-MD process.We think that this work can provide a reference for plant traceability using LIBS.
文摘This paper describes an improved algorithm for fuzzy c-means clustering of remotely sensed data, by which the degree of fuzziness of the resultant classification is de- creased as comparing with that by a conventional algorithm: that is, the classification accura- cy is increased. This is achieved by incorporating covariance matrices at the level of individual classes rather than assuming a global one. Empirical results from a fuzzy classification of an Edinburgh suburban land cover confirmed the improved performance of the new algorithm for fuzzy c-means clustering, in particular when fuzziness is also accommodated in the assumed reference data.
基金This paper is supported by the National Natural Science Foundation of China(Grant No.51875465)the Civil Aircraft Scientific Research Project.The authors would like to thank them.
文摘As the key component in aeroengine rotor systems,the health status of rolling bearings directly influences the reliability and safety of aeroengine rotor systems.In order to monitor rolling bearing conditions,a fusion fault diagnosis method,namely empirical mode decomposition(EMD)-Mahalanobis distance(E2MD)and improved wavelet threshold(IWT)(E2MD-IWT)for vibrational signals and acoustic emission(AE)signals is developed to improve the diagnostic accuracy of rolling bearings.The IWT method is proposed with a hard wavelet threshold and a soft wavelet threshold.Moreover,it is shown to be effective through numerical simulation.EMD is utilized to process the original AE signals for rolling bearings so as to generate a set of components called intrinsic modes functions(IMFs).The Mahalanobis distance(MD)approach is introduced in order to determine the smallest MD between the original AE signal and IMF components.Then,the IWT approach is employed to select the IMF components with the largest MD.It is demonstrated that the proposed E2MD-IWT method for vibrational and AE signals can improve rolling bearing fault diagnosis,beyond its ability to effectively eliminate noise signals.This study offers a promising approach to fault diagnosis for rolling bearings in aeroengines with regard to vibration signals and AE signals.
基金supported by the Special Funds for Basic Scientific Research Business Expenses of Central Universities No. 2014GCYY0the Beijing Natural Science Foundation No. 4163076the Fundamental Research Funds for the Central Universities No. 328201801
文摘Hardware Trojan(HT) refers to a special module intentionally implanted into a chip or an electronic system. The module can be exploited by the attacker to achieve destructive functions. Unfortunately the HT is difficult to detecte due to its minimal resource occupation. In order to achieve an accurate detection with high efficiency, a HT detection method based on the electromagnetic leakage of the chip is proposed in this paper. At first, the dimensionality reduction and the feature extraction of the electromagnetic leakage signals in each group(template chip, Trojan-free chip and target chip) were realized by principal component analysis(PCA). Then, the Mahalanobis distances between the template group and the other groups were calculated. Finally, the differences between the Mahalanobis distances and the threshold were compared to determine whether the HT had been implanted into the target chip. In addition, the concept of the HT Detection Quality(HTDQ) was proposed to analyze and compare the performance of different detection methods. Our experiment results indicate that the accuracy of this detection method is 91.93%, and the time consumption is 0.042s in average, which shows a high HTDQ compared with three other methods.
文摘In this short review paper, the significant and profound impacts of the Chou’s “invariance theorem” have been briefly presented with crystal clear convincingness.