We investigated the parametric optimization on incremental sheet forming of stainless steel using Grey Relational Analysis(GRA) coupled with Principal Component Analysis(PCA). AISI 316L stainless steel sheets were use...We investigated the parametric optimization on incremental sheet forming of stainless steel using Grey Relational Analysis(GRA) coupled with Principal Component Analysis(PCA). AISI 316L stainless steel sheets were used to develop double wall angle pyramid with aid of tungsten carbide tool. GRA coupled with PCA was used to plan the experiment conditions. Control factors such as Tool Diameter(TD), Step Depth(SD), Bottom Wall Angle(BWA), Feed Rate(FR) and Spindle Speed(SS) on Top Wall Angle(TWA) and Top Wall Angle Surface Roughness(TWASR) have been studied. Wall angle increases with increasing tool diameter due to large contact area between tool and workpiece. As the step depth, feed rate and spindle speed increase,TWASR decreases with increasing tool diameter. As the step depth increasing, the hydrostatic stress is raised causing severe cracks in the deformed surface. Hence it was concluded that the proposed hybrid method was suitable for optimizing the factors and response.展开更多
Principal Component Analysis (PCA) is a widely used technique for data analysis and dimensionality reduction, but its sensitivity to feature scale and outliers limits its applicability. Robust Principal Component Anal...Principal Component Analysis (PCA) is a widely used technique for data analysis and dimensionality reduction, but its sensitivity to feature scale and outliers limits its applicability. Robust Principal Component Analysis (RPCA) addresses these limitations by decomposing data into a low-rank matrix capturing the underlying structure and a sparse matrix identifying outliers, enhancing robustness against noise and outliers. This paper introduces a novel RPCA variant, Robust PCA Integrating Sparse and Low-rank Priors (RPCA-SL). Each prior targets a specific aspect of the data’s underlying structure and their combination allows for a more nuanced and accurate separation of the main data components from outliers and noise. Then RPCA-SL is solved by employing a proximal gradient algorithm for improved anomaly detection and data decomposition. Experimental results on simulation and real data demonstrate significant advancements.展开更多
Ore production is usually affected by multiple influencing inputs at open-pit mines.Nevertheless,the complex nonlinear relationships between these inputs and ore production remain unclear.This becomes even more challe...Ore production is usually affected by multiple influencing inputs at open-pit mines.Nevertheless,the complex nonlinear relationships between these inputs and ore production remain unclear.This becomes even more challenging when training data(e.g.truck haulage information and weather conditions)are massive.In machine learning(ML)algorithms,deep neural network(DNN)is a superior method for processing nonlinear and massive data by adjusting the amount of neurons and hidden layers.This study adopted DNN to forecast ore production using truck haulage information and weather conditions at open-pit mines as training data.Before the prediction models were built,principal component analysis(PCA)was employed to reduce the data dimensionality and eliminate the multicollinearity among highly correlated input variables.To verify the superiority of DNN,three ANNs containing only one hidden layer and six traditional ML models were established as benchmark models.The DNN model with multiple hidden layers performed better than the ANN models with a single hidden layer.The DNN model outperformed the extensively applied benchmark models in predicting ore production.This can provide engineers and researchers with an accurate method to forecast ore production,which helps make sound budgetary decisions and mine planning at open-pit mines.展开更多
The safety and integrity requirements of aerospace composite structures necessitate real-time health monitoring throughout their service life.To this end,distributed optical fiber sensors utilizing back Rayleigh scatt...The safety and integrity requirements of aerospace composite structures necessitate real-time health monitoring throughout their service life.To this end,distributed optical fiber sensors utilizing back Rayleigh scattering have been extensively deployed in structural health monitoring due to their advantages,such as lightweight and ease of embedding.However,identifying the precise location of damage from the optical fiber signals remains a critical challenge.In this paper,a novel approach which namely Modified Sliding Window Principal Component Analysis(MSWPCA)was proposed to facilitate automatic damage identification and localization via distributed optical fiber sensors.The proposed method is able to extract signal characteristics interfered by measurement noise to improve the accuracy of damage detection.Specifically,we applied the MSWPCA method to monitor and analyze the debonding propagation process in honeycomb sandwich panel structures.Our findings demonstrate that the training model exhibits high precision in detecting the location and size of honeycomb debonding,thereby facilitating reliable and efficient online assessment of the structural health state.展开更多
The composition control of molten steel is one of the main functions in the ladle furnace(LF)refining process.In this study,a feasible model was established to predict the alloying element yield using principal compon...The composition control of molten steel is one of the main functions in the ladle furnace(LF)refining process.In this study,a feasible model was established to predict the alloying element yield using principal component analysis(PCA)and deep neural network(DNN).The PCA was used to eliminate collinearity and reduce the dimension of the input variables,and then the data processed by PCA were used to establish the DNN model.The prediction hit ratios for the Si element yield in the error ranges of±1%,±3%,and±5%are 54.0%,93.8%,and98.8%,respectively,whereas those of the Mn element yield in the error ranges of±1%,±2%,and±3%are 77.0%,96.3%,and 99.5%,respectively,in the PCA-DNN model.The results demonstrate that the PCA-DNN model performs better than the known models,such as the reference heat method,multiple linear regression,modified backpropagation,and DNN model.Meanwhile,the accurate prediction of the alloying element yield can greatly contribute to realizing a“narrow window”control of composition in molten steel.The construction of the prediction model for the element yield can also provide a reference for the development of an alloying control model in LF intelligent refining in the modern iron and steel industry.展开更多
The large blast furnace is essential equipment in the process of iron and steel manufacturing. Due to the complex operation process and frequent fluctuations of variables, conventional monitoring methods often bring f...The large blast furnace is essential equipment in the process of iron and steel manufacturing. Due to the complex operation process and frequent fluctuations of variables, conventional monitoring methods often bring false alarms. To address the above problem, an ensemble of greedy dynamic principal component analysis-Gaussian mixture model(EGDPCA-GMM) is proposed in this paper. First, PCA-GMM is introduced to deal with the collinearity and the non-Gaussian distribution of blast furnace data.Second, in order to explain the dynamics of data, the greedy algorithm is used to determine the extended variables and their corresponding time lags, so as to avoid introducing unnecessary noise. Then the bagging ensemble is adopted to cooperate with greedy extension to eliminate the randomness brought by the greedy algorithm and further reduce the false alarm rate(FAR) of monitoring results. Finally, the algorithm is applied to the blast furnace of a large iron and steel group in South China to verify performance.Compared with the basic algorithms, the proposed method achieves lowest FAR, while keeping missed alarm rate(MAR) remain stable.展开更多
Total organic carbon(TOC)content is one of the most important parameters for characterizing the quality of source rocks and assessing the hydrocarbon-generating potential of shales.The Lucaogou Formation shale reservo...Total organic carbon(TOC)content is one of the most important parameters for characterizing the quality of source rocks and assessing the hydrocarbon-generating potential of shales.The Lucaogou Formation shale reservoirs in the Jimusaer Sag,Junggar Basin,NW China,is characterized by extremely complex lithology and a wide variety of mineral compositions with source rocks mainly consisting of carbonaceous mudstone and dolomitic mudstone.The logging responses of organic matter in the shale reservoirs is quite different from those in conventional reservoirs.Analyses show that the traditional△logR method is not suitable for evaluating the TOC content in the study area.Analysis of the sensitivity characteristics of TOC content to well logs reveals that the TOC content has good correlation with the separation degree of porosity logs.After a dimension reduction processing by the principal component analysis technology,the principal components are determined through correlation analysis of porosity logs.The results show that the TOC values obtained by the new method are in good agreement with that measured by core analysis.The average absolute error of the new method is only 0.555,much less when compared with 1.222 of using traditional△logR method.The proposed method can be used to produce more accurate TOC estimates,thus providing a reliable basis for source rock mapping.展开更多
Principal component analysis(PCA)was employed to determine the implications of geochemical and isotopic data from Cenozoic volcanic activities in the Southeast Asian region,including China(South China Sea(SCS),Hainan ...Principal component analysis(PCA)was employed to determine the implications of geochemical and isotopic data from Cenozoic volcanic activities in the Southeast Asian region,including China(South China Sea(SCS),Hainan Island,Fujian-Zhejiang coast,Taiwan Island),and parts of Vietnam and Thailand.We analyzed 15 trace element indicators and 5 isotopic indicators for 623 volcanic rock samples collected from the study region.Two principal components(PCs)were extracted by PCA based on the trace elements and Sr-Nd-Pb isotopic ratios,which probably indicate an enriched oceanic island basalt-type mantle plume and a depleted mid-ocean ridge basalt-type spreading ridge.The results show that the influence of the Hainan mantle plume on younger volcanic activities(<13 Ma)is stronger than that on older ones(>13 Ma)at the same location in the Southeast Asian region.PCA was employed to verify the mantle-plume-ridge interaction model of volcanic activities beneath the expansion center of SCS and refute the hypothesis that the tension of SCS is triggered by the Hainan plume.This study reveals the efficiency and applicability of PCA in discussing mantle sources of volcanic activities;thus,PCA is a suitable research method for analyzing geochemical data.展开更多
In this research,an integrated classification method based on principal component analysis-simulated annealing genetic algorithm-fuzzy cluster means(PCA-SAGA-FCM)was proposed for the unsupervised classification of tig...In this research,an integrated classification method based on principal component analysis-simulated annealing genetic algorithm-fuzzy cluster means(PCA-SAGA-FCM)was proposed for the unsupervised classification of tight sandstone reservoirs which lack the prior information and core experiments.A variety of evaluation parameters were selected,including lithology characteristic parameters,poro-permeability quality characteristic parameters,engineering quality characteristic parameters,and pore structure characteristic parameters.The PCA was used to reduce the dimension of the evaluation pa-rameters,and the low-dimensional data was used as input.The unsupervised reservoir classification of tight sandstone reservoir was carried out by the SAGA-FCM,the characteristics of reservoir at different categories were analyzed and compared with the lithological profiles.The analysis results of numerical simulation and actual logging data show that:1)compared with FCM algorithm,SAGA-FCM has stronger stability and higher accuracy;2)the proposed method can cluster the reservoir flexibly and effectively according to the degree of membership;3)the results of reservoir integrated classification match well with the lithologic profle,which demonstrates the reliability of the classification method.展开更多
This work utilizes a statistical approach of Principal Component Ana-lysis(PCA)towards the detection of Methane(CH_(4))-Carbon Monoxide(CO)Poi-soning occurring in coal mines,forestfires,drainage systems etc.where the ...This work utilizes a statistical approach of Principal Component Ana-lysis(PCA)towards the detection of Methane(CH_(4))-Carbon Monoxide(CO)Poi-soning occurring in coal mines,forestfires,drainage systems etc.where the CH_(4) and CO emissions are very high in closed buildings or confined spaces during oxi-dation processes.Both methane and carbon monoxide are highly toxic,colorless and odorless gases.Both of the gases have their own toxic levels to be detected.But during their combined presence,the toxicity of the either one goes unidentified may be due to their low levels which may lead to an explosion.By using PCA,the correlation of CO and CH_(4) data is carried out and by identifying the areas of high correlation(along the principal component axis)the explosion suppression action can be triggered earlier thus avoiding adverse effects of massive explosions.Wire-less Sensor Network is deployed and simulations are carried with heterogeneous sensors(Carbon Monoxide and Methane sensors)in NS-2 Mannasim framework.The rise in the value of CO even when CH_(4) is below the toxic level may become hazardous to the people around.Thus our proposed methodology will detect the combined presence of both the gases(CH_(4) and CO)and provide an early warning in order to avoid any human losses or toxic effects.展开更多
Machine learning algorithms (MLs) can potentially improve disease diagnostics, leading to early detection and treatment of these diseases. As a malignant tumor whose primary focus is located in the bronchial mucosal e...Machine learning algorithms (MLs) can potentially improve disease diagnostics, leading to early detection and treatment of these diseases. As a malignant tumor whose primary focus is located in the bronchial mucosal epithelium, lung cancer has the highest mortality and morbidity among cancer types, threatening health and life of patients suffering from the disease. Machine learning algorithms such as Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbor (KNN) and Naïve Bayes (NB) have been used for lung cancer prediction. However they still face challenges such as high dimensionality of the feature space, over-fitting, high computational complexity, noise and missing data, low accuracies, low precision and high error rates. Ensemble learning, which combines classifiers, may be helpful to boost prediction on new data. However, current ensemble ML techniques rarely consider comprehensive evaluation metrics to evaluate the performance of individual classifiers. The main purpose of this study was to develop an ensemble classifier that improves lung cancer prediction. An ensemble machine learning algorithm is developed based on RF, SVM, NB, and KNN. Feature selection is done based on Principal Component Analysis (PCA) and Analysis of Variance (ANOVA). This algorithm is then executed on lung cancer data and evaluated using execution time, true positives (TP), true negatives (TN), false positives (FP), false negatives (FN), false positive rate (FPR), recall (R), precision (P) and F-measure (FM). Experimental results show that the proposed ensemble classifier has the best classification of 0.9825% with the lowest error rate of 0.0193. This is followed by SVM in which the probability of having the best classification is 0.9652% at an error rate of 0.0206. On the other hand, NB had the worst performance of 0.8475% classification at 0.0738 error rate.展开更多
Principal component analysis (PCA) was employed to examine the effect of nutritional and bioactive compounds of legume milk chocolate as well as the sensory to document the extend of variations and their significance ...Principal component analysis (PCA) was employed to examine the effect of nutritional and bioactive compounds of legume milk chocolate as well as the sensory to document the extend of variations and their significance with plant sources. PCA identified eight significant principle components, that reduce the size of the variables into one principal component in physiochemical analysis interpreting 73.5% of the total variability with/and 78.6% of total variability explained in sensory evaluation. Score plot indicates that Double Bean milk chocolate in-corporated with MOL and CML in nutritional profile have high positive correlations. In nutritional evaluation, carbohydrates and fat content shows negative/minimal correlations whereas no negative correlations were found in sensory evaluation which implies every sensorial variable had high correlation with each other.展开更多
To overcome the too fine-grained granularity problem of multivariate grey incidence analysis and to explore the comprehensive incidence analysis model, three multivariate grey incidences degree models based on princip...To overcome the too fine-grained granularity problem of multivariate grey incidence analysis and to explore the comprehensive incidence analysis model, three multivariate grey incidences degree models based on principal component analysis (PCA) are proposed. Firstly, the PCA method is introduced to extract the feature sequences of a behavioral matrix. Then, the grey incidence analysis between two behavioral matrices is transformed into the similarity and nearness measure between their feature sequences. Based on the classic grey incidence analysis theory, absolute and relative incidence degree models for feature sequences are constructed, and a comprehensive grey incidence model is proposed. Furthermore, the properties of models are researched. It proves that the proposed models satisfy the properties of translation invariance, multiple transformation invariance, and axioms of the grey incidence analysis, respectively. Finally, a case is studied. The results illustrate that the model is effective than other multivariate grey incidence analysis models.展开更多
Genetic variation analysis, correlation analysis and principal component analysis were conducted to investigate the relationship between 12 yield-related agronomic traits of 87 summer sowing soybean eultivars in Huang...Genetic variation analysis, correlation analysis and principal component analysis were conducted to investigate the relationship between 12 yield-related agronomic traits of 87 summer sowing soybean eultivars in Huang-Huai-Hai region. According to the experimental results, effective branch number showed the maxi- mum variation coefficient and growth duration showed the minimum variation coefficient. The variation coefficient of bottom pod height, pod number per plant, grain number per plant, grain weight per plant and 100-grain weight of semi-determinate summer sowing soybean ranged between 18.38% -27.56. The variation coeffi- eient of plant height, bottom pod height, pod number per plant, grain number per plant and grain weight per plant of determinate summer sowing soybean ranged from 21.02% to 8.04%. In semi-determinate summer sowing soybean, yield showed extremely significantly positive correlation with grain number per pod, grain weight per plant and 100-grain weight, but extremely significantly negative correlation with effective branch number and significantly negative correlation with growth duration. In determinate summer sowing soybean, yidd showed extremely significantly positive correlation with stem diameter but significantly positive correlation with bottom pod height, while it showed no significant correlation with other agronomic traits. Principal component analysis of yield-rdated agronomic traits showed that cumulative contribution rates of the former four principal components to the variation of seml-determinate and determinate summer sowing soybean were 79.92% and 79.50%, respectively. Agronomic traits with the greatest variation should be selected first. Semi-determinate and determinate summer sowing soybean individu- als in Huang-Hnai-Hai region should be selected according to different podding habits. In addition, semi-determlnate soybean varieties with moderate plant height and growth duration, fewer effective branches, more grains per pod and greater 100-grain weight should be selected; determinate soybean varieties with thicker stem diameter, higher plant height and bottom pod height, more nodes on main stem, fewer grains per pod, more pods per plant and grains per plant should be selected.展开更多
The objective of this paper was to develop a comprehensive evaluation method and index to evaluate the performance of sealants and fillers for cracks in asphalt concrete pavements using the method of principal compone...The objective of this paper was to develop a comprehensive evaluation method and index to evaluate the performance of sealants and fillers for cracks in asphalt concrete pavements using the method of principal component analysis. The performance experiments including cone penetration, softening point, flow, resilience and tension at low temperature respectively were conducted by reference of ASTM D5329 for eight sealants and fillers often used in China. There by a principal component model was developed and weight of every index was calculated. The experimental results show that there are significantly different performances for sealants and fillers often used in China. Principal component analysis is an objective method that evaluates and selects the performance of sealants and fillers for cracks in asphalt concrete pavements.展开更多
In practical process industries,a variety of online and offline sensors and measuring instruments have been used for process control and monitoring purposes,which indicates that the measurements coming from different ...In practical process industries,a variety of online and offline sensors and measuring instruments have been used for process control and monitoring purposes,which indicates that the measurements coming from different sources are collected at different sampling rates.To build a complete process monitoring strategy,all these multi-rate measurements should be considered for data-based modeling and monitoring.In this paper,a novel kernel multi-rate probabilistic principal component analysis(K-MPPCA)model is proposed to extract the nonlinear correlations among different sampling rates.In the proposed model,the model parameters are calibrated using the kernel trick and the expectation-maximum(EM)algorithm.Also,the corresponding fault detection methods based on the nonlinear features are developed.Finally,a simulated nonlinear case and an actual pre-decarburization unit in the ammonia synthesis process are tested to demonstrate the efficiency of the proposed method.展开更多
5 critical quality characteristics must be controlled in the surface mount and wire-bond process in semiconductor packaging. And these characteristics are correlated with each other. So the principal components analy...5 critical quality characteristics must be controlled in the surface mount and wire-bond process in semiconductor packaging. And these characteristics are correlated with each other. So the principal components analysis(PCA) is used in the analysis of the sample data firstly. And then the process is controlled with hotelling T^2 control chart for the first several principal components which contain sufficient information. Furthermore, a software tool is developed for this kind of problems. And with sample data from a surface mounting device(SMD) process, it is demonstrated that the T^2 control chart with PCA gets the same conclusion as without PCA, but the problem is transformed from high-dimensional one to a lower dimensional one, i.e., from 5 to 2 in this demonstration.展开更多
Panicle swarm optimization (PSO) is an optimization algorithm based on the swarm intelligent principle. In this paper the modified PSO is applied to a kernel principal component analysis ( KPCA ) for an optimal ke...Panicle swarm optimization (PSO) is an optimization algorithm based on the swarm intelligent principle. In this paper the modified PSO is applied to a kernel principal component analysis ( KPCA ) for an optimal kernel function parameter. We first comprehensively considered within-class scatter and between-class scatter of the sample features. Then, the fitness function of an optimized kernel function parameter is constructed, and the particle swarm optimization algorithm with adaptive acceleration (CPSO) is applied to optimizing it. It is used for gearbox condi- tion recognition, and the result is compared with the recognized results based on principal component analysis (PCA). The results show that KPCA optimized by CPSO can effectively recognize fault conditions of the gearbox by reducing bind set-up of the kernel function parameter, and its results of fault recognition outperform those of PCA. We draw the conclusion that KPCA based on CPSO has an advantage in nonlinear feature extraction of mechanical failure, and is helpful for fault condition recognition of complicated machines.展开更多
[Objectives] This study was conducted to give play to the yield-increasing potential of new broad bean varieties. [Methods] The correlation analysis and principal component analysis of eight main agronomic traits of 1...[Objectives] This study was conducted to give play to the yield-increasing potential of new broad bean varieties. [Methods] The correlation analysis and principal component analysis of eight main agronomic traits of 15 autumn-sown broad bean varieties in 2017-2018 were carried out. [Results] There were extremely complicated correlations between the agronomic traits. Among the various agronomic traits, the number of pods per plant showed a significant negative correlation from the number of seeds per pod(P<0.05), and significant negative correlations with pod length and 100-grain weight(P<0.01). The number of seeds per pod was significantly positively correlated with pod length(P<0.01). The 100-grain weight of broad beans was significantly positively correlated with the pod length(P<0.01), and was significantly negatively correlated with the number of pods per plant(P<0.01). Three principal components were established by principal component analysis, and the contribution rate of the first principal component was the highest, which was 40.18%. The contribution rates of the second principal component and the third principal component to the total variance were 24.08% and 13.55%, respectively. The score equations of the principal components were derived. [Conclusions] This study provides a theoretical reference for the application and promotion of new varieties of autumn broad beans.展开更多
Gold mining is now widely acknowledged as one of the significant sources of soil pollution in developed countries. In developing countries, the sources and levels of soil contamination have not been thoroughly address...Gold mining is now widely acknowledged as one of the significant sources of soil pollution in developed countries. In developing countries, the sources and levels of soil contamination have not been thoroughly addressed. Thus, this study was intended to determine the source of soil pollution and the level of contamination in the active and closed gold mining areas. The research paper presents the pollution load of heavy metals (lead-Pb, chromium-Cr, cadmium-Cd, copper-Cu, arsenic-As, manganese-Mn, and nickel-Ni) in 90 soil samples collected from the studied sites. Multivariate statistical analysis, including Principal Component Analysis (PCA) and Cluster Analysis (CA), coupled with correlation coefficient analysis, was performed to determine the possible sources of pollution in the study areas. The results indicated that Pb, Cr, Cu and Mn come from different sources than Cd, As and Ni. The results obtained from the metal pollution assessment using the Pollution Index (PI) and the Geoaccumulation Index (Igeo) confirmed that soils in the mining areas were contaminated in the range from moderately through strongly to highly contaminated soils. This study verified that soil contamination in the gold mining areas results from natural and anthropogenic processes. The current study findings would enhance our knowledge regarding the soil contamination level in the mining areas and the source of contamination. It is recommended to use PCA, CA, PI and Igeo to assess and monitor the heavy metal contaminated soil in gold mining areas.展开更多
文摘We investigated the parametric optimization on incremental sheet forming of stainless steel using Grey Relational Analysis(GRA) coupled with Principal Component Analysis(PCA). AISI 316L stainless steel sheets were used to develop double wall angle pyramid with aid of tungsten carbide tool. GRA coupled with PCA was used to plan the experiment conditions. Control factors such as Tool Diameter(TD), Step Depth(SD), Bottom Wall Angle(BWA), Feed Rate(FR) and Spindle Speed(SS) on Top Wall Angle(TWA) and Top Wall Angle Surface Roughness(TWASR) have been studied. Wall angle increases with increasing tool diameter due to large contact area between tool and workpiece. As the step depth, feed rate and spindle speed increase,TWASR decreases with increasing tool diameter. As the step depth increasing, the hydrostatic stress is raised causing severe cracks in the deformed surface. Hence it was concluded that the proposed hybrid method was suitable for optimizing the factors and response.
文摘Principal Component Analysis (PCA) is a widely used technique for data analysis and dimensionality reduction, but its sensitivity to feature scale and outliers limits its applicability. Robust Principal Component Analysis (RPCA) addresses these limitations by decomposing data into a low-rank matrix capturing the underlying structure and a sparse matrix identifying outliers, enhancing robustness against noise and outliers. This paper introduces a novel RPCA variant, Robust PCA Integrating Sparse and Low-rank Priors (RPCA-SL). Each prior targets a specific aspect of the data’s underlying structure and their combination allows for a more nuanced and accurate separation of the main data components from outliers and noise. Then RPCA-SL is solved by employing a proximal gradient algorithm for improved anomaly detection and data decomposition. Experimental results on simulation and real data demonstrate significant advancements.
基金This work was supported by the Pilot Seed Grant(Grant No.RES0049944)the Collaborative Research Project(Grant No.RES0043251)from the University of Alberta.
文摘Ore production is usually affected by multiple influencing inputs at open-pit mines.Nevertheless,the complex nonlinear relationships between these inputs and ore production remain unclear.This becomes even more challenging when training data(e.g.truck haulage information and weather conditions)are massive.In machine learning(ML)algorithms,deep neural network(DNN)is a superior method for processing nonlinear and massive data by adjusting the amount of neurons and hidden layers.This study adopted DNN to forecast ore production using truck haulage information and weather conditions at open-pit mines as training data.Before the prediction models were built,principal component analysis(PCA)was employed to reduce the data dimensionality and eliminate the multicollinearity among highly correlated input variables.To verify the superiority of DNN,three ANNs containing only one hidden layer and six traditional ML models were established as benchmark models.The DNN model with multiple hidden layers performed better than the ANN models with a single hidden layer.The DNN model outperformed the extensively applied benchmark models in predicting ore production.This can provide engineers and researchers with an accurate method to forecast ore production,which helps make sound budgetary decisions and mine planning at open-pit mines.
基金supported by the National Key Research and Development Program of China(No.2018YFA0702800)the National Natural Science Foundation of China(No.12072056)supported by National Defense Fundamental Scientific Research Project(XXXX2018204BXXX).
文摘The safety and integrity requirements of aerospace composite structures necessitate real-time health monitoring throughout their service life.To this end,distributed optical fiber sensors utilizing back Rayleigh scattering have been extensively deployed in structural health monitoring due to their advantages,such as lightweight and ease of embedding.However,identifying the precise location of damage from the optical fiber signals remains a critical challenge.In this paper,a novel approach which namely Modified Sliding Window Principal Component Analysis(MSWPCA)was proposed to facilitate automatic damage identification and localization via distributed optical fiber sensors.The proposed method is able to extract signal characteristics interfered by measurement noise to improve the accuracy of damage detection.Specifically,we applied the MSWPCA method to monitor and analyze the debonding propagation process in honeycomb sandwich panel structures.Our findings demonstrate that the training model exhibits high precision in detecting the location and size of honeycomb debonding,thereby facilitating reliable and efficient online assessment of the structural health state.
基金supported by the National Natural Science Foundation of China(No.51974023)State Key Laboratory of Advanced Metallurgy,University of Science and Technology Beijing(No.41621005)。
文摘The composition control of molten steel is one of the main functions in the ladle furnace(LF)refining process.In this study,a feasible model was established to predict the alloying element yield using principal component analysis(PCA)and deep neural network(DNN).The PCA was used to eliminate collinearity and reduce the dimension of the input variables,and then the data processed by PCA were used to establish the DNN model.The prediction hit ratios for the Si element yield in the error ranges of±1%,±3%,and±5%are 54.0%,93.8%,and98.8%,respectively,whereas those of the Mn element yield in the error ranges of±1%,±2%,and±3%are 77.0%,96.3%,and 99.5%,respectively,in the PCA-DNN model.The results demonstrate that the PCA-DNN model performs better than the known models,such as the reference heat method,multiple linear regression,modified backpropagation,and DNN model.Meanwhile,the accurate prediction of the alloying element yield can greatly contribute to realizing a“narrow window”control of composition in molten steel.The construction of the prediction model for the element yield can also provide a reference for the development of an alloying control model in LF intelligent refining in the modern iron and steel industry.
基金supported by the National Natural Science Foundation of China (61903326, 61933015)。
文摘The large blast furnace is essential equipment in the process of iron and steel manufacturing. Due to the complex operation process and frequent fluctuations of variables, conventional monitoring methods often bring false alarms. To address the above problem, an ensemble of greedy dynamic principal component analysis-Gaussian mixture model(EGDPCA-GMM) is proposed in this paper. First, PCA-GMM is introduced to deal with the collinearity and the non-Gaussian distribution of blast furnace data.Second, in order to explain the dynamics of data, the greedy algorithm is used to determine the extended variables and their corresponding time lags, so as to avoid introducing unnecessary noise. Then the bagging ensemble is adopted to cooperate with greedy extension to eliminate the randomness brought by the greedy algorithm and further reduce the false alarm rate(FAR) of monitoring results. Finally, the algorithm is applied to the blast furnace of a large iron and steel group in South China to verify performance.Compared with the basic algorithms, the proposed method achieves lowest FAR, while keeping missed alarm rate(MAR) remain stable.
基金This research was funded by the National Natural Science Foundation of China(Grant No.41504103).
文摘Total organic carbon(TOC)content is one of the most important parameters for characterizing the quality of source rocks and assessing the hydrocarbon-generating potential of shales.The Lucaogou Formation shale reservoirs in the Jimusaer Sag,Junggar Basin,NW China,is characterized by extremely complex lithology and a wide variety of mineral compositions with source rocks mainly consisting of carbonaceous mudstone and dolomitic mudstone.The logging responses of organic matter in the shale reservoirs is quite different from those in conventional reservoirs.Analyses show that the traditional△logR method is not suitable for evaluating the TOC content in the study area.Analysis of the sensitivity characteristics of TOC content to well logs reveals that the TOC content has good correlation with the separation degree of porosity logs.After a dimension reduction processing by the principal component analysis technology,the principal components are determined through correlation analysis of porosity logs.The results show that the TOC values obtained by the new method are in good agreement with that measured by core analysis.The average absolute error of the new method is only 0.555,much less when compared with 1.222 of using traditional△logR method.The proposed method can be used to produce more accurate TOC estimates,thus providing a reliable basis for source rock mapping.
基金Supported by the State Key Laboratory of Marine Environmental Science Visiting Fellowship(No.MELRS2233)the State Key Laboratory of Marine Geology,Tongji University(No.MGK202302)+4 种基金the Innovation Group Project of Southern Marine Science and Engineering Guangdong Laboratory(Zhuhai)(No.311021003)the Zhujiang Talent Project Foundation of Guangdong Province(No.2017ZT07Z066)the Fundamental Research Funds for the Central Universities,Sun Yat-sen University(Nos.22qntd2101,2021qntd23)the Major Projects of the National Natural Science Foundation of China(Nos.41790465,41590863)the National Natural Science Foundation of China(Nos.42102333,41806077,41904045)。
文摘Principal component analysis(PCA)was employed to determine the implications of geochemical and isotopic data from Cenozoic volcanic activities in the Southeast Asian region,including China(South China Sea(SCS),Hainan Island,Fujian-Zhejiang coast,Taiwan Island),and parts of Vietnam and Thailand.We analyzed 15 trace element indicators and 5 isotopic indicators for 623 volcanic rock samples collected from the study region.Two principal components(PCs)were extracted by PCA based on the trace elements and Sr-Nd-Pb isotopic ratios,which probably indicate an enriched oceanic island basalt-type mantle plume and a depleted mid-ocean ridge basalt-type spreading ridge.The results show that the influence of the Hainan mantle plume on younger volcanic activities(<13 Ma)is stronger than that on older ones(>13 Ma)at the same location in the Southeast Asian region.PCA was employed to verify the mantle-plume-ridge interaction model of volcanic activities beneath the expansion center of SCS and refute the hypothesis that the tension of SCS is triggered by the Hainan plume.This study reveals the efficiency and applicability of PCA in discussing mantle sources of volcanic activities;thus,PCA is a suitable research method for analyzing geochemical data.
基金funded by the National Natural Science Foundation of China(42174131)the Strategic Cooperation Technology Projects of CNPC and CUPB(ZLZX2020-03).
文摘In this research,an integrated classification method based on principal component analysis-simulated annealing genetic algorithm-fuzzy cluster means(PCA-SAGA-FCM)was proposed for the unsupervised classification of tight sandstone reservoirs which lack the prior information and core experiments.A variety of evaluation parameters were selected,including lithology characteristic parameters,poro-permeability quality characteristic parameters,engineering quality characteristic parameters,and pore structure characteristic parameters.The PCA was used to reduce the dimension of the evaluation pa-rameters,and the low-dimensional data was used as input.The unsupervised reservoir classification of tight sandstone reservoir was carried out by the SAGA-FCM,the characteristics of reservoir at different categories were analyzed and compared with the lithological profiles.The analysis results of numerical simulation and actual logging data show that:1)compared with FCM algorithm,SAGA-FCM has stronger stability and higher accuracy;2)the proposed method can cluster the reservoir flexibly and effectively according to the degree of membership;3)the results of reservoir integrated classification match well with the lithologic profle,which demonstrates the reliability of the classification method.
文摘This work utilizes a statistical approach of Principal Component Ana-lysis(PCA)towards the detection of Methane(CH_(4))-Carbon Monoxide(CO)Poi-soning occurring in coal mines,forestfires,drainage systems etc.where the CH_(4) and CO emissions are very high in closed buildings or confined spaces during oxi-dation processes.Both methane and carbon monoxide are highly toxic,colorless and odorless gases.Both of the gases have their own toxic levels to be detected.But during their combined presence,the toxicity of the either one goes unidentified may be due to their low levels which may lead to an explosion.By using PCA,the correlation of CO and CH_(4) data is carried out and by identifying the areas of high correlation(along the principal component axis)the explosion suppression action can be triggered earlier thus avoiding adverse effects of massive explosions.Wire-less Sensor Network is deployed and simulations are carried with heterogeneous sensors(Carbon Monoxide and Methane sensors)in NS-2 Mannasim framework.The rise in the value of CO even when CH_(4) is below the toxic level may become hazardous to the people around.Thus our proposed methodology will detect the combined presence of both the gases(CH_(4) and CO)and provide an early warning in order to avoid any human losses or toxic effects.
文摘Machine learning algorithms (MLs) can potentially improve disease diagnostics, leading to early detection and treatment of these diseases. As a malignant tumor whose primary focus is located in the bronchial mucosal epithelium, lung cancer has the highest mortality and morbidity among cancer types, threatening health and life of patients suffering from the disease. Machine learning algorithms such as Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbor (KNN) and Naïve Bayes (NB) have been used for lung cancer prediction. However they still face challenges such as high dimensionality of the feature space, over-fitting, high computational complexity, noise and missing data, low accuracies, low precision and high error rates. Ensemble learning, which combines classifiers, may be helpful to boost prediction on new data. However, current ensemble ML techniques rarely consider comprehensive evaluation metrics to evaluate the performance of individual classifiers. The main purpose of this study was to develop an ensemble classifier that improves lung cancer prediction. An ensemble machine learning algorithm is developed based on RF, SVM, NB, and KNN. Feature selection is done based on Principal Component Analysis (PCA) and Analysis of Variance (ANOVA). This algorithm is then executed on lung cancer data and evaluated using execution time, true positives (TP), true negatives (TN), false positives (FP), false negatives (FN), false positive rate (FPR), recall (R), precision (P) and F-measure (FM). Experimental results show that the proposed ensemble classifier has the best classification of 0.9825% with the lowest error rate of 0.0193. This is followed by SVM in which the probability of having the best classification is 0.9652% at an error rate of 0.0206. On the other hand, NB had the worst performance of 0.8475% classification at 0.0738 error rate.
文摘Principal component analysis (PCA) was employed to examine the effect of nutritional and bioactive compounds of legume milk chocolate as well as the sensory to document the extend of variations and their significance with plant sources. PCA identified eight significant principle components, that reduce the size of the variables into one principal component in physiochemical analysis interpreting 73.5% of the total variability with/and 78.6% of total variability explained in sensory evaluation. Score plot indicates that Double Bean milk chocolate in-corporated with MOL and CML in nutritional profile have high positive correlations. In nutritional evaluation, carbohydrates and fat content shows negative/minimal correlations whereas no negative correlations were found in sensory evaluation which implies every sensorial variable had high correlation with each other.
基金supported by the National Natural Science Foundation of China(71401052)the Key Project of National Social Science Fund of China(12AZD108)+2 种基金the Doctoral Fund of Ministry of Education(20120094120024)the Philosophy and Social Science Fund of Jiangsu Province Universities(2013SJD630073)the Central University Basic Service Project Fee of Hohai University(2011B09914)
文摘To overcome the too fine-grained granularity problem of multivariate grey incidence analysis and to explore the comprehensive incidence analysis model, three multivariate grey incidences degree models based on principal component analysis (PCA) are proposed. Firstly, the PCA method is introduced to extract the feature sequences of a behavioral matrix. Then, the grey incidence analysis between two behavioral matrices is transformed into the similarity and nearness measure between their feature sequences. Based on the classic grey incidence analysis theory, absolute and relative incidence degree models for feature sequences are constructed, and a comprehensive grey incidence model is proposed. Furthermore, the properties of models are researched. It proves that the proposed models satisfy the properties of translation invariance, multiple transformation invariance, and axioms of the grey incidence analysis, respectively. Finally, a case is studied. The results illustrate that the model is effective than other multivariate grey incidence analysis models.
基金Supported by Special Fund for Agro-scientific Research in the Public Interest(nyhyzx07-004-06)National Science and Technology Support Program of China(2006BAD521B01-3)
文摘Genetic variation analysis, correlation analysis and principal component analysis were conducted to investigate the relationship between 12 yield-related agronomic traits of 87 summer sowing soybean eultivars in Huang-Huai-Hai region. According to the experimental results, effective branch number showed the maxi- mum variation coefficient and growth duration showed the minimum variation coefficient. The variation coefficient of bottom pod height, pod number per plant, grain number per plant, grain weight per plant and 100-grain weight of semi-determinate summer sowing soybean ranged between 18.38% -27.56. The variation coeffi- eient of plant height, bottom pod height, pod number per plant, grain number per plant and grain weight per plant of determinate summer sowing soybean ranged from 21.02% to 8.04%. In semi-determinate summer sowing soybean, yield showed extremely significantly positive correlation with grain number per pod, grain weight per plant and 100-grain weight, but extremely significantly negative correlation with effective branch number and significantly negative correlation with growth duration. In determinate summer sowing soybean, yidd showed extremely significantly positive correlation with stem diameter but significantly positive correlation with bottom pod height, while it showed no significant correlation with other agronomic traits. Principal component analysis of yield-rdated agronomic traits showed that cumulative contribution rates of the former four principal components to the variation of seml-determinate and determinate summer sowing soybean were 79.92% and 79.50%, respectively. Agronomic traits with the greatest variation should be selected first. Semi-determinate and determinate summer sowing soybean individu- als in Huang-Hnai-Hai region should be selected according to different podding habits. In addition, semi-determlnate soybean varieties with moderate plant height and growth duration, fewer effective branches, more grains per pod and greater 100-grain weight should be selected; determinate soybean varieties with thicker stem diameter, higher plant height and bottom pod height, more nodes on main stem, fewer grains per pod, more pods per plant and grains per plant should be selected.
基金Funded by the National Natural Science Foundation of China(Nos.51408287 and 51668038)the Rolls Supported by Program for Changjiang Scholars and Innovative Research Team in University(IRT_15R29)+2 种基金the Distinguished Young Scholars Fund of Gansu Province(1606RJDA318)the Natural Science Foundation of Gansu Province(1506RJZA064)the Excellent Program of Lanzhou Jiaotong University(201606)
文摘The objective of this paper was to develop a comprehensive evaluation method and index to evaluate the performance of sealants and fillers for cracks in asphalt concrete pavements using the method of principal component analysis. The performance experiments including cone penetration, softening point, flow, resilience and tension at low temperature respectively were conducted by reference of ASTM D5329 for eight sealants and fillers often used in China. There by a principal component model was developed and weight of every index was calculated. The experimental results show that there are significantly different performances for sealants and fillers often used in China. Principal component analysis is an objective method that evaluates and selects the performance of sealants and fillers for cracks in asphalt concrete pavements.
基金supported by Zhejiang Provincial Natural Science Foundation of China(LY19F030003)Key Research and Development Project of Zhejiang Province(2021C04030)+1 种基金the National Natural Science Foundation of China(62003306)Educational Commission Research Program of Zhejiang Province(Y202044842)。
文摘In practical process industries,a variety of online and offline sensors and measuring instruments have been used for process control and monitoring purposes,which indicates that the measurements coming from different sources are collected at different sampling rates.To build a complete process monitoring strategy,all these multi-rate measurements should be considered for data-based modeling and monitoring.In this paper,a novel kernel multi-rate probabilistic principal component analysis(K-MPPCA)model is proposed to extract the nonlinear correlations among different sampling rates.In the proposed model,the model parameters are calibrated using the kernel trick and the expectation-maximum(EM)algorithm.Also,the corresponding fault detection methods based on the nonlinear features are developed.Finally,a simulated nonlinear case and an actual pre-decarburization unit in the ammonia synthesis process are tested to demonstrate the efficiency of the proposed method.
基金This project is supported by National Natural Science Foundation of China (No.70372062)Hi-Tech Program of Tianjin city,China (No.04310881R).
文摘5 critical quality characteristics must be controlled in the surface mount and wire-bond process in semiconductor packaging. And these characteristics are correlated with each other. So the principal components analysis(PCA) is used in the analysis of the sample data firstly. And then the process is controlled with hotelling T^2 control chart for the first several principal components which contain sufficient information. Furthermore, a software tool is developed for this kind of problems. And with sample data from a surface mounting device(SMD) process, it is demonstrated that the T^2 control chart with PCA gets the same conclusion as without PCA, but the problem is transformed from high-dimensional one to a lower dimensional one, i.e., from 5 to 2 in this demonstration.
基金supported by National Natural Science Foundation under Grant No.50875247Shanxi Province Natural Science Foundation under Grant No.2009011026-1
文摘Panicle swarm optimization (PSO) is an optimization algorithm based on the swarm intelligent principle. In this paper the modified PSO is applied to a kernel principal component analysis ( KPCA ) for an optimal kernel function parameter. We first comprehensively considered within-class scatter and between-class scatter of the sample features. Then, the fitness function of an optimized kernel function parameter is constructed, and the particle swarm optimization algorithm with adaptive acceleration (CPSO) is applied to optimizing it. It is used for gearbox condi- tion recognition, and the result is compared with the recognized results based on principal component analysis (PCA). The results show that KPCA optimized by CPSO can effectively recognize fault conditions of the gearbox by reducing bind set-up of the kernel function parameter, and its results of fault recognition outperform those of PCA. We draw the conclusion that KPCA based on CPSO has an advantage in nonlinear feature extraction of mechanical failure, and is helpful for fault condition recognition of complicated machines.
基金Supported by Hubei Agricultural Science and Technology Innovation Center Fund(2016-620-000-001-006)Hubei Agricultural Germplasm Resource Sharing Platform(PTJX2015000008)National Modern Agriculture Industrial Technology System Construction Project(CARS-09)
文摘[Objectives] This study was conducted to give play to the yield-increasing potential of new broad bean varieties. [Methods] The correlation analysis and principal component analysis of eight main agronomic traits of 15 autumn-sown broad bean varieties in 2017-2018 were carried out. [Results] There were extremely complicated correlations between the agronomic traits. Among the various agronomic traits, the number of pods per plant showed a significant negative correlation from the number of seeds per pod(P<0.05), and significant negative correlations with pod length and 100-grain weight(P<0.01). The number of seeds per pod was significantly positively correlated with pod length(P<0.01). The 100-grain weight of broad beans was significantly positively correlated with the pod length(P<0.01), and was significantly negatively correlated with the number of pods per plant(P<0.01). Three principal components were established by principal component analysis, and the contribution rate of the first principal component was the highest, which was 40.18%. The contribution rates of the second principal component and the third principal component to the total variance were 24.08% and 13.55%, respectively. The score equations of the principal components were derived. [Conclusions] This study provides a theoretical reference for the application and promotion of new varieties of autumn broad beans.
文摘Gold mining is now widely acknowledged as one of the significant sources of soil pollution in developed countries. In developing countries, the sources and levels of soil contamination have not been thoroughly addressed. Thus, this study was intended to determine the source of soil pollution and the level of contamination in the active and closed gold mining areas. The research paper presents the pollution load of heavy metals (lead-Pb, chromium-Cr, cadmium-Cd, copper-Cu, arsenic-As, manganese-Mn, and nickel-Ni) in 90 soil samples collected from the studied sites. Multivariate statistical analysis, including Principal Component Analysis (PCA) and Cluster Analysis (CA), coupled with correlation coefficient analysis, was performed to determine the possible sources of pollution in the study areas. The results indicated that Pb, Cr, Cu and Mn come from different sources than Cd, As and Ni. The results obtained from the metal pollution assessment using the Pollution Index (PI) and the Geoaccumulation Index (Igeo) confirmed that soils in the mining areas were contaminated in the range from moderately through strongly to highly contaminated soils. This study verified that soil contamination in the gold mining areas results from natural and anthropogenic processes. The current study findings would enhance our knowledge regarding the soil contamination level in the mining areas and the source of contamination. It is recommended to use PCA, CA, PI and Igeo to assess and monitor the heavy metal contaminated soil in gold mining areas.