The fruits of leguminous plants Cercis Chinensis Bunge are still overlooked although they have been reported to be antioxidative because of the limited information on the phytochemicals of C.chinensis fruits.A simple,...The fruits of leguminous plants Cercis Chinensis Bunge are still overlooked although they have been reported to be antioxidative because of the limited information on the phytochemicals of C.chinensis fruits.A simple,rapid and sensitive HPLC-MS/MS method was developed for the identification and quantitation of the major bioactive components in C.chinensis fruits.Eighteen polyphenols were identified,which are first reported in C.chinensis fruits.Moreover,ten components were simultaneously quantified.The validated quantitative method was proved to be sensitive,reproducible and accurate.Then,it was applied to analyze batches of C.chinensis fruits from different phytomorph and areas.The principal components analysis(PCA)realized visualization and reduction of data set dimension while the hierarchical cluster analysis(HCA)indicated that the content of phenolic acids or all ten components might be used to differentiate C.chinensis fruits of different phytomorph.展开更多
With recent advances in biotechnology, genome-wide association study (GWAS) has been widely used to identify genetic variants that underlie human complex diseases and traits. In case-control GWAS, typical statistica...With recent advances in biotechnology, genome-wide association study (GWAS) has been widely used to identify genetic variants that underlie human complex diseases and traits. In case-control GWAS, typical statistical strategy is traditional logistical regression (LR) based on single-locus analysis. However, such a single-locus analysis leads to the well-known multiplicity problem, with a risk of inflating type I error and reducing power. Dimension reduction-based techniques, such as principal component-based logistic regression (PC-LR), partial least squares-based logistic regression (PLS-LR), have recently gained much attention in the analysis of high dimensional genomic data. However, the perfor- mance of these methods is still not clear, especially in GWAS. We conducted simulations and real data application to compare the type I error and power of PC-LR, PLS-LR and LR applicable to GWAS within a defined single nucleotide polymorphism (SNP) set region. We found that PC-LR and PLS can reasonably control type I error under null hypothesis. On contrast, LR, which is corrected by Bonferroni method, was more conserved in all simulation settings. In particular, we found that PC-LR and PLS-LR had comparable power and they both outperformed LR, especially when the causal SNP was in high linkage disequilibrium with genotyped ones and with a small effective size in simulation. Based on SNP set analysis, we applied all three methods to analyze non-small cell lung cancer GWAS data.展开更多
5 critical quality characteristics must be controlled in the surface mount and wire-bond process in semiconductor packaging. And these characteristics are correlated with each other. So the principal components analy...5 critical quality characteristics must be controlled in the surface mount and wire-bond process in semiconductor packaging. And these characteristics are correlated with each other. So the principal components analysis(PCA) is used in the analysis of the sample data firstly. And then the process is controlled with hotelling T^2 control chart for the first several principal components which contain sufficient information. Furthermore, a software tool is developed for this kind of problems. And with sample data from a surface mounting device(SMD) process, it is demonstrated that the T^2 control chart with PCA gets the same conclusion as without PCA, but the problem is transformed from high-dimensional one to a lower dimensional one, i.e., from 5 to 2 in this demonstration.展开更多
Principal component analysis(PCA)is employed to extract the principal components(PCs)present in nuclear mass models for the first time.The effects from different nuclear mass models are reintegrated and reorganized in...Principal component analysis(PCA)is employed to extract the principal components(PCs)present in nuclear mass models for the first time.The effects from different nuclear mass models are reintegrated and reorganized in the extracted PCs.These PCs are recombined to build new mass models,which achieve better accuracy than the original theoretical mass models.This comparison indicates that using the PCA approach,the effects contained in different mass models can be collaborated to improve nuclear mass predictions.展开更多
We investigated the parametric optimization on incremental sheet forming of stainless steel using Grey Relational Analysis(GRA) coupled with Principal Component Analysis(PCA). AISI 316L stainless steel sheets were use...We investigated the parametric optimization on incremental sheet forming of stainless steel using Grey Relational Analysis(GRA) coupled with Principal Component Analysis(PCA). AISI 316L stainless steel sheets were used to develop double wall angle pyramid with aid of tungsten carbide tool. GRA coupled with PCA was used to plan the experiment conditions. Control factors such as Tool Diameter(TD), Step Depth(SD), Bottom Wall Angle(BWA), Feed Rate(FR) and Spindle Speed(SS) on Top Wall Angle(TWA) and Top Wall Angle Surface Roughness(TWASR) have been studied. Wall angle increases with increasing tool diameter due to large contact area between tool and workpiece. As the step depth, feed rate and spindle speed increase,TWASR decreases with increasing tool diameter. As the step depth increasing, the hydrostatic stress is raised causing severe cracks in the deformed surface. Hence it was concluded that the proposed hybrid method was suitable for optimizing the factors and response.展开更多
Ore production is usually affected by multiple influencing inputs at open-pit mines.Nevertheless,the complex nonlinear relationships between these inputs and ore production remain unclear.This becomes even more challe...Ore production is usually affected by multiple influencing inputs at open-pit mines.Nevertheless,the complex nonlinear relationships between these inputs and ore production remain unclear.This becomes even more challenging when training data(e.g.truck haulage information and weather conditions)are massive.In machine learning(ML)algorithms,deep neural network(DNN)is a superior method for processing nonlinear and massive data by adjusting the amount of neurons and hidden layers.This study adopted DNN to forecast ore production using truck haulage information and weather conditions at open-pit mines as training data.Before the prediction models were built,principal component analysis(PCA)was employed to reduce the data dimensionality and eliminate the multicollinearity among highly correlated input variables.To verify the superiority of DNN,three ANNs containing only one hidden layer and six traditional ML models were established as benchmark models.The DNN model with multiple hidden layers performed better than the ANN models with a single hidden layer.The DNN model outperformed the extensively applied benchmark models in predicting ore production.This can provide engineers and researchers with an accurate method to forecast ore production,which helps make sound budgetary decisions and mine planning at open-pit mines.展开更多
The safety and integrity requirements of aerospace composite structures necessitate real-time health monitoring throughout their service life.To this end,distributed optical fiber sensors utilizing back Rayleigh scatt...The safety and integrity requirements of aerospace composite structures necessitate real-time health monitoring throughout their service life.To this end,distributed optical fiber sensors utilizing back Rayleigh scattering have been extensively deployed in structural health monitoring due to their advantages,such as lightweight and ease of embedding.However,identifying the precise location of damage from the optical fiber signals remains a critical challenge.In this paper,a novel approach which namely Modified Sliding Window Principal Component Analysis(MSWPCA)was proposed to facilitate automatic damage identification and localization via distributed optical fiber sensors.The proposed method is able to extract signal characteristics interfered by measurement noise to improve the accuracy of damage detection.Specifically,we applied the MSWPCA method to monitor and analyze the debonding propagation process in honeycomb sandwich panel structures.Our findings demonstrate that the training model exhibits high precision in detecting the location and size of honeycomb debonding,thereby facilitating reliable and efficient online assessment of the structural health state.展开更多
Principal Component Analysis (PCA) is a widely used technique for data analysis and dimensionality reduction, but its sensitivity to feature scale and outliers limits its applicability. Robust Principal Component Anal...Principal Component Analysis (PCA) is a widely used technique for data analysis and dimensionality reduction, but its sensitivity to feature scale and outliers limits its applicability. Robust Principal Component Analysis (RPCA) addresses these limitations by decomposing data into a low-rank matrix capturing the underlying structure and a sparse matrix identifying outliers, enhancing robustness against noise and outliers. This paper introduces a novel RPCA variant, Robust PCA Integrating Sparse and Low-rank Priors (RPCA-SL). Each prior targets a specific aspect of the data’s underlying structure and their combination allows for a more nuanced and accurate separation of the main data components from outliers and noise. Then RPCA-SL is solved by employing a proximal gradient algorithm for improved anomaly detection and data decomposition. Experimental results on simulation and real data demonstrate significant advancements.展开更多
With increasing global demand for energy,the importance of unconventional shale oil and gas research cannot be over-emphasized.The oil and gas industry requires rapid and reliable means of forecasting production.Exist...With increasing global demand for energy,the importance of unconventional shale oil and gas research cannot be over-emphasized.The oil and gas industry requires rapid and reliable means of forecasting production.Existing traditional decline curve analysis(DCA)methods have been limited in their ability to satisfactorily forecast production from unconventional liquid-rich shale(LRS)reservoirs.This is due to several causes ranging from the complicated production mechanisms to the ultra-low permeability in shales.The use of hybrid(combination)DCA models can improve results.However,complexities associated with these techniques can still make their application quite tedious without proper diagnostic plots,correct use of model parameters and some knowledge of the production mechanisms involved.This work,therefore,presents a new statistical data-driven approach of forecasting production from LRS reservoirs called the Principal Components Methodology(PCM).PCM is a technique that bypasses a lot of the difficulties associated with existing methods of forecasting and forecasts production with reasonable certainty.PCM is a data-driven method of forecasting based on the statistical technique of principal components analysis(PCA).In our study,we simulated production of fluids with different compositions for 30 years with the aid of a commercial compositional simulator.We then applied the Principal Components Methodology(PCM)to the production data from several representative wells by using Singular Value Decomposition(SVD)to calculate the principal components.These principal components were then used to forecast oil production from wells with production histories ranging from 0.5 to 3 years,and the results were compared to simulated data.Application of the PCM to field data is also included in this work.展开更多
The aim of this work is to describe and compare three exploratory chemometrical tools,principal components analysis,independent components analysis and common components analysis,the last one being a modification of t...The aim of this work is to describe and compare three exploratory chemometrical tools,principal components analysis,independent components analysis and common components analysis,the last one being a modification of the multi-block statistical method known as common components and specific weights analysis.The three methods were applied to a set of data to show the differences and similarities of the results obtained,highlighting their complementarity.展开更多
Intended for good productivity and perfect operation of the solar power grid a failure-free system is required.Therefore,thermal image processing with the thermal camera is the latest non-invasive(without manual conta...Intended for good productivity and perfect operation of the solar power grid a failure-free system is required.Therefore,thermal image processing with the thermal camera is the latest non-invasive(without manual contact)type fault identification technique which may give good precision in all aspects.The soiling issue,which is major productivity affecting factor may import from several reasons such as dust on the wind,bird mucks,etc.The efficient power production sufferers due to accumulated soil deposits reaching from 1%–7%in the county,such as India,to more than 25%in middle-east countries country,such as Dubai,Kuwait,etc.This research offers a solar panel soiling detection system built on thermal imaging which powers the inspection method and mitigates the requirement for physical panel inspection in a large solar production place.Hence,in this method,solar panels can be verified by working without disturbing production operation and it will save time and price of recognition.India ranks 3rd worldwide in the usage use age of Photovoltaic(PV)panels now and it is supported about 8.6%of the Nation’s electricity need in the year 2020.In the meantime,the installed PV production areas in India are aged 4–5 years old.Hence the need for inspection and maintenance of installed PV is growing fast day by day.As a result,this research focuses on finding the soiling hotspot exactly of the working solar panels with the help of Principal Components Thermal Analysis(PCTA)on MATLAB Environment.展开更多
The composition control of molten steel is one of the main functions in the ladle furnace(LF)refining process.In this study,a feasible model was established to predict the alloying element yield using principal compon...The composition control of molten steel is one of the main functions in the ladle furnace(LF)refining process.In this study,a feasible model was established to predict the alloying element yield using principal component analysis(PCA)and deep neural network(DNN).The PCA was used to eliminate collinearity and reduce the dimension of the input variables,and then the data processed by PCA were used to establish the DNN model.The prediction hit ratios for the Si element yield in the error ranges of±1%,±3%,and±5%are 54.0%,93.8%,and98.8%,respectively,whereas those of the Mn element yield in the error ranges of±1%,±2%,and±3%are 77.0%,96.3%,and 99.5%,respectively,in the PCA-DNN model.The results demonstrate that the PCA-DNN model performs better than the known models,such as the reference heat method,multiple linear regression,modified backpropagation,and DNN model.Meanwhile,the accurate prediction of the alloying element yield can greatly contribute to realizing a“narrow window”control of composition in molten steel.The construction of the prediction model for the element yield can also provide a reference for the development of an alloying control model in LF intelligent refining in the modern iron and steel industry.展开更多
The large blast furnace is essential equipment in the process of iron and steel manufacturing. Due to the complex operation process and frequent fluctuations of variables, conventional monitoring methods often bring f...The large blast furnace is essential equipment in the process of iron and steel manufacturing. Due to the complex operation process and frequent fluctuations of variables, conventional monitoring methods often bring false alarms. To address the above problem, an ensemble of greedy dynamic principal component analysis-Gaussian mixture model(EGDPCA-GMM) is proposed in this paper. First, PCA-GMM is introduced to deal with the collinearity and the non-Gaussian distribution of blast furnace data.Second, in order to explain the dynamics of data, the greedy algorithm is used to determine the extended variables and their corresponding time lags, so as to avoid introducing unnecessary noise. Then the bagging ensemble is adopted to cooperate with greedy extension to eliminate the randomness brought by the greedy algorithm and further reduce the false alarm rate(FAR) of monitoring results. Finally, the algorithm is applied to the blast furnace of a large iron and steel group in South China to verify performance.Compared with the basic algorithms, the proposed method achieves lowest FAR, while keeping missed alarm rate(MAR) remain stable.展开更多
Telomere assumes intra-molecular G-quadruplex that is a significant drug target for inhibiting telomerase main- tenance of telomeres in cancer. Metal cations have been recognized as playing important roles in stabiliz...Telomere assumes intra-molecular G-quadruplex that is a significant drug target for inhibiting telomerase main- tenance of telomeres in cancer. Metal cations have been recognized as playing important roles in stabilizing G-quadruplex, but their binding processes to human telomeric G-quadruplex remain uncharacterized. To in- vestigate the detailed binding procedures, molecular dynamics simulations were conducted on the hybrid [3 + 1] form-one human telomeric intra-molecular G-quadruplex. We show here that the binding of a potas- sium ion to a G-tetrad core is mediated by two alternative pathways. Principal component analysis illustrated the dominant concerted motions of G-quadruplex occurred at the loop domains. MM-PBSA calculations revealed that binding was energetically favorable and driven by the electrostatic interactions. The lower binding site was found more constructive favorable for binding. Our data provide useful information on a potassium-mediated stable structure of human telomeric intra-molecular G-quadruplex, implicating in ion disorder associated conformationa| changes and targeted drug design.展开更多
The purpose of many microarray studies is to find the association between gene expression and sample characteristics such as treatment type or sample phenotype. There has been a surge of efforts developing different m...The purpose of many microarray studies is to find the association between gene expression and sample characteristics such as treatment type or sample phenotype. There has been a surge of efforts developing different methods for delineating the association. Aside from the high dimensionality of microarray data, one well recognized challenge is the fact that genes could be complicatedly inter-related, thus making many statistical methods inappropriate to use directly on the expression data. Multivariate methods such as principal component analysis (PCA) and clustering are often used as a part of the effort to capture the gene correlation, and the derived components or clusters are used to describe the association between gene expression and sample phenotype. We propose a method for patient population dichotomization using maximally selected test statistics in combination with the PCA method, which shows favorable results. The proposed method is compared with a currently well-recognized method.展开更多
Principal component analysis(PCA) is fundamental in many pattern recognition applications.Much research has been performed to minimize the reconstruction error in L1-norm based reconstruction error minimization(L1-PCA...Principal component analysis(PCA) is fundamental in many pattern recognition applications.Much research has been performed to minimize the reconstruction error in L1-norm based reconstruction error minimization(L1-PCA-REM) since conventional L2-norm based PCA(L2-PCA) is sensitive to outliers.Recently,the variance maximization formulation of PCA with L1-norm(L1-PCA-VM) has been proposed,where new greedy and nongreedy solutions are developed.Armed with the gradient ascent perspective for optimization,we show that the L1-PCA-VM formulation is problematic in learning principal components and that only a greedy solution can achieve robustness motivation,which are verified by experiments on synthetic and real-world datasets.展开更多
Total organic carbon(TOC)content is one of the most important parameters for characterizing the quality of source rocks and assessing the hydrocarbon-generating potential of shales.The Lucaogou Formation shale reservo...Total organic carbon(TOC)content is one of the most important parameters for characterizing the quality of source rocks and assessing the hydrocarbon-generating potential of shales.The Lucaogou Formation shale reservoirs in the Jimusaer Sag,Junggar Basin,NW China,is characterized by extremely complex lithology and a wide variety of mineral compositions with source rocks mainly consisting of carbonaceous mudstone and dolomitic mudstone.The logging responses of organic matter in the shale reservoirs is quite different from those in conventional reservoirs.Analyses show that the traditional△logR method is not suitable for evaluating the TOC content in the study area.Analysis of the sensitivity characteristics of TOC content to well logs reveals that the TOC content has good correlation with the separation degree of porosity logs.After a dimension reduction processing by the principal component analysis technology,the principal components are determined through correlation analysis of porosity logs.The results show that the TOC values obtained by the new method are in good agreement with that measured by core analysis.The average absolute error of the new method is only 0.555,much less when compared with 1.222 of using traditional△logR method.The proposed method can be used to produce more accurate TOC estimates,thus providing a reliable basis for source rock mapping.展开更多
Principal component analysis(PCA)was employed to determine the implications of geochemical and isotopic data from Cenozoic volcanic activities in the Southeast Asian region,including China(South China Sea(SCS),Hainan ...Principal component analysis(PCA)was employed to determine the implications of geochemical and isotopic data from Cenozoic volcanic activities in the Southeast Asian region,including China(South China Sea(SCS),Hainan Island,Fujian-Zhejiang coast,Taiwan Island),and parts of Vietnam and Thailand.We analyzed 15 trace element indicators and 5 isotopic indicators for 623 volcanic rock samples collected from the study region.Two principal components(PCs)were extracted by PCA based on the trace elements and Sr-Nd-Pb isotopic ratios,which probably indicate an enriched oceanic island basalt-type mantle plume and a depleted mid-ocean ridge basalt-type spreading ridge.The results show that the influence of the Hainan mantle plume on younger volcanic activities(<13 Ma)is stronger than that on older ones(>13 Ma)at the same location in the Southeast Asian region.PCA was employed to verify the mantle-plume-ridge interaction model of volcanic activities beneath the expansion center of SCS and refute the hypothesis that the tension of SCS is triggered by the Hainan plume.This study reveals the efficiency and applicability of PCA in discussing mantle sources of volcanic activities;thus,PCA is a suitable research method for analyzing geochemical data.展开更多
In this research,an integrated classification method based on principal component analysis-simulated annealing genetic algorithm-fuzzy cluster means(PCA-SAGA-FCM)was proposed for the unsupervised classification of tig...In this research,an integrated classification method based on principal component analysis-simulated annealing genetic algorithm-fuzzy cluster means(PCA-SAGA-FCM)was proposed for the unsupervised classification of tight sandstone reservoirs which lack the prior information and core experiments.A variety of evaluation parameters were selected,including lithology characteristic parameters,poro-permeability quality characteristic parameters,engineering quality characteristic parameters,and pore structure characteristic parameters.The PCA was used to reduce the dimension of the evaluation pa-rameters,and the low-dimensional data was used as input.The unsupervised reservoir classification of tight sandstone reservoir was carried out by the SAGA-FCM,the characteristics of reservoir at different categories were analyzed and compared with the lithological profiles.The analysis results of numerical simulation and actual logging data show that:1)compared with FCM algorithm,SAGA-FCM has stronger stability and higher accuracy;2)the proposed method can cluster the reservoir flexibly and effectively according to the degree of membership;3)the results of reservoir integrated classification match well with the lithologic profle,which demonstrates the reliability of the classification method.展开更多
This work utilizes a statistical approach of Principal Component Ana-lysis(PCA)towards the detection of Methane(CH_(4))-Carbon Monoxide(CO)Poi-soning occurring in coal mines,forestfires,drainage systems etc.where the ...This work utilizes a statistical approach of Principal Component Ana-lysis(PCA)towards the detection of Methane(CH_(4))-Carbon Monoxide(CO)Poi-soning occurring in coal mines,forestfires,drainage systems etc.where the CH_(4) and CO emissions are very high in closed buildings or confined spaces during oxi-dation processes.Both methane and carbon monoxide are highly toxic,colorless and odorless gases.Both of the gases have their own toxic levels to be detected.But during their combined presence,the toxicity of the either one goes unidentified may be due to their low levels which may lead to an explosion.By using PCA,the correlation of CO and CH_(4) data is carried out and by identifying the areas of high correlation(along the principal component axis)the explosion suppression action can be triggered earlier thus avoiding adverse effects of massive explosions.Wire-less Sensor Network is deployed and simulations are carried with heterogeneous sensors(Carbon Monoxide and Methane sensors)in NS-2 Mannasim framework.The rise in the value of CO even when CH_(4) is below the toxic level may become hazardous to the people around.Thus our proposed methodology will detect the combined presence of both the gases(CH_(4) and CO)and provide an early warning in order to avoid any human losses or toxic effects.展开更多
基金supported by the National Natural Science Foundation of China(Grant Nos.82073808,81872828,and 81573384)。
文摘The fruits of leguminous plants Cercis Chinensis Bunge are still overlooked although they have been reported to be antioxidative because of the limited information on the phytochemicals of C.chinensis fruits.A simple,rapid and sensitive HPLC-MS/MS method was developed for the identification and quantitation of the major bioactive components in C.chinensis fruits.Eighteen polyphenols were identified,which are first reported in C.chinensis fruits.Moreover,ten components were simultaneously quantified.The validated quantitative method was proved to be sensitive,reproducible and accurate.Then,it was applied to analyze batches of C.chinensis fruits from different phytomorph and areas.The principal components analysis(PCA)realized visualization and reduction of data set dimension while the hierarchical cluster analysis(HCA)indicated that the content of phenolic acids or all ten components might be used to differentiate C.chinensis fruits of different phytomorph.
基金founded by the National Natural Science Foundation of China(81202283,81473070,81373102 and81202267)Key Grant of Natural Science Foundation of the Jiangsu Higher Education Institutions of China(10KJA330034 and11KJA330001)+1 种基金the Research Fund for the Doctoral Program of Higher Education of China(20113234110002)the Priority Academic Program for the Development of Jiangsu Higher Education Institutions(Public Health and Preventive Medicine)
文摘With recent advances in biotechnology, genome-wide association study (GWAS) has been widely used to identify genetic variants that underlie human complex diseases and traits. In case-control GWAS, typical statistical strategy is traditional logistical regression (LR) based on single-locus analysis. However, such a single-locus analysis leads to the well-known multiplicity problem, with a risk of inflating type I error and reducing power. Dimension reduction-based techniques, such as principal component-based logistic regression (PC-LR), partial least squares-based logistic regression (PLS-LR), have recently gained much attention in the analysis of high dimensional genomic data. However, the perfor- mance of these methods is still not clear, especially in GWAS. We conducted simulations and real data application to compare the type I error and power of PC-LR, PLS-LR and LR applicable to GWAS within a defined single nucleotide polymorphism (SNP) set region. We found that PC-LR and PLS can reasonably control type I error under null hypothesis. On contrast, LR, which is corrected by Bonferroni method, was more conserved in all simulation settings. In particular, we found that PC-LR and PLS-LR had comparable power and they both outperformed LR, especially when the causal SNP was in high linkage disequilibrium with genotyped ones and with a small effective size in simulation. Based on SNP set analysis, we applied all three methods to analyze non-small cell lung cancer GWAS data.
基金This project is supported by National Natural Science Foundation of China (No.70372062)Hi-Tech Program of Tianjin city,China (No.04310881R).
文摘5 critical quality characteristics must be controlled in the surface mount and wire-bond process in semiconductor packaging. And these characteristics are correlated with each other. So the principal components analysis(PCA) is used in the analysis of the sample data firstly. And then the process is controlled with hotelling T^2 control chart for the first several principal components which contain sufficient information. Furthermore, a software tool is developed for this kind of problems. And with sample data from a surface mounting device(SMD) process, it is demonstrated that the T^2 control chart with PCA gets the same conclusion as without PCA, but the problem is transformed from high-dimensional one to a lower dimensional one, i.e., from 5 to 2 in this demonstration.
基金supported by the State Key Laboratory of Nuclear Physics and Technology,Peking University(Grant No.NPT2023KFY02)the China Postdoctoral Science Foundation(Grant No.2021M700256)+2 种基金the National Key R&D Program of China(Grant No.2018YFA0404400)the National Natural Science Foundation of China(Grant Nos.11935003,11975031,12141501,and 12070131001)the High-performance Computing Platform of Peking University。
文摘Principal component analysis(PCA)is employed to extract the principal components(PCs)present in nuclear mass models for the first time.The effects from different nuclear mass models are reintegrated and reorganized in the extracted PCs.These PCs are recombined to build new mass models,which achieve better accuracy than the original theoretical mass models.This comparison indicates that using the PCA approach,the effects contained in different mass models can be collaborated to improve nuclear mass predictions.
文摘We investigated the parametric optimization on incremental sheet forming of stainless steel using Grey Relational Analysis(GRA) coupled with Principal Component Analysis(PCA). AISI 316L stainless steel sheets were used to develop double wall angle pyramid with aid of tungsten carbide tool. GRA coupled with PCA was used to plan the experiment conditions. Control factors such as Tool Diameter(TD), Step Depth(SD), Bottom Wall Angle(BWA), Feed Rate(FR) and Spindle Speed(SS) on Top Wall Angle(TWA) and Top Wall Angle Surface Roughness(TWASR) have been studied. Wall angle increases with increasing tool diameter due to large contact area between tool and workpiece. As the step depth, feed rate and spindle speed increase,TWASR decreases with increasing tool diameter. As the step depth increasing, the hydrostatic stress is raised causing severe cracks in the deformed surface. Hence it was concluded that the proposed hybrid method was suitable for optimizing the factors and response.
基金This work was supported by the Pilot Seed Grant(Grant No.RES0049944)the Collaborative Research Project(Grant No.RES0043251)from the University of Alberta.
文摘Ore production is usually affected by multiple influencing inputs at open-pit mines.Nevertheless,the complex nonlinear relationships between these inputs and ore production remain unclear.This becomes even more challenging when training data(e.g.truck haulage information and weather conditions)are massive.In machine learning(ML)algorithms,deep neural network(DNN)is a superior method for processing nonlinear and massive data by adjusting the amount of neurons and hidden layers.This study adopted DNN to forecast ore production using truck haulage information and weather conditions at open-pit mines as training data.Before the prediction models were built,principal component analysis(PCA)was employed to reduce the data dimensionality and eliminate the multicollinearity among highly correlated input variables.To verify the superiority of DNN,three ANNs containing only one hidden layer and six traditional ML models were established as benchmark models.The DNN model with multiple hidden layers performed better than the ANN models with a single hidden layer.The DNN model outperformed the extensively applied benchmark models in predicting ore production.This can provide engineers and researchers with an accurate method to forecast ore production,which helps make sound budgetary decisions and mine planning at open-pit mines.
基金supported by the National Key Research and Development Program of China(No.2018YFA0702800)the National Natural Science Foundation of China(No.12072056)supported by National Defense Fundamental Scientific Research Project(XXXX2018204BXXX).
文摘The safety and integrity requirements of aerospace composite structures necessitate real-time health monitoring throughout their service life.To this end,distributed optical fiber sensors utilizing back Rayleigh scattering have been extensively deployed in structural health monitoring due to their advantages,such as lightweight and ease of embedding.However,identifying the precise location of damage from the optical fiber signals remains a critical challenge.In this paper,a novel approach which namely Modified Sliding Window Principal Component Analysis(MSWPCA)was proposed to facilitate automatic damage identification and localization via distributed optical fiber sensors.The proposed method is able to extract signal characteristics interfered by measurement noise to improve the accuracy of damage detection.Specifically,we applied the MSWPCA method to monitor and analyze the debonding propagation process in honeycomb sandwich panel structures.Our findings demonstrate that the training model exhibits high precision in detecting the location and size of honeycomb debonding,thereby facilitating reliable and efficient online assessment of the structural health state.
文摘Principal Component Analysis (PCA) is a widely used technique for data analysis and dimensionality reduction, but its sensitivity to feature scale and outliers limits its applicability. Robust Principal Component Analysis (RPCA) addresses these limitations by decomposing data into a low-rank matrix capturing the underlying structure and a sparse matrix identifying outliers, enhancing robustness against noise and outliers. This paper introduces a novel RPCA variant, Robust PCA Integrating Sparse and Low-rank Priors (RPCA-SL). Each prior targets a specific aspect of the data’s underlying structure and their combination allows for a more nuanced and accurate separation of the main data components from outliers and noise. Then RPCA-SL is solved by employing a proximal gradient algorithm for improved anomaly detection and data decomposition. Experimental results on simulation and real data demonstrate significant advancements.
文摘With increasing global demand for energy,the importance of unconventional shale oil and gas research cannot be over-emphasized.The oil and gas industry requires rapid and reliable means of forecasting production.Existing traditional decline curve analysis(DCA)methods have been limited in their ability to satisfactorily forecast production from unconventional liquid-rich shale(LRS)reservoirs.This is due to several causes ranging from the complicated production mechanisms to the ultra-low permeability in shales.The use of hybrid(combination)DCA models can improve results.However,complexities associated with these techniques can still make their application quite tedious without proper diagnostic plots,correct use of model parameters and some knowledge of the production mechanisms involved.This work,therefore,presents a new statistical data-driven approach of forecasting production from LRS reservoirs called the Principal Components Methodology(PCM).PCM is a technique that bypasses a lot of the difficulties associated with existing methods of forecasting and forecasts production with reasonable certainty.PCM is a data-driven method of forecasting based on the statistical technique of principal components analysis(PCA).In our study,we simulated production of fluids with different compositions for 30 years with the aid of a commercial compositional simulator.We then applied the Principal Components Methodology(PCM)to the production data from several representative wells by using Singular Value Decomposition(SVD)to calculate the principal components.These principal components were then used to forecast oil production from wells with production histories ranging from 0.5 to 3 years,and the results were compared to simulated data.Application of the PCM to field data is also included in this work.
文摘The aim of this work is to describe and compare three exploratory chemometrical tools,principal components analysis,independent components analysis and common components analysis,the last one being a modification of the multi-block statistical method known as common components and specific weights analysis.The three methods were applied to a set of data to show the differences and similarities of the results obtained,highlighting their complementarity.
文摘Intended for good productivity and perfect operation of the solar power grid a failure-free system is required.Therefore,thermal image processing with the thermal camera is the latest non-invasive(without manual contact)type fault identification technique which may give good precision in all aspects.The soiling issue,which is major productivity affecting factor may import from several reasons such as dust on the wind,bird mucks,etc.The efficient power production sufferers due to accumulated soil deposits reaching from 1%–7%in the county,such as India,to more than 25%in middle-east countries country,such as Dubai,Kuwait,etc.This research offers a solar panel soiling detection system built on thermal imaging which powers the inspection method and mitigates the requirement for physical panel inspection in a large solar production place.Hence,in this method,solar panels can be verified by working without disturbing production operation and it will save time and price of recognition.India ranks 3rd worldwide in the usage use age of Photovoltaic(PV)panels now and it is supported about 8.6%of the Nation’s electricity need in the year 2020.In the meantime,the installed PV production areas in India are aged 4–5 years old.Hence the need for inspection and maintenance of installed PV is growing fast day by day.As a result,this research focuses on finding the soiling hotspot exactly of the working solar panels with the help of Principal Components Thermal Analysis(PCTA)on MATLAB Environment.
基金supported by the National Natural Science Foundation of China(No.51974023)State Key Laboratory of Advanced Metallurgy,University of Science and Technology Beijing(No.41621005)。
文摘The composition control of molten steel is one of the main functions in the ladle furnace(LF)refining process.In this study,a feasible model was established to predict the alloying element yield using principal component analysis(PCA)and deep neural network(DNN).The PCA was used to eliminate collinearity and reduce the dimension of the input variables,and then the data processed by PCA were used to establish the DNN model.The prediction hit ratios for the Si element yield in the error ranges of±1%,±3%,and±5%are 54.0%,93.8%,and98.8%,respectively,whereas those of the Mn element yield in the error ranges of±1%,±2%,and±3%are 77.0%,96.3%,and 99.5%,respectively,in the PCA-DNN model.The results demonstrate that the PCA-DNN model performs better than the known models,such as the reference heat method,multiple linear regression,modified backpropagation,and DNN model.Meanwhile,the accurate prediction of the alloying element yield can greatly contribute to realizing a“narrow window”control of composition in molten steel.The construction of the prediction model for the element yield can also provide a reference for the development of an alloying control model in LF intelligent refining in the modern iron and steel industry.
基金supported by the National Natural Science Foundation of China (61903326, 61933015)。
文摘The large blast furnace is essential equipment in the process of iron and steel manufacturing. Due to the complex operation process and frequent fluctuations of variables, conventional monitoring methods often bring false alarms. To address the above problem, an ensemble of greedy dynamic principal component analysis-Gaussian mixture model(EGDPCA-GMM) is proposed in this paper. First, PCA-GMM is introduced to deal with the collinearity and the non-Gaussian distribution of blast furnace data.Second, in order to explain the dynamics of data, the greedy algorithm is used to determine the extended variables and their corresponding time lags, so as to avoid introducing unnecessary noise. Then the bagging ensemble is adopted to cooperate with greedy extension to eliminate the randomness brought by the greedy algorithm and further reduce the false alarm rate(FAR) of monitoring results. Finally, the algorithm is applied to the blast furnace of a large iron and steel group in South China to verify performance.Compared with the basic algorithms, the proposed method achieves lowest FAR, while keeping missed alarm rate(MAR) remain stable.
基金This work was supported by grants from the the National Basic Re- search Program (973 Program) (Nos. 2012CB911200 and 2012CB910104) and National Natural Science Foundation of China (Grant No. 31100584).
文摘Telomere assumes intra-molecular G-quadruplex that is a significant drug target for inhibiting telomerase main- tenance of telomeres in cancer. Metal cations have been recognized as playing important roles in stabilizing G-quadruplex, but their binding processes to human telomeric G-quadruplex remain uncharacterized. To in- vestigate the detailed binding procedures, molecular dynamics simulations were conducted on the hybrid [3 + 1] form-one human telomeric intra-molecular G-quadruplex. We show here that the binding of a potas- sium ion to a G-tetrad core is mediated by two alternative pathways. Principal component analysis illustrated the dominant concerted motions of G-quadruplex occurred at the loop domains. MM-PBSA calculations revealed that binding was energetically favorable and driven by the electrostatic interactions. The lower binding site was found more constructive favorable for binding. Our data provide useful information on a potassium-mediated stable structure of human telomeric intra-molecular G-quadruplex, implicating in ion disorder associated conformationa| changes and targeted drug design.
文摘The purpose of many microarray studies is to find the association between gene expression and sample characteristics such as treatment type or sample phenotype. There has been a surge of efforts developing different methods for delineating the association. Aside from the high dimensionality of microarray data, one well recognized challenge is the fact that genes could be complicatedly inter-related, thus making many statistical methods inappropriate to use directly on the expression data. Multivariate methods such as principal component analysis (PCA) and clustering are often used as a part of the effort to capture the gene correlation, and the derived components or clusters are used to describe the association between gene expression and sample phenotype. We propose a method for patient population dichotomization using maximally selected test statistics in combination with the PCA method, which shows favorable results. The proposed method is compared with a currently well-recognized method.
基金Project supported by the National Natural Science Foundation of China (Nos. 61071131 and 61271388)the Beijing Natural Science Foundation (No. 4122040)+1 种基金the Research Project of Tsinghua University (No. 2012Z01011)the United Technologies Research Center (UTRC)
文摘Principal component analysis(PCA) is fundamental in many pattern recognition applications.Much research has been performed to minimize the reconstruction error in L1-norm based reconstruction error minimization(L1-PCA-REM) since conventional L2-norm based PCA(L2-PCA) is sensitive to outliers.Recently,the variance maximization formulation of PCA with L1-norm(L1-PCA-VM) has been proposed,where new greedy and nongreedy solutions are developed.Armed with the gradient ascent perspective for optimization,we show that the L1-PCA-VM formulation is problematic in learning principal components and that only a greedy solution can achieve robustness motivation,which are verified by experiments on synthetic and real-world datasets.
基金This research was funded by the National Natural Science Foundation of China(Grant No.41504103).
文摘Total organic carbon(TOC)content is one of the most important parameters for characterizing the quality of source rocks and assessing the hydrocarbon-generating potential of shales.The Lucaogou Formation shale reservoirs in the Jimusaer Sag,Junggar Basin,NW China,is characterized by extremely complex lithology and a wide variety of mineral compositions with source rocks mainly consisting of carbonaceous mudstone and dolomitic mudstone.The logging responses of organic matter in the shale reservoirs is quite different from those in conventional reservoirs.Analyses show that the traditional△logR method is not suitable for evaluating the TOC content in the study area.Analysis of the sensitivity characteristics of TOC content to well logs reveals that the TOC content has good correlation with the separation degree of porosity logs.After a dimension reduction processing by the principal component analysis technology,the principal components are determined through correlation analysis of porosity logs.The results show that the TOC values obtained by the new method are in good agreement with that measured by core analysis.The average absolute error of the new method is only 0.555,much less when compared with 1.222 of using traditional△logR method.The proposed method can be used to produce more accurate TOC estimates,thus providing a reliable basis for source rock mapping.
基金Supported by the State Key Laboratory of Marine Environmental Science Visiting Fellowship(No.MELRS2233)the State Key Laboratory of Marine Geology,Tongji University(No.MGK202302)+4 种基金the Innovation Group Project of Southern Marine Science and Engineering Guangdong Laboratory(Zhuhai)(No.311021003)the Zhujiang Talent Project Foundation of Guangdong Province(No.2017ZT07Z066)the Fundamental Research Funds for the Central Universities,Sun Yat-sen University(Nos.22qntd2101,2021qntd23)the Major Projects of the National Natural Science Foundation of China(Nos.41790465,41590863)the National Natural Science Foundation of China(Nos.42102333,41806077,41904045)。
文摘Principal component analysis(PCA)was employed to determine the implications of geochemical and isotopic data from Cenozoic volcanic activities in the Southeast Asian region,including China(South China Sea(SCS),Hainan Island,Fujian-Zhejiang coast,Taiwan Island),and parts of Vietnam and Thailand.We analyzed 15 trace element indicators and 5 isotopic indicators for 623 volcanic rock samples collected from the study region.Two principal components(PCs)were extracted by PCA based on the trace elements and Sr-Nd-Pb isotopic ratios,which probably indicate an enriched oceanic island basalt-type mantle plume and a depleted mid-ocean ridge basalt-type spreading ridge.The results show that the influence of the Hainan mantle plume on younger volcanic activities(<13 Ma)is stronger than that on older ones(>13 Ma)at the same location in the Southeast Asian region.PCA was employed to verify the mantle-plume-ridge interaction model of volcanic activities beneath the expansion center of SCS and refute the hypothesis that the tension of SCS is triggered by the Hainan plume.This study reveals the efficiency and applicability of PCA in discussing mantle sources of volcanic activities;thus,PCA is a suitable research method for analyzing geochemical data.
基金funded by the National Natural Science Foundation of China(42174131)the Strategic Cooperation Technology Projects of CNPC and CUPB(ZLZX2020-03).
文摘In this research,an integrated classification method based on principal component analysis-simulated annealing genetic algorithm-fuzzy cluster means(PCA-SAGA-FCM)was proposed for the unsupervised classification of tight sandstone reservoirs which lack the prior information and core experiments.A variety of evaluation parameters were selected,including lithology characteristic parameters,poro-permeability quality characteristic parameters,engineering quality characteristic parameters,and pore structure characteristic parameters.The PCA was used to reduce the dimension of the evaluation pa-rameters,and the low-dimensional data was used as input.The unsupervised reservoir classification of tight sandstone reservoir was carried out by the SAGA-FCM,the characteristics of reservoir at different categories were analyzed and compared with the lithological profiles.The analysis results of numerical simulation and actual logging data show that:1)compared with FCM algorithm,SAGA-FCM has stronger stability and higher accuracy;2)the proposed method can cluster the reservoir flexibly and effectively according to the degree of membership;3)the results of reservoir integrated classification match well with the lithologic profle,which demonstrates the reliability of the classification method.
文摘This work utilizes a statistical approach of Principal Component Ana-lysis(PCA)towards the detection of Methane(CH_(4))-Carbon Monoxide(CO)Poi-soning occurring in coal mines,forestfires,drainage systems etc.where the CH_(4) and CO emissions are very high in closed buildings or confined spaces during oxi-dation processes.Both methane and carbon monoxide are highly toxic,colorless and odorless gases.Both of the gases have their own toxic levels to be detected.But during their combined presence,the toxicity of the either one goes unidentified may be due to their low levels which may lead to an explosion.By using PCA,the correlation of CO and CH_(4) data is carried out and by identifying the areas of high correlation(along the principal component axis)the explosion suppression action can be triggered earlier thus avoiding adverse effects of massive explosions.Wire-less Sensor Network is deployed and simulations are carried with heterogeneous sensors(Carbon Monoxide and Methane sensors)in NS-2 Mannasim framework.The rise in the value of CO even when CH_(4) is below the toxic level may become hazardous to the people around.Thus our proposed methodology will detect the combined presence of both the gases(CH_(4) and CO)and provide an early warning in order to avoid any human losses or toxic effects.