[Objectives] This study aimed to establish HPLC fingerprint and conduct cluster analysis and principle component analysis for Citri Reticulatae Pericarpium Viride. [Methods] Using the HPLC method, the determination wa...[Objectives] This study aimed to establish HPLC fingerprint and conduct cluster analysis and principle component analysis for Citri Reticulatae Pericarpium Viride. [Methods] Using the HPLC method, the determination was performed on XSelect~® HSS T3-C_(18) column with mobile phase of acetonitrile-0.5% acetic acid solution(gradient elution) at the flow rate of 1.0 mL/min. The detection wavelength was 360 nm. The column temperature was 25℃. The sample size was 10 μL. With peak of hesperidin as the reference, HPLC fingerprints of 10 batches of Citri Reticulatae Pericarpium Viride were determined. The similarity of the 10 batches of samples was evaluated by Similarity Evaluation System for Chromatographic Fingerprint of TCM(2012 edition) to determine the common peaks. Cluster analysis and principal component analysis were performed by using SPSS 17.0 statistical software. [Results] The HPLC fingerprints of the 10 batches of medicinal materials had total 11 common peaks, and the similarity was 0.919-1.000, indicating that the chemical composition of the 10 batches of medicinal materials was consistent. There were 11 common components in the 10 batches of medicinal materials, but their contents were different. When the Euclidean distance was 20, the 10 batches of samples were divided into two categories, S4 in the first category, and the others in the second one. When the Euclidean distance was 5, the second category could be further divided into two sub-categories, S1 and S10 in one sub-category, and S2, S3, S5, S6, S7, S8 and S9 in the other one. The principle component analysis showed that cumulative contribution rate of the two main component factors was 92.797%, and the comprehensive score of S7 was the highest with the best quality. [Conclusions] The results of HPLC fingerprinting, cluster analysis and principle component analysis can provide reference for the quality control of Citri Reticulatae Pericarpium Viride.展开更多
This paper aims to deepen the quality of life of people with celiac disease with a focus on compliance to the diet through Principle Component Analysis and Analyse des Données. In particular, we will try to under...This paper aims to deepen the quality of life of people with celiac disease with a focus on compliance to the diet through Principle Component Analysis and Analyse des Données. In particular, we will try to understand whether these analyzes are also applicable in the context of research web2.0 carried out with web-survey.展开更多
Ground roll is an interference wave that severely degrades the signal-to-noise ratio of seismic data and affects its subsequent processing and interpretation.In this study,according to differences in morphological cha...Ground roll is an interference wave that severely degrades the signal-to-noise ratio of seismic data and affects its subsequent processing and interpretation.In this study,according to differences in morphological characteristics between ground roll and reflected waves,we use morphological component analysis based on two-dimensional dictionaries to separate ground roll and reflected waves.Because ground roll is characterized by lowfrequency,low-velocity,and dispersion,we select two-dimensional undecimated discrete wavelet transform as a sparse representation dictionary of ground roll.Because of a strong local correlation of the reflected wave,we select two-dimensional local discrete cosine transform as the sparse representation dictionary of reflected waves.A sparse representation model of seismic data is constructed based on a two-dimensional joint dictionary then a block coordinate relaxation algorithm is used to solve the model and decompose seismic record into reflected wave part and ground roll part.The good effects for the synthetic seismic data and application of real seismic data indicate that when using the model,strong-energy ground roll is considerably suppressed and the waveform of the reflected wave is effectively protected.展开更多
With the development of power grid, as one of the key equipment, the transformer’s condition assessment method has always receive attention from experts, scholars concern more and more about the method’s practicalit...With the development of power grid, as one of the key equipment, the transformer’s condition assessment method has always receive attention from experts, scholars concern more and more about the method’s practicality and reliability. In the traditional condition assessment method, due to the characteristics of the transformer’s complex structure, the assessment system is not comprehensive enough, or the assessment system is too complex, the indexes are not easy to quantify, such problems are emerging. The traditional method is complex and the degree of quantification is not enough. Therefore it is necessary to propose a condition assessment method that is easy to carry out the condition assessment work and does not affect the assessment results. In this paper, we propose a method to assess the state of the transformer’s complex structure. First, we establish a comprehensive assessment system, then apply the method of principal component analysis to optimize the index system, and then use the theory of cloud-matter-element. Finally the reliability and rationality of the method are verified by an example.展开更多
NonLocal Means(NLM),taking fully advantage of image redundancy,has been proved to be very effective in noise removal.However,high computational load limits its wide application.Based on Principle Component Analysis(PC...NonLocal Means(NLM),taking fully advantage of image redundancy,has been proved to be very effective in noise removal.However,high computational load limits its wide application.Based on Principle Component Analysis(PCA),Principle Neighborhood Dictionary(PND) was proposed to reduce the computational load of NLM.Nevertheless,as the principle components in PND method are computed directly from noisy image neighborhoods,they are prone to be inaccurate due to the presence of noise.In this paper,an improved scheme for image denoising is proposed.This scheme is based on PND and uses preprocessing via Gaussian filter to eliminate the influence of noise.PCA is then used to project those filtered image neighborhood vectors onto a lower-dimensional space.With the preproc-essing process,the principle components computed are more accurate resulting in an improved de-noising performance.A comparison with some NLM based and state-of-art denoising methods shows that the proposed method performs well in terms of Peak Signal to Noise Ratio(PSNR) as well as image visual fidelity.The experimental results demonstrate that our method outperforms existing methods both subjectively and objectively.展开更多
Liquid state methanol and ethanol under different temperatures have been investigated by FT-NIR(Fourier transform nearinfrared) spectroscopy,generalized two-dimensional(2D) correlation spectroscopy,and PCA(principal c...Liquid state methanol and ethanol under different temperatures have been investigated by FT-NIR(Fourier transform nearinfrared) spectroscopy,generalized two-dimensional(2D) correlation spectroscopy,and PCA(principal component analysis) . First,the FT-NIR spectra were measured over a temperature range of 30-64(or 30-71) °C,and then the 2D correlation spectra were computed.Combining near-infrared spectroscopy,generalized 2D correlation spectroscopy,and references,we analyzed the molecular structures(especially the hydrogen bond) of methanol and ethanol,and performed the NIR band assignments. The PCA method was employed to verify the results of the 2D analysis.This study will be helpful to the understanding of these reagents.展开更多
Block principle and pattern classification component analysis (BPCA) is a recently developed technique in computer vision In this paper, we propose a robust and sparse BPCA with Lp-norm, referred to as BPCALp-S, whi...Block principle and pattern classification component analysis (BPCA) is a recently developed technique in computer vision In this paper, we propose a robust and sparse BPCA with Lp-norm, referred to as BPCALp-S, which inherits the robustness of BPCA-L1 due to the employment of adjustable Lp-norm. In order to perform a sparse modelling, the elastic net is integrated into the objective function. An iterative algorithm which extracts feature vectors one by one greedily is elaborately designed. The monotonicity of the proposed iterative procedure is theoretically guaranteed. Experiments of image classification and reconstruction on several benchmark sets show the effectiveness of the proposed approach.展开更多
This document presents an extension of the multiple factorial analysis to symbolic data and especially to space data. The analysis makes use of the characteristic coding method to obtain active individuals and the rec...This document presents an extension of the multiple factorial analysis to symbolic data and especially to space data. The analysis makes use of the characteristic coding method to obtain active individuals and the reconstitutive coding method for additional individuals in order to conserve the variability of assertion objects. Traditional analysis methods of the main components are applied to coded objects. Certain interpretation aids are presented after the coding process. This method was applied to poverty data.展开更多
Traditionally, Chinese indigenous cattle is geographically widespread. The present study analyzed based on genome-wide variants to evaluate the genetic background among 157 individuals from four representative indigen...Traditionally, Chinese indigenous cattle is geographically widespread. The present study analyzed based on genome-wide variants to evaluate the genetic background among 157 individuals from four representative indigenous cattle breeds of Hubei Province of China: Yiling yellow cattle (YL), Bashan cattle (BS), Wuling cattle (WL), Zaobei cattle (ZB), and 21 indi- viduals of Qinchuan cattle (QC) from the nearby Shanxi Province of China. Linkage disequilibrium (LD) analysis showed the LD of YL was the lowest (~=0.32) when the distance between markers was approximately 2 kb. Principle component analysis (PCA), and neighbor-joining (NJ)-tree revealed a separation of Yiling yellow cattle from other geographic nearby local cattle breeds. In PCA plot, the YL and QC groups were segregated as expected; moreover, YL individuals clustered together more obviously. In the N J-tree, the YL group formed an independent branch and BS, WL, ZB groups were mixed. We then used the FST statistic approach to reveal long-term selection sweep of YL and other 4 cattle breeds. According to the selective sweep, we identified the unique pathways of YL, associated with production traits. Based on the results, it can be proposed that YL has its unique genetic characteristics of excellence resource, and it is an indispensable cattle breed in China.展开更多
Optical coherence tomography(OCT)provides significant advantages of high resolution(approaching the histopathology level)realtime imaging of tsess without use of contrast agents.Based on these advantages,the microstru...Optical coherence tomography(OCT)provides significant advantages of high resolution(approaching the histopathology level)realtime imaging of tsess without use of contrast agents.Based on these advantages,the microstructural features of tumors can be visualized and detected intra-operatively.However,it is still not clinically accepted for tumor margin delin-eation due to poor specificity and accuracy.In contrast,Raman spectroscopy(RS)can obtain tissue information at the molecular level,but does not provide real-time inaging capability.Therefore,combining OCT and RS could provide synergy.To this end,we present a tissue analysis and dassification method using both the slope of OCT intensity signal Vs depth and the principle components from the RS spectrum as the indicators for tissuse characterization.The goal of this study was to understand prediction accuracy of OCT and combined OCT/RS method for dassification of optically similar tisues and organs.Our pilot experiments were performed on mouse kidneys,livers,and small intestines(SIs).The prediction accuracy with five-fold cross validation of the method has been evaluated by the support vector machine(SVM)method.The results demonstrate that tissue characterization based on the OCT/RS method was superior compared to using OCT structural information alone.This combined OCT/RS method is potentially useful as a noninvasive optical biopsy technique for rapid and automatic tissue characterization during surgery.展开更多
In this study,multivariate analysis methods,including a principal component analysis(PCA)and partial least square(PLS)analysis,were applied to reveal the inner relationship of the key variables in the process of H_(2)...In this study,multivariate analysis methods,including a principal component analysis(PCA)and partial least square(PLS)analysis,were applied to reveal the inner relationship of the key variables in the process of H_(2)O_(2)-assisted Na_(2)CO_(3)(HSC)pretreatment of corn stover.A total of 120 pretreatment experiments were implemented at the lab scale under different conditions by varying the particle size of the corn stover and process variables.The results showed that the Na_(2)CO_(3) dosage and pretreatment temperature had a strong influence on lignin removal,whereas pulp refining instrument(PFI)refining and Na_(2)CO_(3) dosage played positive roles in the final total sugar yield.Furthermore,it was found that pretreatment conditions had a more significant impact on the amelioration of pretreatment effectiveness compared with the properties of raw corn stover.In addition,a prediction of the effectiveness of the corn stover HSC pretreatment based on a PLS analysis was conducted for the first time,and the test results of the predictability based on additional pretreatment experiments proved that the developed PLS model achieved a good predictive performance(particularly for the final total sugar yield),indicating that the developed PLS model can be used to predict the effectiveness of HSC pretreatment.Therefore,multivariate analysis can be potentially used to monitor and control the pretreatment process in future large-scale biorefinery applications.展开更多
Background:Stem hardness is one of the major influencing factors for plant architecture in upland cotton(Gossypium hirsutum L.).Evaluating hardness phenotypic traits is very important for the selection of elite lines ...Background:Stem hardness is one of the major influencing factors for plant architecture in upland cotton(Gossypium hirsutum L.).Evaluating hardness phenotypic traits is very important for the selection of elite lines for resistance to lodging in Gossypium hirsutum L.Cotton breeders are interested in using diverse genotypes to enhance fiber quality and high-yield.Few pieces of research for hardness and its relationship with fiber quality and yield were found.This study was designed to find the relationship of stem hardness traits with fiber quality and yield contributing traits of upland cotton.Results:Experiments were carried out to measure the bending,acupuncture,and compression properties of the stem from a collection of upland cotton genotypes,comprising 237 accessions.The results showed that the genotypic difference in stem hardness was highly significant among the genotypes,and the stem hardness traits(BL,BU,AL,AU,CL,and CU)have a positive association with fiber quality traits and yield-related traits.Statistical analyses of the results showed that in descriptive statistics result bending(BL,BU)has a maximum coefficient of variance,but fiber length and fiber strength have less coefficient of variance among the genotypes.Principal component analysis(PCA)trimmed quantitative characters into nine principal components.The first nine principal components(PC)with Eigenvalues>1 explained 86%of the variation among 237 accessions of cotton.Both 2017 and 2018,PCA results indicated that BL,BU,FL,FE,and LI contributed to their variability in PC1,and BU,AU,CU,FD,LP,and FWPB have shown their variability in PC2.Conclusion:We describe here the systematic study of the mechanism involved in the regulation of enhancing fiber quality and yield by stem bending strength,acupuncture,and compression properties of G.hirsutum.展开更多
Brain arteriovenous malformation(BAVM) is frequently described as vascular malformation. Although computer tomography(CT), magnetic resonance imaging(MRI) and angiography can clearly detect lesions, there are no diagn...Brain arteriovenous malformation(BAVM) is frequently described as vascular malformation. Although computer tomography(CT), magnetic resonance imaging(MRI) and angiography can clearly detect lesions, there are no diagnostic biological markers of BAVM available. Current study demonstrated that micro RNA(mi RNA)showed a feasible marker for vascular disease. To find key correlations between these mi RNAs and the onset of BAVM, we carried out chip analysis of serum mi RNAs by identifying 18 potential markers of BAVM. We then constructed a principle component analysis and logistic regression(PCA-LR) model to analyze the 18 mi RNAs collected from 77 patients. Another 9 independent samples were used to test the resulting model. The results showed that mi RNAs hsa-mir-126-3p and hsa-mir-140 are important protective factors, while hsa-mir-338 is a dominating risk factor, all of which have stronger correlation with BAVM than others. We also compared the testing results using PCA-LR model with those using LR model. The comparison revealed that PCA-LR model is better in predicting the disease.展开更多
Based on 8 indicators of economic development and according to the economic situation of cities in Shaanxi Province,this paper analyzes the data using PCA( Principle Component Analysis) and FA( Factor Analysis) with t...Based on 8 indicators of economic development and according to the economic situation of cities in Shaanxi Province,this paper analyzes the data using PCA( Principle Component Analysis) and FA( Factor Analysis) with the help of R software.Using the test of Kendal's W coefficient of concordance,the consistency of results from both methods is tested.Finally,Cluster Analysis is used to classify the results and some advices are proposed for the development of cites in Shaanxi Province.展开更多
Serial Analysis of Gene Expression (SAGE) is a powerful tool to analyze whole-genome expression profiles. SAGE data, characterized by large quantity and high dimensions, need reducing their dimensions and extract feat...Serial Analysis of Gene Expression (SAGE) is a powerful tool to analyze whole-genome expression profiles. SAGE data, characterized by large quantity and high dimensions, need reducing their dimensions and extract feature to improve the accuracy and efficiency when they are used for pattern recognition and clustering analysis. A Poisson Model-based Kernel (PMK) was proposed based on the Poisson distribution of the SAGE data. Kernel Principle Component Analysis (KPCA) with PMK was proposed and used in feature-extract analysis of mouse retinal SAGE data. The computa-tional results show that this algorithm can extract feature effectively and reduce dimensions of SAGE data.展开更多
In this paper, we apply clustering analysis of data mining into power system. We adapt K-means clustering algorithm to analyze customer load, analyzing similar behavior between customer of electricity, and we adapt pr...In this paper, we apply clustering analysis of data mining into power system. We adapt K-means clustering algorithm to analyze customer load, analyzing similar behavior between customer of electricity, and we adapt principal component analysis to get the clustering result visible, Simulation and analysis using matlab, and this well verify cluster rationality. The conclusion of this paper can provide important basis to the peak for the power system, stable operation the power system security.展开更多
Online demand prediction plays an important role in transport network services from operations,controls to management,and information provision.However,the online prediction models are impacted by streaming data quali...Online demand prediction plays an important role in transport network services from operations,controls to management,and information provision.However,the online prediction models are impacted by streaming data quality issues with noise measurements and missing data.To address these,we develop a robust prediction method for online network-level demand prediction in public transport.It consists of a PCA method to extract eigen demand images and an optimization-based pattern recognition model to predict the weights of eigen demand images by making use of the partially observed real-time data up to the prediction time in a day.The prediction model is robust to data quality issues given that the eigen demand images are stable and the predicted weights of them are optimized using the network level data(less impacted by local data quality issues).In the case study,we validate the accuracy and transferability of the model by comparing it with benchmark models and evaluate the robustness in tolerating data quality issues of the proposed model.The experimental results demonstrate that the proposed Pattern Recognition Prediction based on PCA(PRP-PCA)consistently outperforms other benchmark models in accuracy and transferability.Moreover,the model shows high robustness in accommodating data quality issues.For example,the PRP-PCA model is robust to missing data up to 50%regardless of the noise level.We also discuss the hidden patterns behind the network level demand.The visualization analysis shows that eigen demand images are significantly connected to the network structure and station activity variabilities.Though the demand changes dramatically before and after the pandemic,the eigen demand images are consistent over time in Stockholm.展开更多
Principle component analysis (PCA) based chi-square test is more sensitive to subtle gross errors and has greater power to correctly detect gross errors than classical chi-square test. However, classical principal c...Principle component analysis (PCA) based chi-square test is more sensitive to subtle gross errors and has greater power to correctly detect gross errors than classical chi-square test. However, classical principal com- ponent test (PCT) is non-robust and can be very sensitive to one or more outliers. In this paper, a Huber function liked robust weight factor was added in the collective chi-square test to eliminate the influence of gross errors on the PCT. Meanwhile, robust chi-square test was applied to modified simultaneous estimation of gross error (MSEGE) strategy to detect and identify multiple gross errors. Simulation results show that the proposed robust test can reduce the possibility of type Ⅱ errors effectively. Adding robust chi-square test into MSEGE does not obviously improve the power of multiple gross error identification, the proposed approach considers the influence of outliers on hypothesis statistic test and is more reasonable.展开更多
Banana is one of the main economic agrotypes in Zhangzhou, Fujian Province. The multitemporal ENVlSAT ASAR data with different polarization are used to classify the banana fields in this paper. Principal component ana...Banana is one of the main economic agrotypes in Zhangzhou, Fujian Province. The multitemporal ENVlSAT ASAR data with different polarization are used to classify the banana fields in this paper. Principal component analysis (PCA) was applied for six pairs of ASAR dual-polarization data. For its large leaves, banana has high backscatter. So the value of banana fields is high and shows very bright in the 1st component, which makes it much easier for banana fields extraction. Dual-polarization data provide more information, and the W and VH backscatter of banana show different characters with other land covers. Based on the analysis of the radar signature of banana fields and other land covers and the 1st compo- nent, banana fields are classified using object-oriented classifier. Compared to the field survey data and ASTER data, the accuracy of banana fields in the study area is 83.5%. It shows that the principal component analysis provides the useful information in SAR images analysis and makes the extraction of banana fields easier.展开更多
基金Supported by National Natural Science Foundation of China(81603251)Key Research and Development Plan of Shanxi Province(201603D3113021)Project of Collaborative Innovation Center for the Comprehensive Development and Utilization of Medicinal Herbs in Shanxi Province(2017-JYXT-05)
文摘[Objectives] This study aimed to establish HPLC fingerprint and conduct cluster analysis and principle component analysis for Citri Reticulatae Pericarpium Viride. [Methods] Using the HPLC method, the determination was performed on XSelect~® HSS T3-C_(18) column with mobile phase of acetonitrile-0.5% acetic acid solution(gradient elution) at the flow rate of 1.0 mL/min. The detection wavelength was 360 nm. The column temperature was 25℃. The sample size was 10 μL. With peak of hesperidin as the reference, HPLC fingerprints of 10 batches of Citri Reticulatae Pericarpium Viride were determined. The similarity of the 10 batches of samples was evaluated by Similarity Evaluation System for Chromatographic Fingerprint of TCM(2012 edition) to determine the common peaks. Cluster analysis and principal component analysis were performed by using SPSS 17.0 statistical software. [Results] The HPLC fingerprints of the 10 batches of medicinal materials had total 11 common peaks, and the similarity was 0.919-1.000, indicating that the chemical composition of the 10 batches of medicinal materials was consistent. There were 11 common components in the 10 batches of medicinal materials, but their contents were different. When the Euclidean distance was 20, the 10 batches of samples were divided into two categories, S4 in the first category, and the others in the second one. When the Euclidean distance was 5, the second category could be further divided into two sub-categories, S1 and S10 in one sub-category, and S2, S3, S5, S6, S7, S8 and S9 in the other one. The principle component analysis showed that cumulative contribution rate of the two main component factors was 92.797%, and the comprehensive score of S7 was the highest with the best quality. [Conclusions] The results of HPLC fingerprinting, cluster analysis and principle component analysis can provide reference for the quality control of Citri Reticulatae Pericarpium Viride.
文摘This paper aims to deepen the quality of life of people with celiac disease with a focus on compliance to the diet through Principle Component Analysis and Analyse des Données. In particular, we will try to understand whether these analyzes are also applicable in the context of research web2.0 carried out with web-survey.
基金supported by the National Scientific Equipment Development Project,"Deep Resource Exploration Core Equipment Research and Development"(Grant No.ZDYZ2012-1)06 Subproject,"Metal Mine Earthquake Detection System"and 05 Subject,"System Integration Field Test and Processing Software Development"
文摘Ground roll is an interference wave that severely degrades the signal-to-noise ratio of seismic data and affects its subsequent processing and interpretation.In this study,according to differences in morphological characteristics between ground roll and reflected waves,we use morphological component analysis based on two-dimensional dictionaries to separate ground roll and reflected waves.Because ground roll is characterized by lowfrequency,low-velocity,and dispersion,we select two-dimensional undecimated discrete wavelet transform as a sparse representation dictionary of ground roll.Because of a strong local correlation of the reflected wave,we select two-dimensional local discrete cosine transform as the sparse representation dictionary of reflected waves.A sparse representation model of seismic data is constructed based on a two-dimensional joint dictionary then a block coordinate relaxation algorithm is used to solve the model and decompose seismic record into reflected wave part and ground roll part.The good effects for the synthetic seismic data and application of real seismic data indicate that when using the model,strong-energy ground roll is considerably suppressed and the waveform of the reflected wave is effectively protected.
文摘With the development of power grid, as one of the key equipment, the transformer’s condition assessment method has always receive attention from experts, scholars concern more and more about the method’s practicality and reliability. In the traditional condition assessment method, due to the characteristics of the transformer’s complex structure, the assessment system is not comprehensive enough, or the assessment system is too complex, the indexes are not easy to quantify, such problems are emerging. The traditional method is complex and the degree of quantification is not enough. Therefore it is necessary to propose a condition assessment method that is easy to carry out the condition assessment work and does not affect the assessment results. In this paper, we propose a method to assess the state of the transformer’s complex structure. First, we establish a comprehensive assessment system, then apply the method of principal component analysis to optimize the index system, and then use the theory of cloud-matter-element. Finally the reliability and rationality of the method are verified by an example.
基金Supported by the National Natural Science Foundation of China (No. 60776795,60736043,60902031,and 60805012)the Research Fund for the Doctoral Program of Higher Education of China (No. 200807010004,20070701023)the Fundamental Research Funds for the Central Universities of China (No. JY10000902028)
文摘NonLocal Means(NLM),taking fully advantage of image redundancy,has been proved to be very effective in noise removal.However,high computational load limits its wide application.Based on Principle Component Analysis(PCA),Principle Neighborhood Dictionary(PND) was proposed to reduce the computational load of NLM.Nevertheless,as the principle components in PND method are computed directly from noisy image neighborhoods,they are prone to be inaccurate due to the presence of noise.In this paper,an improved scheme for image denoising is proposed.This scheme is based on PND and uses preprocessing via Gaussian filter to eliminate the influence of noise.PCA is then used to project those filtered image neighborhood vectors onto a lower-dimensional space.With the preproc-essing process,the principle components computed are more accurate resulting in an improved de-noising performance.A comparison with some NLM based and state-of-art denoising methods shows that the proposed method performs well in terms of Peak Signal to Noise Ratio(PSNR) as well as image visual fidelity.The experimental results demonstrate that our method outperforms existing methods both subjectively and objectively.
基金supported by the Medical Scientific Research Foundation of Guangdong Province,China(B2009043)
文摘Liquid state methanol and ethanol under different temperatures have been investigated by FT-NIR(Fourier transform nearinfrared) spectroscopy,generalized two-dimensional(2D) correlation spectroscopy,and PCA(principal component analysis) . First,the FT-NIR spectra were measured over a temperature range of 30-64(or 30-71) °C,and then the 2D correlation spectra were computed.Combining near-infrared spectroscopy,generalized 2D correlation spectroscopy,and references,we analyzed the molecular structures(especially the hydrogen bond) of methanol and ethanol,and performed the NIR band assignments. The PCA method was employed to verify the results of the 2D analysis.This study will be helpful to the understanding of these reagents.
基金the National Natural Science Foundation of China(No.61572033)the Natural Science Foundation of Education Department of Anhui Province of China(No.KJ2015ZD08)the Higher Education Promotion Plan of Anhui Province of China(No.TSKJ2015B14)
文摘Block principle and pattern classification component analysis (BPCA) is a recently developed technique in computer vision In this paper, we propose a robust and sparse BPCA with Lp-norm, referred to as BPCALp-S, which inherits the robustness of BPCA-L1 due to the employment of adjustable Lp-norm. In order to perform a sparse modelling, the elastic net is integrated into the objective function. An iterative algorithm which extracts feature vectors one by one greedily is elaborately designed. The monotonicity of the proposed iterative procedure is theoretically guaranteed. Experiments of image classification and reconstruction on several benchmark sets show the effectiveness of the proposed approach.
文摘This document presents an extension of the multiple factorial analysis to symbolic data and especially to space data. The analysis makes use of the characteristic coding method to obtain active individuals and the reconstitutive coding method for additional individuals in order to conserve the variability of assertion objects. Traditional analysis methods of the main components are applied to coded objects. Certain interpretation aids are presented after the coding process. This method was applied to poverty data.
基金funded in part by the National Natural Science Foundation of China (31402039,31472079,31372294)the Beijing Natural Science Foundation (6154032)+2 种基金the Species and Breed Resources Conservation of the Ministry of Agriculture of China (2017-2019)the Cattle Breeding Innovative Research Team of Chinese Academy of Agricultural Sciences (cxgc-ias-03)the National Beef Cattle Industrial Technology System (CARS-37)
文摘Traditionally, Chinese indigenous cattle is geographically widespread. The present study analyzed based on genome-wide variants to evaluate the genetic background among 157 individuals from four representative indigenous cattle breeds of Hubei Province of China: Yiling yellow cattle (YL), Bashan cattle (BS), Wuling cattle (WL), Zaobei cattle (ZB), and 21 indi- viduals of Qinchuan cattle (QC) from the nearby Shanxi Province of China. Linkage disequilibrium (LD) analysis showed the LD of YL was the lowest (~=0.32) when the distance between markers was approximately 2 kb. Principle component analysis (PCA), and neighbor-joining (NJ)-tree revealed a separation of Yiling yellow cattle from other geographic nearby local cattle breeds. In PCA plot, the YL and QC groups were segregated as expected; moreover, YL individuals clustered together more obviously. In the N J-tree, the YL group formed an independent branch and BS, WL, ZB groups were mixed. We then used the FST statistic approach to reveal long-term selection sweep of YL and other 4 cattle breeds. According to the selective sweep, we identified the unique pathways of YL, associated with production traits. Based on the results, it can be proposed that YL has its unique genetic characteristics of excellence resource, and it is an indispensable cattle breed in China.
基金supported in part by the grants to Kirill Larin from NIH 1R01EY022362,1R01HL120140,U54HG006348,and DOD PRJ71Tsupported by grants to Wei-Chuan Shih from NSF CAREER Award (CBET-1151154)+1 种基金NASA Early Career Faculty Grant (NNX12AQ44G)Gulf of Mexico Research Initiative (GoMRI-030).
文摘Optical coherence tomography(OCT)provides significant advantages of high resolution(approaching the histopathology level)realtime imaging of tsess without use of contrast agents.Based on these advantages,the microstructural features of tumors can be visualized and detected intra-operatively.However,it is still not clinically accepted for tumor margin delin-eation due to poor specificity and accuracy.In contrast,Raman spectroscopy(RS)can obtain tissue information at the molecular level,but does not provide real-time inaging capability.Therefore,combining OCT and RS could provide synergy.To this end,we present a tissue analysis and dassification method using both the slope of OCT intensity signal Vs depth and the principle components from the RS spectrum as the indicators for tissuse characterization.The goal of this study was to understand prediction accuracy of OCT and combined OCT/RS method for dassification of optically similar tisues and organs.Our pilot experiments were performed on mouse kidneys,livers,and small intestines(SIs).The prediction accuracy with five-fold cross validation of the method has been evaluated by the support vector machine(SVM)method.The results demonstrate that tissue characterization based on the OCT/RS method was superior compared to using OCT structural information alone.This combined OCT/RS method is potentially useful as a noninvasive optical biopsy technique for rapid and automatic tissue characterization during surgery.
基金This work was financially supported by the National Natural Science Foundation of China(No.31870568)Shandong Provincial Natural Science Foundation for Distinguished Young Scholars(China)(No.ZR2019JQ10)+1 种基金the Major Program of the Shandong Province Natural Science Foundation(No.ZR2018ZB0208)the"Transformational Technologies for Clean Energy and Demonstration"Strategic Priority Research Program of the Chinese Academy of Sciences(No.XDA21060201).
文摘In this study,multivariate analysis methods,including a principal component analysis(PCA)and partial least square(PLS)analysis,were applied to reveal the inner relationship of the key variables in the process of H_(2)O_(2)-assisted Na_(2)CO_(3)(HSC)pretreatment of corn stover.A total of 120 pretreatment experiments were implemented at the lab scale under different conditions by varying the particle size of the corn stover and process variables.The results showed that the Na_(2)CO_(3) dosage and pretreatment temperature had a strong influence on lignin removal,whereas pulp refining instrument(PFI)refining and Na_(2)CO_(3) dosage played positive roles in the final total sugar yield.Furthermore,it was found that pretreatment conditions had a more significant impact on the amelioration of pretreatment effectiveness compared with the properties of raw corn stover.In addition,a prediction of the effectiveness of the corn stover HSC pretreatment based on a PLS analysis was conducted for the first time,and the test results of the predictability based on additional pretreatment experiments proved that the developed PLS model achieved a good predictive performance(particularly for the final total sugar yield),indicating that the developed PLS model can be used to predict the effectiveness of HSC pretreatment.Therefore,multivariate analysis can be potentially used to monitor and control the pretreatment process in future large-scale biorefinery applications.
基金National Key Technology R&D Program,Ministry of Science and Technology(2016YFD0100306,2016YFD0100203)National Natural Science Foundation of China(grants 31671746).
文摘Background:Stem hardness is one of the major influencing factors for plant architecture in upland cotton(Gossypium hirsutum L.).Evaluating hardness phenotypic traits is very important for the selection of elite lines for resistance to lodging in Gossypium hirsutum L.Cotton breeders are interested in using diverse genotypes to enhance fiber quality and high-yield.Few pieces of research for hardness and its relationship with fiber quality and yield were found.This study was designed to find the relationship of stem hardness traits with fiber quality and yield contributing traits of upland cotton.Results:Experiments were carried out to measure the bending,acupuncture,and compression properties of the stem from a collection of upland cotton genotypes,comprising 237 accessions.The results showed that the genotypic difference in stem hardness was highly significant among the genotypes,and the stem hardness traits(BL,BU,AL,AU,CL,and CU)have a positive association with fiber quality traits and yield-related traits.Statistical analyses of the results showed that in descriptive statistics result bending(BL,BU)has a maximum coefficient of variance,but fiber length and fiber strength have less coefficient of variance among the genotypes.Principal component analysis(PCA)trimmed quantitative characters into nine principal components.The first nine principal components(PC)with Eigenvalues>1 explained 86%of the variation among 237 accessions of cotton.Both 2017 and 2018,PCA results indicated that BL,BU,FL,FE,and LI contributed to their variability in PC1,and BU,AU,CU,FD,LP,and FWPB have shown their variability in PC2.Conclusion:We describe here the systematic study of the mechanism involved in the regulation of enhancing fiber quality and yield by stem bending strength,acupuncture,and compression properties of G.hirsutum.
文摘Brain arteriovenous malformation(BAVM) is frequently described as vascular malformation. Although computer tomography(CT), magnetic resonance imaging(MRI) and angiography can clearly detect lesions, there are no diagnostic biological markers of BAVM available. Current study demonstrated that micro RNA(mi RNA)showed a feasible marker for vascular disease. To find key correlations between these mi RNAs and the onset of BAVM, we carried out chip analysis of serum mi RNAs by identifying 18 potential markers of BAVM. We then constructed a principle component analysis and logistic regression(PCA-LR) model to analyze the 18 mi RNAs collected from 77 patients. Another 9 independent samples were used to test the resulting model. The results showed that mi RNAs hsa-mir-126-3p and hsa-mir-140 are important protective factors, while hsa-mir-338 is a dominating risk factor, all of which have stronger correlation with BAVM than others. We also compared the testing results using PCA-LR model with those using LR model. The comparison revealed that PCA-LR model is better in predicting the disease.
基金Supported by Natural Science Foundation for Fundamental Research in Shaanxi Province(2011JM1019)
文摘Based on 8 indicators of economic development and according to the economic situation of cities in Shaanxi Province,this paper analyzes the data using PCA( Principle Component Analysis) and FA( Factor Analysis) with the help of R software.Using the test of Kendal's W coefficient of concordance,the consistency of results from both methods is tested.Finally,Cluster Analysis is used to classify the results and some advices are proposed for the development of cites in Shaanxi Province.
基金Supported by the National Natural Science Foundation of China (No. 50877004)
文摘Serial Analysis of Gene Expression (SAGE) is a powerful tool to analyze whole-genome expression profiles. SAGE data, characterized by large quantity and high dimensions, need reducing their dimensions and extract feature to improve the accuracy and efficiency when they are used for pattern recognition and clustering analysis. A Poisson Model-based Kernel (PMK) was proposed based on the Poisson distribution of the SAGE data. Kernel Principle Component Analysis (KPCA) with PMK was proposed and used in feature-extract analysis of mouse retinal SAGE data. The computa-tional results show that this algorithm can extract feature effectively and reduce dimensions of SAGE data.
文摘In this paper, we apply clustering analysis of data mining into power system. We adapt K-means clustering algorithm to analyze customer load, analyzing similar behavior between customer of electricity, and we adapt principal component analysis to get the clustering result visible, Simulation and analysis using matlab, and this well verify cluster rationality. The conclusion of this paper can provide important basis to the peak for the power system, stable operation the power system security.
文摘Online demand prediction plays an important role in transport network services from operations,controls to management,and information provision.However,the online prediction models are impacted by streaming data quality issues with noise measurements and missing data.To address these,we develop a robust prediction method for online network-level demand prediction in public transport.It consists of a PCA method to extract eigen demand images and an optimization-based pattern recognition model to predict the weights of eigen demand images by making use of the partially observed real-time data up to the prediction time in a day.The prediction model is robust to data quality issues given that the eigen demand images are stable and the predicted weights of them are optimized using the network level data(less impacted by local data quality issues).In the case study,we validate the accuracy and transferability of the model by comparing it with benchmark models and evaluate the robustness in tolerating data quality issues of the proposed model.The experimental results demonstrate that the proposed Pattern Recognition Prediction based on PCA(PRP-PCA)consistently outperforms other benchmark models in accuracy and transferability.Moreover,the model shows high robustness in accommodating data quality issues.For example,the PRP-PCA model is robust to missing data up to 50%regardless of the noise level.We also discuss the hidden patterns behind the network level demand.The visualization analysis shows that eigen demand images are significantly connected to the network structure and station activity variabilities.Though the demand changes dramatically before and after the pandemic,the eigen demand images are consistent over time in Stockholm.
基金The National Natural Science Foundation of China(No 60504033)
文摘Principle component analysis (PCA) based chi-square test is more sensitive to subtle gross errors and has greater power to correctly detect gross errors than classical chi-square test. However, classical principal com- ponent test (PCT) is non-robust and can be very sensitive to one or more outliers. In this paper, a Huber function liked robust weight factor was added in the collective chi-square test to eliminate the influence of gross errors on the PCT. Meanwhile, robust chi-square test was applied to modified simultaneous estimation of gross error (MSEGE) strategy to detect and identify multiple gross errors. Simulation results show that the proposed robust test can reduce the possibility of type Ⅱ errors effectively. Adding robust chi-square test into MSEGE does not obviously improve the power of multiple gross error identification, the proposed approach considers the influence of outliers on hypothesis statistic test and is more reasonable.
基金Supported by the Program for New Century Excellent Talents in University (NCET-05-0573)Fujian Science and Technology Project (No2006I0018)the Science Project of the Education Department of Fujian Province(No 2006F5022)
文摘Banana is one of the main economic agrotypes in Zhangzhou, Fujian Province. The multitemporal ENVlSAT ASAR data with different polarization are used to classify the banana fields in this paper. Principal component analysis (PCA) was applied for six pairs of ASAR dual-polarization data. For its large leaves, banana has high backscatter. So the value of banana fields is high and shows very bright in the 1st component, which makes it much easier for banana fields extraction. Dual-polarization data provide more information, and the W and VH backscatter of banana show different characters with other land covers. Based on the analysis of the radar signature of banana fields and other land covers and the 1st compo- nent, banana fields are classified using object-oriented classifier. Compared to the field survey data and ASTER data, the accuracy of banana fields in the study area is 83.5%. It shows that the principal component analysis provides the useful information in SAR images analysis and makes the extraction of banana fields easier.