A method for identification of pulsations in time series of magnetic field data which are simultaneously present in multiple channels of data at one or more sensor locations is described. Candidate pulsations of inter...A method for identification of pulsations in time series of magnetic field data which are simultaneously present in multiple channels of data at one or more sensor locations is described. Candidate pulsations of interest are first identified in geomagnetic time series by inspection. Time series of these "training events" are represented in matrix form and transpose-multiplied to generate time- domain covariance matrices. The ranked eigenvectors of this matrix are stored as a feature of the pulsation. In the second stage of the algorithm, a sliding window (approxi- mately the width of the training event) is moved across the vector-valued time-series comprising the channels on which the training event was observed. At each window position, the data covariance matrix and associated eigen- vectors are calculated. We compare the orientation of the dominant eigenvectors of the training data to those from the windowed data and flag windows where the dominant eigenvectors directions are similar. This was successful in automatically identifying pulses which share polarization and appear to be from the same source process. We apply the method to a case study of continuously sampled (50 Hz) data from six observatories, each equipped with three- component induction coil magnetometers. We examine a 90-day interval of data associated with a cluster of four observatories located within 50 km of Napa, California, together with two remote reference stations-one 100 km to the north of the cluster and the other 350 km south. When the training data contains signals present in the remote reference observatories, we are reliably able to identify and extract global geomagnetic signals such as solar-generated noise. When training data contains pulsations only observed in the cluster of local observatories, we identify several types of non-plane wave signals having similar polarization.展开更多
Financial time series forecasting could be beneficial for individual as well as institutional investors. But, the high noise and complexity residing in the financial data make this job extremely challenging. Over the ...Financial time series forecasting could be beneficial for individual as well as institutional investors. But, the high noise and complexity residing in the financial data make this job extremely challenging. Over the years, many researchers have used support vector regression (SVR) quite successfully to conquer this challenge. In this paper, an SVR based forecasting model is proposed which first uses the principal component analysis (PCA) to extract the low-dimensional and efficient feature information, and then uses the independent component analysis (ICA) to preprocess the extracted features to nullify the influence of noise in the features. Experiments were carried out based on 16 years’ historical data of three prominent stocks from three different sectors listed in Dhaka Stock Exchange (DSE), Bangladesh. The predictions were made for 1 to 4 days in advance targeting the short term prediction. For comparison, the integration of PCA with SVR (PCA-SVR), ICA with SVR (ICA-SVR) and single SVR approaches were applied to evaluate the prediction accuracy of the proposed approach. Experimental results show that the proposed model (PCA-ICA-SVR) outperforms the PCA-SVR, ICA-SVR and single SVR methods.展开更多
In order to directly construct the mapping between multiple state parameters and remaining useful life(RUL),and reduce the interference of random error on prediction accuracy,a RUL prediction model of aeroengine based...In order to directly construct the mapping between multiple state parameters and remaining useful life(RUL),and reduce the interference of random error on prediction accuracy,a RUL prediction model of aeroengine based on principal component analysis(PCA)and one-dimensional convolution neural network(1D-CNN)is proposed in this paper.Firstly,multiple state parameters corresponding to massive cycles of aeroengine are collected and brought into PCA for dimensionality reduction,and principal components are extracted for further time series prediction.Secondly,the 1D-CNN model is constructed to directly study the mapping between principal components and RUL.Multiple convolution and pooling operations are applied for deep feature extraction,and the end-to-end RUL prediction of aeroengine can be realized.Experimental results show that the most effective principal component from the multiple state parameters can be obtained by PCA,and the long time series of multiple state parameters can be directly mapped to RUL by 1D-CNN,so as to improve the efficiency and accuracy of RUL prediction.Compared with other traditional models,the proposed method also has lower prediction error and better robustness.展开更多
On the basis of machine leaning,suitable algorithms can make advanced time series analysis.This paper proposes a complex k-nearest neighbor(KNN)model for predicting financial time series.This model uses a complex feat...On the basis of machine leaning,suitable algorithms can make advanced time series analysis.This paper proposes a complex k-nearest neighbor(KNN)model for predicting financial time series.This model uses a complex feature extraction process integrating a forward rolling empirical mode decomposition(EMD)for financial time series signal analysis and principal component analysis(PCA)for the dimension reduction.The information-rich features are extracted then input to a weighted KNN classifier where the features are weighted with PCA loading.Finally,prediction is generated via regression on the selected nearest neighbors.The structure of the model as a whole is original.The test results on real historical data sets confirm the effectiveness of the models for predicting the Chinese stock index,an individual stock,and the EUR/USD exchange rate.展开更多
To extract features of fabric defects effectively and reduce dimension of feature space,a feature extraction method of fabric defects based on complex contourlet transform (CCT) and principal component analysis (PC...To extract features of fabric defects effectively and reduce dimension of feature space,a feature extraction method of fabric defects based on complex contourlet transform (CCT) and principal component analysis (PCA) is proposed.Firstly,training samples of fabric defect images are decomposed by CCT.Secondly,PCA is applied in the obtained low-frequency component and part of highfrequency components to get a lower dimensional feature space.Finally,components of testing samples obtained by CCT are projected onto the feature space where different types of fabric defects are distinguished by the minimum Euclidean distance method.A large number of experimental results show that,compared with PCA,the method combining wavdet low-frequency component with PCA (WLPCA),the method combining contourlet transform with PCA (CPCA),and the method combining wavelet low-frequency and highfrequency components with PCA (WPCA),the proposed method can extract features of common fabric defect types effectively.The recognition rate is greatly improved while the dimension is reduced.展开更多
Currently, some fault prognosis technology occasionally has relatively unsatisfied performance especially for in- cipient faults in nonlinear processes duo to their large time delay and complex internal connection. To...Currently, some fault prognosis technology occasionally has relatively unsatisfied performance especially for in- cipient faults in nonlinear processes duo to their large time delay and complex internal connection. To overcome this deficiency, multivariate time delay analysis is incorporated into the high sensitive local kernel principal component analysis. In this approach, mutual information estimation and Bayesian information criterion (BIC) are separately used to acquire the correlation degree and time delay of the process variables. Moreover, in order to achieve prediction, time series prediction by back propagation (BP) network is applied whose input is multivar- iate correlated time series other than the original time series. Then the multivariate time delayed series and future values obtained by time series prediction are combined to construct the input of local kernel principal component analysis (LKPCA) model for incipient fault prognosis. The new method has been exemplified in a sim- ple nonlinear process and the complicated Tennessee Eastman (TE) benchmark process. The results indicate that the new method has suoerioritv in the fault prognosis sensitivity over other traditional fault prognosis methods.展开更多
In this paper we propose an approach of prin-cipal component cluster analysis based on Lyapunov expo-nent spectrum (LES) to analyze the ECG time series. Analy-sis results of 22 sample-files of ECG from the MIT-BIH da-...In this paper we propose an approach of prin-cipal component cluster analysis based on Lyapunov expo-nent spectrum (LES) to analyze the ECG time series. Analy-sis results of 22 sample-files of ECG from the MIT-BIH da-tabase confirmed the validity of our approach. Another technique named improved teacher selecting student (TSS) algorithm is presented to analyze unknown samples by means of some known ones, which is of better accuracy. This technique combines the advantages of both statistical and nonlinear dynamical methods and is shown to be significant to the analysis of nonlinear ECG time series.展开更多
Ti-6A1-4V has a wide range of applications, especially in the aerospace field;however, it is a difficultto- cut material. In order to achieve sustainable machining of Ti?6A1-4V, multiple objectives considering not onl...Ti-6A1-4V has a wide range of applications, especially in the aerospace field;however, it is a difficultto- cut material. In order to achieve sustainable machining of Ti?6A1-4V, multiple objectives considering not only economic and technical requirements but also the environmental requirement need to be optimized simultaneously. In this work, the optimization design of process parameters such as type of inserts, feed rate, and depth of cut for Ti-6A1-4V turning under dry condition was investigated experimentally. The major performance indexes chosen to evaluate this sustainable process were radial thrust, cutting power, and coefficient of friction at the toolchip interface. Considering the nonlinearity between the various objectives, grey relational analysis (GRA) was first performed to transform these indexes into the corresponding grey relational coefficients, and then kernel principal component analysis (KPCA) was applied to extract the kernel principal components and determine the corresponding weights which showed their relative importance. Eventually, kernel grey relational grade (KGRG) was proposed as the optimization criterion to identify the optimal combination of process parameters. The results of the range analysis show that the depth of cut has the most significant effect, followed by the feed rate and type of inserts. Confirmation tests clearly show that the modified method combining GRA with KPCA outperforms the traditional GRA method with equal weights and the hybrid method based on GRA and PCA.展开更多
目的比较江西特色炮制技术对升麻化学成分的影响,筛选优质饮片品种。方法采用超高效液相色谱-四极杆-飞行时间串联质谱(ultra performance liquid chromatography-quadrupole-time of flight tandem mass spectrometry,UPLC-Q-TOF-MS)技...目的比较江西特色炮制技术对升麻化学成分的影响,筛选优质饮片品种。方法采用超高效液相色谱-四极杆-飞行时间串联质谱(ultra performance liquid chromatography-quadrupole-time of flight tandem mass spectrometry,UPLC-Q-TOF-MS)技术,在正、负离子模式下分析升麻不同炮制品的化学成分,通过对照品、相对分子质量、质谱裂解规律和文献信息进行鉴定。利用SIMCA-P13.0软件建立升麻各炮制品主成分分析(principal component analysis,PCA)和偏最小二乘法-判别分析(partial least squares discriminant analysis,PLS-DA)模型,获取PCA得分图、PLA-DA得分图和变量重要性投影(variable importance plot,VIP)值,筛选造成升麻炮制前后主要差异的物质基础。利用MetaboAnatyst网页绘图工具,制作得到热图,可更直观地观察升麻化学成分经炮制后的变化趋势。结果鉴定出71个化学成分,PCA显示经不同方法炮制后升麻组间差异性大,PLS-DA筛选出VIP值>1的33个化学成分作为炮制前后差异性的主要化学标记物。其中生品和蜜炙升麻中三萜类含量较高,蜜麸、蜜糠炒升麻中酚酸类物质含量较高,蜜麸升麻中阿魏酸含量较高。结论酚酸类和三萜皂苷类是区分升麻不同炮制品最重要的化合物类别,为江西特色升麻饮片的药效物质基础及优势品种研究提供了依据。展开更多
With the continuous development of urbanization in China,the country’s growing population brings great challenges to urban development.By mastering the refined population spatial distribution in administrative units,...With the continuous development of urbanization in China,the country’s growing population brings great challenges to urban development.By mastering the refined population spatial distribution in administrative units,the quantity and agglomeration of population distribution can be estimated and visualized.It will provide a basis for a more rational urban planning.This paper takes Beijing as the research area and uses a new Luojia1-01 nighttime light image with high resolution,land use type data,Points of Interest(POI)data,and other data to construct the population spatial index system,establishing the index weight based on the principal component analysis.The comprehensive weight value of population distribution in the study area was then used to calculate the street population distribution of Beijing in 2018.Then the population spatial distribution was visualize using GIS technology.After accuracy assessments by comparing the result with the WorldPop data,the accuracy has reached 0.74.The proposed method was validated as a qualified method to generate population spatial maps.By contrast of local areas,Luojia 1-01 data is more suitable for population distribution estimation than the NPP/VIIRS(Net Primary Productivity/Visible infrared Imaging Radiometer)nighttime light data.More geospatial big data and mathematical models can be combined to create more accurate population maps in the future.展开更多
Adapting daily meteorological data provided by China International Exchange Station, and using principal component analysis of meteorological index for dimension reduction comprehensive, the regression analysis model ...Adapting daily meteorological data provided by China International Exchange Station, and using principal component analysis of meteorological index for dimension reduction comprehensive, the regression analysis model between PM2.5 and comprehensive index is established, by making use of Eviews time series modeling of the comprehensive principal component, finally puts forward opinions and suggestions aim at the regression analysis results of using artificial rainfall to ease haze.展开更多
文摘A method for identification of pulsations in time series of magnetic field data which are simultaneously present in multiple channels of data at one or more sensor locations is described. Candidate pulsations of interest are first identified in geomagnetic time series by inspection. Time series of these "training events" are represented in matrix form and transpose-multiplied to generate time- domain covariance matrices. The ranked eigenvectors of this matrix are stored as a feature of the pulsation. In the second stage of the algorithm, a sliding window (approxi- mately the width of the training event) is moved across the vector-valued time-series comprising the channels on which the training event was observed. At each window position, the data covariance matrix and associated eigen- vectors are calculated. We compare the orientation of the dominant eigenvectors of the training data to those from the windowed data and flag windows where the dominant eigenvectors directions are similar. This was successful in automatically identifying pulses which share polarization and appear to be from the same source process. We apply the method to a case study of continuously sampled (50 Hz) data from six observatories, each equipped with three- component induction coil magnetometers. We examine a 90-day interval of data associated with a cluster of four observatories located within 50 km of Napa, California, together with two remote reference stations-one 100 km to the north of the cluster and the other 350 km south. When the training data contains signals present in the remote reference observatories, we are reliably able to identify and extract global geomagnetic signals such as solar-generated noise. When training data contains pulsations only observed in the cluster of local observatories, we identify several types of non-plane wave signals having similar polarization.
文摘Financial time series forecasting could be beneficial for individual as well as institutional investors. But, the high noise and complexity residing in the financial data make this job extremely challenging. Over the years, many researchers have used support vector regression (SVR) quite successfully to conquer this challenge. In this paper, an SVR based forecasting model is proposed which first uses the principal component analysis (PCA) to extract the low-dimensional and efficient feature information, and then uses the independent component analysis (ICA) to preprocess the extracted features to nullify the influence of noise in the features. Experiments were carried out based on 16 years’ historical data of three prominent stocks from three different sectors listed in Dhaka Stock Exchange (DSE), Bangladesh. The predictions were made for 1 to 4 days in advance targeting the short term prediction. For comparison, the integration of PCA with SVR (PCA-SVR), ICA with SVR (ICA-SVR) and single SVR approaches were applied to evaluate the prediction accuracy of the proposed approach. Experimental results show that the proposed model (PCA-ICA-SVR) outperforms the PCA-SVR, ICA-SVR and single SVR methods.
基金supported by Jiangsu Social Science Foundation(No.20GLD008)Science,Technology Projects of Jiangsu Provincial Department of Communications(No.2020Y14)Joint Fund for Civil Aviation Research(No.U1933202)。
文摘In order to directly construct the mapping between multiple state parameters and remaining useful life(RUL),and reduce the interference of random error on prediction accuracy,a RUL prediction model of aeroengine based on principal component analysis(PCA)and one-dimensional convolution neural network(1D-CNN)is proposed in this paper.Firstly,multiple state parameters corresponding to massive cycles of aeroengine are collected and brought into PCA for dimensionality reduction,and principal components are extracted for further time series prediction.Secondly,the 1D-CNN model is constructed to directly study the mapping between principal components and RUL.Multiple convolution and pooling operations are applied for deep feature extraction,and the end-to-end RUL prediction of aeroengine can be realized.Experimental results show that the most effective principal component from the multiple state parameters can be obtained by PCA,and the long time series of multiple state parameters can be directly mapped to RUL by 1D-CNN,so as to improve the efficiency and accuracy of RUL prediction.Compared with other traditional models,the proposed method also has lower prediction error and better robustness.
基金supported by the Social Science Foundation of China under Grant No.17BGL231。
文摘On the basis of machine leaning,suitable algorithms can make advanced time series analysis.This paper proposes a complex k-nearest neighbor(KNN)model for predicting financial time series.This model uses a complex feature extraction process integrating a forward rolling empirical mode decomposition(EMD)for financial time series signal analysis and principal component analysis(PCA)for the dimension reduction.The information-rich features are extracted then input to a weighted KNN classifier where the features are weighted with PCA loading.Finally,prediction is generated via regression on the selected nearest neighbors.The structure of the model as a whole is original.The test results on real historical data sets confirm the effectiveness of the models for predicting the Chinese stock index,an individual stock,and the EUR/USD exchange rate.
基金National Natural Science Foundation of China(No.60872065)the Key Laboratory of Textile Science&Technology,Ministry of Education,China(No.P1111)+1 种基金the Key Laboratory of Advanced Textile Materials and Manufacturing Technology,Ministry of Education,China(No.2010001)the Priority Academic Program Development of Jiangsu Higher Education Institution,China
文摘To extract features of fabric defects effectively and reduce dimension of feature space,a feature extraction method of fabric defects based on complex contourlet transform (CCT) and principal component analysis (PCA) is proposed.Firstly,training samples of fabric defect images are decomposed by CCT.Secondly,PCA is applied in the obtained low-frequency component and part of highfrequency components to get a lower dimensional feature space.Finally,components of testing samples obtained by CCT are projected onto the feature space where different types of fabric defects are distinguished by the minimum Euclidean distance method.A large number of experimental results show that,compared with PCA,the method combining wavdet low-frequency component with PCA (WLPCA),the method combining contourlet transform with PCA (CPCA),and the method combining wavelet low-frequency and highfrequency components with PCA (WPCA),the proposed method can extract features of common fabric defect types effectively.The recognition rate is greatly improved while the dimension is reduced.
基金Supported by the National Natural Science Foundation of China(61573051,61472021)the Natural Science Foundation of Beijing(4142039)+1 种基金Open Fund of the State Key Laboratory of Software Development Environment(SKLSDE-2015KF-01)Fundamental Research Funds for the Central Universities(PT1613-05)
文摘Currently, some fault prognosis technology occasionally has relatively unsatisfied performance especially for in- cipient faults in nonlinear processes duo to their large time delay and complex internal connection. To overcome this deficiency, multivariate time delay analysis is incorporated into the high sensitive local kernel principal component analysis. In this approach, mutual information estimation and Bayesian information criterion (BIC) are separately used to acquire the correlation degree and time delay of the process variables. Moreover, in order to achieve prediction, time series prediction by back propagation (BP) network is applied whose input is multivar- iate correlated time series other than the original time series. Then the multivariate time delayed series and future values obtained by time series prediction are combined to construct the input of local kernel principal component analysis (LKPCA) model for incipient fault prognosis. The new method has been exemplified in a sim- ple nonlinear process and the complicated Tennessee Eastman (TE) benchmark process. The results indicate that the new method has suoerioritv in the fault prognosis sensitivity over other traditional fault prognosis methods.
文摘In this paper we propose an approach of prin-cipal component cluster analysis based on Lyapunov expo-nent spectrum (LES) to analyze the ECG time series. Analy-sis results of 22 sample-files of ECG from the MIT-BIH da-tabase confirmed the validity of our approach. Another technique named improved teacher selecting student (TSS) algorithm is presented to analyze unknown samples by means of some known ones, which is of better accuracy. This technique combines the advantages of both statistical and nonlinear dynamical methods and is shown to be significant to the analysis of nonlinear ECG time series.
文摘Ti-6A1-4V has a wide range of applications, especially in the aerospace field;however, it is a difficultto- cut material. In order to achieve sustainable machining of Ti?6A1-4V, multiple objectives considering not only economic and technical requirements but also the environmental requirement need to be optimized simultaneously. In this work, the optimization design of process parameters such as type of inserts, feed rate, and depth of cut for Ti-6A1-4V turning under dry condition was investigated experimentally. The major performance indexes chosen to evaluate this sustainable process were radial thrust, cutting power, and coefficient of friction at the toolchip interface. Considering the nonlinearity between the various objectives, grey relational analysis (GRA) was first performed to transform these indexes into the corresponding grey relational coefficients, and then kernel principal component analysis (KPCA) was applied to extract the kernel principal components and determine the corresponding weights which showed their relative importance. Eventually, kernel grey relational grade (KGRG) was proposed as the optimization criterion to identify the optimal combination of process parameters. The results of the range analysis show that the depth of cut has the most significant effect, followed by the feed rate and type of inserts. Confirmation tests clearly show that the modified method combining GRA with KPCA outperforms the traditional GRA method with equal weights and the hybrid method based on GRA and PCA.
文摘目的比较江西特色炮制技术对升麻化学成分的影响,筛选优质饮片品种。方法采用超高效液相色谱-四极杆-飞行时间串联质谱(ultra performance liquid chromatography-quadrupole-time of flight tandem mass spectrometry,UPLC-Q-TOF-MS)技术,在正、负离子模式下分析升麻不同炮制品的化学成分,通过对照品、相对分子质量、质谱裂解规律和文献信息进行鉴定。利用SIMCA-P13.0软件建立升麻各炮制品主成分分析(principal component analysis,PCA)和偏最小二乘法-判别分析(partial least squares discriminant analysis,PLS-DA)模型,获取PCA得分图、PLA-DA得分图和变量重要性投影(variable importance plot,VIP)值,筛选造成升麻炮制前后主要差异的物质基础。利用MetaboAnatyst网页绘图工具,制作得到热图,可更直观地观察升麻化学成分经炮制后的变化趋势。结果鉴定出71个化学成分,PCA显示经不同方法炮制后升麻组间差异性大,PLS-DA筛选出VIP值>1的33个化学成分作为炮制前后差异性的主要化学标记物。其中生品和蜜炙升麻中三萜类含量较高,蜜麸、蜜糠炒升麻中酚酸类物质含量较高,蜜麸升麻中阿魏酸含量较高。结论酚酸类和三萜皂苷类是区分升麻不同炮制品最重要的化合物类别,为江西特色升麻饮片的药效物质基础及优势品种研究提供了依据。
基金Under the auspices of Natural Science Foundation of China(No.42071342,31870713)Beijing Natural Science Foundation Program(No.8182038)Fundamental Research Funds for the Central Universities(No.2015ZCQ-LX-01,2018ZY06)。
文摘With the continuous development of urbanization in China,the country’s growing population brings great challenges to urban development.By mastering the refined population spatial distribution in administrative units,the quantity and agglomeration of population distribution can be estimated and visualized.It will provide a basis for a more rational urban planning.This paper takes Beijing as the research area and uses a new Luojia1-01 nighttime light image with high resolution,land use type data,Points of Interest(POI)data,and other data to construct the population spatial index system,establishing the index weight based on the principal component analysis.The comprehensive weight value of population distribution in the study area was then used to calculate the street population distribution of Beijing in 2018.Then the population spatial distribution was visualize using GIS technology.After accuracy assessments by comparing the result with the WorldPop data,the accuracy has reached 0.74.The proposed method was validated as a qualified method to generate population spatial maps.By contrast of local areas,Luojia 1-01 data is more suitable for population distribution estimation than the NPP/VIIRS(Net Primary Productivity/Visible infrared Imaging Radiometer)nighttime light data.More geospatial big data and mathematical models can be combined to create more accurate population maps in the future.
文摘Adapting daily meteorological data provided by China International Exchange Station, and using principal component analysis of meteorological index for dimension reduction comprehensive, the regression analysis model between PM2.5 and comprehensive index is established, by making use of Eviews time series modeling of the comprehensive principal component, finally puts forward opinions and suggestions aim at the regression analysis results of using artificial rainfall to ease haze.
文摘比较不同乳酸菌发酵工艺对桑叶化学成分的影响,筛选优势菌种。采用超高效液相色谱-四极杆-飞行时间串联质谱检测桑叶中的化学成分,通过对照品、分子质量、质谱裂解规律和文献信息鉴定桑叶甲醇提取液的化学成分。利用SIMCA 14.1软件建立桑叶各发酵品的主成分分析(principal component analysis,PCA)模型和偏最小二乘判别分析(partial least squares-discriminant analysis,PLS-DA)模型,获取PCA得分图、PLS-DA载荷图和变量投影重要性(variable importance in projection,VIP)值,筛选桑叶发酵前后的差异物质。本实验鉴定出了41个化学成分,PCA结果提示经不同发酵工艺发酵后的桑叶组间差异性较大,PLS-DA筛选出VIP值>1的11个化学成分,可作为发酵前后差异性的化学标记物,分别为大波斯菊苷、异鼠李素-3-O-新橙皮苷、金丝桃苷、3,4-二咖啡酰奎宁酸、橙黄决明素、水杨酸、染料木苷、矢车菊素-3-O-葡萄糖苷、鸟苷、异麦角甾苷、毛蕊花糖苷;将其综合加权评分后发现鼠李糖乳杆菌发酵桑叶中有效成分含量最高。桑叶发酵前后化学成分含量发生显著变化,黄酮类成分是区分桑叶不同发酵品最重要的化合物类别,鼠李糖乳杆菌为桑叶的优势发酵菌种。