The large blast furnace is essential equipment in the process of iron and steel manufacturing. Due to the complex operation process and frequent fluctuations of variables, conventional monitoring methods often bring f...The large blast furnace is essential equipment in the process of iron and steel manufacturing. Due to the complex operation process and frequent fluctuations of variables, conventional monitoring methods often bring false alarms. To address the above problem, an ensemble of greedy dynamic principal component analysis-Gaussian mixture model(EGDPCA-GMM) is proposed in this paper. First, PCA-GMM is introduced to deal with the collinearity and the non-Gaussian distribution of blast furnace data.Second, in order to explain the dynamics of data, the greedy algorithm is used to determine the extended variables and their corresponding time lags, so as to avoid introducing unnecessary noise. Then the bagging ensemble is adopted to cooperate with greedy extension to eliminate the randomness brought by the greedy algorithm and further reduce the false alarm rate(FAR) of monitoring results. Finally, the algorithm is applied to the blast furnace of a large iron and steel group in South China to verify performance.Compared with the basic algorithms, the proposed method achieves lowest FAR, while keeping missed alarm rate(MAR) remain stable.展开更多
A novel approach named aligned mixture probabilistic principal component analysis(AMPPCA) is proposed in this study for fault detection of multimode chemical processes. In order to exploit within-mode correlations,the...A novel approach named aligned mixture probabilistic principal component analysis(AMPPCA) is proposed in this study for fault detection of multimode chemical processes. In order to exploit within-mode correlations,the AMPPCA algorithm first estimates a statistical description for each operating mode by applying mixture probabilistic principal component analysis(MPPCA). As a comparison, the combined MPPCA is employed where monitoring results are softly integrated according to posterior probabilities of the test sample in each local model. For exploiting the cross-mode correlations, which may be useful but are inadvertently neglected due to separately held monitoring approaches, a global monitoring model is constructed by aligning all local models together. In this way, both within-mode and cross-mode correlations are preserved in this integrated space. Finally, the utility and feasibility of AMPPCA are demonstrated through a non-isothermal continuous stirred tank reactor and the TE benchmark process.展开更多
For plant-wide processes with multiple operating conditions,the multimode feature imposes some challenges to conventional monitoring techniques.Hence,to solve this problem,this paper provides a novel local component b...For plant-wide processes with multiple operating conditions,the multimode feature imposes some challenges to conventional monitoring techniques.Hence,to solve this problem,this paper provides a novel local component based principal component analysis(LCPCA)approach for monitoring the status of a multimode process.In LCPCA,the process prior knowledge of mode division is not required and it purely based on the process data.Firstly,LCPCA divides the processes data into multiple local components using finite Gaussian mixture model mixture(FGMM).Then,calculating the posterior probability is applied to determine each sample belonging to which local component.After that,the local component information(such as mean and standard deviation)is used to standardize each sample of local component.Finally,the standardized samples of each local component are combined to train PCA monitoring model.Based on the PCA monitoring model,two monitoring statistics T^(2) and SPE are used for monitoring multimode processes.Through a numerical example and the Tennessee Eastman(TE)process,the monitoring result demonstrates that LCPCA outperformed conventional PCA and LNS-PCA in the fault detection rate.展开更多
A technology for unintended lane departure warning was proposed. As crucial information, lane boundaries were detected based on principal component analysis of grayscale distribution in search bars of given number and...A technology for unintended lane departure warning was proposed. As crucial information, lane boundaries were detected based on principal component analysis of grayscale distribution in search bars of given number and then each search bar was tracked using Kalman filter between frames. The lane detection performance was evaluated and demonstrated in ways of receiver operating characteristic, dice similarity coefficient and real-time performance. For lane departure detection, a lane departure risk evaluation model based on lasting time and frequency was effectively executed on the ARM-based platform. Experimental results indicate that the algorithm generates satisfactory lane detection results under different traffic and lighting conditions, and the proposed warning mechanism sends effective warning signals, avoiding most false warning.展开更多
Referring to GB5618-1995 about heavy metal pollution,and using statistical analysis SPSS,the major pollutants of mine area farmland heavy metal pollution were identified by variable clustering analysis.Assessment and ...Referring to GB5618-1995 about heavy metal pollution,and using statistical analysis SPSS,the major pollutants of mine area farmland heavy metal pollution were identified by variable clustering analysis.Assessment and classification were done to the mine area farmland heavy metal pollution situation by synthetic principal components analysis (PCA).The results show that variable clustering analysis is efficient to identify the principal components of mine area farmland heavy metal pollution.Sort and clustering were done to the synthetic principal components scores of soil sample,which is given by synthetic principal components analysis.Data structure of soil heavy metal contaminations relationships and pollution level of different soil samples are discovered.The results of mine area farmland heavy metal pollution quality assessed and classified with synthetic component scores reflect the influence of both the major and compound heavy metal pol- lutants.Identification and assessment results of mine area farmland heavy metal pollution can provide reference and guide to propose control measures of mine area farmland heavy metal pollution and focus on the key treatment region.展开更多
Ensemble-based analyses are useful to compare equiprobable scenarios of the reservoir models.However,they require a large suite of reservoir models to cover high uncertainty in heterogeneous and complex reservoir mode...Ensemble-based analyses are useful to compare equiprobable scenarios of the reservoir models.However,they require a large suite of reservoir models to cover high uncertainty in heterogeneous and complex reservoir models.For stable convergence in ensemble Kalman filter(EnKF),increasing ensemble size can be one of the solutions,but it causes high computational cost in large-scale reservoir systems.In this paper,we propose a preprocessing of good initial model selection to reduce the ensemble size,and then,EnKF is utilized to predict production performances stochastically.In the model selection scheme,representative models are chosen by using principal component analysis(PCA)and clustering analysis.The dimension of initial models is reduced using PCA,and the reduced models are grouped by clustering.Then,we choose and simulate representative models from the cluster groups to compare errors of production predictions with historical observation data.One representative model with the minimum error is considered as the best model,and we use the ensemble members near the best model in the cluster plane for applying EnKF.We demonstrate the proposed scheme for two 3D models that EnKF provides reliable assimilation results with much reduced computation time.展开更多
Glass is the precious material evidence of the trade of the early Silk Road. The ancient glass was easily affected by the environmental impact and weathering, and the change of composition ratios affected the correct ...Glass is the precious material evidence of the trade of the early Silk Road. The ancient glass was easily affected by the environmental impact and weathering, and the change of composition ratios affected the correct judgment of its category. In this paper, mathematical models and methods such as Chi-square test, weighted average method, principal component analysis, cluster analysis, binary classification model and grey correlation analysis were used comprehensively to analyze the data of sample glass products combined with their categories. The results showed that the weathered high-potassium glass could be divided into 12, 9, 10 and 27, 7, 22 and so on.展开更多
The accurate extraction and classification of leather defects is an important guarantee for the automation and quality evaluation of leather industry. Aiming at the problem of data classification of leather defects,a ...The accurate extraction and classification of leather defects is an important guarantee for the automation and quality evaluation of leather industry. Aiming at the problem of data classification of leather defects,a hierarchical classification for defects is proposed.Firstly,samples are collected according to the method of minimum rectangle,and defects are extracted by image processing method.According to the geometric features of representation, they are divided into dot,line and surface for rough classification. From analysing the data which extracting the defects of geometry,gray and texture,the dominating characteristics can be acquired. Each type of defect by choosing different and representative characteristics,reducing the dimension of the data,and through these characteristics of clustering to achieve convergence effectively,realize extracted accurately,and digitized the defect characteristics,eventually establish the database. The results showthat this method can achieve more than 90% accuracy and greatly improve the accuracy of classification.展开更多
Spatio-temporal assessment of the above ground biomass (AGB) is a cumbersome task due to the difficulties associated with the measurement of different tree parameters such as girth at breast height and height of tre...Spatio-temporal assessment of the above ground biomass (AGB) is a cumbersome task due to the difficulties associated with the measurement of different tree parameters such as girth at breast height and height of trees. The present research was conducted in the campus of Birla Institute of Technology, Mesra, Ranchi, India, which is predomi- nantly covered by Sal (Shorea robusta C. F. Gaertn). Two methods of regression analysis was employed to determine the potential of remote sensing parameters with the AGB measured in the field such as linear regression analysis between the AGB and the individual bands, principal components (PCs) of the bands, vegetation indices (VI), and the PCs of the VIs respectively and multiple linear regression (MLR) analysis be- tween the AGB and all the variables in each category of data. From the linear regression analysis, it was found that only the NDVI exhibited regression coefficient value above 0.80 with the remaining parameters showing very low values. On the other hand, the MLR based analysis revealed significantly improved results as evidenced by the occurrence of very high correlation coefficient values of greater than 0.90 determined between the computed AGB from the MLR equations and field-estimated AGB thereby ascertaining their superiority in providing reliable estimates of AGB. The highest correlation coefficient of 0.99 is found with the MLR involving PCs of VIs.展开更多
Principal component analysis(PCA)is employed to extract the principal components(PCs)present in nuclear mass models for the first time.The effects from different nuclear mass models are reintegrated and reorganized in...Principal component analysis(PCA)is employed to extract the principal components(PCs)present in nuclear mass models for the first time.The effects from different nuclear mass models are reintegrated and reorganized in the extracted PCs.These PCs are recombined to build new mass models,which achieve better accuracy than the original theoretical mass models.This comparison indicates that using the PCA approach,the effects contained in different mass models can be collaborated to improve nuclear mass predictions.展开更多
Motivated by the Bagging Partial Least Squares(Bagging PLS)and Principal Component Analysis(PCA)algorithms,a novel approach known as Principal Model Analysis(PMA)method is introduced in this paper.In the proposed PMA ...Motivated by the Bagging Partial Least Squares(Bagging PLS)and Principal Component Analysis(PCA)algorithms,a novel approach known as Principal Model Analysis(PMA)method is introduced in this paper.In the proposed PMA algorithm,the PCA and the Bagging PLS are combined.In this method,multiple PLS models are trained on sub-training sets,derived from the training set using the random sampling with replacement approach.The regression coefficients of all the sub-PLS models are fused in a joint regression coefficient matrix.The final projection direction is then estimated by performing the PCA on the joint regression coefficient matrix.Subsequently,the proposed PMA method is compared with other traditional dimension reduction methods,such as PLS,Bagging PLS,Linear discriminant analysis(LDA)and PLS-LDA.Experimental results on six public datasets demonstrate that our proposed method consistently outperforms other approaches in terms of classification performance and exhibits greater stability.Additionally,it is employed in the application of financial statement fraud identification.PMA and other five algorithms are utilized to financial statement fraud which concerned by the academic community,and the results indicate that the classification of PMA surpassed that of the other methods.展开更多
Forward radiative transfer(RT)models are essential for atmospheric applications such as remote sensing and weather and climate models,where computational efficiency becomes equally as important as accuracy for high-re...Forward radiative transfer(RT)models are essential for atmospheric applications such as remote sensing and weather and climate models,where computational efficiency becomes equally as important as accuracy for high-resolution hyperspectral measurements that need rigorous RT simulations for thousands of channels.This study introduces a fast and accurate RT model for the hyperspectral infrared(HIR)sounder based on principal component analysis(PCA)or machine learning(i.e.,neural network,NN).The Geosynchronous Interferometric Infrared Sounder(GIIRS),the first HIR sounder onboard the geostationary Fengyun-4 satellites,is considered to be a candidate example for model development and validation.Our method uses either PCA or NN(PCA/NN)twice for the atmospheric transmittance and radiance,respectively,to reduce the number of independent but similar simulations to accelerate RT simulations;thereby,it is referred to as a multi-domain compression model.The first PCA/NN gives monochromatic gas transmittance in both spectral and atmospheric pressure domains for each gas independently.The second PCA/NN is performed in the traditional spectral radiance domain.Meanwhile,a new method is introduced to choose representative variables for the PCA/NN scheme developments.The model is three orders of magnitude faster than the standard line-by-line-based simulations with averaged brightness temperature difference(BTD)less than 0.1 K,and the compressions based on PCA or NN methods result in comparable efficiency and accuracy.Our fast model not only avoids an excessively complicated transmittance scheme by using PCA/NN but is also highly flexible for hyperspectral instruments with similar spectral ranges simply by updating the corresponding spectral response functions.展开更多
A new algorithm for unsupervised hyperspectral data unmixing is investigated, which includes a modified minimum noise fraction (MNF) transformation and independent component analysis (ICA). The modified MNF transf...A new algorithm for unsupervised hyperspectral data unmixing is investigated, which includes a modified minimum noise fraction (MNF) transformation and independent component analysis (ICA). The modified MNF transformation is used to reduce noise and remove correlation between neighboring bands. Then the ICA is applied to unmix hyperspectral images, and independent endmembers are obtained from unmixed images by using post-processing which includes image segmentation based on statistical histograms and morphological operations. The experimental results demonstrate that this algorithm can identify endmembers resident in mixed pixels. Meanwhile, the results show the high computational efficiency of the modified MNF transformation. The time consumed by the modified method is almost one fifth of the traditional MNF transformation.展开更多
在茶园水资源管理中,蒸散量(Evapotranspiration,ET)是评估作物水分需求的关键指标,由于茶园蒸散量预测具有时序性、不稳定性以及非线性耦合等特点,目前的茶园蒸散量预测模型存在预测精度较低的问题,针对此问题本文提出了一种新型的茶...在茶园水资源管理中,蒸散量(Evapotranspiration,ET)是评估作物水分需求的关键指标,由于茶园蒸散量预测具有时序性、不稳定性以及非线性耦合等特点,目前的茶园蒸散量预测模型存在预测精度较低的问题,针对此问题本文提出了一种新型的茶园蒸散量预测模型。首先使用互信息算法(Mutual information,MI)与主成分分析算法(Principal component analysis,PCA)相融合的数据处理算法(MIPCA),筛选强相关的特征并提取主成分;其次将时域卷积网络(Temporal convolutional network,TCN)与Transformer融合,利用灰狼算法(Grey wolf optimization,GWO)优化超参数,捕捉茶园数据的全局依赖关系;最后整合2个网络构建了MIPCA-TCN-GWO-Transformer模型,通过消融试验和对比试验验证了模型性能,并对模型在不同时间步长下的性能进行测试。结果表明,该模型平均绝对百分比误差(Mean absolute percentage error,MAPE)、均方根误差(Root mean square error,RMSE)和决定系数(Coefficient of determination,R^(2))3个评价指标分别为0.015 mm/d、0.312 mm/d和0.962,优于长短期记忆模型(Long short term memory,LSTM)等传统预测模型。在小时尺度、日尺度和月尺度下的R^(2)分别为0.986、0.978和0.946,在不同时间步长下展现了良好的适应性和准确性。本文构建的MIPCA-TCN-GWO-Transformer模型具有较高的预测精度和稳定性,可为茶园水资源优化管理和灌溉制度制定提供科学参考。展开更多
基金supported by the National Natural Science Foundation of China (61903326, 61933015)。
文摘The large blast furnace is essential equipment in the process of iron and steel manufacturing. Due to the complex operation process and frequent fluctuations of variables, conventional monitoring methods often bring false alarms. To address the above problem, an ensemble of greedy dynamic principal component analysis-Gaussian mixture model(EGDPCA-GMM) is proposed in this paper. First, PCA-GMM is introduced to deal with the collinearity and the non-Gaussian distribution of blast furnace data.Second, in order to explain the dynamics of data, the greedy algorithm is used to determine the extended variables and their corresponding time lags, so as to avoid introducing unnecessary noise. Then the bagging ensemble is adopted to cooperate with greedy extension to eliminate the randomness brought by the greedy algorithm and further reduce the false alarm rate(FAR) of monitoring results. Finally, the algorithm is applied to the blast furnace of a large iron and steel group in South China to verify performance.Compared with the basic algorithms, the proposed method achieves lowest FAR, while keeping missed alarm rate(MAR) remain stable.
基金Supported by the National Natural Science Foundation of China(61374140)Shanghai Pujiang Program(12PJ1402200)
文摘A novel approach named aligned mixture probabilistic principal component analysis(AMPPCA) is proposed in this study for fault detection of multimode chemical processes. In order to exploit within-mode correlations,the AMPPCA algorithm first estimates a statistical description for each operating mode by applying mixture probabilistic principal component analysis(MPPCA). As a comparison, the combined MPPCA is employed where monitoring results are softly integrated according to posterior probabilities of the test sample in each local model. For exploiting the cross-mode correlations, which may be useful but are inadvertently neglected due to separately held monitoring approaches, a global monitoring model is constructed by aligning all local models together. In this way, both within-mode and cross-mode correlations are preserved in this integrated space. Finally, the utility and feasibility of AMPPCA are demonstrated through a non-isothermal continuous stirred tank reactor and the TE benchmark process.
基金National Natural Science Foundation of China(61673279)。
文摘For plant-wide processes with multiple operating conditions,the multimode feature imposes some challenges to conventional monitoring techniques.Hence,to solve this problem,this paper provides a novel local component based principal component analysis(LCPCA)approach for monitoring the status of a multimode process.In LCPCA,the process prior knowledge of mode division is not required and it purely based on the process data.Firstly,LCPCA divides the processes data into multiple local components using finite Gaussian mixture model mixture(FGMM).Then,calculating the posterior probability is applied to determine each sample belonging to which local component.After that,the local component information(such as mean and standard deviation)is used to standardize each sample of local component.Finally,the standardized samples of each local component are combined to train PCA monitoring model.Based on the PCA monitoring model,two monitoring statistics T^(2) and SPE are used for monitoring multimode processes.Through a numerical example and the Tennessee Eastman(TE)process,the monitoring result demonstrates that LCPCA outperformed conventional PCA and LNS-PCA in the fault detection rate.
基金Project(51175159)supported by the National Natural Science Foundation of ChinaProject(2013WK3024)supported by the Science andTechnology Planning Program of Hunan Province,ChinaProject(CX2013B146)supported by the Hunan Provincial InnovationFoundation for Postgraduate,China
文摘A technology for unintended lane departure warning was proposed. As crucial information, lane boundaries were detected based on principal component analysis of grayscale distribution in search bars of given number and then each search bar was tracked using Kalman filter between frames. The lane detection performance was evaluated and demonstrated in ways of receiver operating characteristic, dice similarity coefficient and real-time performance. For lane departure detection, a lane departure risk evaluation model based on lasting time and frequency was effectively executed on the ARM-based platform. Experimental results indicate that the algorithm generates satisfactory lane detection results under different traffic and lighting conditions, and the proposed warning mechanism sends effective warning signals, avoiding most false warning.
文摘Referring to GB5618-1995 about heavy metal pollution,and using statistical analysis SPSS,the major pollutants of mine area farmland heavy metal pollution were identified by variable clustering analysis.Assessment and classification were done to the mine area farmland heavy metal pollution situation by synthetic principal components analysis (PCA).The results show that variable clustering analysis is efficient to identify the principal components of mine area farmland heavy metal pollution.Sort and clustering were done to the synthetic principal components scores of soil sample,which is given by synthetic principal components analysis.Data structure of soil heavy metal contaminations relationships and pollution level of different soil samples are discovered.The results of mine area farmland heavy metal pollution quality assessed and classified with synthetic component scores reflect the influence of both the major and compound heavy metal pol- lutants.Identification and assessment results of mine area farmland heavy metal pollution can provide reference and guide to propose control measures of mine area farmland heavy metal pollution and focus on the key treatment region.
基金supported by The Ministry of Trade,Industry,and Energy(20172510102090,20142520100440,20162010201980)Global PhD Fellowship Program through the National Research Foundation of Korea(NRF)funded by the Ministry of Education(2015H1A2A1030756)supported by the National Research Foundation of Korea(NRF)Grant(No.2018R1C1B5045260).
文摘Ensemble-based analyses are useful to compare equiprobable scenarios of the reservoir models.However,they require a large suite of reservoir models to cover high uncertainty in heterogeneous and complex reservoir models.For stable convergence in ensemble Kalman filter(EnKF),increasing ensemble size can be one of the solutions,but it causes high computational cost in large-scale reservoir systems.In this paper,we propose a preprocessing of good initial model selection to reduce the ensemble size,and then,EnKF is utilized to predict production performances stochastically.In the model selection scheme,representative models are chosen by using principal component analysis(PCA)and clustering analysis.The dimension of initial models is reduced using PCA,and the reduced models are grouped by clustering.Then,we choose and simulate representative models from the cluster groups to compare errors of production predictions with historical observation data.One representative model with the minimum error is considered as the best model,and we use the ensemble members near the best model in the cluster plane for applying EnKF.We demonstrate the proposed scheme for two 3D models that EnKF provides reliable assimilation results with much reduced computation time.
文摘Glass is the precious material evidence of the trade of the early Silk Road. The ancient glass was easily affected by the environmental impact and weathering, and the change of composition ratios affected the correct judgment of its category. In this paper, mathematical models and methods such as Chi-square test, weighted average method, principal component analysis, cluster analysis, binary classification model and grey correlation analysis were used comprehensively to analyze the data of sample glass products combined with their categories. The results showed that the weathered high-potassium glass could be divided into 12, 9, 10 and 27, 7, 22 and so on.
文摘The accurate extraction and classification of leather defects is an important guarantee for the automation and quality evaluation of leather industry. Aiming at the problem of data classification of leather defects,a hierarchical classification for defects is proposed.Firstly,samples are collected according to the method of minimum rectangle,and defects are extracted by image processing method.According to the geometric features of representation, they are divided into dot,line and surface for rough classification. From analysing the data which extracting the defects of geometry,gray and texture,the dominating characteristics can be acquired. Each type of defect by choosing different and representative characteristics,reducing the dimension of the data,and through these characteristics of clustering to achieve convergence effectively,realize extracted accurately,and digitized the defect characteristics,eventually establish the database. The results showthat this method can achieve more than 90% accuracy and greatly improve the accuracy of classification.
文摘Spatio-temporal assessment of the above ground biomass (AGB) is a cumbersome task due to the difficulties associated with the measurement of different tree parameters such as girth at breast height and height of trees. The present research was conducted in the campus of Birla Institute of Technology, Mesra, Ranchi, India, which is predomi- nantly covered by Sal (Shorea robusta C. F. Gaertn). Two methods of regression analysis was employed to determine the potential of remote sensing parameters with the AGB measured in the field such as linear regression analysis between the AGB and the individual bands, principal components (PCs) of the bands, vegetation indices (VI), and the PCs of the VIs respectively and multiple linear regression (MLR) analysis be- tween the AGB and all the variables in each category of data. From the linear regression analysis, it was found that only the NDVI exhibited regression coefficient value above 0.80 with the remaining parameters showing very low values. On the other hand, the MLR based analysis revealed significantly improved results as evidenced by the occurrence of very high correlation coefficient values of greater than 0.90 determined between the computed AGB from the MLR equations and field-estimated AGB thereby ascertaining their superiority in providing reliable estimates of AGB. The highest correlation coefficient of 0.99 is found with the MLR involving PCs of VIs.
基金supported by the State Key Laboratory of Nuclear Physics and Technology,Peking University(Grant No.NPT2023KFY02)the China Postdoctoral Science Foundation(Grant No.2021M700256)+2 种基金the National Key R&D Program of China(Grant No.2018YFA0404400)the National Natural Science Foundation of China(Grant Nos.11935003,11975031,12141501,and 12070131001)the High-performance Computing Platform of Peking University。
文摘Principal component analysis(PCA)is employed to extract the principal components(PCs)present in nuclear mass models for the first time.The effects from different nuclear mass models are reintegrated and reorganized in the extracted PCs.These PCs are recombined to build new mass models,which achieve better accuracy than the original theoretical mass models.This comparison indicates that using the PCA approach,the effects contained in different mass models can be collaborated to improve nuclear mass predictions.
基金Supported by the Beijing Municipal Social Science Foundation(SZ202210005004)Beijing Natural Science Foundation(9242004)。
文摘Motivated by the Bagging Partial Least Squares(Bagging PLS)and Principal Component Analysis(PCA)algorithms,a novel approach known as Principal Model Analysis(PMA)method is introduced in this paper.In the proposed PMA algorithm,the PCA and the Bagging PLS are combined.In this method,multiple PLS models are trained on sub-training sets,derived from the training set using the random sampling with replacement approach.The regression coefficients of all the sub-PLS models are fused in a joint regression coefficient matrix.The final projection direction is then estimated by performing the PCA on the joint regression coefficient matrix.Subsequently,the proposed PMA method is compared with other traditional dimension reduction methods,such as PLS,Bagging PLS,Linear discriminant analysis(LDA)and PLS-LDA.Experimental results on six public datasets demonstrate that our proposed method consistently outperforms other approaches in terms of classification performance and exhibits greater stability.Additionally,it is employed in the application of financial statement fraud identification.PMA and other five algorithms are utilized to financial statement fraud which concerned by the academic community,and the results indicate that the classification of PMA surpassed that of the other methods.
基金supported by the National Natural Science Foundation of China(Grant No.42122038)。
文摘Forward radiative transfer(RT)models are essential for atmospheric applications such as remote sensing and weather and climate models,where computational efficiency becomes equally as important as accuracy for high-resolution hyperspectral measurements that need rigorous RT simulations for thousands of channels.This study introduces a fast and accurate RT model for the hyperspectral infrared(HIR)sounder based on principal component analysis(PCA)or machine learning(i.e.,neural network,NN).The Geosynchronous Interferometric Infrared Sounder(GIIRS),the first HIR sounder onboard the geostationary Fengyun-4 satellites,is considered to be a candidate example for model development and validation.Our method uses either PCA or NN(PCA/NN)twice for the atmospheric transmittance and radiance,respectively,to reduce the number of independent but similar simulations to accelerate RT simulations;thereby,it is referred to as a multi-domain compression model.The first PCA/NN gives monochromatic gas transmittance in both spectral and atmospheric pressure domains for each gas independently.The second PCA/NN is performed in the traditional spectral radiance domain.Meanwhile,a new method is introduced to choose representative variables for the PCA/NN scheme developments.The model is three orders of magnitude faster than the standard line-by-line-based simulations with averaged brightness temperature difference(BTD)less than 0.1 K,and the compressions based on PCA or NN methods result in comparable efficiency and accuracy.Our fast model not only avoids an excessively complicated transmittance scheme by using PCA/NN but is also highly flexible for hyperspectral instruments with similar spectral ranges simply by updating the corresponding spectral response functions.
基金Sponsored by the National Natural Science Foundation of China(Grant No. 60272073).
文摘A new algorithm for unsupervised hyperspectral data unmixing is investigated, which includes a modified minimum noise fraction (MNF) transformation and independent component analysis (ICA). The modified MNF transformation is used to reduce noise and remove correlation between neighboring bands. Then the ICA is applied to unmix hyperspectral images, and independent endmembers are obtained from unmixed images by using post-processing which includes image segmentation based on statistical histograms and morphological operations. The experimental results demonstrate that this algorithm can identify endmembers resident in mixed pixels. Meanwhile, the results show the high computational efficiency of the modified MNF transformation. The time consumed by the modified method is almost one fifth of the traditional MNF transformation.
文摘在茶园水资源管理中,蒸散量(Evapotranspiration,ET)是评估作物水分需求的关键指标,由于茶园蒸散量预测具有时序性、不稳定性以及非线性耦合等特点,目前的茶园蒸散量预测模型存在预测精度较低的问题,针对此问题本文提出了一种新型的茶园蒸散量预测模型。首先使用互信息算法(Mutual information,MI)与主成分分析算法(Principal component analysis,PCA)相融合的数据处理算法(MIPCA),筛选强相关的特征并提取主成分;其次将时域卷积网络(Temporal convolutional network,TCN)与Transformer融合,利用灰狼算法(Grey wolf optimization,GWO)优化超参数,捕捉茶园数据的全局依赖关系;最后整合2个网络构建了MIPCA-TCN-GWO-Transformer模型,通过消融试验和对比试验验证了模型性能,并对模型在不同时间步长下的性能进行测试。结果表明,该模型平均绝对百分比误差(Mean absolute percentage error,MAPE)、均方根误差(Root mean square error,RMSE)和决定系数(Coefficient of determination,R^(2))3个评价指标分别为0.015 mm/d、0.312 mm/d和0.962,优于长短期记忆模型(Long short term memory,LSTM)等传统预测模型。在小时尺度、日尺度和月尺度下的R^(2)分别为0.986、0.978和0.946,在不同时间步长下展现了良好的适应性和准确性。本文构建的MIPCA-TCN-GWO-Transformer模型具有较高的预测精度和稳定性,可为茶园水资源优化管理和灌溉制度制定提供科学参考。