Boreal forests play an important role in global environment systems. Understanding boreal forest ecosystem structure and function requires accurate monitoring and estimating of forest canopy and biomass. We used parti...Boreal forests play an important role in global environment systems. Understanding boreal forest ecosystem structure and function requires accurate monitoring and estimating of forest canopy and biomass. We used partial least square regression (PLSR) models to relate forest parameters, i.e. canopy closure density and above ground tree biomass, to Landsat ETM+ data. The established models were optimized according to the variable importance for projection (VIP) criterion and the bootstrap method, and their performance was compared using several statistical indices. All variables selected by the VIP criterion passed the bootstrap test (p〈0.05). The simplified models without insignificant variables (VIP 〈1) performed as well as the full model but with less computation time. The relative root mean square error (RMSE%) was 29% for canopy closure density, and 58% for above ground tree biomass. We conclude that PLSR can be an effective method for estimating canopy closure density and above ground biomass.展开更多
With the development of mid-infrared (MIR) photoelectric devices, mid-infrared spectroscopy has become one of the important methods for non-invasive detection of blood glucose. The mid-infrared region (4000 - 400 cm&l...With the development of mid-infrared (MIR) photoelectric devices, mid-infrared spectroscopy has become one of the important methods for non-invasive detection of blood glucose. The mid-infrared region (4000 - 400 cm<sup>-1</sup>) has the well-known fingerprint region (1200 - 800 cm<sup>-1</sup>) of glucose, which has clearer characteristic absorption peaks and better specificity. There is a lot of molecular information about glucose in the MIR. The non-invasive detection of blood glucose by mid-infrared spectroscopy needs to achieve certain accuracy, and the quantitative model is an important factor affecting the accuracy of glucose detection. In this paper, the samples of imitation solution containing only glucose and the samples of imitation mixed solution are taken as the research objects, and the mid-infrared spectral data of the samples are collected. The full spectrum partial least squares Regression (PLSR) model, SNV + Ctr-PLSR model, MSC + Ctr-PLSR model, and convolutional neural networks (CNN) model of 3000 - 900 cm<sup>-1</sup> band were constructed. Full spectrum PLS model and CNN model of 1200 - 900 cm<sup>-1</sup> band were constructed. The experimental results show that the optimal model of the two bands is CNN, then the correlation coefficient of prediction set (Rp) of 3000 - 900 cm<sup>-1</sup> band is 0.95, and the root mean square error of pre-diction set (RMSEP) value is 22.10. The Rp of 1200 - 900 cm<sup>-1</sup> band is 0.95, and the RMSEP value is 22.54. The research results show that CNN is a promising method, which has higher accuracy than PLSR, and is especially suitable for modeling human complex environment. In addition, the study provides a theoretical and practical basis for CNN in feature selection and model interpretation.展开更多
为解决多级制造过程关键质量特性识别中多质量特性之间的相关性问题,将偏最小二乘回归方法(Partial Least Squares Regression,PLSR)引入模型构建与分析中。首先应用状态空间方法建立多级制造过程关键质量特性识别模型,进而利用PLSR方...为解决多级制造过程关键质量特性识别中多质量特性之间的相关性问题,将偏最小二乘回归方法(Partial Least Squares Regression,PLSR)引入模型构建与分析中。首先应用状态空间方法建立多级制造过程关键质量特性识别模型,进而利用PLSR方法解决质量特性间的多重共线性问题并进行模型分析,识别关键质量特性,最后以卷烟生产过程为例介绍了该方法的应用。实例表明,该方法不仅可以有效识别多级制造过程关键质量特性,而且能够建立各级过程的输出质量对最终产品质量的影响及其质量特性之间相互关系的模型,反映多级生产过程的结构特征和各级过程质量特性之间的因果关系,为多级制造过程质量分析与控制提供依据。展开更多
基金supported by the 948 Program of the State Forestry Administration (2009-4-43)the National Natura Science Foundation of China (No.30870420)
文摘Boreal forests play an important role in global environment systems. Understanding boreal forest ecosystem structure and function requires accurate monitoring and estimating of forest canopy and biomass. We used partial least square regression (PLSR) models to relate forest parameters, i.e. canopy closure density and above ground tree biomass, to Landsat ETM+ data. The established models were optimized according to the variable importance for projection (VIP) criterion and the bootstrap method, and their performance was compared using several statistical indices. All variables selected by the VIP criterion passed the bootstrap test (p〈0.05). The simplified models without insignificant variables (VIP 〈1) performed as well as the full model but with less computation time. The relative root mean square error (RMSE%) was 29% for canopy closure density, and 58% for above ground tree biomass. We conclude that PLSR can be an effective method for estimating canopy closure density and above ground biomass.
文摘With the development of mid-infrared (MIR) photoelectric devices, mid-infrared spectroscopy has become one of the important methods for non-invasive detection of blood glucose. The mid-infrared region (4000 - 400 cm<sup>-1</sup>) has the well-known fingerprint region (1200 - 800 cm<sup>-1</sup>) of glucose, which has clearer characteristic absorption peaks and better specificity. There is a lot of molecular information about glucose in the MIR. The non-invasive detection of blood glucose by mid-infrared spectroscopy needs to achieve certain accuracy, and the quantitative model is an important factor affecting the accuracy of glucose detection. In this paper, the samples of imitation solution containing only glucose and the samples of imitation mixed solution are taken as the research objects, and the mid-infrared spectral data of the samples are collected. The full spectrum partial least squares Regression (PLSR) model, SNV + Ctr-PLSR model, MSC + Ctr-PLSR model, and convolutional neural networks (CNN) model of 3000 - 900 cm<sup>-1</sup> band were constructed. Full spectrum PLS model and CNN model of 1200 - 900 cm<sup>-1</sup> band were constructed. The experimental results show that the optimal model of the two bands is CNN, then the correlation coefficient of prediction set (Rp) of 3000 - 900 cm<sup>-1</sup> band is 0.95, and the root mean square error of pre-diction set (RMSEP) value is 22.10. The Rp of 1200 - 900 cm<sup>-1</sup> band is 0.95, and the RMSEP value is 22.54. The research results show that CNN is a promising method, which has higher accuracy than PLSR, and is especially suitable for modeling human complex environment. In addition, the study provides a theoretical and practical basis for CNN in feature selection and model interpretation.
文摘为解决多级制造过程关键质量特性识别中多质量特性之间的相关性问题,将偏最小二乘回归方法(Partial Least Squares Regression,PLSR)引入模型构建与分析中。首先应用状态空间方法建立多级制造过程关键质量特性识别模型,进而利用PLSR方法解决质量特性间的多重共线性问题并进行模型分析,识别关键质量特性,最后以卷烟生产过程为例介绍了该方法的应用。实例表明,该方法不仅可以有效识别多级制造过程关键质量特性,而且能够建立各级过程的输出质量对最终产品质量的影响及其质量特性之间相互关系的模型,反映多级生产过程的结构特征和各级过程质量特性之间的因果关系,为多级制造过程质量分析与控制提供依据。