Spectroscopy can be used for detecting crop characteristics. A goal of crop spectrum analysis is to extract effective features from spectral data for establishing a detection model. An ideal spectral feature set shoul...Spectroscopy can be used for detecting crop characteristics. A goal of crop spectrum analysis is to extract effective features from spectral data for establishing a detection model. An ideal spectral feature set should have high sensitivity to target parameters but low information redundancy among features.However, feature-selection methods that satisfy both requirements are lacking. To address this issue,in this study, a novel method, the continuous wavelet projections algorithm(CWPA), was developed,which has advantages of both continuous wavelet analysis(CWA) and the successive projections algorithm(SPA) for generating optimal spectral feature set for crop detection. Three datasets collected for crop stress detection and retrieval of biochemical properties were used to validate the CWPA under both classification and regression scenarios. The CWPA generated a feature set with fewer features yet achieving accuracy comparable to or even higher than those of CWA and SPA. With only two to three features identified by CWPA, an overall accuracy of 98% in classifying tea plant stresses was achieved, and high coefficients of determination were obtained in retrieving corn leaf chlorophyll content(R^(2)= 0.8521)and equivalent water thickness(R^(2)= 0.9508). The mechanism of the CWPA ensures that the novel algorithm discovers the most sensitive features while retaining complementarity among features. Its ability to reduce the data dimension suggests its potential for crop monitoring and phenotyping with hyperspectral data.展开更多
为提高白酒固态发酵的副产物黄水中淀粉含量预测模型精度和建模效率。采用傅里叶变换近红外光谱仪采集黄水光谱信息,利用一阶导数对光谱进行预处理,并结合偏最小二乘回归(partial least squares regression,PLSR)建立黄水淀粉定量预测...为提高白酒固态发酵的副产物黄水中淀粉含量预测模型精度和建模效率。采用傅里叶变换近红外光谱仪采集黄水光谱信息,利用一阶导数对光谱进行预处理,并结合偏最小二乘回归(partial least squares regression,PLSR)建立黄水淀粉定量预测模型。使用决定系数(R^(2))和预测均方误差(root mean square error of prediction,RMSEP)评价模型性能。光谱中含有大量冗余信息,为有效提升黄水淀粉含量检测精度和优化模型效率,将不同特征提取方法的优点结合,发现使用竞争性自适应重加权算法(competitive adaptive reweighted sampling,CARS)结合连续投影算法(successive projections algorithm,SPA)提取的光谱特征所建立的PLSR模型,相较于未使用特征提取或仅使用单一特征提取所建立的模型均有明显提升。在单一使用CARS时,模型的R^(2)为0.9654,RMSEP为0.2012%,而结合SPA后,R2为0.9738,RMSEP为0.1748%。此外,光谱维度从2203个减少到了126个,不仅提高了预测精度,也提升了建模效率。本研究提出的方法可作为黄水近红外定量模型优化的有效途径。展开更多
基金supported by the National Natural Science Foundation of China (42071420)the Major Special Project for 2025 Scientific,Technological Innovation (Major Scientific and Technological Task Project in Ningbo City)(2021Z048)the National Key Research and Development Program of China(2019YFE0125300)。
文摘Spectroscopy can be used for detecting crop characteristics. A goal of crop spectrum analysis is to extract effective features from spectral data for establishing a detection model. An ideal spectral feature set should have high sensitivity to target parameters but low information redundancy among features.However, feature-selection methods that satisfy both requirements are lacking. To address this issue,in this study, a novel method, the continuous wavelet projections algorithm(CWPA), was developed,which has advantages of both continuous wavelet analysis(CWA) and the successive projections algorithm(SPA) for generating optimal spectral feature set for crop detection. Three datasets collected for crop stress detection and retrieval of biochemical properties were used to validate the CWPA under both classification and regression scenarios. The CWPA generated a feature set with fewer features yet achieving accuracy comparable to or even higher than those of CWA and SPA. With only two to three features identified by CWPA, an overall accuracy of 98% in classifying tea plant stresses was achieved, and high coefficients of determination were obtained in retrieving corn leaf chlorophyll content(R^(2)= 0.8521)and equivalent water thickness(R^(2)= 0.9508). The mechanism of the CWPA ensures that the novel algorithm discovers the most sensitive features while retaining complementarity among features. Its ability to reduce the data dimension suggests its potential for crop monitoring and phenotyping with hyperspectral data.
文摘为提高白酒固态发酵的副产物黄水中淀粉含量预测模型精度和建模效率。采用傅里叶变换近红外光谱仪采集黄水光谱信息,利用一阶导数对光谱进行预处理,并结合偏最小二乘回归(partial least squares regression,PLSR)建立黄水淀粉定量预测模型。使用决定系数(R^(2))和预测均方误差(root mean square error of prediction,RMSEP)评价模型性能。光谱中含有大量冗余信息,为有效提升黄水淀粉含量检测精度和优化模型效率,将不同特征提取方法的优点结合,发现使用竞争性自适应重加权算法(competitive adaptive reweighted sampling,CARS)结合连续投影算法(successive projections algorithm,SPA)提取的光谱特征所建立的PLSR模型,相较于未使用特征提取或仅使用单一特征提取所建立的模型均有明显提升。在单一使用CARS时,模型的R^(2)为0.9654,RMSEP为0.2012%,而结合SPA后,R2为0.9738,RMSEP为0.1748%。此外,光谱维度从2203个减少到了126个,不仅提高了预测精度,也提升了建模效率。本研究提出的方法可作为黄水近红外定量模型优化的有效途径。