期刊文献+

Vis-NIR光谱模式识别结合SG平滑用于转基因甘蔗育种筛查 被引量:17

Vis-NIR Spectroscopic Pattern Recognition Combined with SG Smoothing Applied to Breed Screening of Transgenic Sugarcane
下载PDF
导出
摘要 以Savitzky-Golay(SG)平滑筛选,主成分分析(PCA)分别结合有监督的线性判别分析(LDA)、无监督的系统聚类分析(HCA),应用于转基因甘蔗育种筛查的可见-近红外(Vis-NIR)无损检测。提出兼顾随机性、稳定性的定标、预测、检验框架;取田间种植处于伸长期甘蔗叶样品456个,具有Bt基因和Bar基因的转基因样品(阳)306个,非转基因样品(阴)150个;随机选取156个为检验集(阴性50、阳性106),余下为建模集(阴性100、阳性200,共300),建模集再随机划分为定标集(阴性50、阳性100,共150)、预测集(阴性50、阳性100,共150)共50次;扩充SG平滑点数,同时删除绝对值偏小的高阶导数模式,共264个平滑模式用于模型筛选;采用前3个主成分两两组合,再根据模型效果选出最优主成分组合;基于所有定标、预测集划分和SG平滑模式,建立SG-PCA-LDA和SG-PCA-HCA模型,根据平均预测效果优选参数,使模型具有稳定性;最后用检验集进行模型检验。经SG平滑后,PCA-LDA和PCA-HCA的建模精度、稳定性均显著改善;最优SG-PCA-LDA模型阳性、阴性样品检验识别率分别达到94.3%和96.0%;最优SG-PCA-HCA模型阳性、阴性样品检验识别率分别达到92.5%和98.0%。结果表明:Vis-NIR光谱模式识别结合SG平滑可用于转基因甘蔗叶的准确识别,提供了一种简便的转基因甘蔗育种筛查方法。 Based on Savitzky-Golay(SG)smoothing screening,principal component analysis(PCA)combined with separately supervised linear discriminant analysis(LDA)and unsupervised hierarchical clustering analysis(HCA)were used for non-destructive visible and near-infrared(Vis-NIR)detection for breed screening of transgenic sugarcane.A random and stability-dependent framework of calibration,prediction,and validation was proposed.A total of 456 samples of sugarcane leaves planting in the elongating stage were collected from the field,which was composed of 306transgenic(positive)samples containing Bt and Bar gene and 150non-transgenic(negative)samples.A total of 156samples(negative 50 and positive 106)were randomly selected as the validation set;the remaining samples(negative 100 and positive 200,a total of 300samples)were used as the modeling set,and then the modeling set was subdivided into calibration(negative 50 and positive 100,a total of 150samples)and prediction sets(negative 50 and positive 100,a total of 150samples)for 50 times.The number of SG smoothing points was expanded,while some modes of higher derivative were removed because of small absolute value,and a total of 264 smoothing modes were used for screening.The pairwise combinations of first three principal components were used,and then the optimal combination of principal components was selected according to the model effect.Based on all divisions of calibration and prediction sets and all SG smoothing modes,the SG-PCA-LDA and SG-PCA-HCA models were established,the model parameters were optimized based on the average prediction effect for all divisions to produce modeling stability.Finally,the model validation was performed by validation set.With SG smoothing,the modeling accuracy and stability of PCA-LDA,PCA-HCA were significantly improved.For the optimal SG-PCA-LDA model,the recognition rate of positive and negative validation samples were94.3%,96.0%;and were 92.5%,98.0%for the optimal SG-PCA-LDA model,respectively.Conclusion:Vis-NIR spectroscopic pattern recognition combined with SG smoothing could be used for accurate recognition of transgenic sugarcane leaves,and provided a convenient screening method for transgenic sugarcane breeding.
出处 《光谱学与光谱分析》 SCIE EI CAS CSCD 北大核心 2014年第10期2701-2706,共6页 Spectroscopy and Spectral Analysis
基金 国家自然科学基金项目(61078040) 广东省科技计划项目(2012B031800917) 广州市科技计划珠江科技新星专项(2104J2200073) 广州市科技计划项目(2014Y2-00002)资助
关键词 转基因甘蔗育种筛查 Vis-NIR光谱 SG平滑 PCA-LDA PCA-HCA Breed screening of transgenic sugarcane Vis-NIR spectroscopy SG smoothing PCA-LDA PCA-HCA
  • 相关文献

参考文献8

二级参考文献148

共引文献754

同被引文献242

引证文献17

二级引证文献88

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部