Boosting集成回归在近红外光谱定量校正中的应用

Application of boosting ensemble regression for near-infrared spectroscopic quantitative calibration

导出

摘要针对物性参数和近红外光谱数据之间的回归模型的建立问题,基于建立一系列回归器的思想,给出了1种用于多变量校正的Boosting-PLS算法。每个(弱/基本)回归器均建立于原校正集的1个子集上,每个子集均通过原校正集带概率重复采样的方式得到,而样本的概率则由前1个回归器的预测误差确定。大误差的样本将增大概率,以便后续的回归器更集中地对其进行训练。最终的集成回归模型则为弱回归器的加权取中值。通过1个近红外应用实例和与偏最小二乘的比较,证实了Boosting-PLS算法的优良性能,所建校正模型更精确、更稳健,对过拟合不敏感。 For modeling the relationship between physical/chemical parameter and near-infrared spectroscopic data, a boosting-PLS algorithm is provided for multivariate calibration. This algorithm is based on the concept of building a series of base/weak repressors, each of which is trained on different subsets of a calibration set. Each subset is generated by the way that samples in the training set are picked out with the probability which is obtained by the previous repressor. If the prediction of a specific sample with the previous repressor is poor, its probability is increased to be trained intensively later. Final prediction is made by weighted median of all weak repressors. By an experiment related to near-infrared spectroscopy and comparison with PLS, it seems that the proposed boosting-PLS can produce a more accurate and more robust calibration model, which is less sensitive to overfiting.

作者谭超覃鑫

机构地区宜宾学院化学与化工系宜宾学院计算物理重点实验室川渝中烟集团

出处《计算机与应用化学》 CAS CSCD 北大核心 2010年第2期241-244,共4页 Computers and Applied Chemistry

基金四川省青年科技基金(09ZQ026-066) 宜宾学院博士科研启动基金(2008B06)

关键词 BOOSTING 近红外校正回归 boosting, near-infrared, calibration, regression

分类号 O65 [理学—分析化学] TQ015.9 [化学工程]

引文网络
相关文献

参考文献1

1王家俊,梁逸曾,汪帆.SIMCA分类法与偏最小二乘法结合近红外光谱检测卷烟的内在品质[J].计算机与应用化学,2006,23(11):1133-1136. 被引量：25

二级参考文献11

1Edited by The State Tobacco Monopoly Administration of PRC. YC/T161-2002 Tobacco and tobacco products-Determination of total nitrogen-Continuous flow Method. Beijing: The State Tobacco Monopoly Administration of PRC, 2002.
2Edited by The State Tobacco Monopoly Administration of PRC. YC/T159-2002 Tobacco and tobacco products-Determination of water soluble sugars - Continuous flow Method. Beijing:The State Tobacco Monopoly. Administration of PRC, 2002.
3Eclited by The State Tobacco Monopoly Administration of PRC. YC/T160-2002 Tobacco and Tobacco Products-Determination of Total Alkaloids-Continuous Flow Method. Beijing:The State Tobacco Monopoly Administration of PRC, 2002.
4Edited by The State Tobacco Monopoly Administration of PRC. YC/T29-1996 Cigarettes-Determination of Total Particulate Matter and Tar Using a Routine Analytical Smoking Machine. Beijing:The State Tobacco Monopoly Administration of PRC, 1996..
5Edited by The State Tobacco MonopolyAdministration of PRC. YC/T156-2001 Cigarettes-Determination of Nicotine in Total Particulate Matter of Smoke - Gas Chromatography Method. Beijing : The State Tobacco Monopoly Administration of PRC ,2001.
6Edited by The State Tobacco Monopoly Administration of PRC. YC/130-1996 Cigarettes-Determinatlon of Carbon Monoxide in Gas Phase of Smoke -NDIR Method. Beijing:The State Tobacco Monopoly Administration of PRC, 1996.
7Wold S. Pattern recognition by means of disjoint principal components models, Pattern Recognition, 1976., (8):127 - 139.
8Liang YiZeng. White, Grey and Black Multicomponent System and their Chemometric Algorithms. Changsha:Hunan Publishing House of Science and Technology, 1996:32 -36.
9Haaland DM, and Thomas EV. Partial least squares methods for spectral analysis. Anal Chem, 1988, (60) :1193 -1202.
10Osten DW. Selection of optimal regression model via cross validation. J Chemometrics,1988, (2):39.

共引文献24

1王保兴,陈国辉,汪旭,候英.近红外光谱技术在烟草领域的应用进展[J].光谱实验室,2006,23(5):1075-1084. 被引量：8
2邱军,张怀宝,宋岩,王允白,许武,付中会,李乃会.近红外光谱分析技术在烟草行业的应用[J].中国烟草科学,2008,29(1):55-59. 被引量：13
3陈郁,周小锋,于文博,唐平,周学秋.近红外光谱法测定黄酒中氨基酸态氮和酒精度的研究[J].计算机与应用化学,2008,25(3):361-364. 被引量：9
4王家俊,李娟.基于FT-NIR分析技术的SIMCA建模及其在卷烟配方过程质量监测中的应用[J].烟草科技,2008,41(3):5-9. 被引量：16
5谭超,吴同,覃鑫.偏最小二乘组合后向区间选择在近红外定量建模中的应用[J].计算机与应用化学,2008,25(4):509-512. 被引量：6
6谭超,覃鑫,吴同.基于小波变换的近红外光谱校正转移研究[J].计算机与应用化学,2009,26(5):645-648. 被引量：3
7钟坚成,冯毅,雷君虎,杨家红.烟用香精香料质量控制系统的模型及架构[J].计算技术与自动化,2009,28(2):23-27. 被引量：1
8张燕,彭黔荣,周静,蔡元青,赖东辉,张永萍.近红外光谱定性分析方法在烟草行业中的应用[J].河北农业科学,2009,13(6):150-152. 被引量：12
9秦冲,陈雯雯,何雄奎,张录达,马翔.近红外光谱分析中建模校正集的选择[J].光谱学与光谱分析,2009,29(10):2661-2664. 被引量：14
10李维莉,张亚平.近红外光谱的主成分分析——马氏距离分类法应用于品牌卷烟烟丝的快速鉴别[J].云南农业大学学报（自然科学版）,2010,25(2):268-271. 被引量：11

1陈昭,吴志生,史新元,徐冰,赵娜,乔延江.Bagging偏最小二乘和Boosting偏最小二乘算法的金银花醇沉过程近红外光谱定量模型预测能力研究[J].分析化学,2014,42(11):1679-1686. 被引量：13
2君正集团年产40万吨PVC／烧碱项目开工[J].浙江化工,2008,39(6):35-35.
3WU Yi WANG Yu QIN Yong SONG Hao.Synthesis of 2-Phenyl-10-substituted Hymenialdisine Derivatives[J].Chemical Research in Chinese Universities,2011,27(6):977-980.
4丁亚平,陈念贻,吴庆生,李国正,杨杰.导数光谱-支撑向量回归法同时测定NO_3^-和NO_2^-[J].计算机与应用化学,2002,19(6):752-754. 被引量：8
5成忠,张立庆,刘赫扬,诸爱士.连续投影算法及其在小麦近红外光谱波长选择中的应用[J].光谱学与光谱分析,2010,30(4):949-952. 被引量：48
6板硝子越南超薄浮法玻璃生产线投入运营[J].建筑玻璃与工业玻璃,2014,0(7):42-42.
7君正集团拟对鄂尔多斯市君正能源化工增资[J].氯碱工业,2016,52(3):46-47.
8君正集团拟对鄂尔多斯市君正能源化工增资27．2亿元[J].聚氯乙烯,2016,44(2):48-48.
9晓洲.上海氯碱化工与君正集团共同出资创立的内蒙古君正天原化工有限责任公司正式揭牌[J].上海化工,2017,42(2):49-49.
10钱锦,刘珊珊,王小华,薛士壮,王靖,周志伟,晋京,杨晓印.近红外光谱法检测氨纶中残存DMAC溶剂含量[J].合成纤维工业,2016,38(3):66-69. 被引量：1

计算机与应用化学

2010年第2期

浏览历史

内容加载中请稍等...

Boosting集成回归在近红外光谱定量校正中的应用

参考文献1

二级参考文献11

共引文献24

相关作者

相关机构

相关主题

浏览历史