The construction method of background value is improved in the original multi-variable grey model (MGM(1,m)) from its source of construction errors. The MGM(1,m) with optimized background value is used to elimin...The construction method of background value is improved in the original multi-variable grey model (MGM(1,m)) from its source of construction errors. The MGM(1,m) with optimized background value is used to eliminate the random fluctuations or errors of the observational data of all variables, and the combined prediction model together with the multiple linear regression is established in order to improve the simulation and prediction accuracy of the combined model. Finally, a combined model of the MGM(1,2) with optimized background value and the binary linear regression is constructed by an example. The results show that the model has good effects for simulation and prediction.展开更多
This paper selects seven indicators of financial revenue and housing sales price in recent 19 years in China,and uses SPSS and Excel to carry out descriptive statistics,independent sample t-test,correlation analysis a...This paper selects seven indicators of financial revenue and housing sales price in recent 19 years in China,and uses SPSS and Excel to carry out descriptive statistics,independent sample t-test,correlation analysis and regression analysis to comprehensively study the correlation between financial revenue and housing sales price in China,and establishes the relationship between financial revenue and housing sales price When the average selling price of commercial housing increases by one unit,the fiscal revenue will increase by 27.855 points.展开更多
Multiple linear regression (MLR) method was applied to quantify the effects of the net heat flux (NHF), the net freshwater flux (NFF) and the wind stress on the mixed layer depth (MLD) of the South China Sea ...Multiple linear regression (MLR) method was applied to quantify the effects of the net heat flux (NHF), the net freshwater flux (NFF) and the wind stress on the mixed layer depth (MLD) of the South China Sea (SCS) based on the simple ocean data assimilation (SODA) dataset. The spatio-temporal distributions of the MLD, the buoyancy flux (combining the NHF and the NFF) and the wind stress of the SCS were presented. Then using an oceanic vertical mixing model, the MLD after a certain time under the same initial conditions but various pairs of boundary conditions (the three factors) was simulated. Applying the MLR method to the results, regression equations which modeling the relationship between the simulated MLD and the three factors were calculated. The equations indicate that when the NHF was negative, it was the primary driver of the mixed layer deepening; and when the NHF was positive, the wind stress played a more important role than that of the NHF while the NFF had the least effect. When the NHF was positive, the relative quantitative effects of the wind stress, the NHF, and the NFF were about i0, 6 and 2. The above conclusions were applied to explaining the spatio-temporal distributions of the MLD in the SCS and thus proved to be valid.展开更多
Based on protein-DNA complex crystal structural data in up-to-date Nucleic Acid Database,the related parameters of DNA Kinetic Structure were investigated by Monte-Carlo Multiple Integrals on the base of modified DNA ...Based on protein-DNA complex crystal structural data in up-to-date Nucleic Acid Database,the related parameters of DNA Kinetic Structure were investigated by Monte-Carlo Multiple Integrals on the base of modified DNA structure statistical mechanical model,and time complexity and precision were analyzed on the calculated results.展开更多
Rivers are important systems which provide water to fulfill human needs. However, excessive human uses over the years have led to deterioration in quality of river causing, causing health problems from contaminated wa...Rivers are important systems which provide water to fulfill human needs. However, excessive human uses over the years have led to deterioration in quality of river causing, causing health problems from contaminated water. This study focuses on the application of statistical techniques, Multiple Linear Regression model and MANOVA to assess health impacts due to pollution in Cauvery river stretch in Srirangapatna. In this study, using Multiple Linear Regression, it is found that health impact level is 60.8% dependent on water quality parameters of BOD, COD, TDS, TC and FC. The t-statistics and their associated 2-tailed p-values indicate that COD and TDS produces health impacts compared to BOD, TC and FC, when their effects are put together across all the six sampling stations in Srirangapatna. Further Pearson correlation Matrix shows highly significant positive correlation amongst parameters across all stations indicating possibility of common sources of origin that might be anthropogenic. Also graphs are plotted for individual parameters across all stations and it reveals that COD and TDS values are significant across all sampling stations, though their values are higher in impact stations, causing health impacts.展开更多
Predictive Emission Monitoring Systems (PEMS) offer a cost-effective and environmentally friendly alternative to Continuous Emission Monitoring Systems (CEMS) for monitoring pollution from industrial sources. Multiple...Predictive Emission Monitoring Systems (PEMS) offer a cost-effective and environmentally friendly alternative to Continuous Emission Monitoring Systems (CEMS) for monitoring pollution from industrial sources. Multiple regression is one of the fundamental statistical techniques to describe the relationship between dependent and independent variables. This model can be effectively used to develop a PEMS, to estimate the amount of pollution emitted by industrial sources, where the fuel composition and other process-related parameters are available. It often makes them sufficient to predict the emission discharge with acceptable accuracy. In cases where PEMS are accepted as an alternative method to CEMS, which use gas analyzers, they can provide cost savings and substantial benefits for ongoing system support and maintenance. The described mathematical concept is based on the matrix algebra representation in multiple regression involving multiple precision arithmetic techniques. Challenging numerical examples for statistical big data analysis, are investigated. Numerical examples illustrate computational accuracy and efficiency of statistical analysis due to increasing the precision level. The programming language C++ is used for mathematical model implementation. The data for research and development, including the dependent fuel and independent NOx emissions data, were obtained from CEMS software installed on a petrochemical plant.展开更多
In this paper we consider a linear regression model with fixed design. A new rule for the selection of a relevant submodel is introduced on the basis of parameter tests. One particular feature of the rule is that subj...In this paper we consider a linear regression model with fixed design. A new rule for the selection of a relevant submodel is introduced on the basis of parameter tests. One particular feature of the rule is that subjective grading of the model complexity can be incorporated. We provide bounds for the mis-selection error. Simulations show that by using the proposed selection rule, the mis-selection error can be controlled uniformly.展开更多
The purpose of this study was to determine a suitable model for investigating the effects of climate factors on the area burned by forest fire in the Tahe forest region, Daxing'an Mountains, in northeast China. The r...The purpose of this study was to determine a suitable model for investigating the effects of climate factors on the area burned by forest fire in the Tahe forest region, Daxing'an Mountains, in northeast China. The response variables were the area burned by lightning- caused fire, human-caused fire, and total burned area. The predictor variables were nine climate variables collected from the local weather station. Three regression models were utilized, including multiple linear regression, log- linear model (log-transformation on both response and predictor variables), and gamma-generalized linear model. The goodness-of-fit of the models were compared based on model fitting statistics such as R2, AIC, and RMSE. The results revealed that the gamma-generalized linear model was generally superior to both multiple linear regressionmodel and log-linear model for fitting the fire data. Further, the best models were selected based on the criteria that the climate variables were statistically significant at at = 0.05. The gamma best models indicated that maximum wind speed, precipitation, and days that rainfall greater than 0.1 mm had significant impacts on the area burned by the lightning-caused fire, while the mean temperature and minimum relative humidity were the .main drivers of the burned area caused by human activities. Overall, the total burned area by forest fire was significantly influenced by days that rainfall greater than 0.1 mm and minimum rela- tive humidity, indicating that the moisture condition of forest stands determine the burned area by forest fire.展开更多
Although friction characteristics of fault gouge are important to understand reactivation of fault,behavior of earthquake,and mechanism of slope failure,analysis results of fault gouge have low accuracy mostly than th...Although friction characteristics of fault gouge are important to understand reactivation of fault,behavior of earthquake,and mechanism of slope failure,analysis results of fault gouge have low accuracy mostly than those of soils or rocks due to its high heterogeneity and low strength.Therefore,to improve the accuracy of analysis results,we conducted simple regression analysis and structural equation model analysis and selected major influential factors of friction characteristics among many factors,and then we deduced advanced regression model with the highest coefficient of determination(R^(2)) via multiple regression analysis.Whereas most coefficients of determination in simple regression analysis are below0.3-0.4,coefficient of determination in multiple regression analysis is remarkably large as 0.657.展开更多
In current paper, a quantitative structure-activity relationship (QSAR) study was performed for the prediction of acute toxicity of aromatic amines. A set of 56 compounds was randomly divided into a training set of ...In current paper, a quantitative structure-activity relationship (QSAR) study was performed for the prediction of acute toxicity of aromatic amines. A set of 56 compounds was randomly divided into a training set of 46 compounds and a test set of 10 compounds. The electronic and topological descriptors computed by the Scigress package and Dragon software were used as predictor variables. Multiple linear regression (MLR) and support vector machine (SVM) were utilized to build the linear and nonlinear QSAR models, respectively. The obtained models with five descriptors show strong predictive ability. The linear model fits the training set with R2 = 0.71, with higher SVM values of R2 = 0.77. The validation results obtained from the test set indicate that the SVM model is comparable or superior to that obtained by MLR, both in terms of prediction ability and robustness.展开更多
Based on the quantum chemical descriptors,quantitative structure-property relationship(QSPR) models have been developed to estimate and predict the photodegradation rate constant(logK) of polycyclic aromatic hydro...Based on the quantum chemical descriptors,quantitative structure-property relationship(QSPR) models have been developed to estimate and predict the photodegradation rate constant(logK) of polycyclic aromatic hydrocarbons(PAHs) by use of linear method(multiple linear regression,MLR) and non-linear method(back propagation artificial neural network,BP-ANN).A BP-ANN with 3-3-1 architecture was generated by using three quantum chemical descriptors appearing in the MLR model.The standard heat of formation(HOF),the gap of frontier molecular orbital energies(ΔELH) and total energy(TE) were inputs and its output was logK.Leave-One-Out(LOO) Cross-Validated correlation coefficient(R^2CV) of the established MLR and BP-ANN models were 0.6383 and 0.7843,respectively.The nonlinear BP-ANN model has better predictive ability compared to the linear MLR model with the root mean square error(RMSE) for training and validation sets to be 0.1071,0.1514 and the squared correlation coefficient(R^2) of 0.9791,0.9897,respectively.In addition,some insights into the molecular structural features affecting the photodegradation of PAHs were also discussed.展开更多
The reactivity parameters,Q and e,in the Q-e scheme reflect the reactivities of a monomer(or a radical)in free-radical copolymerizations.By applying multiple linear regression(MLR)analysis,the optimal quantitative str...The reactivity parameters,Q and e,in the Q-e scheme reflect the reactivities of a monomer(or a radical)in free-radical copolymerizations.By applying multiple linear regression(MLR)analysis,the optimal quantitative structure-activity relationship(QSAR)model for the reactivity parameter lnQ was developed based on five descriptors(NAF,NOF,EαLUMO,EβHOMO,and EβLUMO)and 69 monomers with the root mean square(rms)error of 0.61.The optimal MLR model of the parameter e obtained from five descriptors(TOcl,NpN,NSO,EαHOMO and DH)and 68 monomers produced rms error of 0.42.Compared with previous models,the two optimal MLR models in this paper show satisfactory statistical characteristics.The feasibility of combining 2D descriptors obtained from the monomers and 3D descriptors calculated from the radical structures(formed from monomers+H )to predict parameters Q and e has been demonstrated.展开更多
目的科学评价芙蓉李果实成熟期间的营养品质,建立色度值表观特征与营养品质的关系。方法以福建省主栽品种芙蓉李为研究对象,对其成熟期间果糖、葡萄糖、蔗糖、苹果酸、奎尼酸、琥珀酸、柠檬酸、富马酸、矢车菊素-3-芸香糖苷、矢车菊素-3...目的科学评价芙蓉李果实成熟期间的营养品质,建立色度值表观特征与营养品质的关系。方法以福建省主栽品种芙蓉李为研究对象,对其成熟期间果糖、葡萄糖、蔗糖、苹果酸、奎尼酸、琥珀酸、柠檬酸、富马酸、矢车菊素-3-芸香糖苷、矢车菊素-3-葡萄糖苷、多酚、黄酮、类胡萝卜素等13个品质指标进行分析和综合评价。结果芙蓉李成熟期间,各品质指标的含量变化存在显著差异(P<0.05),综合运用相关分析、因子分析、绝对因子分析-多元线性回归(absolute principal component scores-multiple linear regression,APCS-MLR)分析筛选可反映芙蓉李综合品质的主要指标。因子分析提取出3个主因子,贡献率分别为52.677%、23.468%、11.649%,累计贡献率为87.794%。综合APCS-MLR等数理统计分析,主因子1主要对果糖、矢车菊素-3-芸香糖苷、矢车菊素-3-葡萄糖苷贡献较大,贡献率分别为53.00%、73.85%、55.54%;主因子2主要对蔗糖、富马酸、果糖、柠檬酸的贡献率较大,分别为28.26%、18.70%、16.14%、15.59%;主因子3主要对多酚(29.13%)和黄酮(28.28%)有较大贡献率;选取3个主因子总贡献率高于60%的果糖、葡萄糖、矢车菊素-3-芸香糖苷、矢车菊素-3-葡萄糖苷作为综合品质评价的主要指标。分别对已筛选出的4个主要评价指标与色度值进行多元线性逐步回归分析,建立4个主要指标与色度值的表观预测模型,各模型均具有较好的拟合度,预测值与实测值的均方根误差较小;进一步验证结果表明,通过色度值对4个指标的预测具有较高的可靠性和准确性。结论本研究筛选出的主要指标及预测模型可更加简单、便捷地评价芙蓉李果实成熟期间的综合品质。展开更多
基金supported by the National Natural Science Foundation of China(71071077)the Ministry of Education Key Project of National Educational Science Planning(DFA090215)+1 种基金China Postdoctoral Science Foundation(20100481137)Funding of Jiangsu Innovation Program for Graduate Education(CXZZ11-0226)
文摘The construction method of background value is improved in the original multi-variable grey model (MGM(1,m)) from its source of construction errors. The MGM(1,m) with optimized background value is used to eliminate the random fluctuations or errors of the observational data of all variables, and the combined prediction model together with the multiple linear regression is established in order to improve the simulation and prediction accuracy of the combined model. Finally, a combined model of the MGM(1,2) with optimized background value and the binary linear regression is constructed by an example. The results show that the model has good effects for simulation and prediction.
基金Thank you for your valuable comments and suggestions.This research was supported by Yunnan applied basic research project(NO.2017FD150)Chuxiong Normal University General Research Project(NO.XJYB2001).
文摘This paper selects seven indicators of financial revenue and housing sales price in recent 19 years in China,and uses SPSS and Excel to carry out descriptive statistics,independent sample t-test,correlation analysis and regression analysis to comprehensively study the correlation between financial revenue and housing sales price in China,and establishes the relationship between financial revenue and housing sales price When the average selling price of commercial housing increases by one unit,the fiscal revenue will increase by 27.855 points.
基金The National Natural Science Foundation of China under contract No.11174235the Science and Technology Development Project of Shaanxi Province of China under contract No.2010KJXX-02+2 种基金the Program for New Century Excellent Talents in University of China under contract No. NCET-08-0455the Science and Technology Innovation Foundation of Northwestern Polytechnical University of Chinathe Doctorate Foundation of Northwestern Polytechnical University of China under contract No.CX201226.
文摘Multiple linear regression (MLR) method was applied to quantify the effects of the net heat flux (NHF), the net freshwater flux (NFF) and the wind stress on the mixed layer depth (MLD) of the South China Sea (SCS) based on the simple ocean data assimilation (SODA) dataset. The spatio-temporal distributions of the MLD, the buoyancy flux (combining the NHF and the NFF) and the wind stress of the SCS were presented. Then using an oceanic vertical mixing model, the MLD after a certain time under the same initial conditions but various pairs of boundary conditions (the three factors) was simulated. Applying the MLR method to the results, regression equations which modeling the relationship between the simulated MLD and the three factors were calculated. The equations indicate that when the NHF was negative, it was the primary driver of the mixed layer deepening; and when the NHF was positive, the wind stress played a more important role than that of the NHF while the NFF had the least effect. When the NHF was positive, the relative quantitative effects of the wind stress, the NHF, and the NFF were about i0, 6 and 2. The above conclusions were applied to explaining the spatio-temporal distributions of the MLD in the SCS and thus proved to be valid.
基金Supported by Inner Mongolia Natural Science Foundation(200711020112)Innovation Fundation of Inner Mongolia University of Science and Technology (2009NC064)~~
文摘Based on protein-DNA complex crystal structural data in up-to-date Nucleic Acid Database,the related parameters of DNA Kinetic Structure were investigated by Monte-Carlo Multiple Integrals on the base of modified DNA structure statistical mechanical model,and time complexity and precision were analyzed on the calculated results.
文摘Rivers are important systems which provide water to fulfill human needs. However, excessive human uses over the years have led to deterioration in quality of river causing, causing health problems from contaminated water. This study focuses on the application of statistical techniques, Multiple Linear Regression model and MANOVA to assess health impacts due to pollution in Cauvery river stretch in Srirangapatna. In this study, using Multiple Linear Regression, it is found that health impact level is 60.8% dependent on water quality parameters of BOD, COD, TDS, TC and FC. The t-statistics and their associated 2-tailed p-values indicate that COD and TDS produces health impacts compared to BOD, TC and FC, when their effects are put together across all the six sampling stations in Srirangapatna. Further Pearson correlation Matrix shows highly significant positive correlation amongst parameters across all stations indicating possibility of common sources of origin that might be anthropogenic. Also graphs are plotted for individual parameters across all stations and it reveals that COD and TDS values are significant across all sampling stations, though their values are higher in impact stations, causing health impacts.
文摘Predictive Emission Monitoring Systems (PEMS) offer a cost-effective and environmentally friendly alternative to Continuous Emission Monitoring Systems (CEMS) for monitoring pollution from industrial sources. Multiple regression is one of the fundamental statistical techniques to describe the relationship between dependent and independent variables. This model can be effectively used to develop a PEMS, to estimate the amount of pollution emitted by industrial sources, where the fuel composition and other process-related parameters are available. It often makes them sufficient to predict the emission discharge with acceptable accuracy. In cases where PEMS are accepted as an alternative method to CEMS, which use gas analyzers, they can provide cost savings and substantial benefits for ongoing system support and maintenance. The described mathematical concept is based on the matrix algebra representation in multiple regression involving multiple precision arithmetic techniques. Challenging numerical examples for statistical big data analysis, are investigated. Numerical examples illustrate computational accuracy and efficiency of statistical analysis due to increasing the precision level. The programming language C++ is used for mathematical model implementation. The data for research and development, including the dependent fuel and independent NOx emissions data, were obtained from CEMS software installed on a petrochemical plant.
文摘In this paper we consider a linear regression model with fixed design. A new rule for the selection of a relevant submodel is introduced on the basis of parameter tests. One particular feature of the rule is that subjective grading of the model complexity can be incorporated. We provide bounds for the mis-selection error. Simulations show that by using the proposed selection rule, the mis-selection error can be controlled uniformly.
基金funded by Asia-Pacific Forests Net(APFNET/2010/FPF/001)National Natural Science Foundation of China(Grant No.31400552)Forestry industry research special funds for public welfare projects(201404402)
文摘The purpose of this study was to determine a suitable model for investigating the effects of climate factors on the area burned by forest fire in the Tahe forest region, Daxing'an Mountains, in northeast China. The response variables were the area burned by lightning- caused fire, human-caused fire, and total burned area. The predictor variables were nine climate variables collected from the local weather station. Three regression models were utilized, including multiple linear regression, log- linear model (log-transformation on both response and predictor variables), and gamma-generalized linear model. The goodness-of-fit of the models were compared based on model fitting statistics such as R2, AIC, and RMSE. The results revealed that the gamma-generalized linear model was generally superior to both multiple linear regressionmodel and log-linear model for fitting the fire data. Further, the best models were selected based on the criteria that the climate variables were statistically significant at at = 0.05. The gamma best models indicated that maximum wind speed, precipitation, and days that rainfall greater than 0.1 mm had significant impacts on the area burned by the lightning-caused fire, while the mean temperature and minimum relative humidity were the .main drivers of the burned area caused by human activities. Overall, the total burned area by forest fire was significantly influenced by days that rainfall greater than 0.1 mm and minimum rela- tive humidity, indicating that the moisture condition of forest stands determine the burned area by forest fire.
基金supported by Postdoctoral Fellowship Program funded by the Ministry of Education of the Republic of Korea through the Chungbuk National University in 2020。
文摘Although friction characteristics of fault gouge are important to understand reactivation of fault,behavior of earthquake,and mechanism of slope failure,analysis results of fault gouge have low accuracy mostly than those of soils or rocks due to its high heterogeneity and low strength.Therefore,to improve the accuracy of analysis results,we conducted simple regression analysis and structural equation model analysis and selected major influential factors of friction characteristics among many factors,and then we deduced advanced regression model with the highest coefficient of determination(R^(2)) via multiple regression analysis.Whereas most coefficients of determination in simple regression analysis are below0.3-0.4,coefficient of determination in multiple regression analysis is remarkably large as 0.657.
基金Supported by the Ministry of Environmental Protection of China(No.2011467037)
文摘In current paper, a quantitative structure-activity relationship (QSAR) study was performed for the prediction of acute toxicity of aromatic amines. A set of 56 compounds was randomly divided into a training set of 46 compounds and a test set of 10 compounds. The electronic and topological descriptors computed by the Scigress package and Dragon software were used as predictor variables. Multiple linear regression (MLR) and support vector machine (SVM) were utilized to build the linear and nonlinear QSAR models, respectively. The obtained models with five descriptors show strong predictive ability. The linear model fits the training set with R2 = 0.71, with higher SVM values of R2 = 0.77. The validation results obtained from the test set indicate that the SVM model is comparable or superior to that obtained by MLR, both in terms of prediction ability and robustness.
基金supported by the Natural Science Foundation of Fujian Province (D0710019)the Natural Science Foundation of Overseas Chinese Affairs Office of the State Council (06QZR09)
文摘Based on the quantum chemical descriptors,quantitative structure-property relationship(QSPR) models have been developed to estimate and predict the photodegradation rate constant(logK) of polycyclic aromatic hydrocarbons(PAHs) by use of linear method(multiple linear regression,MLR) and non-linear method(back propagation artificial neural network,BP-ANN).A BP-ANN with 3-3-1 architecture was generated by using three quantum chemical descriptors appearing in the MLR model.The standard heat of formation(HOF),the gap of frontier molecular orbital energies(ΔELH) and total energy(TE) were inputs and its output was logK.Leave-One-Out(LOO) Cross-Validated correlation coefficient(R^2CV) of the established MLR and BP-ANN models were 0.6383 and 0.7843,respectively.The nonlinear BP-ANN model has better predictive ability compared to the linear MLR model with the root mean square error(RMSE) for training and validation sets to be 0.1071,0.1514 and the squared correlation coefficient(R^2) of 0.9791,0.9897,respectively.In addition,some insights into the molecular structural features affecting the photodegradation of PAHs were also discussed.
基金supported by the National Natural Science Foundation of China(No.21472040)the Scientific Research Fund of Hunan Education Department(Nos.16A047 and 18A344)the Open Project Program of Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration(Hunan Institute of Engineering)(2018KF11)
文摘The reactivity parameters,Q and e,in the Q-e scheme reflect the reactivities of a monomer(or a radical)in free-radical copolymerizations.By applying multiple linear regression(MLR)analysis,the optimal quantitative structure-activity relationship(QSAR)model for the reactivity parameter lnQ was developed based on five descriptors(NAF,NOF,EαLUMO,EβHOMO,and EβLUMO)and 69 monomers with the root mean square(rms)error of 0.61.The optimal MLR model of the parameter e obtained from five descriptors(TOcl,NpN,NSO,EαHOMO and DH)and 68 monomers produced rms error of 0.42.Compared with previous models,the two optimal MLR models in this paper show satisfactory statistical characteristics.The feasibility of combining 2D descriptors obtained from the monomers and 3D descriptors calculated from the radical structures(formed from monomers+H )to predict parameters Q and e has been demonstrated.
文摘目的科学评价芙蓉李果实成熟期间的营养品质,建立色度值表观特征与营养品质的关系。方法以福建省主栽品种芙蓉李为研究对象,对其成熟期间果糖、葡萄糖、蔗糖、苹果酸、奎尼酸、琥珀酸、柠檬酸、富马酸、矢车菊素-3-芸香糖苷、矢车菊素-3-葡萄糖苷、多酚、黄酮、类胡萝卜素等13个品质指标进行分析和综合评价。结果芙蓉李成熟期间,各品质指标的含量变化存在显著差异(P<0.05),综合运用相关分析、因子分析、绝对因子分析-多元线性回归(absolute principal component scores-multiple linear regression,APCS-MLR)分析筛选可反映芙蓉李综合品质的主要指标。因子分析提取出3个主因子,贡献率分别为52.677%、23.468%、11.649%,累计贡献率为87.794%。综合APCS-MLR等数理统计分析,主因子1主要对果糖、矢车菊素-3-芸香糖苷、矢车菊素-3-葡萄糖苷贡献较大,贡献率分别为53.00%、73.85%、55.54%;主因子2主要对蔗糖、富马酸、果糖、柠檬酸的贡献率较大,分别为28.26%、18.70%、16.14%、15.59%;主因子3主要对多酚(29.13%)和黄酮(28.28%)有较大贡献率;选取3个主因子总贡献率高于60%的果糖、葡萄糖、矢车菊素-3-芸香糖苷、矢车菊素-3-葡萄糖苷作为综合品质评价的主要指标。分别对已筛选出的4个主要评价指标与色度值进行多元线性逐步回归分析,建立4个主要指标与色度值的表观预测模型,各模型均具有较好的拟合度,预测值与实测值的均方根误差较小;进一步验证结果表明,通过色度值对4个指标的预测具有较高的可靠性和准确性。结论本研究筛选出的主要指标及预测模型可更加简单、便捷地评价芙蓉李果实成熟期间的综合品质。