6 Atomic fragment types of organic compound have been defined, and the multilevel atom-pair frequency matrix has been constructed according to the occurrence number in pairs of atomic fragments with different bond len...6 Atomic fragment types of organic compound have been defined, and the multilevel atom-pair frequency matrix has been constructed according to the occurrence number in pairs of atomic fragments with different bond lengths in the molecule. On the basis of them, a novel molecular coding technique: characteristic atom-pair holographic code (CAHC), is obtained. To some extent, this method exhibits a large number of benefits at the same time. For example, it can calculate 2D molecular topological descriptor easily, operate without difficulty and possess definite physicochemical meaning of 3D molecular structural characterization methods, and may fetch the complicated information of molecule, etc. Therefore, it is appropriate for the study on quantitative structure-property/activity relationship (QSPR/QSAR) of medicines and biological molecules. We attempt in this paper to utilize the method of CAHC to the quantitative prediction of reversed-phase liquid chromatogram (RPLC) retention data of 33 purine derivatives and 24 steroids. The fitting multiple correlation coefficient R2, cross-validated multiple correlation coefficient Q2 and predicted ability Q^2 pred over test set's samples of obtained partial least-square (PLS) regression model are respectively 0.990, 0.893 and 0.977, 0.897, 0.941.展开更多
In the present study,(QSRR) study had been carried out for volatile components from Rosa banksiae Ait.based on various quantum-chemical and physicochemical descriptors derived by B3LYP method.To build QSRR models,a ...In the present study,(QSRR) study had been carried out for volatile components from Rosa banksiae Ait.based on various quantum-chemical and physicochemical descriptors derived by B3LYP method.To build QSRR models,a multiple linear regression (MLR) stepwise method was used.The generated models have good predictive ability and are of high statistical significance with good correlation coefficients (R2≥0.734) and p values far less than 0.05.Preliminary results indicated that the application of the models,especially the prediction of GC retention time and linear retention index of volatile components from Rosa banksiae Ait.,will be helpful.The models contribute also to the identification of important quantum-chemical and physicochemical descriptors responsible for the retention time and linear retention index.It was found that the shape attribute (ShpA) and logP value play a vital role in determining component’s GC retention time and linear retention index which increase with the lipophilicity of volatile components.The larger the shape attribute of analyte is,the larger the deformability is,the stronger the interaction between analyte and stationary phase is,and the longer the GC retention time is,the larger the linear retention index is.The importance of E HOMO,q+,and SEV is also embodied in models,but they are not dominant.展开更多
Polychlorinated dibenzothiophenes(PCDTs) are classified as persistent organic pollutants in the environment,so the analysis of PCDTs by their gas chromatographic behaviors is of great significance.Quantitative struc...Polychlorinated dibenzothiophenes(PCDTs) are classified as persistent organic pollutants in the environment,so the analysis of PCDTs by their gas chromatographic behaviors is of great significance.Quantitative structure-retention relationship(QSRR) analysis is a useful technique capable of relating chromatographic retention time to the molecular structure.In this paper,a QSRR study of 37 PCDTs was carried out by using molecular electronegativity distance vector(MEDV) descriptors and multiple linear regression(MLR) and partial least-squares regression(PLS) methods.The correlation coefficient R of established MLR,PLS models,leave-one-out(LOO) cross-validation(CV),Q2ext were 0.9951,0.9942,0.9839(MLR) and 0.9925,0.9915,0.9833(PLS),respectively.Results showed that the model exhibited excellent estimate capability for internal sample set and good predictive capability for external sample set.By using MEDV descriptors,the QSRR model can provide a simple and rapid way to predict the gas-chromatographic retention indices of polychlorinated dibenzothiophenes in conditions of lacking standard samples or poor experimental conditions.展开更多
Polychlorinated dibenzothiophenes(PCDTs)and their corresponding sulfone(PCDTO2)compounds are a group of important persistent organic pollutants.In the present study,geometrical optimization and subsequent calculat...Polychlorinated dibenzothiophenes(PCDTs)and their corresponding sulfone(PCDTO2)compounds are a group of important persistent organic pollutants.In the present study,geometrical optimization and subsequent calculations of electrostatic potentials(ESPs)on molecular surface have been performed for all 135 PCDTs and 135 PCDTO2 congeners at the HF/6-31G*level of theory.A number of statistically-based parameters have been extracted.Linear relationship between gas-chromatographic retention index(RI)and the structural descriptors have been established by multiple linear regression.The result shows that two descriptors derived from positive electrostatic potential on molecular surface, ■ and π,together with the molecular volume(Vmc)and the energy of the lowest unoccupied molecular orbital(ELUMO)can be well used to express the quantitative structure-retention relationship(QSRR)of PCDTs and PCDTO2.Predictive capability of the two models has been demonstrated by leave-one-out cross-validation with the cross-validated correlation coefficient(RCV)of 0.996 and 0.997,respectively.Furthermore,the predictive power of the models is further examined for the external test set.Correlation coefficients(R)between the observed and predicted RI values for the external test set are 0.997 and0.998,respectively,validating the robustness and good prediction of our model.The QSRR model established may provide again a powerful method for predicting chromatographic properties of aromatic organosulfur compounds.展开更多
The capacity factors (k') of fourteen types ofhalogenated thiophenols in different phases of methanol-water eluent were determined by reversed phased high-performance liquid chromatography (RP-HPLC) and the relat...The capacity factors (k') of fourteen types ofhalogenated thiophenols in different phases of methanol-water eluent were determined by reversed phased high-performance liquid chromatography (RP-HPLC) and the relationships between the logarithm of capacity factor lgK' and methanol ratio ψ were analyzed. A fair linear relationship is found between lgK' and ψ, and the correlation coefficients R2 of the constructed linear equations are all greater than 0.990. Relationship between the chromatographic data lgKw' when extrapolated to pure water and n-octanol/water partition coefficient lgKow obtained by the group contribution method has shown a good linear correlation with R2= 0.956. The structure parameters of fourteen halogenated thiophenols were calculated by using DFT, and the correlation equation of lgKw' and structure parameters was obtained by using SPSS, lgKw' = -0.409 + 0.039a and R2 = 0.981, meaning that lgKw' is mainly determined by the polarizability α.展开更多
Twenty eight alkyl(1-phenylsulfonyl) cycloalkane carboxylates were computed at the B3LYP/6-31G* level. Based on linear solvation energy theory, two quantitative correlation equations of the molecular structures of alk...Twenty eight alkyl(1-phenylsulfonyl) cycloalkane carboxylates were computed at the B3LYP/6-31G* level. Based on linear solvation energy theory, two quantitative correlation equations of the molecular structures of alkyl(1-phenylsulfonyl) cycloalkane carboxylate com- pounds to their chromatographic retention (capacity factor lgKW) and the toxicity for photo- bacterium phosphoreum (–lgEC50) were developed by using the molecular structural parameters as theoretical descriptors (r2 = 0.9501, 0.9488). The two quantitative correlation equations were consequently cross validated by leave-one-out (LOO) validation method with q2 of 0.9113 and 0.9281, respectively. The result showed that the two equations achieved in this work by B3LYP/6-31G* are both more advantageous than those from AM1, and can be used to predict the lgKW and –lgEC50 of congeneric organics.展开更多
将食用植物油中的脂肪酸转化为相应的脂肪酸甲酯,并采用立体结构参数Steric and Electronic Descriptors(SEDs)表征其分子结构,然后运用多元线性回归(MLR)方法,建立了预测食用植物油中脂肪酸(甲酯)的定量结构-色谱保留相关(QSRR)模型,...将食用植物油中的脂肪酸转化为相应的脂肪酸甲酯,并采用立体结构参数Steric and Electronic Descriptors(SEDs)表征其分子结构,然后运用多元线性回归(MLR)方法,建立了预测食用植物油中脂肪酸(甲酯)的定量结构-色谱保留相关(QSRR)模型,同时采用内部及外部双重验证的方法对所建模型的稳定性能和预测能力进行了分析和验证。建模计算值、留一法(LOO)交互检验(CV)预测值和外部样本预测值的相关系数R、R LOO、Q2ext分别为0.9990、0.9970、0.9860。结果表明,SEDs参数能较好地表征食用植物油中的脂肪酸甲酯分子的结构信息,所建立的QSRR模型具有良好的稳定性和预测能力,为间接分析鉴定食用植物油中脂肪酸提供了一种方便有效的新途径。展开更多
基金This work was supported by the State Key Laboratory of Chemo/Biosensing and Chemometrics Foundation (No. 05-12-1), Fok-Yingtung Educational Foundation (No. 98-7-6) and Chongqing University Innovation Foundation of Science and Technology ( No. 06-1-1)
文摘6 Atomic fragment types of organic compound have been defined, and the multilevel atom-pair frequency matrix has been constructed according to the occurrence number in pairs of atomic fragments with different bond lengths in the molecule. On the basis of them, a novel molecular coding technique: characteristic atom-pair holographic code (CAHC), is obtained. To some extent, this method exhibits a large number of benefits at the same time. For example, it can calculate 2D molecular topological descriptor easily, operate without difficulty and possess definite physicochemical meaning of 3D molecular structural characterization methods, and may fetch the complicated information of molecule, etc. Therefore, it is appropriate for the study on quantitative structure-property/activity relationship (QSPR/QSAR) of medicines and biological molecules. We attempt in this paper to utilize the method of CAHC to the quantitative prediction of reversed-phase liquid chromatogram (RPLC) retention data of 33 purine derivatives and 24 steroids. The fitting multiple correlation coefficient R2, cross-validated multiple correlation coefficient Q2 and predicted ability Q^2 pred over test set's samples of obtained partial least-square (PLS) regression model are respectively 0.990, 0.893 and 0.977, 0.897, 0.941.
基金Supported by Shanghai Education Committee Project (No. 11YZ224)Shanghai Leading Academic Discipline Project (No. J51503)
文摘In the present study,(QSRR) study had been carried out for volatile components from Rosa banksiae Ait.based on various quantum-chemical and physicochemical descriptors derived by B3LYP method.To build QSRR models,a multiple linear regression (MLR) stepwise method was used.The generated models have good predictive ability and are of high statistical significance with good correlation coefficients (R2≥0.734) and p values far less than 0.05.Preliminary results indicated that the application of the models,especially the prediction of GC retention time and linear retention index of volatile components from Rosa banksiae Ait.,will be helpful.The models contribute also to the identification of important quantum-chemical and physicochemical descriptors responsible for the retention time and linear retention index.It was found that the shape attribute (ShpA) and logP value play a vital role in determining component’s GC retention time and linear retention index which increase with the lipophilicity of volatile components.The larger the shape attribute of analyte is,the larger the deformability is,the stronger the interaction between analyte and stationary phase is,and the longer the GC retention time is,the larger the linear retention index is.The importance of E HOMO,q+,and SEV is also embodied in models,but they are not dominant.
基金supported by the Foundation of Returned Scholars (Main Program) of Shanxi Province (200902)
文摘Polychlorinated dibenzothiophenes(PCDTs) are classified as persistent organic pollutants in the environment,so the analysis of PCDTs by their gas chromatographic behaviors is of great significance.Quantitative structure-retention relationship(QSRR) analysis is a useful technique capable of relating chromatographic retention time to the molecular structure.In this paper,a QSRR study of 37 PCDTs was carried out by using molecular electronegativity distance vector(MEDV) descriptors and multiple linear regression(MLR) and partial least-squares regression(PLS) methods.The correlation coefficient R of established MLR,PLS models,leave-one-out(LOO) cross-validation(CV),Q2ext were 0.9951,0.9942,0.9839(MLR) and 0.9925,0.9915,0.9833(PLS),respectively.Results showed that the model exhibited excellent estimate capability for internal sample set and good predictive capability for external sample set.By using MEDV descriptors,the QSRR model can provide a simple and rapid way to predict the gas-chromatographic retention indices of polychlorinated dibenzothiophenes in conditions of lacking standard samples or poor experimental conditions.
基金supported by the Science and Technology Project of Zhejiang Province(2016C33039)the Public Technology Research Project(Analysis and Measurement)of Zhejiang Province(LGC19B070004)+1 种基金State Key Laboratory of Environmental Chemistry and Ecotoxicology,Research Center for Eco-Environmental Sciences,Chinese Academy of Sciences(KF2018-15)Natural Science Foundation of Zhejiang Province(LY18C030003)
文摘Polychlorinated dibenzothiophenes(PCDTs)and their corresponding sulfone(PCDTO2)compounds are a group of important persistent organic pollutants.In the present study,geometrical optimization and subsequent calculations of electrostatic potentials(ESPs)on molecular surface have been performed for all 135 PCDTs and 135 PCDTO2 congeners at the HF/6-31G*level of theory.A number of statistically-based parameters have been extracted.Linear relationship between gas-chromatographic retention index(RI)and the structural descriptors have been established by multiple linear regression.The result shows that two descriptors derived from positive electrostatic potential on molecular surface, ■ and π,together with the molecular volume(Vmc)and the energy of the lowest unoccupied molecular orbital(ELUMO)can be well used to express the quantitative structure-retention relationship(QSRR)of PCDTs and PCDTO2.Predictive capability of the two models has been demonstrated by leave-one-out cross-validation with the cross-validated correlation coefficient(RCV)of 0.996 and 0.997,respectively.Furthermore,the predictive power of the models is further examined for the external test set.Correlation coefficients(R)between the observed and predicted RI values for the external test set are 0.997 and0.998,respectively,validating the robustness and good prediction of our model.The QSRR model established may provide again a powerful method for predicting chromatographic properties of aromatic organosulfur compounds.
文摘The capacity factors (k') of fourteen types ofhalogenated thiophenols in different phases of methanol-water eluent were determined by reversed phased high-performance liquid chromatography (RP-HPLC) and the relationships between the logarithm of capacity factor lgK' and methanol ratio ψ were analyzed. A fair linear relationship is found between lgK' and ψ, and the correlation coefficients R2 of the constructed linear equations are all greater than 0.990. Relationship between the chromatographic data lgKw' when extrapolated to pure water and n-octanol/water partition coefficient lgKow obtained by the group contribution method has shown a good linear correlation with R2= 0.956. The structure parameters of fourteen halogenated thiophenols were calculated by using DFT, and the correlation equation of lgKw' and structure parameters was obtained by using SPSS, lgKw' = -0.409 + 0.039a and R2 = 0.981, meaning that lgKw' is mainly determined by the polarizability α.
基金This work was financially supported by the National Basic Research Program of China (2003CB415002), the China Postdoctoral Science Foundation (No. 2003033486) and the Natural Science Research Fund of University in Jiangsu (04KJB150149)
文摘Twenty eight alkyl(1-phenylsulfonyl) cycloalkane carboxylates were computed at the B3LYP/6-31G* level. Based on linear solvation energy theory, two quantitative correlation equations of the molecular structures of alkyl(1-phenylsulfonyl) cycloalkane carboxylate com- pounds to their chromatographic retention (capacity factor lgKW) and the toxicity for photo- bacterium phosphoreum (–lgEC50) were developed by using the molecular structural parameters as theoretical descriptors (r2 = 0.9501, 0.9488). The two quantitative correlation equations were consequently cross validated by leave-one-out (LOO) validation method with q2 of 0.9113 and 0.9281, respectively. The result showed that the two equations achieved in this work by B3LYP/6-31G* are both more advantageous than those from AM1, and can be used to predict the lgKW and –lgEC50 of congeneric organics.
文摘将食用植物油中的脂肪酸转化为相应的脂肪酸甲酯,并采用立体结构参数Steric and Electronic Descriptors(SEDs)表征其分子结构,然后运用多元线性回归(MLR)方法,建立了预测食用植物油中脂肪酸(甲酯)的定量结构-色谱保留相关(QSRR)模型,同时采用内部及外部双重验证的方法对所建模型的稳定性能和预测能力进行了分析和验证。建模计算值、留一法(LOO)交互检验(CV)预测值和外部样本预测值的相关系数R、R LOO、Q2ext分别为0.9990、0.9970、0.9860。结果表明,SEDs参数能较好地表征食用植物油中的脂肪酸甲酯分子的结构信息,所建立的QSRR模型具有良好的稳定性和预测能力,为间接分析鉴定食用植物油中脂肪酸提供了一种方便有效的新途径。