Polychlorinated dibenzothiophenes(PCDTs) are classified as persistent organic pollutants in the environment,so the analysis of PCDTs by their gas chromatographic behaviors is of great significance.Quantitative struc...Polychlorinated dibenzothiophenes(PCDTs) are classified as persistent organic pollutants in the environment,so the analysis of PCDTs by their gas chromatographic behaviors is of great significance.Quantitative structure-retention relationship(QSRR) analysis is a useful technique capable of relating chromatographic retention time to the molecular structure.In this paper,a QSRR study of 37 PCDTs was carried out by using molecular electronegativity distance vector(MEDV) descriptors and multiple linear regression(MLR) and partial least-squares regression(PLS) methods.The correlation coefficient R of established MLR,PLS models,leave-one-out(LOO) cross-validation(CV),Q2ext were 0.9951,0.9942,0.9839(MLR) and 0.9925,0.9915,0.9833(PLS),respectively.Results showed that the model exhibited excellent estimate capability for internal sample set and good predictive capability for external sample set.By using MEDV descriptors,the QSRR model can provide a simple and rapid way to predict the gas-chromatographic retention indices of polychlorinated dibenzothiophenes in conditions of lacking standard samples or poor experimental conditions.展开更多
In the present study,(QSRR) study had been carried out for volatile components from Rosa banksiae Ait.based on various quantum-chemical and physicochemical descriptors derived by B3LYP method.To build QSRR models,a ...In the present study,(QSRR) study had been carried out for volatile components from Rosa banksiae Ait.based on various quantum-chemical and physicochemical descriptors derived by B3LYP method.To build QSRR models,a multiple linear regression (MLR) stepwise method was used.The generated models have good predictive ability and are of high statistical significance with good correlation coefficients (R2≥0.734) and p values far less than 0.05.Preliminary results indicated that the application of the models,especially the prediction of GC retention time and linear retention index of volatile components from Rosa banksiae Ait.,will be helpful.The models contribute also to the identification of important quantum-chemical and physicochemical descriptors responsible for the retention time and linear retention index.It was found that the shape attribute (ShpA) and logP value play a vital role in determining component’s GC retention time and linear retention index which increase with the lipophilicity of volatile components.The larger the shape attribute of analyte is,the larger the deformability is,the stronger the interaction between analyte and stationary phase is,and the longer the GC retention time is,the larger the linear retention index is.The importance of E HOMO,q+,and SEV is also embodied in models,but they are not dominant.展开更多
Polychlorinated dibenzothiophenes(PCDTs)and their corresponding sulfone(PCDTO2)compounds are a group of important persistent organic pollutants.In the present study,geometrical optimization and subsequent calculat...Polychlorinated dibenzothiophenes(PCDTs)and their corresponding sulfone(PCDTO2)compounds are a group of important persistent organic pollutants.In the present study,geometrical optimization and subsequent calculations of electrostatic potentials(ESPs)on molecular surface have been performed for all 135 PCDTs and 135 PCDTO2 congeners at the HF/6-31G*level of theory.A number of statistically-based parameters have been extracted.Linear relationship between gas-chromatographic retention index(RI)and the structural descriptors have been established by multiple linear regression.The result shows that two descriptors derived from positive electrostatic potential on molecular surface, ■ and π,together with the molecular volume(Vmc)and the energy of the lowest unoccupied molecular orbital(ELUMO)can be well used to express the quantitative structure-retention relationship(QSRR)of PCDTs and PCDTO2.Predictive capability of the two models has been demonstrated by leave-one-out cross-validation with the cross-validated correlation coefficient(RCV)of 0.996 and 0.997,respectively.Furthermore,the predictive power of the models is further examined for the external test set.Correlation coefficients(R)between the observed and predicted RI values for the external test set are 0.997 and0.998,respectively,validating the robustness and good prediction of our model.The QSRR model established may provide again a powerful method for predicting chromatographic properties of aromatic organosulfur compounds.展开更多
A newly developed descriptor, three- dimensional holographic vector of atomic interaction field (3D-HoVAIF), was used to describe the chemical structures of purine bases. After variable screening by stepwise multiple ...A newly developed descriptor, three- dimensional holographic vector of atomic interaction field (3D-HoVAIF), was used to describe the chemical structures of purine bases. After variable screening by stepwise multiple regression (SMR) technique, a partial least square (PLS) regression model was built with 3D-HoVAIF. The model was satisfactory com- paring to reference since correlation coefficients of molecular modeling ( Rc 2um), cross- validation ( Qc 2um) and standard deviation of estimation (SD) were 0.966, 0.860 and 0.112, respectively, showing that the model had favorable estimation and prediction capa- bilities. It was illustrated that information related to retention data of purine bases could preferably be expressed by 3D-HoVAIF with definite physico- chemical meanings and easy structural interpretation for purine bases. It was illustrated that 3D-HoVAIF was to preferably express retention data of purine bases and had definite physicochemical significance. So 3D-HoVAIF was a useful structural expression technique for quantitative structure activity (or prop- erty or retention) relationships (QSAR/QSPR/QSRR) study, such as structural characterization and chro- matographic retention prediction.展开更多
An integrated approach is proposed to predict the chromatographic retention time of oligonucleotides based on quantitative structure-retention relationships (QSRR) models. First, the primary base sequences of oligon...An integrated approach is proposed to predict the chromatographic retention time of oligonucleotides based on quantitative structure-retention relationships (QSRR) models. First, the primary base sequences of oligonucleotides are translated into vectors based on scores of generalized base properties (SGBP), involving physicochemical, quantum chemical, topological, spatial structural properties, etc.; thereafter, the sequence data are transformed into a uniform matrix by auto cross covariance (ACC). ACC accounts for the interactions between bases at a certain distance apart in an oligonucleotide sequence; hence, this method adequately takes the neighboring effect into account. Then, a genetic algorithm is used to select the variables related to chromatographic retention behavior of oligonuclcotides. Finally, a support vector machine is used to develop QSRR models to predict chromatographic retention behavior. The whole dataset is divided into pairs of training sets and test sets with different proportions; as a result, it has been found that the QSRR models using more than 26 training samples have an appropriate external power, and can accurately represent the relationship between the features of sequences and structures, and the retention times. The results indicate that the SGBP-ACC approach is a useful structural representation method in QSRR of oligonucleotides due to its many advantages such as plentiful structural information, easy manipulation and high characterization competence. Moreover, the method can further be applied to predict chromatographic retention behavior of oligonucleotides.展开更多
Based on the identical group as a pseudo atom instead of a typical atom, a novel modified molecular dis-tance-edge (MDE) vector μ was developed in our laboratory to characterize chemical structure of polychlorinated ...Based on the identical group as a pseudo atom instead of a typical atom, a novel modified molecular dis-tance-edge (MDE) vector μ was developed in our laboratory to characterize chemical structure of polychlorinated diben-zofurans (PCDFs) congeners and/or isomers. Quantitative structure-retention relationships (QSRRs) between the new VMDE parameters and gas chromatographic (GC) retention behavior of PCDFs were then generated by multiple linear regression (MLR) method for non-polar, moderately polar, and polar stationary phases. Four excellent models with high correlation coefficients, R=0.984-0.995, were proposed for non-polar columns (DB-5, SE-54, OV-101). For the moder-ately polar columns (OV-1701), the correlation coefficient of the developed good model is only 0.958. For the polar col-umns (SP-2300), the QSRR model is poor with R=0.884. Then cross validation with leave-one out of procedure (CV) is performed in high correlation with the non-polar (Rcv=992-0.974) and weakly polar (Rcv=921) columns and in little cor-relation (Rcv=0.834) with the polar columns. These results show that the new μ vector is suitable for describing the re-tention behaviors of PCDFs on non-polar and moderately polar stationary phases and not for the various gas chroma-tographic retention behaviors of PCDFs on the different po-larity-varying stationary phases.展开更多
Acylcarnitines are metabolic intermediates of fatty acids and branched-chain amino acids having vital biofunctions and pathophysiological significances. Here, we developed a high-throughput method for quantifying hund...Acylcarnitines are metabolic intermediates of fatty acids and branched-chain amino acids having vital biofunctions and pathophysiological significances. Here, we developed a high-throughput method for quantifying hundreds of acylcarnitines in one run using ultrahigh performance liquid chromatography and tandem mass spectrometry (UPLC-MS/MS). This enabled simultaneous quantification of 1136 acylcarnitines (C0–C26) within 10-min with good sensitivity (limit of detection < 0.7 fmol), linearity (correlation coefficient > 0.992), accuracy (relative error < 20%), precision (coefficient of variation (CV), CV < 15%), stability (CV < 15%), and inter-technician consistency (CV < 20%, n = 6). We also established a quantitative structure-retention relationship (goodness of fit > 0.998) for predicting retention time (tR) of acylcarnitines with no standards and built a database of their multiple reaction monitoring parameters (tR, ion-pairs, and collision energy). Furthermore, we quantified 514 acylcarnitines in human plasma and urine, mouse kidney, liver, heart, lung, and muscle. This provides a rapid method for quantifying acylcarnitines in multiple biological matrices.展开更多
Atoms in most organic molecules are often carbon,oxygen,nitrogen,sulfur,halogens,etc. Based on the three-dimensional structure of a molecule,a molecular structural characterization(MSC) method called improved molecu...Atoms in most organic molecules are often carbon,oxygen,nitrogen,sulfur,halogens,etc. Based on the three-dimensional structure of a molecule,a molecular structural characterization(MSC) method called improved molecular electronegativity-distance vector(I-MEDV) was developed. It was used to describe the structures of 37 compounds of styrax japonicus sieb flowers. Through multiple linear regression(MLR),a QSRR model was built up. The correlation coefficient(R1) of the model was 0.980. Then,4 vectors were selected to build another model through the method of stepwise multiple regression(SMR) ,and the correlation coefficient(R2) of the model was 0.975. Moreover,all the two models were evaluated by performing the crossvalidation with the leave-one-out(LOO) procedure and the correlation coefficients(Rcv) were 0.948 and 0.968,respectively. The results show that the I-MEDV could successfully describe the structures of organic compounds. The stability and predictability of the models were good.展开更多
将食用植物油中的脂肪酸转化为相应的脂肪酸甲酯,并采用立体结构参数Steric and Electronic Descriptors(SEDs)表征其分子结构,然后运用多元线性回归(MLR)方法,建立了预测食用植物油中脂肪酸(甲酯)的定量结构-色谱保留相关(QSRR)模型,...将食用植物油中的脂肪酸转化为相应的脂肪酸甲酯,并采用立体结构参数Steric and Electronic Descriptors(SEDs)表征其分子结构,然后运用多元线性回归(MLR)方法,建立了预测食用植物油中脂肪酸(甲酯)的定量结构-色谱保留相关(QSRR)模型,同时采用内部及外部双重验证的方法对所建模型的稳定性能和预测能力进行了分析和验证。建模计算值、留一法(LOO)交互检验(CV)预测值和外部样本预测值的相关系数R、R LOO、Q2ext分别为0.9990、0.9970、0.9860。结果表明,SEDs参数能较好地表征食用植物油中的脂肪酸甲酯分子的结构信息,所建立的QSRR模型具有良好的稳定性和预测能力,为间接分析鉴定食用植物油中脂肪酸提供了一种方便有效的新途径。展开更多
基金supported by the Foundation of Returned Scholars (Main Program) of Shanxi Province (200902)
文摘Polychlorinated dibenzothiophenes(PCDTs) are classified as persistent organic pollutants in the environment,so the analysis of PCDTs by their gas chromatographic behaviors is of great significance.Quantitative structure-retention relationship(QSRR) analysis is a useful technique capable of relating chromatographic retention time to the molecular structure.In this paper,a QSRR study of 37 PCDTs was carried out by using molecular electronegativity distance vector(MEDV) descriptors and multiple linear regression(MLR) and partial least-squares regression(PLS) methods.The correlation coefficient R of established MLR,PLS models,leave-one-out(LOO) cross-validation(CV),Q2ext were 0.9951,0.9942,0.9839(MLR) and 0.9925,0.9915,0.9833(PLS),respectively.Results showed that the model exhibited excellent estimate capability for internal sample set and good predictive capability for external sample set.By using MEDV descriptors,the QSRR model can provide a simple and rapid way to predict the gas-chromatographic retention indices of polychlorinated dibenzothiophenes in conditions of lacking standard samples or poor experimental conditions.
基金Supported by Shanghai Education Committee Project (No. 11YZ224)Shanghai Leading Academic Discipline Project (No. J51503)
文摘In the present study,(QSRR) study had been carried out for volatile components from Rosa banksiae Ait.based on various quantum-chemical and physicochemical descriptors derived by B3LYP method.To build QSRR models,a multiple linear regression (MLR) stepwise method was used.The generated models have good predictive ability and are of high statistical significance with good correlation coefficients (R2≥0.734) and p values far less than 0.05.Preliminary results indicated that the application of the models,especially the prediction of GC retention time and linear retention index of volatile components from Rosa banksiae Ait.,will be helpful.The models contribute also to the identification of important quantum-chemical and physicochemical descriptors responsible for the retention time and linear retention index.It was found that the shape attribute (ShpA) and logP value play a vital role in determining component’s GC retention time and linear retention index which increase with the lipophilicity of volatile components.The larger the shape attribute of analyte is,the larger the deformability is,the stronger the interaction between analyte and stationary phase is,and the longer the GC retention time is,the larger the linear retention index is.The importance of E HOMO,q+,and SEV is also embodied in models,but they are not dominant.
基金supported by the Science and Technology Project of Zhejiang Province(2016C33039)the Public Technology Research Project(Analysis and Measurement)of Zhejiang Province(LGC19B070004)+1 种基金State Key Laboratory of Environmental Chemistry and Ecotoxicology,Research Center for Eco-Environmental Sciences,Chinese Academy of Sciences(KF2018-15)Natural Science Foundation of Zhejiang Province(LY18C030003)
文摘Polychlorinated dibenzothiophenes(PCDTs)and their corresponding sulfone(PCDTO2)compounds are a group of important persistent organic pollutants.In the present study,geometrical optimization and subsequent calculations of electrostatic potentials(ESPs)on molecular surface have been performed for all 135 PCDTs and 135 PCDTO2 congeners at the HF/6-31G*level of theory.A number of statistically-based parameters have been extracted.Linear relationship between gas-chromatographic retention index(RI)and the structural descriptors have been established by multiple linear regression.The result shows that two descriptors derived from positive electrostatic potential on molecular surface, ■ and π,together with the molecular volume(Vmc)and the energy of the lowest unoccupied molecular orbital(ELUMO)can be well used to express the quantitative structure-retention relationship(QSRR)of PCDTs and PCDTO2.Predictive capability of the two models has been demonstrated by leave-one-out cross-validation with the cross-validated correlation coefficient(RCV)of 0.996 and 0.997,respectively.Furthermore,the predictive power of the models is further examined for the external test set.Correlation coefficients(R)between the observed and predicted RI values for the external test set are 0.997 and0.998,respectively,validating the robustness and good prediction of our model.The QSRR model established may provide again a powerful method for predicting chromatographic properties of aromatic organosulfur compounds.
基金supported by the Industry Innovation Foundation of Shanxi Province(Grant No.2006031204)the Chongqing Applied Fundamental Science Foundation(Grant No.01-3-6).
文摘A newly developed descriptor, three- dimensional holographic vector of atomic interaction field (3D-HoVAIF), was used to describe the chemical structures of purine bases. After variable screening by stepwise multiple regression (SMR) technique, a partial least square (PLS) regression model was built with 3D-HoVAIF. The model was satisfactory com- paring to reference since correlation coefficients of molecular modeling ( Rc 2um), cross- validation ( Qc 2um) and standard deviation of estimation (SD) were 0.966, 0.860 and 0.112, respectively, showing that the model had favorable estimation and prediction capa- bilities. It was illustrated that information related to retention data of purine bases could preferably be expressed by 3D-HoVAIF with definite physico- chemical meanings and easy structural interpretation for purine bases. It was illustrated that 3D-HoVAIF was to preferably express retention data of purine bases and had definite physicochemical significance. So 3D-HoVAIF was a useful structural expression technique for quantitative structure activity (or prop- erty or retention) relationships (QSAR/QSPR/QSRR) study, such as structural characterization and chro- matographic retention prediction.
基金supported by the National Natural Science Foundation of China (10901169)National 111 Programme of Introducing Talents of Discipline to Universities (0507111106)+2 种基金Innovation Ability Training Foundation of Chongqing University (CDCX008)Innovative Group Program for Graduates of Chongqing University,ScienceInnovation Fund (200711C1A0010260)
文摘An integrated approach is proposed to predict the chromatographic retention time of oligonucleotides based on quantitative structure-retention relationships (QSRR) models. First, the primary base sequences of oligonucleotides are translated into vectors based on scores of generalized base properties (SGBP), involving physicochemical, quantum chemical, topological, spatial structural properties, etc.; thereafter, the sequence data are transformed into a uniform matrix by auto cross covariance (ACC). ACC accounts for the interactions between bases at a certain distance apart in an oligonucleotide sequence; hence, this method adequately takes the neighboring effect into account. Then, a genetic algorithm is used to select the variables related to chromatographic retention behavior of oligonuclcotides. Finally, a support vector machine is used to develop QSRR models to predict chromatographic retention behavior. The whole dataset is divided into pairs of training sets and test sets with different proportions; as a result, it has been found that the QSRR models using more than 26 training samples have an appropriate external power, and can accurately represent the relationship between the features of sequences and structures, and the retention times. The results indicate that the SGBP-ACC approach is a useful structural representation method in QSRR of oligonucleotides due to its many advantages such as plentiful structural information, easy manipulation and high characterization competence. Moreover, the method can further be applied to predict chromatographic retention behavior of oligonucleotides.
基金This work was supported by the Chunhui Project Fund of the Ministry of Education(Grant No.SCPF99-4-4+37)Fok Ying-Tung Educational Foundation(Grant No.FYTF98-7-6)+1 种基金Chongqing Applied Science Fund(Grant No,CASF01-3-6)Chongqing University ZYXT Innovation Fund(Grant No.CUIF03-5-6+04-10-10).
文摘Based on the identical group as a pseudo atom instead of a typical atom, a novel modified molecular dis-tance-edge (MDE) vector μ was developed in our laboratory to characterize chemical structure of polychlorinated diben-zofurans (PCDFs) congeners and/or isomers. Quantitative structure-retention relationships (QSRRs) between the new VMDE parameters and gas chromatographic (GC) retention behavior of PCDFs were then generated by multiple linear regression (MLR) method for non-polar, moderately polar, and polar stationary phases. Four excellent models with high correlation coefficients, R=0.984-0.995, were proposed for non-polar columns (DB-5, SE-54, OV-101). For the moder-ately polar columns (OV-1701), the correlation coefficient of the developed good model is only 0.958. For the polar col-umns (SP-2300), the QSRR model is poor with R=0.884. Then cross validation with leave-one out of procedure (CV) is performed in high correlation with the non-polar (Rcv=992-0.974) and weakly polar (Rcv=921) columns and in little cor-relation (Rcv=0.834) with the polar columns. These results show that the new μ vector is suitable for describing the re-tention behaviors of PCDFs on non-polar and moderately polar stationary phases and not for the various gas chroma-tographic retention behaviors of PCDFs on the different po-larity-varying stationary phases.
基金financial supports from the National Key R&D Program of China(Grant Nos.:2022YFC3400700,2022YFA0806400,and 2020YFE0201600)Shanghai Municipal Science and Technology Major Project(Grant No.:2017SHZDZX01)the National Natural Science Foundation of China(Grant No.:31821002).
文摘Acylcarnitines are metabolic intermediates of fatty acids and branched-chain amino acids having vital biofunctions and pathophysiological significances. Here, we developed a high-throughput method for quantifying hundreds of acylcarnitines in one run using ultrahigh performance liquid chromatography and tandem mass spectrometry (UPLC-MS/MS). This enabled simultaneous quantification of 1136 acylcarnitines (C0–C26) within 10-min with good sensitivity (limit of detection < 0.7 fmol), linearity (correlation coefficient > 0.992), accuracy (relative error < 20%), precision (coefficient of variation (CV), CV < 15%), stability (CV < 15%), and inter-technician consistency (CV < 20%, n = 6). We also established a quantitative structure-retention relationship (goodness of fit > 0.998) for predicting retention time (tR) of acylcarnitines with no standards and built a database of their multiple reaction monitoring parameters (tR, ion-pairs, and collision energy). Furthermore, we quantified 514 acylcarnitines in human plasma and urine, mouse kidney, liver, heart, lung, and muscle. This provides a rapid method for quantifying acylcarnitines in multiple biological matrices.
基金supported by the Youth Foundation of Education Bureau,Sichuan Province (09ZB036)Technology Bureau,Sichuan Province (2006j13-141)
文摘Atoms in most organic molecules are often carbon,oxygen,nitrogen,sulfur,halogens,etc. Based on the three-dimensional structure of a molecule,a molecular structural characterization(MSC) method called improved molecular electronegativity-distance vector(I-MEDV) was developed. It was used to describe the structures of 37 compounds of styrax japonicus sieb flowers. Through multiple linear regression(MLR),a QSRR model was built up. The correlation coefficient(R1) of the model was 0.980. Then,4 vectors were selected to build another model through the method of stepwise multiple regression(SMR) ,and the correlation coefficient(R2) of the model was 0.975. Moreover,all the two models were evaluated by performing the crossvalidation with the leave-one-out(LOO) procedure and the correlation coefficients(Rcv) were 0.948 and 0.968,respectively. The results show that the I-MEDV could successfully describe the structures of organic compounds. The stability and predictability of the models were good.
文摘将食用植物油中的脂肪酸转化为相应的脂肪酸甲酯,并采用立体结构参数Steric and Electronic Descriptors(SEDs)表征其分子结构,然后运用多元线性回归(MLR)方法,建立了预测食用植物油中脂肪酸(甲酯)的定量结构-色谱保留相关(QSRR)模型,同时采用内部及外部双重验证的方法对所建模型的稳定性能和预测能力进行了分析和验证。建模计算值、留一法(LOO)交互检验(CV)预测值和外部样本预测值的相关系数R、R LOO、Q2ext分别为0.9990、0.9970、0.9860。结果表明,SEDs参数能较好地表征食用植物油中的脂肪酸甲酯分子的结构信息,所建立的QSRR模型具有良好的稳定性和预测能力,为间接分析鉴定食用植物油中脂肪酸提供了一种方便有效的新途径。