6 Atomic fragment types of organic compound have been defined, and the multilevel atom-pair frequency matrix has been constructed according to the occurrence number in pairs of atomic fragments with different bond len...6 Atomic fragment types of organic compound have been defined, and the multilevel atom-pair frequency matrix has been constructed according to the occurrence number in pairs of atomic fragments with different bond lengths in the molecule. On the basis of them, a novel molecular coding technique: characteristic atom-pair holographic code (CAHC), is obtained. To some extent, this method exhibits a large number of benefits at the same time. For example, it can calculate 2D molecular topological descriptor easily, operate without difficulty and possess definite physicochemical meaning of 3D molecular structural characterization methods, and may fetch the complicated information of molecule, etc. Therefore, it is appropriate for the study on quantitative structure-property/activity relationship (QSPR/QSAR) of medicines and biological molecules. We attempt in this paper to utilize the method of CAHC to the quantitative prediction of reversed-phase liquid chromatogram (RPLC) retention data of 33 purine derivatives and 24 steroids. The fitting multiple correlation coefficient R2, cross-validated multiple correlation coefficient Q2 and predicted ability Q^2 pred over test set's samples of obtained partial least-square (PLS) regression model are respectively 0.990, 0.893 and 0.977, 0.897, 0.941.展开更多
The molecular electronegativity interaction vector (MEIV) was used to describe the molecular structure of 30 selected esters. Two excellent QSTR models were built up by using multiple linear regression (MLR) and p...The molecular electronegativity interaction vector (MEIV) was used to describe the molecular structure of 30 selected esters. Two excellent QSTR models were built up by using multiple linear regression (MLR) and partial least-squares regression (PLS). The correlation coefficients (R) of the two models were 0.945 and 0.941, respectively. The models were evaluated by performing the cross validation with the leave-one-out (LOO) procedure. The cross-verification correlation coefficients (RCV) of the two models were 0.921 and 0.919, respectively. The results showed that the models constructed in this work could provide estimation stability and favorable predictive ability.展开更多
基金This work was supported by the State Key Laboratory of Chemo/Biosensing and Chemometrics Foundation (No. 05-12-1), Fok-Yingtung Educational Foundation (No. 98-7-6) and Chongqing University Innovation Foundation of Science and Technology ( No. 06-1-1)
文摘6 Atomic fragment types of organic compound have been defined, and the multilevel atom-pair frequency matrix has been constructed according to the occurrence number in pairs of atomic fragments with different bond lengths in the molecule. On the basis of them, a novel molecular coding technique: characteristic atom-pair holographic code (CAHC), is obtained. To some extent, this method exhibits a large number of benefits at the same time. For example, it can calculate 2D molecular topological descriptor easily, operate without difficulty and possess definite physicochemical meaning of 3D molecular structural characterization methods, and may fetch the complicated information of molecule, etc. Therefore, it is appropriate for the study on quantitative structure-property/activity relationship (QSPR/QSAR) of medicines and biological molecules. We attempt in this paper to utilize the method of CAHC to the quantitative prediction of reversed-phase liquid chromatogram (RPLC) retention data of 33 purine derivatives and 24 steroids. The fitting multiple correlation coefficient R2, cross-validated multiple correlation coefficient Q2 and predicted ability Q^2 pred over test set's samples of obtained partial least-square (PLS) regression model are respectively 0.990, 0.893 and 0.977, 0.897, 0.941.
基金supported by the Youth Foundation of Education Bureau, Sichuan Province (09ZB036)Technology Bureau, Sichuan Province (2006j13-141)
文摘The molecular electronegativity interaction vector (MEIV) was used to describe the molecular structure of 30 selected esters. Two excellent QSTR models were built up by using multiple linear regression (MLR) and partial least-squares regression (PLS). The correlation coefficients (R) of the two models were 0.945 and 0.941, respectively. The models were evaluated by performing the cross validation with the leave-one-out (LOO) procedure. The cross-verification correlation coefficients (RCV) of the two models were 0.921 and 0.919, respectively. The results showed that the models constructed in this work could provide estimation stability and favorable predictive ability.