Support vector machine (SVM), partial least squares (PLS), and Back-Propagation artificial neural net- work (ANN) were employed to establish QSAR models of 2 dipeptide datasets. In order to validate predictive capabil...Support vector machine (SVM), partial least squares (PLS), and Back-Propagation artificial neural net- work (ANN) were employed to establish QSAR models of 2 dipeptide datasets. In order to validate predictive capabilities on external dataset of the resulting models, both internal and external validations were performed. The division of dataset into both training and test sets was carried out by D-optimal design. The results showed that support vector machine (SVM) behaved well in both calibration and prediction. For the dataset of 48 bitter tasting dipeptides (BTD), the results obtained by support vector regression (SVR) were superior to that by PLS in both calibration and prediction. When compared with BP artificial neural network, SVR showed less calibration power but more predictive capability. For the dataset of angiotensin-converting enzyme (ACE) inhibitors, the results obtained by support vector machine (SVM) re- gression were equivalent to those by PLS and BP artificial neural network. In both datasets, SVR using linear kernel function behaved well as that using radial basis kernel func- tion. The results showed that there is wide prospect for the application of support vector machine (SVM) into QSAR modeling.展开更多
基金This work was supported by the Fok-Yingtung Educational Foundation(FYEF)(Grant No.98-7-6)the National Chun-hui Project Foundation(NCPF)(Grant No.99-04+99-37)Chongqing Applied Fundamental Science Fund(CAFS)(Grant No.01-3-6).
文摘Support vector machine (SVM), partial least squares (PLS), and Back-Propagation artificial neural net- work (ANN) were employed to establish QSAR models of 2 dipeptide datasets. In order to validate predictive capabilities on external dataset of the resulting models, both internal and external validations were performed. The division of dataset into both training and test sets was carried out by D-optimal design. The results showed that support vector machine (SVM) behaved well in both calibration and prediction. For the dataset of 48 bitter tasting dipeptides (BTD), the results obtained by support vector regression (SVR) were superior to that by PLS in both calibration and prediction. When compared with BP artificial neural network, SVR showed less calibration power but more predictive capability. For the dataset of angiotensin-converting enzyme (ACE) inhibitors, the results obtained by support vector machine (SVM) re- gression were equivalent to those by PLS and BP artificial neural network. In both datasets, SVR using linear kernel function behaved well as that using radial basis kernel func- tion. The results showed that there is wide prospect for the application of support vector machine (SVM) into QSAR modeling.