摘要
The identification of DNA binding proteins(DNABPs)is considered a major challenge in genome annotation because they are linked to several important applied and research applications of cellular functions e.g.,in the study of the biological,biophysical,and biochemical effects of antibiotics,drugs,and steroids on DNA.This paper presents an efficient approach for DNABPs identification based on deep transfer learning,named“DTLM-DBP.”Two transfer learning methods are used in the identification process.The first is based on the pre-trained deep learning model as a feature’s extractor and classifier.Two different pre-trained Convolutional Neural Networks(CNN),AlexNet 8 and VGG 16,are tested and compared.The second method uses the deep learning model as a feature’s extractor only and two different classifiers for the identification process.Two classifiers,Support Vector Machine(SVM)and Random Forest(RF),are tested and compared.The proposed approach is tested using different DNA proteins datasets.The performance of the identification process is evaluated in terms of identification accuracy,sensitivity,specificity and MCC,with four available DNA proteins datasets:PDB1075,PDB186,PDNA-543,and PDNA-316.The results show that the RF classifier,with VGG-Net pre-trained deep transfer learning features,gives the highest performance.DTLM-DBP was compared with other published methods and it provides a considerable improvement in the performance of DNABPs identification.
基金
This paper was funded under the 2020–2021 Industry-International Incentive Grant by Universiti Teknologi Malaysia(Grant Number:Q.K130000.3043.02M12)which was granted to U.Khairuddin,F.Behrooz and R.Yusof.