Protein subcellular localization prediction is im- portant for studying the function of proteins. Recently, as significant progress has been witnessed in the field of mi- croscopic imaging, automatically determining t...Protein subcellular localization prediction is im- portant for studying the function of proteins. Recently, as significant progress has been witnessed in the field of mi- croscopic imaging, automatically determining the subcellular localization of proteins from bio-images is becoming a new research hotspot. One of the central themes in this field is to determine what features are suitable for describing the pro- tein images. Existing feature extraction methods are usually hand-crafted designed, by which only one layer of features will be extracted, which may not be sufficient to represent the complex protein images. To this end, we propose a deep model based descriptor (DMD) to extract the high-level fea- tures from protein images. Specifically, in order to make the extracted features more generic, we firstly trained a convolu- tion neural network (i.e., AlexNe0 by using a natural image set with millions of labels, and then used the partial parame- ter transfer strategy to fine-tnne the parameters from natural images to protein images. After that, we applied the Lasso model to select the most distinguishing features from the last fully connected layer of the CNN (Convolution Neural Net- work), and used these selected features for final classifica- tions. Experimental results on a protein image dataset vali- date the efficacy of our method.展开更多
基金This work was supported in part by the National Nat- ural Science Foundation of China (Grant Nos. 61422204, 61473149 and 61671288), Jiangsu Natural Science Foundation for Distinguished Young Scholar (BK20130034), and Science and Technology Commission of Shang- hai Municipality (16JC1404300).
文摘Protein subcellular localization prediction is im- portant for studying the function of proteins. Recently, as significant progress has been witnessed in the field of mi- croscopic imaging, automatically determining the subcellular localization of proteins from bio-images is becoming a new research hotspot. One of the central themes in this field is to determine what features are suitable for describing the pro- tein images. Existing feature extraction methods are usually hand-crafted designed, by which only one layer of features will be extracted, which may not be sufficient to represent the complex protein images. To this end, we propose a deep model based descriptor (DMD) to extract the high-level fea- tures from protein images. Specifically, in order to make the extracted features more generic, we firstly trained a convolu- tion neural network (i.e., AlexNe0 by using a natural image set with millions of labels, and then used the partial parame- ter transfer strategy to fine-tnne the parameters from natural images to protein images. After that, we applied the Lasso model to select the most distinguishing features from the last fully connected layer of the CNN (Convolution Neural Net- work), and used these selected features for final classifica- tions. Experimental results on a protein image dataset vali- date the efficacy of our method.