Canonical correlation analysis (CCA) is one of the most well-known methods to extract features from multi- view data and has attracted much attention in recent years. However, classical CCA is unsupervised and does ...Canonical correlation analysis (CCA) is one of the most well-known methods to extract features from multi- view data and has attracted much attention in recent years. However, classical CCA is unsupervised and does not take discriminant information into account. In this paper, we add discriminant information into CCA by using random cross- view correlations between within-class samples and propose a new method for multi-view dimensionality reduction called canonical random correlation analysis (RCA). In RCA, two approaches for randomly generating cross-view correlation samples are developed on the basis of bootstrap technique. Furthermore, kernel RCA (KRCA) is proposed to extract nonlinear correlations between different views. Experiments on several multi-view data sets show the effectiveness of the proposed methods.展开更多
Human limb movement imagery,which can be used in limb neural disorders rehabilitation and brain-controlled external devices,has become a significant control paradigm in the domain of brain-computer interface(BCI).Alth...Human limb movement imagery,which can be used in limb neural disorders rehabilitation and brain-controlled external devices,has become a significant control paradigm in the domain of brain-computer interface(BCI).Although numerous pioneering studies have been devoted to motor imagery classification based on electroencephalography(EEG)signal,their performance is somewhat limited due to insufficient analysis of key effective frequency bands of EEG signals.In this paper,we propose a model of multiband decomposition and spectral discriminative analysis for motor imagery classification,which is called variational sample-long short term memory(VS-LSTM)network.Specifically,we first use a channel fusion operator to reduce the signal channels of the raw EEG signal.Then,we use the variational mode decomposition(VMD)model to decompose the EEG signal into six band-limited intrinsic mode functions(BIMFs)for further signal noise reduction.In order to select discriminative frequency bands,we calculate the sample entropy(SampEn)value of each frequency band and select the maximum value.Finally,to predict the classification of motor imagery,a LSTM model is used to predict the class of frequency band with the largest SampEn value.An open-access public data is used to evaluated the effectiveness of the proposed model.In the data,15 subjects performed motor imagery tasks with elbow flexion/extension,forearm supination/pronation and hand open/close of right upper limb.The experiment results show that the average classification result of seven kinds of motor imagery was 76.2%,the average accuracy of motor imagery binary classification is 96.6%(imagery vs.rest),respectively,which outperforms the state-of-the-art deep learning-based models.This framework significantly improves the accuracy of motor imagery by selecting effective frequency bands.This research is very meaningful for BCIs,and it is inspiring for end-to-end learning research.展开更多
1 Introduction and main contributions Differential transcript usage(DTU),which refers to the event that the relative transcript abundance within a gene changes between conditions.To detect DTU,various methods have bee...1 Introduction and main contributions Differential transcript usage(DTU),which refers to the event that the relative transcript abundance within a gene changes between conditions.To detect DTU,various methods have been proposed,which can be classified into exon-based models and gene-based models.These approaches either cannot estimate the relative transcript abundance,or they cannot deal properly with the multi-source mapping problems of reads.Besides,few methods currently consider sample-to-sample variability under multiple conditions[1].展开更多
Background Brain network describing interconnections between brain regions contains abundant topological information.It is a challenge for the existing statistical methods(e.g.,t test)to investigate the topological di...Background Brain network describing interconnections between brain regions contains abundant topological information.It is a challenge for the existing statistical methods(e.g.,t test)to investigate the topological differences of brain networks.Methods We proposed a kernel based statistic framework for identifying topological differences in brain networks.In our framework,the topological similarities between paired brain networks were measured by graph kernels.Then,graph kernels are embedded into maximum mean discrepancy for calculating kernel based test statistic.Based on this test statistic,we adopted conditional Monte Carlo simulation to compute the statistical significance(i.e.,P value)and statistical power.We recruited 33 patients with Alzheimer’s disease(AD),33 patients with early mild cognitive impairment(EMCI),33 patients with late mild cognitive impairment(LMCI)and 33 normal controls(NC)in our experiment.There are no statistical differences in demographic information between patients and NC.The compared state-of-the-art statistical methods include t test,t squared test,two-sample permutation test and non-normal test.Results We applied the proposed shortest path matched kernel to our framework for investigating the statistical differences of shortest path topological structures in brain networks of AD and NC.We compared our method with the existing state-of-the-art statistical methods in brain network characteristic including clustering coefficient and functional connection among EMCI,LMCI,AD,and NC.The results indicate that our framework can capture the statistically discriminative shortest path topological structures,such as shortest path from right rolandic operculum to right supplementary motor area(P=0.00314,statistical power=0.803).In clustering coefficient and functional connection,our framework outperforms the state-of-the-art statistical methods,such as P=0.0013 and statistical power=0.83 in the analysis of AD and NC.Conclusion Our proposed kernel based statistic framework not only can be used to investigate the topological differences of brain network,but also can be used to investigate the static characteristics(e.g.,clustering coefficient and functional connection)of brain network.展开更多
Protein subcellular localization prediction is im- portant for studying the function of proteins. Recently, as significant progress has been witnessed in the field of mi- croscopic imaging, automatically determining t...Protein subcellular localization prediction is im- portant for studying the function of proteins. Recently, as significant progress has been witnessed in the field of mi- croscopic imaging, automatically determining the subcellular localization of proteins from bio-images is becoming a new research hotspot. One of the central themes in this field is to determine what features are suitable for describing the pro- tein images. Existing feature extraction methods are usually hand-crafted designed, by which only one layer of features will be extracted, which may not be sufficient to represent the complex protein images. To this end, we propose a deep model based descriptor (DMD) to extract the high-level fea- tures from protein images. Specifically, in order to make the extracted features more generic, we firstly trained a convolu- tion neural network (i.e., AlexNe0 by using a natural image set with millions of labels, and then used the partial parame- ter transfer strategy to fine-tnne the parameters from natural images to protein images. After that, we applied the Lasso model to select the most distinguishing features from the last fully connected layer of the CNN (Convolution Neural Net- work), and used these selected features for final classifica- tions. Experimental results on a protein image dataset vali- date the efficacy of our method.展开更多
文摘Canonical correlation analysis (CCA) is one of the most well-known methods to extract features from multi- view data and has attracted much attention in recent years. However, classical CCA is unsupervised and does not take discriminant information into account. In this paper, we add discriminant information into CCA by using random cross- view correlations between within-class samples and propose a new method for multi-view dimensionality reduction called canonical random correlation analysis (RCA). In RCA, two approaches for randomly generating cross-view correlation samples are developed on the basis of bootstrap technique. Furthermore, kernel RCA (KRCA) is proposed to extract nonlinear correlations between different views. Experiments on several multi-view data sets show the effectiveness of the proposed methods.
基金This work was supported in part by the National Natural Science Foundation of China(Grant Nos.61876082,61861130366,61732006)National Key R&D Program of China(2018YFC2001600,2018YFC2001602).
文摘Human limb movement imagery,which can be used in limb neural disorders rehabilitation and brain-controlled external devices,has become a significant control paradigm in the domain of brain-computer interface(BCI).Although numerous pioneering studies have been devoted to motor imagery classification based on electroencephalography(EEG)signal,their performance is somewhat limited due to insufficient analysis of key effective frequency bands of EEG signals.In this paper,we propose a model of multiband decomposition and spectral discriminative analysis for motor imagery classification,which is called variational sample-long short term memory(VS-LSTM)network.Specifically,we first use a channel fusion operator to reduce the signal channels of the raw EEG signal.Then,we use the variational mode decomposition(VMD)model to decompose the EEG signal into six band-limited intrinsic mode functions(BIMFs)for further signal noise reduction.In order to select discriminative frequency bands,we calculate the sample entropy(SampEn)value of each frequency band and select the maximum value.Finally,to predict the classification of motor imagery,a LSTM model is used to predict the class of frequency band with the largest SampEn value.An open-access public data is used to evaluated the effectiveness of the proposed model.In the data,15 subjects performed motor imagery tasks with elbow flexion/extension,forearm supination/pronation and hand open/close of right upper limb.The experiment results show that the average classification result of seven kinds of motor imagery was 76.2%,the average accuracy of motor imagery binary classification is 96.6%(imagery vs.rest),respectively,which outperforms the state-of-the-art deep learning-based models.This framework significantly improves the accuracy of motor imagery by selecting effective frequency bands.This research is very meaningful for BCIs,and it is inspiring for end-to-end learning research.
基金supported by the National Key R&D Program of China(2018 YFC2001600,2018 YFC2001602)。
文摘1 Introduction and main contributions Differential transcript usage(DTU),which refers to the event that the relative transcript abundance within a gene changes between conditions.To detect DTU,various methods have been proposed,which can be classified into exon-based models and gene-based models.These approaches either cannot estimate the relative transcript abundance,or they cannot deal properly with the multi-source mapping problems of reads.Besides,few methods currently consider sample-to-sample variability under multiple conditions[1].
基金supported by the National Natural Science Foundation of China(Grant Nos.61876082,61732006,and 61861130366)the National Key R&D Program of China(Grant Nos.2018YFC2001600,2018YFC2001602,and 2018ZX10201002)the Royal Society Academy of Medical Sciences Newton Advanced Fellowship(Grant No.NAF\R1\180371).
文摘Background Brain network describing interconnections between brain regions contains abundant topological information.It is a challenge for the existing statistical methods(e.g.,t test)to investigate the topological differences of brain networks.Methods We proposed a kernel based statistic framework for identifying topological differences in brain networks.In our framework,the topological similarities between paired brain networks were measured by graph kernels.Then,graph kernels are embedded into maximum mean discrepancy for calculating kernel based test statistic.Based on this test statistic,we adopted conditional Monte Carlo simulation to compute the statistical significance(i.e.,P value)and statistical power.We recruited 33 patients with Alzheimer’s disease(AD),33 patients with early mild cognitive impairment(EMCI),33 patients with late mild cognitive impairment(LMCI)and 33 normal controls(NC)in our experiment.There are no statistical differences in demographic information between patients and NC.The compared state-of-the-art statistical methods include t test,t squared test,two-sample permutation test and non-normal test.Results We applied the proposed shortest path matched kernel to our framework for investigating the statistical differences of shortest path topological structures in brain networks of AD and NC.We compared our method with the existing state-of-the-art statistical methods in brain network characteristic including clustering coefficient and functional connection among EMCI,LMCI,AD,and NC.The results indicate that our framework can capture the statistically discriminative shortest path topological structures,such as shortest path from right rolandic operculum to right supplementary motor area(P=0.00314,statistical power=0.803).In clustering coefficient and functional connection,our framework outperforms the state-of-the-art statistical methods,such as P=0.0013 and statistical power=0.83 in the analysis of AD and NC.Conclusion Our proposed kernel based statistic framework not only can be used to investigate the topological differences of brain network,but also can be used to investigate the static characteristics(e.g.,clustering coefficient and functional connection)of brain network.
基金This work was supported in part by the National Nat- ural Science Foundation of China (Grant Nos. 61422204, 61473149 and 61671288), Jiangsu Natural Science Foundation for Distinguished Young Scholar (BK20130034), and Science and Technology Commission of Shang- hai Municipality (16JC1404300).
文摘Protein subcellular localization prediction is im- portant for studying the function of proteins. Recently, as significant progress has been witnessed in the field of mi- croscopic imaging, automatically determining the subcellular localization of proteins from bio-images is becoming a new research hotspot. One of the central themes in this field is to determine what features are suitable for describing the pro- tein images. Existing feature extraction methods are usually hand-crafted designed, by which only one layer of features will be extracted, which may not be sufficient to represent the complex protein images. To this end, we propose a deep model based descriptor (DMD) to extract the high-level fea- tures from protein images. Specifically, in order to make the extracted features more generic, we firstly trained a convolu- tion neural network (i.e., AlexNe0 by using a natural image set with millions of labels, and then used the partial parame- ter transfer strategy to fine-tnne the parameters from natural images to protein images. After that, we applied the Lasso model to select the most distinguishing features from the last fully connected layer of the CNN (Convolution Neural Net- work), and used these selected features for final classifica- tions. Experimental results on a protein image dataset vali- date the efficacy of our method.