Automatic Speech Emotion Recognition(SER)is used to recognize emotion from speech automatically.Speech Emotion recognition is working well in a laboratory environment but real-time emotion recognition has been influen...Automatic Speech Emotion Recognition(SER)is used to recognize emotion from speech automatically.Speech Emotion recognition is working well in a laboratory environment but real-time emotion recognition has been influenced by the variations in gender,age,the cultural and acoustical background of the speaker.The acoustical resemblance between emotional expressions further increases the complexity of recognition.Many recent research works are concentrated to address these effects individually.Instead of addressing every influencing attribute individually,we would like to design a system,which reduces the effect that arises on any factor.We propose a two-level Hierarchical classifier named Interpreter of responses(IR).Thefirst level of IR has been realized using Support Vector Machine(SVM)and Gaussian Mixer Model(GMM)classifiers.In the second level of IR,a discriminative SVM classifier has been trained and tested with meta information offirst-level classifiers along with the input acoustical feature vector which is used in primary classifiers.To train the system with a corpus of versatile nature,an integrated emotion corpus has been composed using emotion samples of 5 speech corpora,namely;EMO-DB,IITKGP-SESC,SAVEE Corpus,Spanish emotion corpus,CMU's Woogle corpus.The hierarchical classifier has been trained and tested using MFCC and Low-Level Descriptors(LLD).The empirical analysis shows that the proposed classifier outperforms the traditional classifiers.The proposed ensemble design is very generic and can be adapted even when the number and nature of features change.Thefirst-level classifiers GMM or SVM may be replaced with any other learning algorithm.展开更多
In this Letter, we develop the Stokes space-based method for modulation format identification by combing power spectral density and a cluster analysis to identify quadrature amplitude modulation(QAM) and phase-shift...In this Letter, we develop the Stokes space-based method for modulation format identification by combing power spectral density and a cluster analysis to identify quadrature amplitude modulation(QAM) and phase-shift keying(PSK) signals. Fuzzy c-means and hierarchical clustering algorithms are used for the cluster analysis.Simulations are conducted for binary PSK, quadrature PSK, 8PSK, 16-QAM, and 32-QAM signals. The results demonstrate that the proposed technique can effectively classify all these modulation formats, and that the method is superior in lowering the threshold of the optical signal-to-noise ratio. Meanwhile, the proposed method is insensitive to phase offset and laser phase noise.展开更多
文摘Automatic Speech Emotion Recognition(SER)is used to recognize emotion from speech automatically.Speech Emotion recognition is working well in a laboratory environment but real-time emotion recognition has been influenced by the variations in gender,age,the cultural and acoustical background of the speaker.The acoustical resemblance between emotional expressions further increases the complexity of recognition.Many recent research works are concentrated to address these effects individually.Instead of addressing every influencing attribute individually,we would like to design a system,which reduces the effect that arises on any factor.We propose a two-level Hierarchical classifier named Interpreter of responses(IR).Thefirst level of IR has been realized using Support Vector Machine(SVM)and Gaussian Mixer Model(GMM)classifiers.In the second level of IR,a discriminative SVM classifier has been trained and tested with meta information offirst-level classifiers along with the input acoustical feature vector which is used in primary classifiers.To train the system with a corpus of versatile nature,an integrated emotion corpus has been composed using emotion samples of 5 speech corpora,namely;EMO-DB,IITKGP-SESC,SAVEE Corpus,Spanish emotion corpus,CMU's Woogle corpus.The hierarchical classifier has been trained and tested using MFCC and Low-Level Descriptors(LLD).The empirical analysis shows that the proposed classifier outperforms the traditional classifiers.The proposed ensemble design is very generic and can be adapted even when the number and nature of features change.Thefirst-level classifiers GMM or SVM may be replaced with any other learning algorithm.
基金supported by the National Natural Science Foundation of China under Grant No. 61205065
文摘In this Letter, we develop the Stokes space-based method for modulation format identification by combing power spectral density and a cluster analysis to identify quadrature amplitude modulation(QAM) and phase-shift keying(PSK) signals. Fuzzy c-means and hierarchical clustering algorithms are used for the cluster analysis.Simulations are conducted for binary PSK, quadrature PSK, 8PSK, 16-QAM, and 32-QAM signals. The results demonstrate that the proposed technique can effectively classify all these modulation formats, and that the method is superior in lowering the threshold of the optical signal-to-noise ratio. Meanwhile, the proposed method is insensitive to phase offset and laser phase noise.