The cognitive performance-based dimensional emotion recognition in whispered speech is studied.First,the whispered speech emotion databases and data collection methods are compared, and the character of emotion expres...The cognitive performance-based dimensional emotion recognition in whispered speech is studied.First,the whispered speech emotion databases and data collection methods are compared, and the character of emotion expression in whispered speech is studied,especially the basic types of emotions.Secondly,the emotion features for whispered speech is analyzed,and by reviewing the latest references,the related valence features and the arousal features are provided. The effectiveness of valence and arousal features in whispered speech emotion classification is studied.Finally,the Gaussian mixture model is studied and applied to whispered speech emotion recognition. The cognitive performance is also considered in emotion recognition so that the recognition errors of whispered speech emotion can be corrected.Based on the cognitive scores,the emotion recognition results can be improved.The results show that the formant features are not significantly related to arousal dimension,while the short-term energy features are related to the emotion changes in arousal dimension.Using the cognitive scores,the recognition results can be improved.展开更多
The perceptual effect of the phase information in speech has been studied by auditorysubjective tests. On the condition that the phase spectrum in speech is changed while amplitudespectrum is unchanged, the tests show...The perceptual effect of the phase information in speech has been studied by auditorysubjective tests. On the condition that the phase spectrum in speech is changed while amplitudespectrum is unchanged, the tests show that: (1) If the envelop of the reconstructed speech signalis unchanged, there is indistinctive auditory perception between the original speech and thereconstructed speech; (2) The auditory perception effect of the reconstructed speech mainly lieson the amplitude of the derivative of the additive phase; (3) td is the maximum relative time shiftbetween different frequency components of the reconstructed speech signal. The speech qualityis excellent while td <10ms; good while 10ms< td <20ms; common while 20ms< td <35ms, andpoor while td >35ms.展开更多
Languages differ in their phoneme inventories. Some phonemes exist in more than one language but others exist in relatively few languages. More specifically, English Language has some sounds that Arabic does not have ...Languages differ in their phoneme inventories. Some phonemes exist in more than one language but others exist in relatively few languages. More specifically, English Language has some sounds that Arabic does not have and vice versa. This paper focuses on the perception of the English bilabial stops/b/and/p/in contrast to the perception of the English alveolar stops/t/and/d/by some Saudi linguists who have been speaking English for more than six years and who are currently in an English speaking country, Australia. This phenomenon of perception of the English bilabial stops/b/and/p/will be tested mainly by virtue of minimal pairs and other words that may better help to investigate this perception. The paper uses some minimal pairs in which the bilabial and alveolar stops occur initially and finally. Also, it uses some verbs that end with the suffix/-ed/, but this/-ed/suffix is pronounced [t] or [d] when preceded by /p/ or /b/ respectively. Notice that [t] and [d] are allophones of the English past tense morpheme/-ed/(for example, Fromkin, Rodman, & Hyams, 2007). The pronunciation of the suffix as It] and [d] works as a clue for the subjects to know the preceding bilabial sound.展开更多
This paper explores three College English teachers' perceived difficulties in teaching content-based courses in the Chinese context and opportunities for their change in the knowledge base. Interviews and classroom o...This paper explores three College English teachers' perceived difficulties in teaching content-based courses in the Chinese context and opportunities for their change in the knowledge base. Interviews and classroom observation were used to collect data. After coding and recoding of the audio data, the researcher found that College English teachers face the following difficulties: positioning of themselves, commitment to the course, students' expectation, the balance between language and content, and administrative support. Meanwhile, the experience of teaching content-based courses offered them an opportunity to increase their knowledge of the content, the learners, and educational values. Some implications for CBI (content-based instruction) in curriculum reform were put forward at the end of the paper.展开更多
Based on the approximate sparseness of speech in wavelet basis,a compressed sensing theory is applied to compress and reconstruct speech signals.Compared with one-dimensional orthogonal wavelet transform(OWT),two-dime...Based on the approximate sparseness of speech in wavelet basis,a compressed sensing theory is applied to compress and reconstruct speech signals.Compared with one-dimensional orthogonal wavelet transform(OWT),two-dimensional OWT combined with Dmeyer and biorthogonal wavelet is firstly proposed to raise running efficiency in speech frame processing,furthermore,the threshold is set to improve the sparseness.Then an adaptive subgradient projection method(ASPM)is adopted for speech reconstruction in compressed sensing.Meanwhile,mechanism which adaptively adjusts inflation parameter in different iterations has been designed for fast convergence.Theoretical analysis and simulation results conclude that this algorithm has fast convergence,and lower reconstruction error,and also exhibits higher robustness in different noise intensities.展开更多
If there is no imagination, there is no music appreciation. It should create imaginary world for students in the music classroom teaching practice, and it should foster the students' musical imagination. Thus, collea...If there is no imagination, there is no music appreciation. It should create imaginary world for students in the music classroom teaching practice, and it should foster the students' musical imagination. Thus, colleagues can make discussion about strategies proposed including pilot background, context led, screen hygiene conditions and others. We use Cognitive Linguistic Theories to introduce idealized cognitive model and its theoretical basis and the intensified impact on student musical imagination.展开更多
In this paper, we conduct analysis on the optimization approaches of contemporary piano education system from the perspective of music knowledge integration. In the process of the piano teaching, teacher and school th...In this paper, we conduct analysis on the optimization approaches of contemporary piano education system from the perspective of music knowledge integration. In the process of the piano teaching, teacher and school the training of musicianship as the key point of teaching. This is because the skill can through student plays to display directly, appraises the student performance technique quality an important criterion. However, the student has the excellent adept technique merely also by far insufficient, but also needs the high artistic aesthetic ability, not only need be able to appreciate the piano performance, but must be able to reveal the artistic aesthetic feature in the process of piano performance. Piano education as one of the ways of music education, and other subjects, there is the subject of the universal education as its educational ideas, educational ideas and teaching methods, a direct impact on the generation after generation of the comprehensive quality of education. Under this basis, we propose the new idea on the issues that will be beneficial.展开更多
In order to investigate sample minimization for classification of supercritical and subcritical patterns in supersonic inlet, three optimization methods, namely, opposite one towards nearest method, closest one toward...In order to investigate sample minimization for classification of supercritical and subcritical patterns in supersonic inlet, three optimization methods, namely, opposite one towards nearest method, closest one towards the byper-plane method and random selection method, are proposed for investigation on minimization of classification samples for supercritical and subcritical patterns of supersonic inlet. The study has been carried out to analyze wind tunnel test data and to compare the classification accuracy based on those three methods with or without priori knowledge. Those three methods are different from each other by different selecting methods for samples. The results show that one of the optimization methods needs the minimization samples to get the highest classification accuracy without priori knowledge. Meanwhile, the number of minimization samples needed to get highest classification accuracy can be further reduced by introducing priori knowledge. Furthermore, it demonstrates that the best optimization method has been found by comparing all cases studied with or without introducing priori knowledge. This method can be applied to reduce the number of wind tunnel tests to obtain the inlet performance and to identify the supercritical/subcritical modes for supersonic inlet.展开更多
基金The National Natural Science Foundation of China(No.11401412)
文摘The cognitive performance-based dimensional emotion recognition in whispered speech is studied.First,the whispered speech emotion databases and data collection methods are compared, and the character of emotion expression in whispered speech is studied,especially the basic types of emotions.Secondly,the emotion features for whispered speech is analyzed,and by reviewing the latest references,the related valence features and the arousal features are provided. The effectiveness of valence and arousal features in whispered speech emotion classification is studied.Finally,the Gaussian mixture model is studied and applied to whispered speech emotion recognition. The cognitive performance is also considered in emotion recognition so that the recognition errors of whispered speech emotion can be corrected.Based on the cognitive scores,the emotion recognition results can be improved.The results show that the formant features are not significantly related to arousal dimension,while the short-term energy features are related to the emotion changes in arousal dimension.Using the cognitive scores,the recognition results can be improved.
基金the National Natural Science Foundation of China (No.60071029)
文摘The perceptual effect of the phase information in speech has been studied by auditorysubjective tests. On the condition that the phase spectrum in speech is changed while amplitudespectrum is unchanged, the tests show that: (1) If the envelop of the reconstructed speech signalis unchanged, there is indistinctive auditory perception between the original speech and thereconstructed speech; (2) The auditory perception effect of the reconstructed speech mainly lieson the amplitude of the derivative of the additive phase; (3) td is the maximum relative time shiftbetween different frequency components of the reconstructed speech signal. The speech qualityis excellent while td <10ms; good while 10ms< td <20ms; common while 20ms< td <35ms, andpoor while td >35ms.
文摘Languages differ in their phoneme inventories. Some phonemes exist in more than one language but others exist in relatively few languages. More specifically, English Language has some sounds that Arabic does not have and vice versa. This paper focuses on the perception of the English bilabial stops/b/and/p/in contrast to the perception of the English alveolar stops/t/and/d/by some Saudi linguists who have been speaking English for more than six years and who are currently in an English speaking country, Australia. This phenomenon of perception of the English bilabial stops/b/and/p/will be tested mainly by virtue of minimal pairs and other words that may better help to investigate this perception. The paper uses some minimal pairs in which the bilabial and alveolar stops occur initially and finally. Also, it uses some verbs that end with the suffix/-ed/, but this/-ed/suffix is pronounced [t] or [d] when preceded by /p/ or /b/ respectively. Notice that [t] and [d] are allophones of the English past tense morpheme/-ed/(for example, Fromkin, Rodman, & Hyams, 2007). The pronunciation of the suffix as It] and [d] works as a clue for the subjects to know the preceding bilabial sound.
文摘This paper explores three College English teachers' perceived difficulties in teaching content-based courses in the Chinese context and opportunities for their change in the knowledge base. Interviews and classroom observation were used to collect data. After coding and recoding of the audio data, the researcher found that College English teachers face the following difficulties: positioning of themselves, commitment to the course, students' expectation, the balance between language and content, and administrative support. Meanwhile, the experience of teaching content-based courses offered them an opportunity to increase their knowledge of the content, the learners, and educational values. Some implications for CBI (content-based instruction) in curriculum reform were put forward at the end of the paper.
基金Supported by the National Natural Science Foundation of China(No.60472058,60975017)the Fundamental Research Funds for the Central Universities(No.2009B32614,2009B32414)
文摘Based on the approximate sparseness of speech in wavelet basis,a compressed sensing theory is applied to compress and reconstruct speech signals.Compared with one-dimensional orthogonal wavelet transform(OWT),two-dimensional OWT combined with Dmeyer and biorthogonal wavelet is firstly proposed to raise running efficiency in speech frame processing,furthermore,the threshold is set to improve the sparseness.Then an adaptive subgradient projection method(ASPM)is adopted for speech reconstruction in compressed sensing.Meanwhile,mechanism which adaptively adjusts inflation parameter in different iterations has been designed for fast convergence.Theoretical analysis and simulation results conclude that this algorithm has fast convergence,and lower reconstruction error,and also exhibits higher robustness in different noise intensities.
文摘If there is no imagination, there is no music appreciation. It should create imaginary world for students in the music classroom teaching practice, and it should foster the students' musical imagination. Thus, colleagues can make discussion about strategies proposed including pilot background, context led, screen hygiene conditions and others. We use Cognitive Linguistic Theories to introduce idealized cognitive model and its theoretical basis and the intensified impact on student musical imagination.
文摘In this paper, we conduct analysis on the optimization approaches of contemporary piano education system from the perspective of music knowledge integration. In the process of the piano teaching, teacher and school the training of musicianship as the key point of teaching. This is because the skill can through student plays to display directly, appraises the student performance technique quality an important criterion. However, the student has the excellent adept technique merely also by far insufficient, but also needs the high artistic aesthetic ability, not only need be able to appreciate the piano performance, but must be able to reveal the artistic aesthetic feature in the process of piano performance. Piano education as one of the ways of music education, and other subjects, there is the subject of the universal education as its educational ideas, educational ideas and teaching methods, a direct impact on the generation after generation of the comprehensive quality of education. Under this basis, we propose the new idea on the issues that will be beneficial.
基金Academy of Fundamental and Interdisciplinary Sciences,Harbin Institute of Technology
文摘In order to investigate sample minimization for classification of supercritical and subcritical patterns in supersonic inlet, three optimization methods, namely, opposite one towards nearest method, closest one towards the byper-plane method and random selection method, are proposed for investigation on minimization of classification samples for supercritical and subcritical patterns of supersonic inlet. The study has been carried out to analyze wind tunnel test data and to compare the classification accuracy based on those three methods with or without priori knowledge. Those three methods are different from each other by different selecting methods for samples. The results show that one of the optimization methods needs the minimization samples to get the highest classification accuracy without priori knowledge. Meanwhile, the number of minimization samples needed to get highest classification accuracy can be further reduced by introducing priori knowledge. Furthermore, it demonstrates that the best optimization method has been found by comparing all cases studied with or without introducing priori knowledge. This method can be applied to reduce the number of wind tunnel tests to obtain the inlet performance and to identify the supercritical/subcritical modes for supersonic inlet.