The Perception Spectrogram Structure Boundary(PSSB)parameter is proposed for speech endpoint detection as a preprocess of speech or speaker recognition.At first a hearing perception speech enhancement is carried out...The Perception Spectrogram Structure Boundary(PSSB)parameter is proposed for speech endpoint detection as a preprocess of speech or speaker recognition.At first a hearing perception speech enhancement is carried out.Then the two-dimensional enhancement is performed upon the sound spectrogram according to the difference between the determinacy distribution characteristic of speech and the random distribution characteristic of noise.Finally a decision for endpoint was made by the PSSB parameter.Experimental results show that,in a low SNR environment from-10 dB to 10 dB,the algorithm proposed in this paper may achieve higher accuracy than the extant endpoint detection algorithms.The detection accuracy of 75.2%can be reached even in the extremely low SNR at-10 dB.Therefore it is suitable for speech endpoint detection in low-SNRs environment.展开更多
The perceptual representation of the prosodic structure of Chinese sentences was constructed statistically by using the method of multidimensional scaling analysis on the basis of the result of a discrimination experi...The perceptual representation of the prosodic structure of Chinese sentences was constructed statistically by using the method of multidimensional scaling analysis on the basis of the result of a discrimination experiment, in which listeners were asked to compare perceptual distances between two adjacent syllables in each of six sentences. Listeners' ability to resolve levels of prosodic hierarchy and the relationship between the prosodic and syntactic structures were discussed in relation to perceptual representations.展开更多
基金supported by the National Natural Science Foundation of China.(61071215,61271359,61372146)
文摘The Perception Spectrogram Structure Boundary(PSSB)parameter is proposed for speech endpoint detection as a preprocess of speech or speaker recognition.At first a hearing perception speech enhancement is carried out.Then the two-dimensional enhancement is performed upon the sound spectrogram according to the difference between the determinacy distribution characteristic of speech and the random distribution characteristic of noise.Finally a decision for endpoint was made by the PSSB parameter.Experimental results show that,in a low SNR environment from-10 dB to 10 dB,the algorithm proposed in this paper may achieve higher accuracy than the extant endpoint detection algorithms.The detection accuracy of 75.2%can be reached even in the extremely low SNR at-10 dB.Therefore it is suitable for speech endpoint detection in low-SNRs environment.
文摘The perceptual representation of the prosodic structure of Chinese sentences was constructed statistically by using the method of multidimensional scaling analysis on the basis of the result of a discrimination experiment, in which listeners were asked to compare perceptual distances between two adjacent syllables in each of six sentences. Listeners' ability to resolve levels of prosodic hierarchy and the relationship between the prosodic and syntactic structures were discussed in relation to perceptual representations.