The perceptual effect of the phase information in speech has been studied by auditorysubjective tests. On the condition that the phase spectrum in speech is changed while amplitudespectrum is unchanged, the tests show...The perceptual effect of the phase information in speech has been studied by auditorysubjective tests. On the condition that the phase spectrum in speech is changed while amplitudespectrum is unchanged, the tests show that: (1) If the envelop of the reconstructed speech signalis unchanged, there is indistinctive auditory perception between the original speech and thereconstructed speech; (2) The auditory perception effect of the reconstructed speech mainly lieson the amplitude of the derivative of the additive phase; (3) td is the maximum relative time shiftbetween different frequency components of the reconstructed speech signal. The speech qualityis excellent while td <10ms; good while 10ms< td <20ms; common while 20ms< td <35ms, andpoor while td >35ms.展开更多
Based on the approximate sparseness of speech in wavelet basis,a compressed sensing theory is applied to compress and reconstruct speech signals.Compared with one-dimensional orthogonal wavelet transform(OWT),two-dime...Based on the approximate sparseness of speech in wavelet basis,a compressed sensing theory is applied to compress and reconstruct speech signals.Compared with one-dimensional orthogonal wavelet transform(OWT),two-dimensional OWT combined with Dmeyer and biorthogonal wavelet is firstly proposed to raise running efficiency in speech frame processing,furthermore,the threshold is set to improve the sparseness.Then an adaptive subgradient projection method(ASPM)is adopted for speech reconstruction in compressed sensing.Meanwhile,mechanism which adaptively adjusts inflation parameter in different iterations has been designed for fast convergence.Theoretical analysis and simulation results conclude that this algorithm has fast convergence,and lower reconstruction error,and also exhibits higher robustness in different noise intensities.展开更多
基金the National Natural Science Foundation of China (No.60071029)
文摘The perceptual effect of the phase information in speech has been studied by auditorysubjective tests. On the condition that the phase spectrum in speech is changed while amplitudespectrum is unchanged, the tests show that: (1) If the envelop of the reconstructed speech signalis unchanged, there is indistinctive auditory perception between the original speech and thereconstructed speech; (2) The auditory perception effect of the reconstructed speech mainly lieson the amplitude of the derivative of the additive phase; (3) td is the maximum relative time shiftbetween different frequency components of the reconstructed speech signal. The speech qualityis excellent while td <10ms; good while 10ms< td <20ms; common while 20ms< td <35ms, andpoor while td >35ms.
基金Supported by the National Natural Science Foundation of China(No.60472058,60975017)the Fundamental Research Funds for the Central Universities(No.2009B32614,2009B32414)
文摘Based on the approximate sparseness of speech in wavelet basis,a compressed sensing theory is applied to compress and reconstruct speech signals.Compared with one-dimensional orthogonal wavelet transform(OWT),two-dimensional OWT combined with Dmeyer and biorthogonal wavelet is firstly proposed to raise running efficiency in speech frame processing,furthermore,the threshold is set to improve the sparseness.Then an adaptive subgradient projection method(ASPM)is adopted for speech reconstruction in compressed sensing.Meanwhile,mechanism which adaptively adjusts inflation parameter in different iterations has been designed for fast convergence.Theoretical analysis and simulation results conclude that this algorithm has fast convergence,and lower reconstruction error,and also exhibits higher robustness in different noise intensities.