At medium or long distance (〉 10 kin) underwater acoustic speech communication, information transfer rate is constrained by the complicated, time varying channel and limited bandwidth. The bit rate of speech coding...At medium or long distance (〉 10 kin) underwater acoustic speech communication, information transfer rate is constrained by the complicated, time varying channel and limited bandwidth. The bit rate of speech coding is required to be as low as possible. The time delay of underwater acoustic wave propagation can be used for low bit rate speech coding. After investigating the Mixed Excitation Linear Prediction (MELP) standard and taking account of the auditory perceptual features, a variable and adjustable bit rate speech codec algorithm has been proposed, whose average bit rate is about 600 bps. The average Perceptual Evaluation of Speech Quality Mean Opinion Score (PESQ MOS) of synthesized speeches is about 2.8. It has been proved by the computer simulation and sea trial that the performance of the proposed algorithm is well and robust when bit error rate is no more than 10-3. The synthesized speech is vivid and intelligible, and keeps main individual characteristics of speaker.展开更多
基金supported by the National Natural Science Foundation of China(61102152)
文摘At medium or long distance (〉 10 kin) underwater acoustic speech communication, information transfer rate is constrained by the complicated, time varying channel and limited bandwidth. The bit rate of speech coding is required to be as low as possible. The time delay of underwater acoustic wave propagation can be used for low bit rate speech coding. After investigating the Mixed Excitation Linear Prediction (MELP) standard and taking account of the auditory perceptual features, a variable and adjustable bit rate speech codec algorithm has been proposed, whose average bit rate is about 600 bps. The average Perceptual Evaluation of Speech Quality Mean Opinion Score (PESQ MOS) of synthesized speeches is about 2.8. It has been proved by the computer simulation and sea trial that the performance of the proposed algorithm is well and robust when bit error rate is no more than 10-3. The synthesized speech is vivid and intelligible, and keeps main individual characteristics of speaker.