期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
WTASR:Wavelet Transformer for Automatic Speech Recognition of Indian Languages
1
作者 tripti choudhary Vishal Goyal Atul Bansal 《Big Data Mining and Analytics》 EI CSCD 2023年第1期85-91,共7页
Automatic speech recognition systems are developed for translating the speech signals into the corresponding text representation.This translation is used in a variety of applications like voice enabled commands,assist... Automatic speech recognition systems are developed for translating the speech signals into the corresponding text representation.This translation is used in a variety of applications like voice enabled commands,assistive devices and bots,etc.There is a significant lack of efficient technology for Indian languages.In this paper,an wavelet transformer for automatic speech recognition(WTASR)of Indian language is proposed.The speech signals suffer from the problem of high and low frequency over different times due to variation in speech of the speaker.Thus,wavelets enable the network to analyze the signal in multiscale.The wavelet decomposition of the signal is fed in the network for generating the text.The transformer network comprises an encoder decoder system for speech translation.The model is trained on Indian language dataset for translation of speech into corresponding text.The proposed method is compared with other state of the art methods.The results show that the proposed WTASR has a low word error rate and can be used for effective speech recognition for Indian language. 展开更多
关键词 TRANSFORMER WAVELET automatic speech recognition(ASR) Indian language
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部