An Innovative Approach Utilizing Binary-View Transformer for Speech Recognition Task 被引量：3

下载PDF

导出

摘要 The deep learning advancements have greatly improved the performance of speech recognition systems,and most recent systems are based on the Recurrent Neural Network(RNN).Overall,the RNN works fine with the small sequence data,but suffers from the gradient vanishing problem in case of large sequence.The transformer networks have neutralized this issue and have shown state-of-the-art results on sequential or speech-related data.Generally,in speech recognition,the input audio is converted into an image using Mel-spectrogram to illustrate frequencies and intensities.The image is classified by the machine learning mechanism to generate a classification transcript.However,the audio frequency in the image has low resolution and causing inaccurate predictions.This paper presents a novel end-to-end binary view transformer-based architecture for speech recognition to cope with the frequency resolution problem.Firstly,the input audio signal is transformed into a 2D image using Mel-spectrogram.Secondly,the modified universal transformers utilize the multi-head attention to derive contextual information and derive different speech-related features.Moreover,a feedforward neural network is also deployed for classification.The proposed system has generated robust results on Google’s speech command dataset with an accuracy of 95.16%and with minimal loss.The binary-view transformer eradicates the eventuality of the over-fitting problem by deploying a multiview mechanism to diversify the input data,and multi-head attention captures multiple contexts from the data’s feature map.

作者 Muhammad Babar Kamal Arfat Ahmad Khan Faizan Ahmed Khan Malik Muhammad Ali Shahid Chitapong Wechtaisong Muhammad Daud Kamal Muhammad Junaid Ali Peerapong Uthansakul

机构地区 COMSATS University Islamabad Suranaree University of Technology COMSATS University Islamabad COMSATS University Islamabad National University of Sciences&Technology Virtual University of Pakistan

出处《Computers, Materials & Continua》 SCIE EI 2022年第9期5547-5562,共16页 计算机、材料和连续体（英文）

基金 This research was supported by Suranaree University of Technology,Thailand,Grant Number:BRO7-709-62-12-03.

关键词 Convolution neural network multi-head attention MULTI-VIEW RNN self-attention speech recognition TRANSFORMER

分类号 TN9 [电子电信—信息与通信工程]

引文网络
相关文献

参考文献1

1Arfat Ahmad Khan,Chitapong Wechtaisong,Faizan Ahmed Khan,Nadeem Ahmad.A Cost-Efficient Environment Monitoring Robotic Vehicle for Smart Industries[J].Computers, Materials & Continua,2022(4):473-487. 被引量：5

二级参考文献2

1Jamal Mabrouki,Mourade Azrour,Driss Dhiba,Yousef Farhaoui,Souad El Hajjaji.IoT-Based Data Logger for Weather Monitoring Using Arduino-Based Wireless Sensor Networks with Remote Graphical Application and Alerts[J].Big Data Mining and Analytics,2021,4(1):25-32. 被引量：7
2Yong Chen,Hong Chen,Anjee Gorkhali,Yang Lu,Yiqian Ma,Ling Li.Big data analytics and big data science:a survey[J].Journal of Management Analytics,2016,3(1):1-42. 被引量：5

共引文献4

1Anuj Sharma,Deepak Prashar,Arfat Ahmad Khan,Faizan Ahmed Khan,Settawit Poochaya.Automatic Leukaemia Segmentation Approach for Blood Cancer Classification Using Microscopic Images[J].Computers, Materials & Continua,2022(11):3629-3648. 被引量：1
2Settawit Poochaya,Peerapong Uthansakul,Monthippa Uthansakul,Patikorn Anchuen,Kontorn Thammakul,Arfat Ahmad Khan,Niwat Punanwarakorn,Pech Sirivoratum,Aranya Kaewkrad,Panrawee Kanpan,Apichart Wantamee.A Multi-Mode Public Transportation System Using Vehicular to Network Architecture[J].Computers, Materials & Continua,2022(12):5845-5862.
3Dalwinder Singh,Deepak Prashar,Jimmy Singla,Arfat Ahmad Khan,Mohammed Al-Sarem,Neesrin Ali Kurdi.Intelligent Medical Diagnostic System for Hepatitis B[J].Computers, Materials & Continua,2022(12):6047-6068.
4SVinson Joshua,ASelwin Mich Priyadharson,Raju Kannadasan,Arfat Ahmad Khan,Worawat Lawanont,Faizan Ahmed Khan,Ateeq Ur Rehman,Muhammad Junaid Ali.Crop Yield Prediction Using Machine Learning Approaches on a Wide Spectrum[J].Computers, Materials & Continua,2022(9):5663-5679. 被引量：3

同被引文献17

1Lingyun Gao,Mingquan Ye,Xiaojie Lu,Daobin Huang.Hybrid Method Based on Information Gain and Support Vector Machine for Gene Selection in Cancer Classi?cation[J].Genomics, Proteomics & Bioinformatics,2017,15(6):389-395. 被引量：5
2Feng Li,Chaofeng Ou,Yan Gui,Lingyun Xiang.Instant Edit Propagation on Images Based on Bilateral Grid[J].Computers, Materials & Continua,2019(8):643-656. 被引量：6
3Shiming He,Zhuozhou Li,Yangning Tang,Zhuofan Liao,Feng Li,Se-Jung Lim.Parameters Compressing in Deep Learning[J].Computers, Materials & Continua,2020(1):321-336. 被引量：9
4Reham Alabduljabbar,Hala Alshamlan.Intelligent Multiclass Skin Cancer Detection Using Convolution Neural Networks[J].Computers, Materials & Continua,2021(10):831-847. 被引量：1
5Osamah Ibrahim Khalaf,Munsif Sokiyna,Youseef Alotaibi,Abdulmajeed Alsufyani,Saleh Alghamdi.Web Attack Detection Using the Input Validation Method:DPDA Theory[J].Computers, Materials & Continua,2021(9):3167-3184. 被引量：3
6Youseef Alotaibi,Muhammad Noman Malik,Huma Hayat Khan,Anab Batool,Saif ul Islam,Abdulmajeed Alsufyani,Saleh Alghamdi.Suggestion Mining from Opinionated Text of Big Social Media Data[J].Computers, Materials & Continua,2021(9):3323-3338. 被引量：6
7Anh Tuan Hoang,Xuan Phuong Nguyen,Osamah Ibrahim Khalaf,Thi Xuan Tran,Minh Quang Chau,Thi Minh Hao Dong,Duong Nam Nguyen.Thermodynamic Simulation on the Change in Phase for Carburizing Process[J].Computers, Materials & Continua,2021(7):1129-1145. 被引量：1
8Youseef Alotaibi.A New Database Intrusion Detection Approach Based on Hybrid Meta-Heuristics[J].Computers, Materials & Continua,2021(2):1879-1895. 被引量：9
9Dengyong Zhang,Jiawei Hu,Feng Li,Xiangling Ding,Arun Kumar Sangaiah,Victor SSheng.Small Object Detection via Precise Region-Based Fully Convolutional Networks[J].Computers, Materials & Continua,2021(11):1503-1517. 被引量：9
10Anuj Sharma,Deepak Prashar,Arfat Ahmad Khan,Faizan Ahmed Khan,Settawit Poochaya.Automatic Leukaemia Segmentation Approach for Blood Cancer Classification Using Microscopic Images[J].Computers, Materials & Continua,2022(11):3629-3648. 被引量：1

引证文献3

1Anuj Sharma,Deepak Prashar,Arfat Ahmad Khan,Faizan Ahmed Khan,Settawit Poochaya.Automatic Leukaemia Segmentation Approach for Blood Cancer Classification Using Microscopic Images[J].Computers, Materials & Continua,2022(11):3629-3648. 被引量：1
2Settawit Poochaya,Peerapong Uthansakul,Monthippa Uthansakul,Patikorn Anchuen,Kontorn Thammakul,Arfat Ahmad Khan,Niwat Punanwarakorn,Pech Sirivoratum,Aranya Kaewkrad,Panrawee Kanpan,Apichart Wantamee.A Multi-Mode Public Transportation System Using Vehicular to Network Architecture[J].Computers, Materials & Continua,2022(12):5845-5862.
3Dalwinder Singh,Deepak Prashar,Jimmy Singla,Arfat Ahmad Khan,Mohammed Al-Sarem,Neesrin Ali Kurdi.Intelligent Medical Diagnostic System for Hepatitis B[J].Computers, Materials & Continua,2022(12):6047-6068.

二级引证文献1

1Dalwinder Singh,Deepak Prashar,Jimmy Singla,Arfat Ahmad Khan,Mohammed Al-Sarem,Neesrin Ali Kurdi.Intelligent Medical Diagnostic System for Hepatitis B[J].Computers, Materials & Continua,2022(12):6047-6068.

1吴雅娟,牛甲奎,解红涛,马宁.基于词典与字向量融合的井控领域命名实体识别[J].海南大学学报（自然科学版）,2022,40(2):125-133. 被引量：1
2马骁鹏.基于UNet和注意力机制的太阳光球层磁场预测模型[J].科学技术创新,2022(18):74-77.
3余力,李慧媛,焦晨璐,冷友方,徐冠宇.基于多头注意力对抗机制的复杂场景行人轨迹预测[J].计算机学报,2022,45(6):1133-1146. 被引量：4
4Peter Kamp Busk.Accurate, automatic annotation of peptidases with hotpep-protease[J].Green Chemical Engineering,2020,1(2):124-130.
5刘楠,张凤荔,王瑞锦,张志扬,赖金山.融合元路径学习和胶囊网络的社交媒体谣言检测方法[J].电子科技大学学报,2022,51(4):608-614.
6肖贵明,丁德锐,梁伟,魏国亮.一种基于共享多头模块的轻量型一阶段网络[J].智能计算机与应用,2022,12(7):90-94.
7Baiyan Zhang,Hefei Ling,Ping Li,Qian Wang,Yuxuan Shi,Lei Wu,Runsheng Wang,Jialie Shen.Multi-Head Attention Graph Network for Few Shot Learning[J].Computers, Materials & Continua,2021(8):1505-1517. 被引量：1
8吴峰,周军,谢聪,姬少培.基于交互式学习与多头注意力机制的金融文本情感分类[J].现代计算机,2022,28(11):1-9.
9Qing Deng.A BP neural network optimisation method based on dynamical regularization[J].Journal of Control and Decision,2019,6(2):111-121. 被引量：3
10Asif Mahmood,Ahmad Irfan,Jin-Liang Wang.Machine Learning for Organic Photovoltaic Polymers:A Minireview[J].Chinese Journal of Polymer Science,2022,40(8):870-876. 被引量：1

Computers, Materials & Continua

2022年第9期

浏览历史

内容加载中请稍等...