期刊文献+
共找到44篇文章
< 1 2 3 >
每页显示 20 50 100
Automatic recognition of sonar targets using feature selection in micro-Doppler signature
1
作者 Abbas Saffari Seyed-Hamid Zahiri Mohammad Khishe 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2023年第2期58-71,共14页
Currently,the use of intelligent systems for the automatic recognition of targets in the fields of defence and military has increased significantly.The primary advantage of these systems is that they do not need human... Currently,the use of intelligent systems for the automatic recognition of targets in the fields of defence and military has increased significantly.The primary advantage of these systems is that they do not need human participation in target recognition processes.This paper uses the particle swarm optimization(PSO)algorithm to select the optimal features in the micro-Doppler signature of sonar targets.The microDoppler effect is referred to amplitude/phase modulation on the received signal by rotating parts of a target such as propellers.Since different targets'geometric and physical properties are not the same,their micro-Doppler signature is different.This Inconsistency can be considered a practical issue(especially in the frequency domain)for sonar target recognition.Despite using 128-point fast Fourier transform(FFT)for the feature extraction step,not all extracted features contain helpful information.As a result,PSO selects the most optimum and valuable features.To evaluate the micro-Doppler signature of sonar targets and the effect of feature selection on sonar target recognition,the simplest and most popular machine learning algorithm,k-nearest neighbor(k-NN),is used,which is called k-PSO in this paper because of the use of PSO for feature selection.The parameters measured are the correct recognition rate,reliability rate,and processing time.The simulation results show that k-PSO achieved a 100%correct recognition rate and reliability rate at 19.35 s when using simulated data at a 15 dB signal-tonoise ratio(SNR)angle of 40°.Also,for the experimental dataset obtained from the cavitation tunnel,the correct recognition rate is 98.26%,and the reliability rate is 99.69%at 18.46s.Therefore,the k-PSO has an encouraging performance in automatically recognizing sonar targets when using experimental datasets and for real-world use. 展开更多
关键词 Micro-Doppler signature automatic recognition Feature selection K-NN PSO
下载PDF
Automatic Recognition of Construction Worker Activities Using Deep Learning Approaches and Wearable Inertial Sensors
2
作者 Sakorn Mekruksavanich Anuchit Jitpattanakul 《Intelligent Automation & Soft Computing》 SCIE 2023年第5期2111-2128,共18页
The automated evaluation and analysis of employee behavior in an Industry 4.0-compliant manufacturingfirm are vital for the rapid and accurate diagnosis of work performance,particularly during the training of a new wor... The automated evaluation and analysis of employee behavior in an Industry 4.0-compliant manufacturingfirm are vital for the rapid and accurate diagnosis of work performance,particularly during the training of a new worker.Various techniques for identifying and detecting worker performance in industrial applications are based on computer vision techniques.Despite widespread com-puter vision-based approaches,it is challenging to develop technologies that assist the automated monitoring of worker actions at external working sites where cam-era deployment is problematic.Through the use of wearable inertial sensors,we propose a deep learning method for automatically recognizing the activities of construction workers.The suggested method incorporates a convolutional neural network,residual connection blocks,and multi-branch aggregate transformation modules for high-performance recognition of complicated activities such as con-struction worker tasks.The proposed approach has been evaluated using standard performance measures,such as precision,F1-score,and AUC,using a publicly available benchmark dataset known as VTT-ConIoT,which contains genuine con-struction work activities.In addition,standard deep learning models(CNNs,RNNs,and hybrid models)were developed in different empirical circumstances to compare them to the proposed model.With an average accuracy of 99.71%and an average F1-score of 99.71%,the experimentalfindings revealed that the suggested model could accurately recognize the actions of construction workers.Furthermore,we examined the impact of window size and sensor position on the identification efficiency of the proposed method. 展开更多
关键词 Complex human activity recognition wearable inertial sensors deep learning construction workers automatic recognition
下载PDF
Study on automatic recognition of the first motion in a seismic event
3
作者 谢永杰 陶果 《Acta Seismologica Sinica(English Edition)》 EI CSCD 2000年第5期585-590,共6页
In this paper, we have studied the waveforms of background noise in a seismograph and set up an AR model to characterize them. We then complete the modeling and the automatic recognition program. Finally, we provide t... In this paper, we have studied the waveforms of background noise in a seismograph and set up an AR model to characterize them. We then complete the modeling and the automatic recognition program. Finally, we provide the results from automatic recognition and the manual recognition of the first motion for 25 underground explosions. 展开更多
关键词 seismic signal underground explosion AR model first motion automatic recognition
下载PDF
Automatic Recognition Algorithm of AM Signals Based on Spectrum and Modulation Characters
4
作者 Xiao-Fei Zhang Liang Chang Pei-Ming Ren Rong Liu 《Journal of Electronic Science and Technology》 CAS 2012年第2期163-166,共4页
To meet the actual requirement of automatic monitoring of the shortwave signals under wide band ranges, a technique for automatic recognition is studied in this paper. And basing upon the spectrum and modulation chara... To meet the actual requirement of automatic monitoring of the shortwave signals under wide band ranges, a technique for automatic recognition is studied in this paper. And basing upon the spectrum and modulation characters of amplitude modulation (AM) signals, an automatic recognition scheme for AM signals is proposed. The proposed scheme is achieved by a joint judgment with four different characteristic parameters. Experiment results indicate that the proposed scheme can effectively recognize AM signals in practice. 展开更多
关键词 Amplitude modulation automatic recognition characteristic parameters shortwave radio.
下载PDF
Automatic modulation recognition of radiation source signals based on two-dimensional data matrix and improved residual neural network
5
作者 Guanghua Yi Xinhong Hao +3 位作者 Xiaopeng Yan Jian Dai Yangtian Liu Yanwen Han 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2024年第3期364-373,共10页
Automatic modulation recognition(AMR)of radiation source signals is a research focus in the field of cognitive radio.However,the AMR of radiation source signals at low SNRs still faces a great challenge.Therefore,the ... Automatic modulation recognition(AMR)of radiation source signals is a research focus in the field of cognitive radio.However,the AMR of radiation source signals at low SNRs still faces a great challenge.Therefore,the AMR method of radiation source signals based on two-dimensional data matrix and improved residual neural network is proposed in this paper.First,the time series of the radiation source signals are reconstructed into two-dimensional data matrix,which greatly simplifies the signal preprocessing process.Second,the depthwise convolution and large-size convolutional kernels based residual neural network(DLRNet)is proposed to improve the feature extraction capability of the AMR model.Finally,the model performs feature extraction and classification on the two-dimensional data matrix to obtain the recognition vector that represents the signal modulation type.Theoretical analysis and simulation results show that the AMR method based on two-dimensional data matrix and improved residual network can significantly improve the accuracy of the AMR method.The recognition accuracy of the proposed method maintains a high level greater than 90% even at -14 dB SNR. 展开更多
关键词 automatic modulation recognition Radiation source signals Two-dimensional data matrix Residual neural network Depthwise convolution
下载PDF
Automatic modulation recognition of radio fuzes using a DR2D-based adaptive denoising method and textural feature extraction
6
作者 Yangtian Liu Xiaopeng Yan +2 位作者 Qiang Liu Tai An Jian Dai 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2024年第4期328-338,共11页
The identification of intercepted radio fuze modulation types is a prerequisite for decision-making in interference systems.However,the electromagnetic environment of modern battlefields is complex,and the signal-to-n... The identification of intercepted radio fuze modulation types is a prerequisite for decision-making in interference systems.However,the electromagnetic environment of modern battlefields is complex,and the signal-to-noise ratio(SNR)of such environments is usually low,which makes it difficult to implement accurate recognition of radio fuzes.To solve the above problem,a radio fuze automatic modulation recognition(AMR)method for low-SNR environments is proposed.First,an adaptive denoising algorithm based on data rearrangement and the two-dimensional(2D)fast Fourier transform(FFT)(DR2D)is used to reduce the noise of the intercepted radio fuze intermediate frequency(IF)signal.Then,the textural features of the denoised IF signal rearranged data matrix are extracted from the statistical indicator vectors of gray-level cooccurrence matrices(GLCMs),and support vector machines(SVMs)are used for classification.The DR2D-based adaptive denoising algorithm achieves an average correlation coefficient of more than 0.76 for ten fuze types under SNRs of-10 d B and above,which is higher than that of other typical algorithms.The trained SVM classification model achieves an average recognition accuracy of more than 96%on seven modulation types and recognition accuracies of more than 94%on each modulation type under SNRs of-12 d B and above,which represents a good AMR performance of radio fuzes under low SNRs. 展开更多
关键词 automatic modulation recognition Adaptive denoising Data rearrangement and the 2D FFT(DR2D) Radio fuze
下载PDF
Automatic recognition and intelligent analysis of central shrinkage defects of continuous casting billets based on deep learning 被引量:1
7
作者 Gong-hao Lian Qi-hao Sun +6 位作者 Xiao-ming Liu Wei-miao Kong Ming Lv Jian-jun Qi Yong Liu Ben-ming Yuan Qiang Wang 《Journal of Iron and Steel Research(International)》 SCIE EI CAS CSCD 2023年第5期937-948,共12页
The internal quality inspection of the continuous casting billets is very important,and mis-inspection will seriously affect the subsequent production process.The UNet-VGG16 transfer learning model was used for semant... The internal quality inspection of the continuous casting billets is very important,and mis-inspection will seriously affect the subsequent production process.The UNet-VGG16 transfer learning model was used for semantic segmentation of the central shrinkage defects of the continuous casting billets.The automatic recognition accuracy of the central shrinkage defects of the continuous casting billets reaches more than 0.9.We use the minimum circumscribed rectangle to quantify the geometric dimensions such as length,width and area of the central shrinkage defects and use the threshold method to rate the central shrinkage defects of the continuous casting billets.The results show that all the testing images are rated correctly,and this method achieves the automatic recognition and intelligent analysis of the central shrinkage defects of the continuous casting billets. 展开更多
关键词 Central shrinkage Deep learning Image segmentation Circumscribed rectangle automatic recognition
原文传递
Radar Signal Intra-Pulse Modulation Recognition Based on Deep Residual Network
8
作者 Fuyuan Xu Guangqing Shao +3 位作者 Jiazhan Lu Zhiyin Wang Zhipeng Wu Shuhang Xia 《Journal of Beijing Institute of Technology》 EI CAS 2024年第2期155-162,共8页
In view of low recognition rate of complex radar intra-pulse modulation signal type by traditional methods under low signal-to-noise ratio(SNR),the paper proposes an automatic recog-nition method of complex radar intr... In view of low recognition rate of complex radar intra-pulse modulation signal type by traditional methods under low signal-to-noise ratio(SNR),the paper proposes an automatic recog-nition method of complex radar intra-pulse modulation signal type based on deep residual network.The basic principle of the recognition method is to obtain the transformation relationship between the time and frequency of complex radar intra-pulse modulation signal through short-time Fourier transform(STFT),and then design an appropriate deep residual network to extract the features of the time-frequency map and complete a variety of complex intra-pulse modulation signal type recognition.In addition,in order to improve the generalization ability of the proposed method,label smoothing and L2 regularization are introduced.The simulation results show that the proposed method has a recognition accuracy of more than 95%for complex radar intra-pulse modulation sig-nal types under low SNR(2 dB). 展开更多
关键词 intra-pulse modulation low signal-to-noise deep residual network automatic recognition
下载PDF
Joint On-Demand Pruning and Online Distillation in Automatic Speech Recognition Language Model Optimization
9
作者 Soonshin Seo Ji-Hwan Kim 《Computers, Materials & Continua》 SCIE EI 2023年第12期2833-2856,共24页
Automatic speech recognition(ASR)systems have emerged as indispensable tools across a wide spectrum of applications,ranging from transcription services to voice-activated assistants.To enhance the performance of these... Automatic speech recognition(ASR)systems have emerged as indispensable tools across a wide spectrum of applications,ranging from transcription services to voice-activated assistants.To enhance the performance of these systems,it is important to deploy efficient models capable of adapting to diverse deployment conditions.In recent years,on-demand pruning methods have obtained significant attention within the ASR domain due to their adaptability in various deployment scenarios.However,these methods often confront substantial trade-offs,particularly in terms of unstable accuracy when reducing the model size.To address challenges,this study introduces two crucial empirical findings.Firstly,it proposes the incorporation of an online distillation mechanism during on-demand pruning training,which holds the promise of maintaining more consistent accuracy levels.Secondly,it proposes the utilization of the Mogrifier long short-term memory(LSTM)language model(LM),an advanced iteration of the conventional LSTM LM,as an effective alternative for pruning targets within the ASR framework.Through rigorous experimentation on the ASR system,employing the Mogrifier LSTM LM and training it using the suggested joint on-demand pruning and online distillation method,this study provides compelling evidence.The results exhibit that the proposed methods significantly outperform a benchmark model trained solely with on-demand pruning methods.Impressively,the proposed strategic configuration successfully reduces the parameter count by approximately 39%,all the while minimizing trade-offs. 展开更多
关键词 automatic speech recognition neural language model Mogrifier long short-term memory PRUNING DISTILLATION efficient deployment OPTIMIZATION joint training
下载PDF
Audio-Text Multimodal Speech Recognition via Dual-Tower Architecture for Mandarin Air Traffic Control Communications
10
作者 Shuting Ge Jin Ren +3 位作者 Yihua Shi Yujun Zhang Shunzhi Yang Jinfeng Yang 《Computers, Materials & Continua》 SCIE EI 2024年第3期3215-3245,共31页
In air traffic control communications (ATCC), misunderstandings between pilots and controllers could result in fatal aviation accidents. Fortunately, advanced automatic speech recognition technology has emerged as a p... In air traffic control communications (ATCC), misunderstandings between pilots and controllers could result in fatal aviation accidents. Fortunately, advanced automatic speech recognition technology has emerged as a promising means of preventing miscommunications and enhancing aviation safety. However, most existing speech recognition methods merely incorporate external language models on the decoder side, leading to insufficient semantic alignment between speech and text modalities during the encoding phase. Furthermore, it is challenging to model acoustic context dependencies over long distances due to the longer speech sequences than text, especially for the extended ATCC data. To address these issues, we propose a speech-text multimodal dual-tower architecture for speech recognition. It employs cross-modal interactions to achieve close semantic alignment during the encoding stage and strengthen its capabilities in modeling auditory long-distance context dependencies. In addition, a two-stage training strategy is elaborately devised to derive semantics-aware acoustic representations effectively. The first stage focuses on pre-training the speech-text multimodal encoding module to enhance inter-modal semantic alignment and aural long-distance context dependencies. The second stage fine-tunes the entire network to bridge the input modality variation gap between the training and inference phases and boost generalization performance. Extensive experiments demonstrate the effectiveness of the proposed speech-text multimodal speech recognition method on the ATCC and AISHELL-1 datasets. It reduces the character error rate to 6.54% and 8.73%, respectively, and exhibits substantial performance gains of 28.76% and 23.82% compared with the best baseline model. The case studies indicate that the obtained semantics-aware acoustic representations aid in accurately recognizing terms with similar pronunciations but distinctive semantics. The research provides a novel modeling paradigm for semantics-aware speech recognition in air traffic control communications, which could contribute to the advancement of intelligent and efficient aviation safety management. 展开更多
关键词 Speech-text multimodal automatic speech recognition semantic alignment air traffic control communications dual-tower architecture
下载PDF
Machine learning guided automatic recognition of crystal boundaries in bainitic/martensitic alloy and relationship between boundary types and ductile-to-brittle transition behavior 被引量:4
11
作者 X.C.Li J.X.Zhao +4 位作者 J.H.Cong R.D.K.Misra X.M.Wang X.L.Wang C.J.Shang 《Journal of Materials Science & Technology》 SCIE EI CAS CSCD 2021年第25期49-58,共10页
Gradient boosting decision tree(GBDT)machine learning(ML)method was adopted for the first time to automatically recognize and conduct quantitative statistical analysis of boundaries in bainitic microstructure using el... Gradient boosting decision tree(GBDT)machine learning(ML)method was adopted for the first time to automatically recognize and conduct quantitative statistical analysis of boundaries in bainitic microstructure using electron back-scatter diffraction(EBSD)data.In spite of lack of large sets of EBSD data,we were successful in achieving the desired accuracy and accomplishing the objective of recognizing the boundaries.Compared with a low model accuracy of<50%as using Euler angles or axis-angle pair as characteristic features,the accuracy of the model was significantly enhanced to about 88%when the Euler angle was converted to overall misorientation angle(OMA)and specific misorientation angle(SMA)and considered as important features.In this model,the recall score of prior austenite grain(PAG)boundary was~93%,high angle packet boundary(OMA>40°)was~97%,and block boundary was~96%.The derived outcomes of ML were used to obtain insights into the ductile-to-brittle transition(DBTT)behavior.Interestingly,ML modeling approach suggested that DBTT was not determined by the density of high angle grain boundaries,but significantly influenced by the density of PAG and packet boundaries.The study underscores that ML has a great potential in detailed recognition of complex multi-hierarchical microstructure such as bainite and martensite and relates to material performance. 展开更多
关键词 Machine learning Feature engineering automatic recognition Lath structure CRYSTALLOGRAPHY
原文传递
Investigation into the automatic recognition of time series precursor of earthquakes
12
作者 黄汉明 范洪顺 +1 位作者 边银菊 邹立晔 《Acta Seismologica Sinica(English Edition)》 EI CSCD 1998年第5期87-96,共10页
In this paper, a new method of quantitative description of earthquake precursors is proposed; by this method, the precursory pattern of time series can be quantitatively described with a two-dimensional matrix. On thi... In this paper, a new method of quantitative description of earthquake precursors is proposed; by this method, the precursory pattern of time series can be quantitatively described with a two-dimensional matrix. On this basis, a method of automatic recognition or automatic acquirement of precursory pattern, called simply the AA method, is put forward. Then, taking North China region as an example, various seismological precursors such as the frequency, energy, b -value, etc . and various nonlinear parameter precursors such as the capacity dimension, information dimension, correlation dimension, Hurst index and its difference, etc. are analyzed and the 8 time series so obtained are recognized automatically using the proposed precursory pattern and AA method. Besides, C-method tests and very rigorous HF (history and future) tests are made. The result shows that the R-value of prediction efficacy assessment is fairly high. 展开更多
关键词 earthquake prediction precursory pattern automatic recognition C-method test HF test
下载PDF
Research on PCA and KPCA Self-Fusion Based MSTAR SAR Automatic Target Recognition Algorithm 被引量:6
13
作者 Chuang Lin Fei Peng +2 位作者 Bing-Hui Wang Wei-Feng Sun Xiang-Jie Kong 《Journal of Electronic Science and Technology》 CAS 2012年第4期352-357,共6页
This paper proposes a PCA and KPCA self-fusion based MSTAR SAR automatic target recognition algorithm. This algorithm combines the linear feature extracted from principal component analysis (PCA) and nonlinear featu... This paper proposes a PCA and KPCA self-fusion based MSTAR SAR automatic target recognition algorithm. This algorithm combines the linear feature extracted from principal component analysis (PCA) and nonlinear feature extracted from kernel principal component analysis (KPCA) respectively, and then utilizes the adaptive feature fusion algorithm which is based on the weighted maximum margin criterion (WMMC) to fuse the features in order to achieve better performance. The linear regression classifier is used in the experiments. The experimental results indicate that the proposed self-fusion algorithm achieves higher recognition rate compared with the traditional PCA and KPCA feature fusion algorithms. 展开更多
关键词 automatic target recognition principal component analysis self-fusion syntheticaperture radar.
下载PDF
Summed volume region selection based three-dimensional automatic target recognition for airborne LIDAR 被引量:2
14
作者 Qi-shu Qian Yi-hua Hu +2 位作者 Nan-xiang Zhao Min-le Li Fu-cai Shao 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2020年第3期535-542,共8页
Airborne LIDAR can flexibly obtain point cloud data with three-dimensional structural information,which can improve its effectiveness of automatic target recognition in the complex environment.Compared with 2D informa... Airborne LIDAR can flexibly obtain point cloud data with three-dimensional structural information,which can improve its effectiveness of automatic target recognition in the complex environment.Compared with 2D information,3D information performs better in separating objects and background.However,an aircraft platform can have a negative influence on LIDAR obtained data because of various flight attitudes,flight heights and atmospheric disturbances.A structure of global feature based 3D automatic target recognition method for airborne LIDAR is proposed,which is composed of offline phase and online phase.The performance of four global feature descriptors is compared.Considering the summed volume region(SVR) discrepancy in real objects,SVR selection is added into the pre-processing operations to eliminate mismatching clusters compared with the interested target.Highly reliable simulated data are obtained under various sensor’s altitudes,detection distances and atmospheric disturbances.The final experiments results show that the added step increases the recognition rate by above 2.4% and decreases the execution time by about 33%. 展开更多
关键词 3D automatic target recognition Point cloud LIDAR AIRBORNE Global feature descriptor
下载PDF
Automatic Speaker Recognition Using Mel-Frequency Cepstral Coefficients Through Machine Learning 被引量:1
15
作者 U˘gur Ayvaz Hüseyin Gürüler +3 位作者 Faheem Khan Naveed Ahmed Taegkeun Whangbo Abdusalomov Akmalbek Bobomirzaevich 《Computers, Materials & Continua》 SCIE EI 2022年第6期5511-5521,共11页
Automatic speaker recognition(ASR)systems are the field of Human-machine interaction and scientists have been using feature extraction and feature matching methods to analyze and synthesize these signals.One of the mo... Automatic speaker recognition(ASR)systems are the field of Human-machine interaction and scientists have been using feature extraction and feature matching methods to analyze and synthesize these signals.One of the most commonly used methods for feature extraction is Mel Frequency Cepstral Coefficients(MFCCs).Recent researches show that MFCCs are successful in processing the voice signal with high accuracies.MFCCs represents a sequence of voice signal-specific features.This experimental analysis is proposed to distinguish Turkish speakers by extracting the MFCCs from the speech recordings.Since the human perception of sound is not linear,after the filterbank step in theMFCC method,we converted the obtained log filterbanks into decibel(dB)features-based spectrograms without applying the Discrete Cosine Transform(DCT).A new dataset was created with converted spectrogram into a 2-D array.Several learning algorithms were implementedwith a 10-fold cross-validationmethod to detect the speaker.The highest accuracy of 90.2%was achieved using Multi-layer Perceptron(MLP)with tanh activation function.The most important output of this study is the inclusion of human voice as a new feature set. 展开更多
关键词 automatic speaker recognition human voice recognition spatial pattern recognition MFCCs SPECTROGRAM machine learning artificial intelligence
下载PDF
Automatic Mexican Sign Language Recognition Using Normalized Moments and Artificial Neural Networks 被引量:1
16
作者 Francisco Solís David Martínez Oscar Espinoza 《Engineering(科研)》 2016年第10期733-740,共8页
This document presents a computer vision system for the automatic recognition of Mexican Sign Language (MSL), based on normalized moments as invariant (to translation and scale transforms) descriptors, using artificia... This document presents a computer vision system for the automatic recognition of Mexican Sign Language (MSL), based on normalized moments as invariant (to translation and scale transforms) descriptors, using artificial neural networks as pattern recognition model. An experimental feature selection was performed to reduce computational costs due to this work focusing on automatic recognition. The computer vision system includes four LED-reflectors of 700 lumens each in order to improve image acquisition quality;this illumination system allows reducing shadows in each sign of the MSL. MSL contains 27 signs in total but 6 of them are expressed with movement;this paper presents a framework for the automatic recognition of 21 static signs of MSL. The proposed system achieved 93% of recognition rate. 展开更多
关键词 Mexican Sign Language automatic Sign Language recognition Normalized Moments Computer Vision System
下载PDF
Deep Learning and SVM-Based Approach for Indian Licence Plate Character Recognition 被引量:1
17
作者 Nitin Sharma Mohd Anul Haq +4 位作者 Pawan Kumar Dahiya B.R.Marwah Reema Lalit Nitin Mittal Ismail Keshta 《Computers, Materials & Continua》 SCIE EI 2023年第1期881-895,共15页
Every developing country relies on transportation,and there has been an exponential expansion in the development of various sorts of vehicles with various configurations,which is a major component strengthening the au... Every developing country relies on transportation,and there has been an exponential expansion in the development of various sorts of vehicles with various configurations,which is a major component strengthening the automobile sector.India is a developing country with increasing road traffic,which has resulted in challenges such as increased road accidents and traffic oversight issues.In the lack of a parametric technique for accurate vehicle recognition,which is a major worry in terms of reliability,high traffic density also leads to mayhem at checkpoints and toll plazas.A system that combines an intelligent domain approach with more sustainability indices is a better way to handle traffic density and transparency issues.The Automatic Licence Plate Recognition(ALPR)system is one of the components of the intelligent transportation system for traffic monitoring.This study is based on a comprehensive and detailed literature evaluation in the field of ALPR.The major goal of this study is to create an automatic pattern recognition system with various combinations and higher accuracy in order to increase the reliability and accuracy of identifying digits and alphabets on a car plate.The research is founded on the idea that image processing opens up a diverse environment with allied fields when employing distinct soft techniques for recognition.The properties of characters are employed to recognise the Indian licence plate in this study.For licence plate recognition,more than 200 images were analysed with various parameters and soft computing techniques were applied.In comparison to neural networks,a hybrid technique using a Convolution Neural Network(CNN)and a Support Vector Machine(SVM)classifier has a 98.45%efficiency. 展开更多
关键词 Intelligent transportation system automatic license plate recognition system neural network random forest convolutional neural network support vector machine
下载PDF
Speech Recognition via CTC-CNN Model
18
作者 Wen-Tsai Sung Hao-WeiKang Sung-Jung Hsiao 《Computers, Materials & Continua》 SCIE EI 2023年第9期3833-3858,共26页
In the speech recognition system,the acoustic model is an important underlying model,and its accuracy directly affects the performance of the entire system.This paper introduces the construction and training process o... In the speech recognition system,the acoustic model is an important underlying model,and its accuracy directly affects the performance of the entire system.This paper introduces the construction and training process of the acoustic model in detail and studies the Connectionist temporal classification(CTC)algorithm,which plays an important role in the end-to-end framework,established a convolutional neural network(CNN)combined with an acoustic model of Connectionist temporal classification to improve the accuracy of speech recognition.This study uses a sound sensor,ReSpeakerMic Array v2.0.1,to convert the collected speech signals into text or corresponding speech signals to improve communication and reduce noise and hardware interference.The baseline acousticmodel in this study faces challenges such as long training time,high error rate,and a certain degree of overfitting.The model is trained through continuous design and improvement of the relevant parameters of the acousticmodel,and finally the performance is selected according to the evaluation index.Excellentmodel,which reduces the error rate to about 18%,thus improving the accuracy rate.Finally,comparative verificationwas carried out from the selection of acoustic feature parameters,the selection of modeling units,and the speaker’s speech rate,which further verified the excellent performance of the CTCCNN_5+BN+Residual model structure.In terms of experiments,to train and verify the CTC-CNN baseline acoustic model,this study uses THCHS-30 and ST-CMDS speech data sets as training data sets,and after 54 epochs of training,the word error rate of the acoustic model training set is 31%,the word error rate of the test set is stable at about 43%.This experiment also considers the surrounding environmental noise.Under the noise level of 80∼90 dB,the accuracy rate is 88.18%,which is the worst performance among all levels.In contrast,at 40–60 dB,the accuracy was as high as 97.33%due to less noise pollution. 展开更多
关键词 Artificial intelligence speech recognition speech to text convolutional neural network automatic speech recognition
下载PDF
A Robust Conformer-Based Speech Recognition Model for Mandarin Air Traffic Control
19
作者 Peiyuan Jiang Weijun Pan +2 位作者 Jian Zhang Teng Wang Junxiang Huang 《Computers, Materials & Continua》 SCIE EI 2023年第10期911-940,共30页
This study aims to address the deviation in downstream tasks caused by inaccurate recognition results when applying Automatic Speech Recognition(ASR)technology in the Air Traffic Control(ATC)field.This paper presents ... This study aims to address the deviation in downstream tasks caused by inaccurate recognition results when applying Automatic Speech Recognition(ASR)technology in the Air Traffic Control(ATC)field.This paper presents a novel cascaded model architecture,namely Conformer-CTC/Attention-T5(CCAT),to build a highly accurate and robust ATC speech recognition model.To tackle the challenges posed by noise and fast speech rate in ATC,the Conformer model is employed to extract robust and discriminative speech representations from raw waveforms.On the decoding side,the Attention mechanism is integrated to facilitate precise alignment between input features and output characters.The Text-To-Text Transfer Transformer(T5)language model is also introduced to handle particular pronunciations and code-mixing issues,providing more accurate and concise textual output for downstream tasks.To enhance the model’s robustness,transfer learning and data augmentation techniques are utilized in the training strategy.The model’s performance is optimized by performing hyperparameter tunings,such as adjusting the number of attention heads,encoder layers,and the weights of the loss function.The experimental results demonstrate the significant contributions of data augmentation,hyperparameter tuning,and error correction models to the overall model performance.On the Our ATC Corpus dataset,the proposed model achieves a Character Error Rate(CER)of 3.44%,representing a 3.64%improvement compared to the baseline model.Moreover,the effectiveness of the proposed model is validated on two publicly available datasets.On the AISHELL-1 dataset,the CCAT model achieves a CER of 3.42%,showcasing a 1.23%improvement over the baseline model.Similarly,on the LibriSpeech dataset,the CCAT model achieves a Word Error Rate(WER)of 5.27%,demonstrating a performance improvement of 7.67%compared to the baseline model.Additionally,this paper proposes an evaluation criterion for assessing the robustness of ATC speech recognition systems.In robustness evaluation experiments based on this criterion,the proposed model demonstrates a performance improvement of 22%compared to the baseline model. 展开更多
关键词 Air traffic control automatic speech recognition CONFORMER robustness evaluation T5 error correction model
下载PDF
Challenges and Limitations in Speech Recognition Technology:A Critical Review of Speech Signal Processing Algorithms,Tools and Systems
20
作者 Sneha Basak Himanshi Agrawal +4 位作者 Shreya Jena Shilpa Gite Mrinal Bachute Biswajeet Pradhan Mazen Assiri 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第5期1053-1089,共37页
Speech recognition systems have become a unique human-computer interaction(HCI)family.Speech is one of the most naturally developed human abilities;speech signal processing opens up a transparent and hand-free computa... Speech recognition systems have become a unique human-computer interaction(HCI)family.Speech is one of the most naturally developed human abilities;speech signal processing opens up a transparent and hand-free computation experience.This paper aims to present a retrospective yet modern approach to the world of speech recognition systems.The development journey of ASR(Automatic Speech Recognition)has seen quite a few milestones and breakthrough technologies that have been highlighted in this paper.A step-by-step rundown of the fundamental stages in developing speech recognition systems has been presented,along with a brief discussion of various modern-day developments and applications in this domain.This review paper aims to summarize and provide a beginning point for those starting in the vast field of speech signal processing.Since speech recognition has a vast potential in various industries like telecommunication,emotion recognition,healthcare,etc.,this review would be helpful to researchers who aim at exploring more applications that society can quickly adopt in future years of evolution. 展开更多
关键词 Speech recognition automatic speech recognition(ASR) mel-frequency cepstral coefficients(MFCC) hidden Markov model(HMM) artificial neural network(ANN)
下载PDF
上一页 1 2 3 下一页 到第
使用帮助 返回顶部