期刊文献+
共找到55篇文章
< 1 2 3 >
每页显示 20 50 100
Automatic Recognition Method for Optical Measuring Instruments Based on Machine Vision 被引量:2
1
作者 宋乐 林玉池 郝立果 《Transactions of Tianjin University》 EI CAS 2008年第3期202-207,共6页
Based on a comprehensive study of various algorithms, the automatic recognition of traditional ocular optical measuring instruments is realized. Taking a universal tools microscope(UTM) lens view image as an example, ... Based on a comprehensive study of various algorithms, the automatic recognition of traditional ocular optical measuring instruments is realized. Taking a universal tools microscope(UTM) lens view image as an example, a 2-layer automatic recognition model for data reading is established after adopting a series of pre-processing algorithms. This model is an optimal combination of the correlation-based template matching method and a concurrent back propagation(BP) neural network. Multiple complementary feature extraction is used in generating the eigenvectors of the concurrent network. In order to improve fault-tolerance capacity, rotation invariant features based on Zernike moments are extracted from digit characters and a 4-dimensional group of the outline features is also obtained. Moreover, the operating time and reading accuracy can be adjusted dy-namically by setting the threshold value. The experimental result indicates that the newly developed algorithm has optimal recognition precision and working speed. The average reading ratio can achieve 97.23%. The recognition method can automatically obtain the results of optical measuring instruments rapidly and stably without modifying their original structure, which meets the application requirements. 展开更多
关键词 automatic recognition optical measuring instruments template matching neural network
下载PDF
Automatic recognition of sonar targets using feature selection in micro-Doppler signature 被引量:1
2
作者 Abbas Saffari Seyed-Hamid Zahiri Mohammad Khishe 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2023年第2期58-71,共14页
Currently,the use of intelligent systems for the automatic recognition of targets in the fields of defence and military has increased significantly.The primary advantage of these systems is that they do not need human... Currently,the use of intelligent systems for the automatic recognition of targets in the fields of defence and military has increased significantly.The primary advantage of these systems is that they do not need human participation in target recognition processes.This paper uses the particle swarm optimization(PSO)algorithm to select the optimal features in the micro-Doppler signature of sonar targets.The microDoppler effect is referred to amplitude/phase modulation on the received signal by rotating parts of a target such as propellers.Since different targets'geometric and physical properties are not the same,their micro-Doppler signature is different.This Inconsistency can be considered a practical issue(especially in the frequency domain)for sonar target recognition.Despite using 128-point fast Fourier transform(FFT)for the feature extraction step,not all extracted features contain helpful information.As a result,PSO selects the most optimum and valuable features.To evaluate the micro-Doppler signature of sonar targets and the effect of feature selection on sonar target recognition,the simplest and most popular machine learning algorithm,k-nearest neighbor(k-NN),is used,which is called k-PSO in this paper because of the use of PSO for feature selection.The parameters measured are the correct recognition rate,reliability rate,and processing time.The simulation results show that k-PSO achieved a 100%correct recognition rate and reliability rate at 19.35 s when using simulated data at a 15 dB signal-tonoise ratio(SNR)angle of 40°.Also,for the experimental dataset obtained from the cavitation tunnel,the correct recognition rate is 98.26%,and the reliability rate is 99.69%at 18.46s.Therefore,the k-PSO has an encouraging performance in automatically recognizing sonar targets when using experimental datasets and for real-world use. 展开更多
关键词 Micro-Doppler signature automatic recognition Feature selection K-NN PSO
下载PDF
Study on automatic recognition of the first motion in a seismic event
3
作者 谢永杰 陶果 《Acta Seismologica Sinica(English Edition)》 EI CSCD 2000年第5期585-590,共6页
In this paper, we have studied the waveforms of background noise in a seismograph and set up an AR model to characterize them. We then complete the modeling and the automatic recognition program. Finally, we provide t... In this paper, we have studied the waveforms of background noise in a seismograph and set up an AR model to characterize them. We then complete the modeling and the automatic recognition program. Finally, we provide the results from automatic recognition and the manual recognition of the first motion for 25 underground explosions. 展开更多
关键词 seismic signal underground explosion AR model first motion automatic recognition
下载PDF
Automatic Recognition Algorithm of AM Signals Based on Spectrum and Modulation Characters
4
作者 Xiao-Fei Zhang Liang Chang Pei-Ming Ren Rong Liu 《Journal of Electronic Science and Technology》 CAS 2012年第2期163-166,共4页
To meet the actual requirement of automatic monitoring of the shortwave signals under wide band ranges, a technique for automatic recognition is studied in this paper. And basing upon the spectrum and modulation chara... To meet the actual requirement of automatic monitoring of the shortwave signals under wide band ranges, a technique for automatic recognition is studied in this paper. And basing upon the spectrum and modulation characters of amplitude modulation (AM) signals, an automatic recognition scheme for AM signals is proposed. The proposed scheme is achieved by a joint judgment with four different characteristic parameters. Experiment results indicate that the proposed scheme can effectively recognize AM signals in practice. 展开更多
关键词 Amplitude modulation automatic recognition characteristic parameters shortwave radio.
下载PDF
Automatic Recognition of Construction Worker Activities Using Deep Learning Approaches and Wearable Inertial Sensors
5
作者 Sakorn Mekruksavanich Anuchit Jitpattanakul 《Intelligent Automation & Soft Computing》 SCIE 2023年第5期2111-2128,共18页
The automated evaluation and analysis of employee behavior in an Industry 4.0-compliant manufacturingfirm are vital for the rapid and accurate diagnosis of work performance,particularly during the training of a new wor... The automated evaluation and analysis of employee behavior in an Industry 4.0-compliant manufacturingfirm are vital for the rapid and accurate diagnosis of work performance,particularly during the training of a new worker.Various techniques for identifying and detecting worker performance in industrial applications are based on computer vision techniques.Despite widespread com-puter vision-based approaches,it is challenging to develop technologies that assist the automated monitoring of worker actions at external working sites where cam-era deployment is problematic.Through the use of wearable inertial sensors,we propose a deep learning method for automatically recognizing the activities of construction workers.The suggested method incorporates a convolutional neural network,residual connection blocks,and multi-branch aggregate transformation modules for high-performance recognition of complicated activities such as con-struction worker tasks.The proposed approach has been evaluated using standard performance measures,such as precision,F1-score,and AUC,using a publicly available benchmark dataset known as VTT-ConIoT,which contains genuine con-struction work activities.In addition,standard deep learning models(CNNs,RNNs,and hybrid models)were developed in different empirical circumstances to compare them to the proposed model.With an average accuracy of 99.71%and an average F1-score of 99.71%,the experimentalfindings revealed that the suggested model could accurately recognize the actions of construction workers.Furthermore,we examined the impact of window size and sensor position on the identification efficiency of the proposed method. 展开更多
关键词 Complex human activity recognition wearable inertial sensors deep learning construction workers automatic recognition
下载PDF
Automatic Recognition of Analog Modulated Signals Using Artificial Neural Networks
6
作者 Jide Julius Popoola Rex Van Olst 《Computer Technology and Application》 2011年第1期29-35,共7页
This paper presents work on modulated signal recognition using an artificial neural network (ANN) developed using the Python programme language. The study is basically on the analysis of analog modulated signals. Fo... This paper presents work on modulated signal recognition using an artificial neural network (ANN) developed using the Python programme language. The study is basically on the analysis of analog modulated signals. Four of the best-known analog modulation types are considered namely: amplitude modulation (AM), double sideband (DSB) modulation, single sideband (SSB) modulation and frequency modulation (FM). Computer simulations of the four modulated signals are carried out using MATLAB. MATLAB code is used in simulating the analog signals as well as the power spectral density of each of the analog modulated signals. In achieving an accurate classification of each of the modulated signals, extensive simulations are performed for the training of the artificial neural network. The results of the study show accurate and correct performance of the developed automatic modulation recognition with average success rate above 99.5%. 展开更多
关键词 automatic modulation recognition modulation schemes features extraction key artificial neural network (ANN).
下载PDF
Automatic modulation recognition of radio fuzes using a DR2D-based adaptive denoising method and textural feature extraction 被引量:1
7
作者 Yangtian Liu Xiaopeng Yan +2 位作者 Qiang Liu Tai An Jian Dai 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2024年第4期328-338,共11页
The identification of intercepted radio fuze modulation types is a prerequisite for decision-making in interference systems.However,the electromagnetic environment of modern battlefields is complex,and the signal-to-n... The identification of intercepted radio fuze modulation types is a prerequisite for decision-making in interference systems.However,the electromagnetic environment of modern battlefields is complex,and the signal-to-noise ratio(SNR)of such environments is usually low,which makes it difficult to implement accurate recognition of radio fuzes.To solve the above problem,a radio fuze automatic modulation recognition(AMR)method for low-SNR environments is proposed.First,an adaptive denoising algorithm based on data rearrangement and the two-dimensional(2D)fast Fourier transform(FFT)(DR2D)is used to reduce the noise of the intercepted radio fuze intermediate frequency(IF)signal.Then,the textural features of the denoised IF signal rearranged data matrix are extracted from the statistical indicator vectors of gray-level cooccurrence matrices(GLCMs),and support vector machines(SVMs)are used for classification.The DR2D-based adaptive denoising algorithm achieves an average correlation coefficient of more than 0.76 for ten fuze types under SNRs of-10 d B and above,which is higher than that of other typical algorithms.The trained SVM classification model achieves an average recognition accuracy of more than 96%on seven modulation types and recognition accuracies of more than 94%on each modulation type under SNRs of-12 d B and above,which represents a good AMR performance of radio fuzes under low SNRs. 展开更多
关键词 automatic modulation recognition Adaptive denoising Data rearrangement and the 2D FFT(DR2D) Radio fuze
下载PDF
Automatic modulation recognition of radiation source signals based on two-dimensional data matrix and improved residual neural network
8
作者 Guanghua Yi Xinhong Hao +3 位作者 Xiaopeng Yan Jian Dai Yangtian Liu Yanwen Han 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2024年第3期364-373,共10页
Automatic modulation recognition(AMR)of radiation source signals is a research focus in the field of cognitive radio.However,the AMR of radiation source signals at low SNRs still faces a great challenge.Therefore,the ... Automatic modulation recognition(AMR)of radiation source signals is a research focus in the field of cognitive radio.However,the AMR of radiation source signals at low SNRs still faces a great challenge.Therefore,the AMR method of radiation source signals based on two-dimensional data matrix and improved residual neural network is proposed in this paper.First,the time series of the radiation source signals are reconstructed into two-dimensional data matrix,which greatly simplifies the signal preprocessing process.Second,the depthwise convolution and large-size convolutional kernels based residual neural network(DLRNet)is proposed to improve the feature extraction capability of the AMR model.Finally,the model performs feature extraction and classification on the two-dimensional data matrix to obtain the recognition vector that represents the signal modulation type.Theoretical analysis and simulation results show that the AMR method based on two-dimensional data matrix and improved residual network can significantly improve the accuracy of the AMR method.The recognition accuracy of the proposed method maintains a high level greater than 90% even at -14 dB SNR. 展开更多
关键词 automatic modulation recognition Radiation source signals Two-dimensional data matrix Residual neural network Depthwise convolution
下载PDF
Machine learning guided automatic recognition of crystal boundaries in bainitic/martensitic alloy and relationship between boundary types and ductile-to-brittle transition behavior 被引量:8
9
作者 X.C.Li J.X.Zhao +4 位作者 J.H.Cong R.D.K.Misra X.M.Wang X.L.Wang C.J.Shang 《Journal of Materials Science & Technology》 SCIE EI CAS CSCD 2021年第25期49-58,共10页
Gradient boosting decision tree(GBDT)machine learning(ML)method was adopted for the first time to automatically recognize and conduct quantitative statistical analysis of boundaries in bainitic microstructure using el... Gradient boosting decision tree(GBDT)machine learning(ML)method was adopted for the first time to automatically recognize and conduct quantitative statistical analysis of boundaries in bainitic microstructure using electron back-scatter diffraction(EBSD)data.In spite of lack of large sets of EBSD data,we were successful in achieving the desired accuracy and accomplishing the objective of recognizing the boundaries.Compared with a low model accuracy of<50%as using Euler angles or axis-angle pair as characteristic features,the accuracy of the model was significantly enhanced to about 88%when the Euler angle was converted to overall misorientation angle(OMA)and specific misorientation angle(SMA)and considered as important features.In this model,the recall score of prior austenite grain(PAG)boundary was~93%,high angle packet boundary(OMA>40°)was~97%,and block boundary was~96%.The derived outcomes of ML were used to obtain insights into the ductile-to-brittle transition(DBTT)behavior.Interestingly,ML modeling approach suggested that DBTT was not determined by the density of high angle grain boundaries,but significantly influenced by the density of PAG and packet boundaries.The study underscores that ML has a great potential in detailed recognition of complex multi-hierarchical microstructure such as bainite and martensite and relates to material performance. 展开更多
关键词 Machine learning Feature engineering automatic recognition Lath structure CRYSTALLOGRAPHY
原文传递
Automatic recognition and intelligent analysis of central shrinkage defects of continuous casting billets based on deep learning 被引量:3
10
作者 Gong-hao Lian Qi-hao Sun +6 位作者 Xiao-ming Liu Wei-miao Kong Ming Lv Jian-jun Qi Yong Liu Ben-ming Yuan Qiang Wang 《Journal of Iron and Steel Research International》 SCIE EI CAS CSCD 2023年第5期937-948,共12页
The internal quality inspection of the continuous casting billets is very important,and mis-inspection will seriously affect the subsequent production process.The UNet-VGG16 transfer learning model was used for semant... The internal quality inspection of the continuous casting billets is very important,and mis-inspection will seriously affect the subsequent production process.The UNet-VGG16 transfer learning model was used for semantic segmentation of the central shrinkage defects of the continuous casting billets.The automatic recognition accuracy of the central shrinkage defects of the continuous casting billets reaches more than 0.9.We use the minimum circumscribed rectangle to quantify the geometric dimensions such as length,width and area of the central shrinkage defects and use the threshold method to rate the central shrinkage defects of the continuous casting billets.The results show that all the testing images are rated correctly,and this method achieves the automatic recognition and intelligent analysis of the central shrinkage defects of the continuous casting billets. 展开更多
关键词 Central shrinkage Deep learning Image segmentation Circumscribed rectangle automatic recognition
原文传递
Investigation into the automatic recognition of time series precursor of earthquakes
11
作者 黄汉明 范洪顺 +1 位作者 边银菊 邹立晔 《Acta Seismologica Sinica(English Edition)》 EI CSCD 1998年第5期87-96,共10页
In this paper, a new method of quantitative description of earthquake precursors is proposed; by this method, the precursory pattern of time series can be quantitatively described with a two-dimensional matrix. On thi... In this paper, a new method of quantitative description of earthquake precursors is proposed; by this method, the precursory pattern of time series can be quantitatively described with a two-dimensional matrix. On this basis, a method of automatic recognition or automatic acquirement of precursory pattern, called simply the AA method, is put forward. Then, taking North China region as an example, various seismological precursors such as the frequency, energy, b -value, etc . and various nonlinear parameter precursors such as the capacity dimension, information dimension, correlation dimension, Hurst index and its difference, etc. are analyzed and the 8 time series so obtained are recognized automatically using the proposed precursory pattern and AA method. Besides, C-method tests and very rigorous HF (history and future) tests are made. The result shows that the R-value of prediction efficacy assessment is fairly high. 展开更多
关键词 earthquake prediction precursory pattern automatic recognition C-method test HF test
下载PDF
Radar Signal Intra-Pulse Modulation Recognition Based on Deep Residual Network
12
作者 Fuyuan Xu Guangqing Shao +3 位作者 Jiazhan Lu Zhiyin Wang Zhipeng Wu Shuhang Xia 《Journal of Beijing Institute of Technology》 EI CAS 2024年第2期155-162,共8页
In view of low recognition rate of complex radar intra-pulse modulation signal type by traditional methods under low signal-to-noise ratio(SNR),the paper proposes an automatic recog-nition method of complex radar intr... In view of low recognition rate of complex radar intra-pulse modulation signal type by traditional methods under low signal-to-noise ratio(SNR),the paper proposes an automatic recog-nition method of complex radar intra-pulse modulation signal type based on deep residual network.The basic principle of the recognition method is to obtain the transformation relationship between the time and frequency of complex radar intra-pulse modulation signal through short-time Fourier transform(STFT),and then design an appropriate deep residual network to extract the features of the time-frequency map and complete a variety of complex intra-pulse modulation signal type recognition.In addition,in order to improve the generalization ability of the proposed method,label smoothing and L2 regularization are introduced.The simulation results show that the proposed method has a recognition accuracy of more than 95%for complex radar intra-pulse modulation sig-nal types under low SNR(2 dB). 展开更多
关键词 intra-pulse modulation low signal-to-noise deep residual network automatic recognition
下载PDF
Investigation on Analog and Digital Modulations Recognition Using Machine Learning Algorithms
13
作者 Jean Ndoumbe Ivan Basile Kabeina +1 位作者 Gaelle Patricia Talotsing Soubiel-Noël Nkomo Biloo 《World Journal of Engineering and Technology》 2024年第4期867-884,共18页
In the field of radiocommunication, modulation type identification is one of the most important characteristics in signal processing. This study aims to implement a modulation recognition system on two approaches to m... In the field of radiocommunication, modulation type identification is one of the most important characteristics in signal processing. This study aims to implement a modulation recognition system on two approaches to machine learning techniques, the K-Nearest Neighbors (KNN) and Artificial Neural Networks (ANN). From a statistical and spectral analysis of signals, nine key differentiation features are extracted and used as input vectors for each trained model. The feature extraction is performed by using the Hilbert transform, the forward and inverse Fourier transforms. The experiments with the AMC Master dataset classify ten (10) types of analog and digital modulations. AM_DSB_FC, AM_DSB_SC, AM_USB, AM_LSB, FM, MPSK, 2PSK, MASK, 2ASK, MQAM are put forward in this article. For the simulation of the chosen model, signals are polluted by the Additive White Gaussian Noise (AWGN). The simulation results show that the best identification rate is the MLP neuronal method with 90.5% of accuracy after 10 dB signal-to-noise ratio value, with a shift of more than 15% from the k-nearest neighbors’ algorithm. 展开更多
关键词 automatic recognition Artificial Neural Networks K-Nearest Neighbors Machine Learning Analog Modulations Digital Modulations
下载PDF
Research on PCA and KPCA Self-Fusion Based MSTAR SAR Automatic Target Recognition Algorithm 被引量:6
14
作者 Chuang Lin Fei Peng +2 位作者 Bing-Hui Wang Wei-Feng Sun Xiang-Jie Kong 《Journal of Electronic Science and Technology》 CAS 2012年第4期352-357,共6页
This paper proposes a PCA and KPCA self-fusion based MSTAR SAR automatic target recognition algorithm. This algorithm combines the linear feature extracted from principal component analysis (PCA) and nonlinear featu... This paper proposes a PCA and KPCA self-fusion based MSTAR SAR automatic target recognition algorithm. This algorithm combines the linear feature extracted from principal component analysis (PCA) and nonlinear feature extracted from kernel principal component analysis (KPCA) respectively, and then utilizes the adaptive feature fusion algorithm which is based on the weighted maximum margin criterion (WMMC) to fuse the features in order to achieve better performance. The linear regression classifier is used in the experiments. The experimental results indicate that the proposed self-fusion algorithm achieves higher recognition rate compared with the traditional PCA and KPCA feature fusion algorithms. 展开更多
关键词 automatic target recognition principal component analysis self-fusion syntheticaperture radar.
下载PDF
Summed volume region selection based three-dimensional automatic target recognition for airborne LIDAR 被引量:2
15
作者 Qi-shu Qian Yi-hua Hu +2 位作者 Nan-xiang Zhao Min-le Li Fu-cai Shao 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2020年第3期535-542,共8页
Airborne LIDAR can flexibly obtain point cloud data with three-dimensional structural information,which can improve its effectiveness of automatic target recognition in the complex environment.Compared with 2D informa... Airborne LIDAR can flexibly obtain point cloud data with three-dimensional structural information,which can improve its effectiveness of automatic target recognition in the complex environment.Compared with 2D information,3D information performs better in separating objects and background.However,an aircraft platform can have a negative influence on LIDAR obtained data because of various flight attitudes,flight heights and atmospheric disturbances.A structure of global feature based 3D automatic target recognition method for airborne LIDAR is proposed,which is composed of offline phase and online phase.The performance of four global feature descriptors is compared.Considering the summed volume region(SVR) discrepancy in real objects,SVR selection is added into the pre-processing operations to eliminate mismatching clusters compared with the interested target.Highly reliable simulated data are obtained under various sensor’s altitudes,detection distances and atmospheric disturbances.The final experiments results show that the added step increases the recognition rate by above 2.4% and decreases the execution time by about 33%. 展开更多
关键词 3D automatic target recognition Point cloud LIDAR AIRBORNE Global feature descriptor
下载PDF
Automatic Speaker Recognition Using Mel-Frequency Cepstral Coefficients Through Machine Learning 被引量:1
16
作者 U˘gur Ayvaz Hüseyin Gürüler +3 位作者 Faheem Khan Naveed Ahmed Taegkeun Whangbo Abdusalomov Akmalbek Bobomirzaevich 《Computers, Materials & Continua》 SCIE EI 2022年第6期5511-5521,共11页
Automatic speaker recognition(ASR)systems are the field of Human-machine interaction and scientists have been using feature extraction and feature matching methods to analyze and synthesize these signals.One of the mo... Automatic speaker recognition(ASR)systems are the field of Human-machine interaction and scientists have been using feature extraction and feature matching methods to analyze and synthesize these signals.One of the most commonly used methods for feature extraction is Mel Frequency Cepstral Coefficients(MFCCs).Recent researches show that MFCCs are successful in processing the voice signal with high accuracies.MFCCs represents a sequence of voice signal-specific features.This experimental analysis is proposed to distinguish Turkish speakers by extracting the MFCCs from the speech recordings.Since the human perception of sound is not linear,after the filterbank step in theMFCC method,we converted the obtained log filterbanks into decibel(dB)features-based spectrograms without applying the Discrete Cosine Transform(DCT).A new dataset was created with converted spectrogram into a 2-D array.Several learning algorithms were implementedwith a 10-fold cross-validationmethod to detect the speaker.The highest accuracy of 90.2%was achieved using Multi-layer Perceptron(MLP)with tanh activation function.The most important output of this study is the inclusion of human voice as a new feature set. 展开更多
关键词 automatic speaker recognition human voice recognition spatial pattern recognition MFCCs SPECTROGRAM machine learning artificial intelligence
下载PDF
Automatic target recognition of moving target based on empirical mode decomposition and genetic algorithm support vector machine 被引量:4
17
作者 张军 欧建平 占荣辉 《Journal of Central South University》 SCIE EI CAS CSCD 2015年第4期1389-1396,共8页
In order to improve measurement accuracy of moving target signals, an automatic target recognition model of moving target signals was established based on empirical mode decomposition(EMD) and support vector machine(S... In order to improve measurement accuracy of moving target signals, an automatic target recognition model of moving target signals was established based on empirical mode decomposition(EMD) and support vector machine(SVM). Automatic target recognition process on the nonlinear and non-stationary of Doppler signals of military target by using automatic target recognition model can be expressed as follows. Firstly, the nonlinearity and non-stationary of Doppler signals were decomposed into a set of intrinsic mode functions(IMFs) using EMD. After the Hilbert transform of IMF, the energy ratio of each IMF to the total IMFs can be extracted as the features of military target. Then, the SVM was trained through using the energy ratio to classify the military targets, and genetic algorithm(GA) was used to optimize SVM parameters in the solution space. The experimental results show that this algorithm can achieve the recognition accuracies of 86.15%, 87.93%, and 82.28% for tank, vehicle and soldier, respectively. 展开更多
关键词 automatic target recognition(ATR) moving target empirical mode decomposition genetic algorithm support vector machine
下载PDF
Automatic Mexican Sign Language Recognition Using Normalized Moments and Artificial Neural Networks 被引量:1
18
作者 Francisco Solís David Martínez Oscar Espinoza 《Engineering(科研)》 2016年第10期733-740,共8页
This document presents a computer vision system for the automatic recognition of Mexican Sign Language (MSL), based on normalized moments as invariant (to translation and scale transforms) descriptors, using artificia... This document presents a computer vision system for the automatic recognition of Mexican Sign Language (MSL), based on normalized moments as invariant (to translation and scale transforms) descriptors, using artificial neural networks as pattern recognition model. An experimental feature selection was performed to reduce computational costs due to this work focusing on automatic recognition. The computer vision system includes four LED-reflectors of 700 lumens each in order to improve image acquisition quality;this illumination system allows reducing shadows in each sign of the MSL. MSL contains 27 signs in total but 6 of them are expressed with movement;this paper presents a framework for the automatic recognition of 21 static signs of MSL. The proposed system achieved 93% of recognition rate. 展开更多
关键词 Mexican Sign Language automatic Sign Language recognition Normalized Moments Computer Vision System
下载PDF
Joint On-Demand Pruning and Online Distillation in Automatic Speech Recognition Language Model Optimization
19
作者 Soonshin Seo Ji-Hwan Kim 《Computers, Materials & Continua》 SCIE EI 2023年第12期2833-2856,共24页
Automatic speech recognition(ASR)systems have emerged as indispensable tools across a wide spectrum of applications,ranging from transcription services to voice-activated assistants.To enhance the performance of these... Automatic speech recognition(ASR)systems have emerged as indispensable tools across a wide spectrum of applications,ranging from transcription services to voice-activated assistants.To enhance the performance of these systems,it is important to deploy efficient models capable of adapting to diverse deployment conditions.In recent years,on-demand pruning methods have obtained significant attention within the ASR domain due to their adaptability in various deployment scenarios.However,these methods often confront substantial trade-offs,particularly in terms of unstable accuracy when reducing the model size.To address challenges,this study introduces two crucial empirical findings.Firstly,it proposes the incorporation of an online distillation mechanism during on-demand pruning training,which holds the promise of maintaining more consistent accuracy levels.Secondly,it proposes the utilization of the Mogrifier long short-term memory(LSTM)language model(LM),an advanced iteration of the conventional LSTM LM,as an effective alternative for pruning targets within the ASR framework.Through rigorous experimentation on the ASR system,employing the Mogrifier LSTM LM and training it using the suggested joint on-demand pruning and online distillation method,this study provides compelling evidence.The results exhibit that the proposed methods significantly outperform a benchmark model trained solely with on-demand pruning methods.Impressively,the proposed strategic configuration successfully reduces the parameter count by approximately 39%,all the while minimizing trade-offs. 展开更多
关键词 automatic speech recognition neural language model Mogrifier long short-term memory PRUNING DISTILLATION efficient deployment OPTIMIZATION joint training
下载PDF
Audio-Text Multimodal Speech Recognition via Dual-Tower Architecture for Mandarin Air Traffic Control Communications
20
作者 Shuting Ge Jin Ren +3 位作者 Yihua Shi Yujun Zhang Shunzhi Yang Jinfeng Yang 《Computers, Materials & Continua》 SCIE EI 2024年第3期3215-3245,共31页
In air traffic control communications (ATCC), misunderstandings between pilots and controllers could result in fatal aviation accidents. Fortunately, advanced automatic speech recognition technology has emerged as a p... In air traffic control communications (ATCC), misunderstandings between pilots and controllers could result in fatal aviation accidents. Fortunately, advanced automatic speech recognition technology has emerged as a promising means of preventing miscommunications and enhancing aviation safety. However, most existing speech recognition methods merely incorporate external language models on the decoder side, leading to insufficient semantic alignment between speech and text modalities during the encoding phase. Furthermore, it is challenging to model acoustic context dependencies over long distances due to the longer speech sequences than text, especially for the extended ATCC data. To address these issues, we propose a speech-text multimodal dual-tower architecture for speech recognition. It employs cross-modal interactions to achieve close semantic alignment during the encoding stage and strengthen its capabilities in modeling auditory long-distance context dependencies. In addition, a two-stage training strategy is elaborately devised to derive semantics-aware acoustic representations effectively. The first stage focuses on pre-training the speech-text multimodal encoding module to enhance inter-modal semantic alignment and aural long-distance context dependencies. The second stage fine-tunes the entire network to bridge the input modality variation gap between the training and inference phases and boost generalization performance. Extensive experiments demonstrate the effectiveness of the proposed speech-text multimodal speech recognition method on the ATCC and AISHELL-1 datasets. It reduces the character error rate to 6.54% and 8.73%, respectively, and exhibits substantial performance gains of 28.76% and 23.82% compared with the best baseline model. The case studies indicate that the obtained semantics-aware acoustic representations aid in accurately recognizing terms with similar pronunciations but distinctive semantics. The research provides a novel modeling paradigm for semantics-aware speech recognition in air traffic control communications, which could contribute to the advancement of intelligent and efficient aviation safety management. 展开更多
关键词 Speech-text multimodal automatic speech recognition semantic alignment air traffic control communications dual-tower architecture
下载PDF
上一页 1 2 3 下一页 到第
使用帮助 返回顶部