In-process damage to a cutting tool degrades the surfacenish of the job shaped by machining and causes a signicantnancial loss.This stimulates the need for Tool Condition Monitoring(TCM)t...In-process damage to a cutting tool degrades the surfacenish of the job shaped by machining and causes a signicantnancial loss.This stimulates the need for Tool Condition Monitoring(TCM)to assist detection of failure before it extends to the worse phase.Machine Learning(ML)based TCM has been extensively explored in the last decade.However,most of the research is now directed toward Deep Learning(DL).The“Deep”formulation,hierarchical compositionality,distributed representation and end-to-end learning of Neural Nets need to be explored to create a generalized TCM framework to perform eciently in a high-noise environment of cross-domain machining.With this motivation,the design of dierent CNN(Convolutional Neural Network)architectures such as AlexNet,ResNet-50,LeNet-5,and VGG-16 is presented in this paper.Real-time spindle vibrations corresponding to healthy and various faulty congurations of milling cutter were acquired.This data was transformed into the time-frequency domain and further processed by proposed architectures in graphical form,i.e.,spectrogram.The model is trained,tested,and validated considering dierent datasets and showcased promising results.展开更多
Acoustic array sensor device for partial discharge detection is widely used in power equipment inspection with the advantages of non-contact and precise positioning compared with partial discharge detection methods su...Acoustic array sensor device for partial discharge detection is widely used in power equipment inspection with the advantages of non-contact and precise positioning compared with partial discharge detection methods such as ultrasonic method and pulse current method.However,due to the sensitivity of the acoustic array sensor and the influence of the equipment operation site interference,the acoustic array sensor device for partial discharge type diagnosis by phase resolved partial discharge(PRPD)map might occasionally presents incorrect results,thus affecting the power equipment operation and maintenance strategy.The acoustic array sensor detection device for power equipment developed in this paper applies the array design model of equal-area multi-arm spiral with machine learning fast fourier transform clean(FFT-CLEAN)sound source localization identification algorithm to avoid the interference factors in the noise acquisition system using a single microphone and conventional beam forming algorithm,improves the spatial resolution of the acoustic array sensor device,and proposes an acoustic array sensor device based on the acoustic spectrogram.The analysis and diagnosis method of discharge type of acoustic array sensor device can effectively reduce the system misjudgment caused by factors such as the resolution of the acoustic imaging device and the time domain pulse of the digital signal,and reduce the false alarm rate of the acoustic array sensor device.The proposed method is tested by selecting power cables as the object,and its effectiveness is proved by laboratory verification and field verification.展开更多
Recently,user recognitionmethods to authenticate personal identity has attracted significant attention especially with increased availability of various internet of things(IoT)services through fifth-generation technol...Recently,user recognitionmethods to authenticate personal identity has attracted significant attention especially with increased availability of various internet of things(IoT)services through fifth-generation technology(5G)based mobile devices.The EMG signals generated inside the body with unique individual characteristics are being studied as a part of nextgeneration user recognition methods.However,there is a limitation when applying EMG signals to user recognition systems as the same operation needs to be repeated while maintaining a constant strength of muscle over time.Hence,it is necessary to conduct research on multidimensional feature transformation that includes changes in frequency features over time.In this paper,we propose a user recognition system that applies EMG signals to the short-time fourier transform(STFT),and converts the signals into EMG spectrogram images while adjusting the time-frequency resolution to extract multidimensional features.The proposed system is composed of a data pre-processing and normalization process,spectrogram image conversion process,and final classification process.The experimental results revealed that the proposed EMG spectrogram image-based user recognition system has a 95.4%accuracy performance,which is 13%higher than the EMGsignal-based system.Such a user recognition accuracy improvement was achieved by using multidimensional features,in the time-frequency domain.展开更多
A new lighting and enlargement on phase spectrogram(PS)and frequency spectrogram(FS)is presented in this paper.These representations result from the coupling of power spectrogram and short time Fourier transform(STFT)...A new lighting and enlargement on phase spectrogram(PS)and frequency spectrogram(FS)is presented in this paper.These representations result from the coupling of power spectrogram and short time Fourier transform(STFT).The main con-tribution is the construction of the 3D phase spectrogram(3DPS)and the 3D frequency spectrogram(3DFS).These new tools allow such specific test signals as small slope linear chirp,phase jump and small frequency jump to be analyzed.An application case of musical signal analysis is reported.The main objective is to detect small frequency and phase variations in order to characterize each type of sound attack without losing the amplitude information given by power spectrogram.展开更多
噪声环境下语音检测准确率偏低是短波通话面临的公开挑战。当前已有方法应用有限,其根源在于难以可靠地在噪音环境下提取准确且高效的语音特征。针对上述问题,提出了一个面向短波通信的低秩方向梯度直方图(Low-rank Histogram of Orient...噪声环境下语音检测准确率偏低是短波通话面临的公开挑战。当前已有方法应用有限,其根源在于难以可靠地在噪音环境下提取准确且高效的语音特征。针对上述问题,提出了一个面向短波通信的低秩方向梯度直方图(Low-rank Histogram of Oriented Gradient,LHOG)话音检测方法。首先,对目标音频源数据进行预处理,实现噪声环境下语音信息的可视化表征;然后,在HOG特征提取器中嵌入低秩化结构,缓解特征中的冗余信息,并降低噪声干扰,从而获得准确且高效的特征;最后,通过常用的SVM分类模型便可在噪声环境中准确快速地区分话音和噪声。测试结果表明,该方法的准确率达到了95.12%,误报率仅为0.96%,漏报率为13.14%。与现有主流方法的对比实验证明,该方法话音检测准确率高,资源占用少,能够有效提高短波通信侦控效率。展开更多
【目的】为解决群养环境下生猪音频难以分离与识别的问题,提出基于欠定盲源分离与E C A-EfficientNetV2的生猪状态音频识别方法。【方法】以仿真群养环境下4类生猪音频信号作为观测信号,将信号稀疏表示后,通过层次聚类估计出信号混合矩...【目的】为解决群养环境下生猪音频难以分离与识别的问题,提出基于欠定盲源分离与E C A-EfficientNetV2的生猪状态音频识别方法。【方法】以仿真群养环境下4类生猪音频信号作为观测信号,将信号稀疏表示后,通过层次聚类估计出信号混合矩阵,并利用lp范数重构算法求解lp范数最小值以完成生猪音频信号重构。将重构信号转化为声谱图,分为进食声、咆哮声、哼叫声和发情声4类,利用ECA-EfficientNetV2网络模型识别音频,获取生猪状态。【结果】混合矩阵估计的归一化均方误差最低为3.266×10^(−4),分离重构的音频信噪比在3.254~4.267 dB之间。声谱图经ECA-EfficientNetV2识别检测,准确率高达98.35%;与经典卷积神经网络ResNet50和VGG16对比,准确率分别提升2.88和1.81个百分点;与原EfficientNetV2相比,准确率降低0.52个百分点,但模型参数量减少33.56%,浮点运算量(FLOPs)降低1.86 G,推理时间减少9.40 ms。【结论】基于盲源分离及改进EfficientNetV2的方法,轻量且高效地实现了分离与识别群养生猪音频信号。展开更多
In order to eliminate the subjectivity of wheeze diagnosis and improve the accuracy of objective detecting methods,this paper introduces a wheeze detecting method based on spectrogram entropy analysis.This algorithm m...In order to eliminate the subjectivity of wheeze diagnosis and improve the accuracy of objective detecting methods,this paper introduces a wheeze detecting method based on spectrogram entropy analysis.This algorithm mainly comprises three steps which are preprocessing,features extracting and wheeze detecting based on support vector machine(SVM).Herein,the preprocessing consists of the short-time Fourier transform(STFT) decomposition and detrending.The features are extracted from the entropy of spectrograms.The step of detrending makes the difference of the features between wheeze and normal lung sounds more obvious.Moreover,compared with the method whose decision is based on the empirical threshold,there is no uncertain detecting result any more.Results of two testing experiments show that the detecting accuracy(AC) are 97.1%and 95.7%,respectively,which proves that the proposed method could be an efficient way to detect wheeze.展开更多
The Perception Spectrogram Structure Boundary(PSSB)parameter is proposed for speech endpoint detection as a preprocess of speech or speaker recognition.At first a hearing perception speech enhancement is carried out...The Perception Spectrogram Structure Boundary(PSSB)parameter is proposed for speech endpoint detection as a preprocess of speech or speaker recognition.At first a hearing perception speech enhancement is carried out.Then the two-dimensional enhancement is performed upon the sound spectrogram according to the difference between the determinacy distribution characteristic of speech and the random distribution characteristic of noise.Finally a decision for endpoint was made by the PSSB parameter.Experimental results show that,in a low SNR environment from-10 dB to 10 dB,the algorithm proposed in this paper may achieve higher accuracy than the extant endpoint detection algorithms.The detection accuracy of 75.2%can be reached even in the extremely low SNR at-10 dB.Therefore it is suitable for speech endpoint detection in low-SNRs environment.展开更多
提出一种基于高压放电声音识别快速评估电池健康状态的方法,目的在于以尽可能短的时间评估锂电池的健康状态(State of Health,SOH),以便电池的重组和梯次利用。研究主要从电子迁移能力方面建立电池高压放电与健康状态的联系,分析电池在...提出一种基于高压放电声音识别快速评估电池健康状态的方法,目的在于以尽可能短的时间评估锂电池的健康状态(State of Health,SOH),以便电池的重组和梯次利用。研究主要从电子迁移能力方面建立电池高压放电与健康状态的联系,分析电池在高压静电场中发生放电的声谱图特征,并将其作为电池SOH快速评估的依据。同时,针对方壳型磷酸铁锂电池搭建了一套自动化检测装置,并进行装置的结构和控制系统设计,实现了在5 min内完成单体电池的检测,极大地提高了锂电池的检测效率。展开更多
文摘In-process damage to a cutting tool degrades the surfacenish of the job shaped by machining and causes a signicantnancial loss.This stimulates the need for Tool Condition Monitoring(TCM)to assist detection of failure before it extends to the worse phase.Machine Learning(ML)based TCM has been extensively explored in the last decade.However,most of the research is now directed toward Deep Learning(DL).The“Deep”formulation,hierarchical compositionality,distributed representation and end-to-end learning of Neural Nets need to be explored to create a generalized TCM framework to perform eciently in a high-noise environment of cross-domain machining.With this motivation,the design of dierent CNN(Convolutional Neural Network)architectures such as AlexNet,ResNet-50,LeNet-5,and VGG-16 is presented in this paper.Real-time spindle vibrations corresponding to healthy and various faulty congurations of milling cutter were acquired.This data was transformed into the time-frequency domain and further processed by proposed architectures in graphical form,i.e.,spectrogram.The model is trained,tested,and validated considering dierent datasets and showcased promising results.
基金This work was supported by the science and technology project of State Grid Shanghai Municipal Electric Power Company(No.52090020007F)National Key R&D Program of China(2017YFB0902800).
文摘Acoustic array sensor device for partial discharge detection is widely used in power equipment inspection with the advantages of non-contact and precise positioning compared with partial discharge detection methods such as ultrasonic method and pulse current method.However,due to the sensitivity of the acoustic array sensor and the influence of the equipment operation site interference,the acoustic array sensor device for partial discharge type diagnosis by phase resolved partial discharge(PRPD)map might occasionally presents incorrect results,thus affecting the power equipment operation and maintenance strategy.The acoustic array sensor detection device for power equipment developed in this paper applies the array design model of equal-area multi-arm spiral with machine learning fast fourier transform clean(FFT-CLEAN)sound source localization identification algorithm to avoid the interference factors in the noise acquisition system using a single microphone and conventional beam forming algorithm,improves the spatial resolution of the acoustic array sensor device,and proposes an acoustic array sensor device based on the acoustic spectrogram.The analysis and diagnosis method of discharge type of acoustic array sensor device can effectively reduce the system misjudgment caused by factors such as the resolution of the acoustic imaging device and the time domain pulse of the digital signal,and reduce the false alarm rate of the acoustic array sensor device.The proposed method is tested by selecting power cables as the object,and its effectiveness is proved by laboratory verification and field verification.
基金supported by Basic Science Research Program through the National Research Foundation of Korea(NRF)funded by the Ministry of Education(No.2017R1A6A1A03015496)the National Research Foundation of Korea(NRF)grant funded by the Korea government(MSIT)(No.NRF-2021R1A2C1014033).
文摘Recently,user recognitionmethods to authenticate personal identity has attracted significant attention especially with increased availability of various internet of things(IoT)services through fifth-generation technology(5G)based mobile devices.The EMG signals generated inside the body with unique individual characteristics are being studied as a part of nextgeneration user recognition methods.However,there is a limitation when applying EMG signals to user recognition systems as the same operation needs to be repeated while maintaining a constant strength of muscle over time.Hence,it is necessary to conduct research on multidimensional feature transformation that includes changes in frequency features over time.In this paper,we propose a user recognition system that applies EMG signals to the short-time fourier transform(STFT),and converts the signals into EMG spectrogram images while adjusting the time-frequency resolution to extract multidimensional features.The proposed system is composed of a data pre-processing and normalization process,spectrogram image conversion process,and final classification process.The experimental results revealed that the proposed EMG spectrogram image-based user recognition system has a 95.4%accuracy performance,which is 13%higher than the EMGsignal-based system.Such a user recognition accuracy improvement was achieved by using multidimensional features,in the time-frequency domain.
文摘A new lighting and enlargement on phase spectrogram(PS)and frequency spectrogram(FS)is presented in this paper.These representations result from the coupling of power spectrogram and short time Fourier transform(STFT).The main con-tribution is the construction of the 3D phase spectrogram(3DPS)and the 3D frequency spectrogram(3DFS).These new tools allow such specific test signals as small slope linear chirp,phase jump and small frequency jump to be analyzed.An application case of musical signal analysis is reported.The main objective is to detect small frequency and phase variations in order to characterize each type of sound attack without losing the amplitude information given by power spectrogram.
文摘噪声环境下语音检测准确率偏低是短波通话面临的公开挑战。当前已有方法应用有限,其根源在于难以可靠地在噪音环境下提取准确且高效的语音特征。针对上述问题,提出了一个面向短波通信的低秩方向梯度直方图(Low-rank Histogram of Oriented Gradient,LHOG)话音检测方法。首先,对目标音频源数据进行预处理,实现噪声环境下语音信息的可视化表征;然后,在HOG特征提取器中嵌入低秩化结构,缓解特征中的冗余信息,并降低噪声干扰,从而获得准确且高效的特征;最后,通过常用的SVM分类模型便可在噪声环境中准确快速地区分话音和噪声。测试结果表明,该方法的准确率达到了95.12%,误报率仅为0.96%,漏报率为13.14%。与现有主流方法的对比实验证明,该方法话音检测准确率高,资源占用少,能够有效提高短波通信侦控效率。
文摘In order to eliminate the subjectivity of wheeze diagnosis and improve the accuracy of objective detecting methods,this paper introduces a wheeze detecting method based on spectrogram entropy analysis.This algorithm mainly comprises three steps which are preprocessing,features extracting and wheeze detecting based on support vector machine(SVM).Herein,the preprocessing consists of the short-time Fourier transform(STFT) decomposition and detrending.The features are extracted from the entropy of spectrograms.The step of detrending makes the difference of the features between wheeze and normal lung sounds more obvious.Moreover,compared with the method whose decision is based on the empirical threshold,there is no uncertain detecting result any more.Results of two testing experiments show that the detecting accuracy(AC) are 97.1%and 95.7%,respectively,which proves that the proposed method could be an efficient way to detect wheeze.
基金supported by the National Natural Science Foundation of China.(61071215,61271359,61372146)
文摘The Perception Spectrogram Structure Boundary(PSSB)parameter is proposed for speech endpoint detection as a preprocess of speech or speaker recognition.At first a hearing perception speech enhancement is carried out.Then the two-dimensional enhancement is performed upon the sound spectrogram according to the difference between the determinacy distribution characteristic of speech and the random distribution characteristic of noise.Finally a decision for endpoint was made by the PSSB parameter.Experimental results show that,in a low SNR environment from-10 dB to 10 dB,the algorithm proposed in this paper may achieve higher accuracy than the extant endpoint detection algorithms.The detection accuracy of 75.2%can be reached even in the extremely low SNR at-10 dB.Therefore it is suitable for speech endpoint detection in low-SNRs environment.
文摘提出一种基于高压放电声音识别快速评估电池健康状态的方法,目的在于以尽可能短的时间评估锂电池的健康状态(State of Health,SOH),以便电池的重组和梯次利用。研究主要从电子迁移能力方面建立电池高压放电与健康状态的联系,分析电池在高压静电场中发生放电的声谱图特征,并将其作为电池SOH快速评估的依据。同时,针对方壳型磷酸铁锂电池搭建了一套自动化检测装置,并进行装置的结构和控制系统设计,实现了在5 min内完成单体电池的检测,极大地提高了锂电池的检测效率。