Cell voltage is a widely used signal that can be measured online from an industrial aluminum electrolysis cell.A variety of parameters for the analysis and control of industrial cells are calculated using the cell vol...Cell voltage is a widely used signal that can be measured online from an industrial aluminum electrolysis cell.A variety of parameters for the analysis and control of industrial cells are calculated using the cell voltage.In this paper,the frequency segmentation of cell voltage is used as the basis for designing filters to obtain these parameters.Based on the qualitative analysis of the cell voltage,the sub-band instantaneous energy spectrum(SIEP)is first proposed,which is then used to quantitatively represent the characteristics of the designated frequency bands of the cell voltage under various cell conditions.Ultimately,a cell condition-sensitive frequency segmentation method is given.The proposed frequency segmentation method divides the effective frequency band into the[0,0.001]Hz band of lowfrequency signals and the[0.001,0.050]Hz band of low-frequency noise,and subdivides the lowfrequency noise into the[0.001,0.010]Hz band of metal pad abnormal rolling and the[0.01,0.05]Hz band of sub-low-frequency noise.Compared with the instantaneous energy spectrum based on empirical mode decomposition,the SIEP more finely represents the law of energy change with time in any designated frequency band within the effective frequency band of the cell voltage.The proposed frequency segmentation method is more sensitive to cell condition changes and can obtain more elaborate details of online cell condition information,thus providing a more reliable and accurate online basis for cell condition monitoring and control decisions.展开更多
To develop a more robust endpoint detection algorithm, this paper first proposes a fuzzy adaptive smoothing algorithm. The general idea underlying adaptive smoothing is to adapt the short-term sub-band mean of the amp...To develop a more robust endpoint detection algorithm, this paper first proposes a fuzzy adaptive smoothing algorithm. The general idea underlying adaptive smoothing is to adapt the short-term sub-band mean of the amplitude to the local attributes of speech on the basis of discontinuity measures. The adaptive smoothing algorithm in this paper utilizes a scale-space framework through the minimal description length (MDL). We recommend using the fuzzy muhi-attribute decision making approach to select the proper sub-bands where the word boundary can be more reliably detected. The process and simulation of the fuzzy adaptive smoothing algorithm are given. The parameters utilize the mean amplitude of the audible frequency range (300 -3 700 Hz) and the sub-band mean of the amplitude (16 band filter-bank). We selected the audible band energy because of its usefulness in detecting high-energy regions and making the distinction between speech and noise. Otherwise, the fuzzy adaptive smoothing algorithm is processed in sub-band speech to utilize the full range of frequency information.展开更多
Because of the correlation of images,the efficiency of the standard ICA is not satisfied in the blind source separation (BSS) of image.Therefore,a new method of sub-band ICA with selection criterion is proposed for th...Because of the correlation of images,the efficiency of the standard ICA is not satisfied in the blind source separation (BSS) of image.Therefore,a new method of sub-band ICA with selection criterion is proposed for this problem.Firstly,the sub-bands of the new method are made up of the wavelet packets (WP) coefficients.Secondly,the selection criterion of the new method is a combination of the mutual information (MI),kurtosis and sparsity.One sub-band or a sub-bands group obtained from the new method are more suitable as the inputs parameters of the algorithm of ICA than mixed images.The new method has been applied into the BSS of partially dependent images and highly dependent images successfully.According to the separation experiments,it is shown that the separation efficacy of the new method is more accurate and robust.展开更多
In this paper, we present a novel and efficient scheme for detection of P300 component of the event-related potential in the Brain Computer Interface (BCI) speller paradigm that needs significantly less EEG channels a...In this paper, we present a novel and efficient scheme for detection of P300 component of the event-related potential in the Brain Computer Interface (BCI) speller paradigm that needs significantly less EEG channels and uses a minimal subset of effective features. Removing unnecessary channels and reducing the feature dimension resulted in lower cost and shorter time and thus improved the BCI implementation. The idea was to employ a proper method to optimize the number of channels and feature vectors while keeping high accuracy in classification performance. Optimal channel selection was based on both discriminative criteria and forward-backward investigation. Besides, we obtained a minimal subset of effective features by choosing the discriminant coefficients of wavelet decomposition. Our algorithm was tested on dataset II of the BCI competition 2005. We achieved 92% accuracy using a simple LDA classifier, as compared with the second best result in BCI 2005 with an accuracy of 90.5% using SVM for classification which required more computation, and against the highest accuracy of 96.5% in BCI 2005 that used SVM and much more channels requiring excessive calculations. We also applied our proposed scheme on Hoffmann’s dataset to evaluate the effectiveness of channel reduction and achieved acceptable results.展开更多
In this paper, a new mesh based algorithm is applied for motion estimation and compensation in the wavelet domain. The first major contribution of this work is the introduction of a new active mesh based method for mo...In this paper, a new mesh based algorithm is applied for motion estimation and compensation in the wavelet domain. The first major contribution of this work is the introduction of a new active mesh based method for motion estimation and compensation. The proposed algorithm is based on the mesh energy minimization with novel sets of energy functions. The proposed energy functions have appropriate features, which improve the accuracy of motion estimation and compensation algorithm. We employ the proposed motion estimation algorithm in two different manners for video compression. In the first approach, the proposed algorithm is employed for motion estimation of consecutive frames. In the second approach, the algorithm is applied for motion estimation and compensation in the wavelet sub-bands. The experimental results reveal that the incorporation of active mesh based motion-compensated temporal filtering into wavelet sub-bands significantly improves the distortion performance rate of the video compression. We also use a new wavelet coder for the coding of the 3D volume of coefficients based on the retained energy criteria. This coder gives the maximum retained energy in all sub-bands. The proposed algorithm was tested with some video sequences and the results showed that the use of the proposed active mesh method for motion compensation and its implementation in sub-bands yields significant improvement in PSNR performance.展开更多
A novel compression method for mechanical vibrating signals,binding with sub-band vector quantization(SVQ) by wavelet packet transformation(WPT) and discrete cosine transformation(DCT) is proposed.Firstly,the vibratin...A novel compression method for mechanical vibrating signals,binding with sub-band vector quantization(SVQ) by wavelet packet transformation(WPT) and discrete cosine transformation(DCT) is proposed.Firstly,the vibrating signal is decomposed into sub-bands by WPT.Then DCT and adaptive bit allocation are done per sub-band and SVQ is performed in each sub-band.It is noted that,after DCT,we only need to code the first components whose numbers are determined by the bits allocated to that sub-band.Through an actual signal,our algorithm is proven to improve the signal-to-noise ratio(SNR) of the reconstructed signal effectively,especially in the situation of lowrate transmission.展开更多
For realizing of long text information hiding and covert communication, a binary watermark sequence was obtained firstly from a text file and encoded by a redundant encoding method. Then, two neighboring blocks were s...For realizing of long text information hiding and covert communication, a binary watermark sequence was obtained firstly from a text file and encoded by a redundant encoding method. Then, two neighboring blocks were selected at each time from the Hilbert scanning sequence of carrier image blocks, and transformed by 1-level discrete wavelet transformation (DWT). And then the double block based JNDs (just noticeable difference) were calculated with a visual model. According to the different codes of each two watermark bits, the average values of two corresponding detail sub-bands were modified by using one of JNDs to hide information into carrier image. The experimental results show that the hidden information is invisible to human eyes, and the algorithm is robust to some common image processing operations. The conclusion is that the algorithm is effective and practical.展开更多
The main cause of skin cancer is the ultraviolet radiation of the sun.It spreads quickly to other body parts.Thus,early diagnosis is required to decrease the mortality rate due to skin cancer.In this study,an automati...The main cause of skin cancer is the ultraviolet radiation of the sun.It spreads quickly to other body parts.Thus,early diagnosis is required to decrease the mortality rate due to skin cancer.In this study,an automatic system for Skin Lesion Classification(SLC)using Non-Subsampled Shearlet Transform(NSST)based energy features and Support Vector Machine(SVM)classifier is proposed.Atfirst,the NSST is used for the decomposition of input skin lesion images with different directions like 2,4,8 and 16.From the NSST’s sub-bands,energy fea-tures are extracted and stored in the feature database for training.SVM classifier is used for the classification of skin lesion images.The dermoscopic skin images are obtained from PH^(2) database which comprises of 200 dermoscopic color images with melanocytic lesions.The performances of the SLC system are evaluated using the confusion matrix and Receiver Operating Characteristic(ROC)curves.The SLC system achieves 96%classification accuracy using NSST’s energy fea-tures obtained from 3^(rd) level with 8-directions.展开更多
Speech separation is an active research topic that plays an important role in numerous applications,such as speaker recognition,hearing pros-thesis,and autonomous robots.Many algorithms have been put forward to improv...Speech separation is an active research topic that plays an important role in numerous applications,such as speaker recognition,hearing pros-thesis,and autonomous robots.Many algorithms have been put forward to improve separation performance.However,speech separation in reverberant noisy environment is still a challenging task.To address this,a novel speech separation algorithm using gate recurrent unit(GRU)network based on microphone array has been proposed in this paper.The main aim of the proposed algorithm is to improve the separation performance and reduce the computational cost.The proposed algorithm extracts the sub-band steered response power-phase transform(SRP-PHAT)weighted by gammatone filter as the speech separation feature due to its discriminative and robust spatial position in formation.Since the GRU net work has the advantage of processing time series data with faster training speed and fewer training parameters,the GRU model is adopted to process the separation featuresof several sequential frames in the same sub-band to estimate the ideal Ratio Masking(IRM).The proposed algorithm decomposes the mixture signals into time-frequency(TF)units using gammatone filter bank in the frequency domain,and the target speech is reconstructed in the frequency domain by masking the mixture signal according to the estimated IRM.The operations of decomposing the mixture signal and reconstructing the target signal are completed in the frequency domain which can reduce the total computational cost.Experimental results demonstrate that the proposed algorithm realizes omnidirectional speech sep-aration in noisy and reverberant environments,provides good performance in terms of speech quality and intelligibility,and has the generalization capacity to reverberate.展开更多
基金This work was supported by the Program of the National Natural Science Foundation of China(61988101,61773405,and 61751312).
文摘Cell voltage is a widely used signal that can be measured online from an industrial aluminum electrolysis cell.A variety of parameters for the analysis and control of industrial cells are calculated using the cell voltage.In this paper,the frequency segmentation of cell voltage is used as the basis for designing filters to obtain these parameters.Based on the qualitative analysis of the cell voltage,the sub-band instantaneous energy spectrum(SIEP)is first proposed,which is then used to quantitatively represent the characteristics of the designated frequency bands of the cell voltage under various cell conditions.Ultimately,a cell condition-sensitive frequency segmentation method is given.The proposed frequency segmentation method divides the effective frequency band into the[0,0.001]Hz band of lowfrequency signals and the[0.001,0.050]Hz band of low-frequency noise,and subdivides the lowfrequency noise into the[0.001,0.010]Hz band of metal pad abnormal rolling and the[0.01,0.05]Hz band of sub-low-frequency noise.Compared with the instantaneous energy spectrum based on empirical mode decomposition,the SIEP more finely represents the law of energy change with time in any designated frequency band within the effective frequency band of the cell voltage.The proposed frequency segmentation method is more sensitive to cell condition changes and can obtain more elaborate details of online cell condition information,thus providing a more reliable and accurate online basis for cell condition monitoring and control decisions.
文摘To develop a more robust endpoint detection algorithm, this paper first proposes a fuzzy adaptive smoothing algorithm. The general idea underlying adaptive smoothing is to adapt the short-term sub-band mean of the amplitude to the local attributes of speech on the basis of discontinuity measures. The adaptive smoothing algorithm in this paper utilizes a scale-space framework through the minimal description length (MDL). We recommend using the fuzzy muhi-attribute decision making approach to select the proper sub-bands where the word boundary can be more reliably detected. The process and simulation of the fuzzy adaptive smoothing algorithm are given. The parameters utilize the mean amplitude of the audible frequency range (300 -3 700 Hz) and the sub-band mean of the amplitude (16 band filter-bank). We selected the audible band energy because of its usefulness in detecting high-energy regions and making the distinction between speech and noise. Otherwise, the fuzzy adaptive smoothing algorithm is processed in sub-band speech to utilize the full range of frequency information.
文摘Because of the correlation of images,the efficiency of the standard ICA is not satisfied in the blind source separation (BSS) of image.Therefore,a new method of sub-band ICA with selection criterion is proposed for this problem.Firstly,the sub-bands of the new method are made up of the wavelet packets (WP) coefficients.Secondly,the selection criterion of the new method is a combination of the mutual information (MI),kurtosis and sparsity.One sub-band or a sub-bands group obtained from the new method are more suitable as the inputs parameters of the algorithm of ICA than mixed images.The new method has been applied into the BSS of partially dependent images and highly dependent images successfully.According to the separation experiments,it is shown that the separation efficacy of the new method is more accurate and robust.
文摘In this paper, we present a novel and efficient scheme for detection of P300 component of the event-related potential in the Brain Computer Interface (BCI) speller paradigm that needs significantly less EEG channels and uses a minimal subset of effective features. Removing unnecessary channels and reducing the feature dimension resulted in lower cost and shorter time and thus improved the BCI implementation. The idea was to employ a proper method to optimize the number of channels and feature vectors while keeping high accuracy in classification performance. Optimal channel selection was based on both discriminative criteria and forward-backward investigation. Besides, we obtained a minimal subset of effective features by choosing the discriminant coefficients of wavelet decomposition. Our algorithm was tested on dataset II of the BCI competition 2005. We achieved 92% accuracy using a simple LDA classifier, as compared with the second best result in BCI 2005 with an accuracy of 90.5% using SVM for classification which required more computation, and against the highest accuracy of 96.5% in BCI 2005 that used SVM and much more channels requiring excessive calculations. We also applied our proposed scheme on Hoffmann’s dataset to evaluate the effectiveness of channel reduction and achieved acceptable results.
文摘In this paper, a new mesh based algorithm is applied for motion estimation and compensation in the wavelet domain. The first major contribution of this work is the introduction of a new active mesh based method for motion estimation and compensation. The proposed algorithm is based on the mesh energy minimization with novel sets of energy functions. The proposed energy functions have appropriate features, which improve the accuracy of motion estimation and compensation algorithm. We employ the proposed motion estimation algorithm in two different manners for video compression. In the first approach, the proposed algorithm is employed for motion estimation of consecutive frames. In the second approach, the algorithm is applied for motion estimation and compensation in the wavelet sub-bands. The experimental results reveal that the incorporation of active mesh based motion-compensated temporal filtering into wavelet sub-bands significantly improves the distortion performance rate of the video compression. We also use a new wavelet coder for the coding of the 3D volume of coefficients based on the retained energy criteria. This coder gives the maximum retained energy in all sub-bands. The proposed algorithm was tested with some video sequences and the results showed that the use of the proposed active mesh method for motion compensation and its implementation in sub-bands yields significant improvement in PSNR performance.
基金Supported by the National Natural Science Foundation of China(No.51135001)
文摘A novel compression method for mechanical vibrating signals,binding with sub-band vector quantization(SVQ) by wavelet packet transformation(WPT) and discrete cosine transformation(DCT) is proposed.Firstly,the vibrating signal is decomposed into sub-bands by WPT.Then DCT and adaptive bit allocation are done per sub-band and SVQ is performed in each sub-band.It is noted that,after DCT,we only need to code the first components whose numbers are determined by the bits allocated to that sub-band.Through an actual signal,our algorithm is proven to improve the signal-to-noise ratio(SNR) of the reconstructed signal effectively,especially in the situation of lowrate transmission.
文摘For realizing of long text information hiding and covert communication, a binary watermark sequence was obtained firstly from a text file and encoded by a redundant encoding method. Then, two neighboring blocks were selected at each time from the Hilbert scanning sequence of carrier image blocks, and transformed by 1-level discrete wavelet transformation (DWT). And then the double block based JNDs (just noticeable difference) were calculated with a visual model. According to the different codes of each two watermark bits, the average values of two corresponding detail sub-bands were modified by using one of JNDs to hide information into carrier image. The experimental results show that the hidden information is invisible to human eyes, and the algorithm is robust to some common image processing operations. The conclusion is that the algorithm is effective and practical.
文摘The main cause of skin cancer is the ultraviolet radiation of the sun.It spreads quickly to other body parts.Thus,early diagnosis is required to decrease the mortality rate due to skin cancer.In this study,an automatic system for Skin Lesion Classification(SLC)using Non-Subsampled Shearlet Transform(NSST)based energy features and Support Vector Machine(SVM)classifier is proposed.Atfirst,the NSST is used for the decomposition of input skin lesion images with different directions like 2,4,8 and 16.From the NSST’s sub-bands,energy fea-tures are extracted and stored in the feature database for training.SVM classifier is used for the classification of skin lesion images.The dermoscopic skin images are obtained from PH^(2) database which comprises of 200 dermoscopic color images with melanocytic lesions.The performances of the SLC system are evaluated using the confusion matrix and Receiver Operating Characteristic(ROC)curves.The SLC system achieves 96%classification accuracy using NSST’s energy fea-tures obtained from 3^(rd) level with 8-directions.
基金This work is supported by Nanjing Institute of Technology(NIT)fund for Research Startup Projects of Introduced talents under Grant No.YKJ202019Nature Sci-ence Research Project of Higher Education Institutions in Jiangsu Province under Grant No.21KJB510018+1 种基金National Nature Science Foundation of China(NSFC)under Grant No.62001215NIT fund for Doctoral Research Projects under Grant No.ZKJ2020003.
文摘Speech separation is an active research topic that plays an important role in numerous applications,such as speaker recognition,hearing pros-thesis,and autonomous robots.Many algorithms have been put forward to improve separation performance.However,speech separation in reverberant noisy environment is still a challenging task.To address this,a novel speech separation algorithm using gate recurrent unit(GRU)network based on microphone array has been proposed in this paper.The main aim of the proposed algorithm is to improve the separation performance and reduce the computational cost.The proposed algorithm extracts the sub-band steered response power-phase transform(SRP-PHAT)weighted by gammatone filter as the speech separation feature due to its discriminative and robust spatial position in formation.Since the GRU net work has the advantage of processing time series data with faster training speed and fewer training parameters,the GRU model is adopted to process the separation featuresof several sequential frames in the same sub-band to estimate the ideal Ratio Masking(IRM).The proposed algorithm decomposes the mixture signals into time-frequency(TF)units using gammatone filter bank in the frequency domain,and the target speech is reconstructed in the frequency domain by masking the mixture signal according to the estimated IRM.The operations of decomposing the mixture signal and reconstructing the target signal are completed in the frequency domain which can reduce the total computational cost.Experimental results demonstrate that the proposed algorithm realizes omnidirectional speech sep-aration in noisy and reverberant environments,provides good performance in terms of speech quality and intelligibility,and has the generalization capacity to reverberate.