The coal-rock interface recognition method based on multi-sensor data fusiontechnique is put forward because of the localization of single type sensor recognition method. Themeasuring theory based on multi-sensor data...The coal-rock interface recognition method based on multi-sensor data fusiontechnique is put forward because of the localization of single type sensor recognition method. Themeasuring theory based on multi-sensor data fusion technique is analyzed, and hereby the testplatform of recognition system is manufactured. The advantage of data fusion with the fuzzy neuralnetwork (FNN) technique has been probed. The two-level FNN is constructed and data fusion is carriedout. The experiments show that in various conditions the method can always acquire a much higherrecognition rate than normal ones.展开更多
The challenge faced by the visually impaired persons in their day-today lives is to interpret text from documents.In this context,to help these people,the objective of this work is to develop an efficient text recogni...The challenge faced by the visually impaired persons in their day-today lives is to interpret text from documents.In this context,to help these people,the objective of this work is to develop an efficient text recognition system that allows the isolation,the extraction,and the recognition of text in the case of documents having a textured background,a degraded aspect of colors,and of poor quality,and to synthesize it into speech.This system basically consists of three algorithms:a text localization and detection algorithm based on mathematical morphology method(MMM);a text extraction algorithm based on the gamma correction method(GCM);and an optical character recognition(OCR)algorithm for text recognition.A detailed complexity study of the different blocks of this text recognition system has been realized.Following this study,an acceleration of the GCM algorithm(AGCM)is proposed.The AGCM algorithm has reduced the complexity in the text recognition system by 70%and kept the same quality of text recognition as that of the original method.To assist visually impaired persons,a graphical interface of the entire text recognition chain has been developed,allowing the capture of images from a camera,rapid and intuitive visualization of the recognized text from this image,and text-to-speech synthesis.Our text recognition system provides an improvement of 6.8%for the recognition rate and 7.6%for the F-measure relative to GCM and AGCM algorithms.展开更多
This paper describes a novel method of online composite shape recognition interms of the relevance feedback technology to capture a user's intentions incrementally, and adynamic user modeling method to adapt to va...This paper describes a novel method of online composite shape recognition interms of the relevance feedback technology to capture a user's intentions incrementally, and adynamic user modeling method to adapt to various users' styles. First, the relevance feedback isadapted to refine the recognition results and reduce the ambiguity incrementally based on theestablishment of a feature-based vector model of a user's sketches. Secondly, a dynamic usermodeling is introduced to model the user's sketching habits based on recording and analyzinghistorical information incrementally. A model-based matching strategy is also employed in the methodto recognize sketches dynamically. Experiments prove that the proposed method is both effective andefficient.展开更多
Any natural language may have dozens of accents.Even though the equivalent phonemic formation of the word,if it is properly called in different accents,humans do have audio signals that are distinct from one another.A...Any natural language may have dozens of accents.Even though the equivalent phonemic formation of the word,if it is properly called in different accents,humans do have audio signals that are distinct from one another.Among the most common issues with speech,the processing is discrepancies in pronunciation,accent,and enunciation.This research study examines the issues of detecting,fixing,and summarising accent defects of average Arabic individuals in English-speaking speech.The article then discusses the key approaches and structure that will be utilized to address both accent flaws and pronunciation issues.The proposed SpeakCorrect computerized interface employs a cuttingedge speech recognition system and analyses pronunciation errors with a speech decoder.As a result,some of the most essential types of changes in pronunciation that are significant for speech recognition are performed,and accent defects defining such differences are presented.Consequently,the suggested technique increases the Speaker’s accuracy.SpeakCorrect uses 100 h of phonetically prepared individuals to construct a pronunciation instruction repository.These prerecorded sets are used to train Hidden Markov Models(HMM)as well as weighted graph systems.Their speeches are quite clear and might be considered natural.The proposed interface is optimized for use with an integrated phonetic pronounced dataset,as well as for analyzing and identifying speech faults in Saudi and Egyptian dialects.The proposed interface detects,analyses,and assists English learners in correcting utterance faults,overcoming problems,and improving their pronunciations.展开更多
Transformer tracking always takes paired template and search images as encoder input and conduct feature extraction and target‐search feature correlation by self and/or cross attention operations,thus the model compl...Transformer tracking always takes paired template and search images as encoder input and conduct feature extraction and target‐search feature correlation by self and/or cross attention operations,thus the model complexity will grow quadratically with the number of input images.To alleviate the burden of this tracking paradigm and facilitate practical deployment of Transformer‐based trackers,we propose a dual pooling transformer tracking framework,dubbed as DPT,which consists of three components:a simple yet efficient spatiotemporal attention model(SAM),a mutual correlation pooling Trans-former(MCPT)and a multiscale aggregation pooling Transformer(MAPT).SAM is designed to gracefully aggregates temporal dynamics and spatial appearance information of multi‐frame templates along space‐time dimensions.MCPT aims to capture multi‐scale pooled and correlated contextual features,which is followed by MAPT that aggregates multi‐scale features into a unified feature representation for tracking prediction.DPT tracker achieves AUC score of 69.5 on LaSOT and precision score of 82.8 on Track-ingNet while maintaining a shorter sequence length of attention tokens,fewer parameters and FLOPs compared to existing state‐of‐the‐art(SOTA)Transformer tracking methods.Extensive experiments demonstrate that DPT tracker yields a strong real‐time tracking baseline with a good trade‐off between tracking performance and inference efficiency.展开更多
In underground coal mines,hydrofracture can cause the increase of breathability in the fractured coal bed.When the hydrofracture crack propagates to the interface between the coal bed and the roof-floor stratum,the cr...In underground coal mines,hydrofracture can cause the increase of breathability in the fractured coal bed.When the hydrofracture crack propagates to the interface between the coal bed and the roof-floor stratum,the crack may enter roof-floor lithology,thus posing a limit on the scope of breathability increase and making it difficult to support the roof and floor board for subsequent coal mining.In this work,a two-dimensional model of coal rock bed that contains hydrofracture crack was constructed.Then an investigation that combines the fracture mechanics and the system of flow and solid in rock failure process analysis(RFPA2D-Flow)were carried out to study the failure mechanism at the interface between rocks and coals,and critical water pressure that hydrofracture crack propagates.The results indicated that the main factors that affect the direction of hydrofracture crack propagation are the angle of intersection between coal-rock interface and horizontal section,horizontal crustal stress difference,tension-shear mixed crack fracture toughness in coal-rock interface and differences in elasticity modulus of coal-rock bed.The possibility of crack directly entering coal-rock interface would increase with the increase in angle of intersection or horizontal crustal stress difference.The trend that crack propagates along the coal-rock interface will become stronger with the decrease of the fracture toughness at the coal-rock interface and the increase of the elasticity modulus difference between the coal bed and the roof strata.The results of this study was to put forward a method of controlling hydrofracture crack,optimize the fracturing well location provides a certain theoretical basis.展开更多
An intelligent wheelchair is devised, which is controlled by a coordinated mechanism based on a brain-computer interface(BCI) and speech recognition. By performing appropriate activities, users can navigate the wheelc...An intelligent wheelchair is devised, which is controlled by a coordinated mechanism based on a brain-computer interface(BCI) and speech recognition. By performing appropriate activities, users can navigate the wheelchair with four steering behaviors(start, stop, turn left, and turn right). Five healthy subjects participated in an indoor experiment. The results demonstrate the efficiency of the coordinated control mechanism with satisfactory path and time optimality ratios, and show that speech recognition is a fast and accurate supplement for BCI-based control systems. The proposed intelligent wheelchair is especially suitable for patients suffering from paralysis(especially those with aphasia) who can learn to pronounce only a single sound(e.g., ‘ah').展开更多
A microcalorimetric study on molecular recognition of p-sulfonatocalix[4]arene derivatives at selfassembled interface in comparison with in bulk water was performed,inspired by the dramatic change in physicochemical c...A microcalorimetric study on molecular recognition of p-sulfonatocalix[4]arene derivatives at selfassembled interface in comparison with in bulk water was performed,inspired by the dramatic change in physicochemical characteristics from bulk water to interface.A total of six cationic molecules were screened as model guests,including ammonium(NH_4~+),guanidinium(Gdm~+).N,N'-dimethyl-1,4-diazabicyclo[2.2.2]octane(DMDABCO^(2+)),tropylium(Tpm~+),N-methyl pyridinium(N-mPY*) and methyl viologen(MV^(2+)).The complexation with NH_4~+.Gdm~+ and DMDABCO2* is pronouncedly enhanced when the recognition process moved from bulk water to interface,whereas the complexation stabilities with Tpm~+,N-mPY~+ and MV2* increase slightly or even decrease to some extent.A more interesting phenomenon arises from the NH_4~+/Gdm~+ pair that the thermodynamic origin at interface differs definitely from each other although with similar association constants.The results were discussed in terms of differential driving forces,electrostatic,hydrogen bond as well as π-stacking interactions,originating from the unique physicochemical features of interfaces,mainly the polarity and dielectric constant.展开更多
A two-stage state recognition method is proposed for asynchronous SSVEP(steady-state visual evoked potential) based brain-computer interface(SBCI) system.The two-stage method is composed of the idle state(IS) detectio...A two-stage state recognition method is proposed for asynchronous SSVEP(steady-state visual evoked potential) based brain-computer interface(SBCI) system.The two-stage method is composed of the idle state(IS) detection and control state(CS) discrimination modules.Based on blind source separation and continuous wavelet transform techniques,the proposed method integrates functions of multi-electrode spatial filtering and feature extraction.In IS detection module,a method using the ensemble IS feature is proposed.In CS discrimination module,the ensemble CS feature is designed as feature vector for control intent classification.Further,performance comparisons are investigated among our IS detection module and other existing ones.Also the experimental results validate the satisfactory performance of our CS discrimination module.展开更多
基金This project is supported by Provincial Youth Science Foundation of Shanxi China (No.20011020)National Natural Science Foundation of China (No.59975064).
文摘The coal-rock interface recognition method based on multi-sensor data fusiontechnique is put forward because of the localization of single type sensor recognition method. Themeasuring theory based on multi-sensor data fusion technique is analyzed, and hereby the testplatform of recognition system is manufactured. The advantage of data fusion with the fuzzy neuralnetwork (FNN) technique has been probed. The two-level FNN is constructed and data fusion is carriedout. The experiments show that in various conditions the method can always acquire a much higherrecognition rate than normal ones.
基金This work was funded by the Deanship of Scientific Research at Jouf University under Grant Number(DSR2022-RG-0114).
文摘The challenge faced by the visually impaired persons in their day-today lives is to interpret text from documents.In this context,to help these people,the objective of this work is to develop an efficient text recognition system that allows the isolation,the extraction,and the recognition of text in the case of documents having a textured background,a degraded aspect of colors,and of poor quality,and to synthesize it into speech.This system basically consists of three algorithms:a text localization and detection algorithm based on mathematical morphology method(MMM);a text extraction algorithm based on the gamma correction method(GCM);and an optical character recognition(OCR)algorithm for text recognition.A detailed complexity study of the different blocks of this text recognition system has been realized.Following this study,an acceleration of the GCM algorithm(AGCM)is proposed.The AGCM algorithm has reduced the complexity in the text recognition system by 70%and kept the same quality of text recognition as that of the original method.To assist visually impaired persons,a graphical interface of the entire text recognition chain has been developed,allowing the capture of images from a camera,rapid and intuitive visualization of the recognized text from this image,and text-to-speech synthesis.Our text recognition system provides an improvement of 6.8%for the recognition rate and 7.6%for the F-measure relative to GCM and AGCM algorithms.
文摘This paper describes a novel method of online composite shape recognition interms of the relevance feedback technology to capture a user's intentions incrementally, and adynamic user modeling method to adapt to various users' styles. First, the relevance feedback isadapted to refine the recognition results and reduce the ambiguity incrementally based on theestablishment of a feature-based vector model of a user's sketches. Secondly, a dynamic usermodeling is introduced to model the user's sketching habits based on recording and analyzinghistorical information incrementally. A model-based matching strategy is also employed in the methodto recognize sketches dynamically. Experiments prove that the proposed method is both effective andefficient.
基金funded by the National Plan for Science,Technology and Innovation(MAARIFAH)-King Abdulaziz City for Science and Technology(KACST)-Kingdom of Saudi Arabia-Project Number(10-INF-1406-03).
文摘Any natural language may have dozens of accents.Even though the equivalent phonemic formation of the word,if it is properly called in different accents,humans do have audio signals that are distinct from one another.Among the most common issues with speech,the processing is discrepancies in pronunciation,accent,and enunciation.This research study examines the issues of detecting,fixing,and summarising accent defects of average Arabic individuals in English-speaking speech.The article then discusses the key approaches and structure that will be utilized to address both accent flaws and pronunciation issues.The proposed SpeakCorrect computerized interface employs a cuttingedge speech recognition system and analyses pronunciation errors with a speech decoder.As a result,some of the most essential types of changes in pronunciation that are significant for speech recognition are performed,and accent defects defining such differences are presented.Consequently,the suggested technique increases the Speaker’s accuracy.SpeakCorrect uses 100 h of phonetically prepared individuals to construct a pronunciation instruction repository.These prerecorded sets are used to train Hidden Markov Models(HMM)as well as weighted graph systems.Their speeches are quite clear and might be considered natural.The proposed interface is optimized for use with an integrated phonetic pronounced dataset,as well as for analyzing and identifying speech faults in Saudi and Egyptian dialects.The proposed interface detects,analyses,and assists English learners in correcting utterance faults,overcoming problems,and improving their pronunciations.
基金the National Natural Science Foundation of China,Grant/Award Number:62006065the Science and Technology Research Program of Chongqing Municipal Education Commission,Grant/Award Number:KJQN202100634+1 种基金the Natural Science Foundation of Chongqing,Grant/Award Number:CSTB2022NSCQ‐MSX1202Chongqing Municipal Education Commission,Grant/Award Number:KJQN202100634。
文摘Transformer tracking always takes paired template and search images as encoder input and conduct feature extraction and target‐search feature correlation by self and/or cross attention operations,thus the model complexity will grow quadratically with the number of input images.To alleviate the burden of this tracking paradigm and facilitate practical deployment of Transformer‐based trackers,we propose a dual pooling transformer tracking framework,dubbed as DPT,which consists of three components:a simple yet efficient spatiotemporal attention model(SAM),a mutual correlation pooling Trans-former(MCPT)and a multiscale aggregation pooling Transformer(MAPT).SAM is designed to gracefully aggregates temporal dynamics and spatial appearance information of multi‐frame templates along space‐time dimensions.MCPT aims to capture multi‐scale pooled and correlated contextual features,which is followed by MAPT that aggregates multi‐scale features into a unified feature representation for tracking prediction.DPT tracker achieves AUC score of 69.5 on LaSOT and precision score of 82.8 on Track-ingNet while maintaining a shorter sequence length of attention tokens,fewer parameters and FLOPs compared to existing state‐of‐the‐art(SOTA)Transformer tracking methods.Extensive experiments demonstrate that DPT tracker yields a strong real‐time tracking baseline with a good trade‐off between tracking performance and inference efficiency.
基金jointly supported by 973 Program(NO.2014 CB239206)PCSIRT(NO.IRT13043)+1 种基金the National Science Foundation of China(NO.51374258,NO.51474158)the Open Projects of State Key Laboratory of Coal Mine Disaster Dynamics and Control(Chongqing University 2011DA105287-FW201412).
文摘In underground coal mines,hydrofracture can cause the increase of breathability in the fractured coal bed.When the hydrofracture crack propagates to the interface between the coal bed and the roof-floor stratum,the crack may enter roof-floor lithology,thus posing a limit on the scope of breathability increase and making it difficult to support the roof and floor board for subsequent coal mining.In this work,a two-dimensional model of coal rock bed that contains hydrofracture crack was constructed.Then an investigation that combines the fracture mechanics and the system of flow and solid in rock failure process analysis(RFPA2D-Flow)were carried out to study the failure mechanism at the interface between rocks and coals,and critical water pressure that hydrofracture crack propagates.The results indicated that the main factors that affect the direction of hydrofracture crack propagation are the angle of intersection between coal-rock interface and horizontal section,horizontal crustal stress difference,tension-shear mixed crack fracture toughness in coal-rock interface and differences in elasticity modulus of coal-rock bed.The possibility of crack directly entering coal-rock interface would increase with the increase in angle of intersection or horizontal crustal stress difference.The trend that crack propagates along the coal-rock interface will become stronger with the decrease of the fracture toughness at the coal-rock interface and the increase of the elasticity modulus difference between the coal bed and the roof strata.The results of this study was to put forward a method of controlling hydrofracture crack,optimize the fracturing well location provides a certain theoretical basis.
基金Project supported by the National High-Tech R&D Program(863)of China(No.2012AA011601)the National Natural Science Foundation of China(No.91120305)+3 种基金the University High Level Talent Program of Guangdong,China(No.N9120140A)the Foundation and Theoretical Science Project supported by Jiangmen Research Program(No.2014(17)the Fundamental Research Funds for the Central Universities,South China University of Technology(No.2014ZB0031)the Science Foundation for Young Teachers of Wuyi University(No.2013zk08)
文摘An intelligent wheelchair is devised, which is controlled by a coordinated mechanism based on a brain-computer interface(BCI) and speech recognition. By performing appropriate activities, users can navigate the wheelchair with four steering behaviors(start, stop, turn left, and turn right). Five healthy subjects participated in an indoor experiment. The results demonstrate the efficiency of the coordinated control mechanism with satisfactory path and time optimality ratios, and show that speech recognition is a fast and accurate supplement for BCI-based control systems. The proposed intelligent wheelchair is especially suitable for patients suffering from paralysis(especially those with aphasia) who can learn to pronounce only a single sound(e.g., ‘ah').
基金supported by NSFC(Nos.21322207 and 21672112)the Fundamental Research Funds for the Central Universities and Program of Tianjin Young Talents
文摘A microcalorimetric study on molecular recognition of p-sulfonatocalix[4]arene derivatives at selfassembled interface in comparison with in bulk water was performed,inspired by the dramatic change in physicochemical characteristics from bulk water to interface.A total of six cationic molecules were screened as model guests,including ammonium(NH_4~+),guanidinium(Gdm~+).N,N'-dimethyl-1,4-diazabicyclo[2.2.2]octane(DMDABCO^(2+)),tropylium(Tpm~+),N-methyl pyridinium(N-mPY*) and methyl viologen(MV^(2+)).The complexation with NH_4~+.Gdm~+ and DMDABCO2* is pronouncedly enhanced when the recognition process moved from bulk water to interface,whereas the complexation stabilities with Tpm~+,N-mPY~+ and MV2* increase slightly or even decrease to some extent.A more interesting phenomenon arises from the NH_4~+/Gdm~+ pair that the thermodynamic origin at interface differs definitely from each other although with similar association constants.The results were discussed in terms of differential driving forces,electrostatic,hydrogen bond as well as π-stacking interactions,originating from the unique physicochemical features of interfaces,mainly the polarity and dielectric constant.
基金National Natural Science Foundation of China(90820305,60775040)
文摘A two-stage state recognition method is proposed for asynchronous SSVEP(steady-state visual evoked potential) based brain-computer interface(SBCI) system.The two-stage method is composed of the idle state(IS) detection and control state(CS) discrimination modules.Based on blind source separation and continuous wavelet transform techniques,the proposed method integrates functions of multi-electrode spatial filtering and feature extraction.In IS detection module,a method using the ensemble IS feature is proposed.In CS discrimination module,the ensemble CS feature is designed as feature vector for control intent classification.Further,performance comparisons are investigated among our IS detection module and other existing ones.Also the experimental results validate the satisfactory performance of our CS discrimination module.