Recent advancement in low-cost cameras has facilitated surveillance in various developing towns in India.The video obtained from such surveillance are of low quality.Still counting vehicles from such videos are necess...Recent advancement in low-cost cameras has facilitated surveillance in various developing towns in India.The video obtained from such surveillance are of low quality.Still counting vehicles from such videos are necessity to avoid traf-fic congestion and allows drivers to plan their routes more precisely.On the other hand,detecting vehicles from such low quality videos are highly challenging with vision based methodologies.In this research a meticulous attempt is made to access low-quality videos to describe traffic in Salem town in India,which is mostly an un-attempted entity by most available sources.In this work profound Detection Transformer(DETR)model is used for object(vehicle)detection.Here vehicles are anticipated in a rush-hour traffic video using a set of loss functions that carry out bipartite coordinating among estimated and information acquired on real attributes.Every frame in the traffic footage has its date and time which is detected and retrieved using Tesseract Optical Character Recognition.The date and time extricated and perceived from the input image are incorporated with the length of the recognized objects acquired from the DETR model.This furnishes the vehicles report with timestamp.Transformer Timeseries Prediction Model(TTPM)is proposed to predict the density of the vehicle for future prediction,here the regular NLP layers have been removed and the encoding temporal layer has been modified.The proposed TTPM error rate outperforms the existing models with RMSE of 4.313 and MAE of 3.812.展开更多
This paper focuses on improving the detection performance of spectrum sensing in cognitive radio(CR) networks under complicated electromagnetic environment. Some existing fast spectrum sensing algorithms cannot get sp...This paper focuses on improving the detection performance of spectrum sensing in cognitive radio(CR) networks under complicated electromagnetic environment. Some existing fast spectrum sensing algorithms cannot get specific features of the licensed users'(LUs') signal, thus they cannot be applied in this situation without knowing the power of noise. On the other hand some algorithms that yield specific features are too complicated. In this paper, an algorithm based on the cyclostationary feature detection and theory of Hilbert transformation is proposed. Comparing with the conventional cyclostationary feature detection algorithm, this approach is more flexible i.e. it can flexibly change the computational complexity according to current electromagnetic environment by changing its sampling times and the step size of cyclic frequency. Results of simulation indicate that this approach can flexibly detect the feature of received signal and provide satisfactory detection performance compared to existing approaches in low Signal-to-noise Ratio(SNR) situations.展开更多
Fingerprint authentication system is used to verify users' identification according to the characteristics of their fingerprints.However,this system has some security and privacy problems.For example,some artifici...Fingerprint authentication system is used to verify users' identification according to the characteristics of their fingerprints.However,this system has some security and privacy problems.For example,some artificial fingerprints can trick the fingerprint authentication system and access information using real users' identification.Therefore,a fingerprint liveness detection algorithm needs to be designed to prevent illegal users from accessing privacy information.In this paper,a new software-based liveness detection approach using multi-scale local phase quantity(LPQ) and principal component analysis(PCA) is proposed.The feature vectors of a fingerprint are constructed through multi-scale LPQ.PCA technology is also introduced to reduce the dimensionality of the feature vectors and gain more effective features.Finally,a training model is gained using support vector machine classifier,and the liveness of a fingerprint is detected on the basis of the training model.Experimental results demonstrate that our proposed method can detect the liveness of users' fingerprints and achieve high recognition accuracy.This study also confirms that multi-resolution analysis is a useful method for texture feature extraction during fingerprint liveness detection.展开更多
Since the coal mine in-pit personnel positioning system neither can effectively achieve the function to detect the uniqueness of in-pit coal-mine personnel nor can identify and eliminate violations in attendance manag...Since the coal mine in-pit personnel positioning system neither can effectively achieve the function to detect the uniqueness of in-pit coal-mine personnel nor can identify and eliminate violations in attendance management such as multiple cards for one person, and swiping one's cards by others in China at present. Therefore, the research introduces a uniqueness detection system and method for in-pit coal-mine personnel integrated into the in-pit coal mine personnel positioning system, establishing a system mode based on face recognition + recognition of personnel positioning card + release by automatic detection. Aiming at the facts that the in-pit personnel are wearing helmets and faces are prone to be stained during the face recognition, the study proposes the ideas that pre-process face images using the 2D-wavelet-transformation-based Mallat algorithm and extracts three face features: miner light, eyes and mouths, using the generalized symmetry transformation-based algorithm. This research carried out test with 40 clean face images with no helmets and 40 lightly-stained face images, and then compared with results with the one using the face feature extraction method based on grey-scale transformation and edge detection. The results show that the method described in the paper can detect accurately face features in the above-mentioned two cases, and the accuracy to detect face features is 97.5% in the case of wearing helmets and lightly-stained faces.展开更多
We present in this paper an implementation of a multiscale edges detection algorithm on multiprocessor using SYnDEx which is a programming environment to generate optimized distributed real-time executives. The implem...We present in this paper an implementation of a multiscale edges detection algorithm on multiprocessor using SYnDEx which is a programming environment to generate optimized distributed real-time executives. The implementation has been done on three TMS320C40 and the acceleration in comparison with one processor is 2.2.展开更多
Because the extract of the weak failure information is always the difficulty and focus of fault detection. Aiming for specific statistical properties of complex wavelet coefficients of gearbox vibration signals, a new...Because the extract of the weak failure information is always the difficulty and focus of fault detection. Aiming for specific statistical properties of complex wavelet coefficients of gearbox vibration signals, a new signal-denoising method which uses local adaptive algorithm based on dual-tree complex wavelet transform (DT-CWT) is introduced to extract weak failure information in gear, especially to extract impulse components. By taking into account the non-Gaussian probability distribution and the statistical dependencies among wavelet coefficients of some signals, and by taking the advantage of near shift-invariance of DT-CWT, the higher signal-to-noise ratio (SNR) than common wavelet denoising methods can be obtained. Experiments of extracting periodic impulses in gearbox vibration signals indicate that the method can extract incipient fault feature and hidden information from heavy noise, and it has an excellent effect on identifying weak feature signals in gearbox vibration signals.展开更多
A concise fractional Fourier transform (CFRFT) is proposed to detect the linear frequency-modulated (LFM) signal with low signal to noise ratio (SNR). The frequency axis in time-frequency plane of the CFRFT is r...A concise fractional Fourier transform (CFRFT) is proposed to detect the linear frequency-modulated (LFM) signal with low signal to noise ratio (SNR). The frequency axis in time-frequency plane of the CFRFT is rotated to get the spectrum of the signal in different an- gles using chirp multiplication and Fourier transform (FT). For LFM signal which distributes as a straight line in time-frequency plane, the CFRFT can gather the energy in the corresponding angle as a peak and improve the detection SNR, thus the LFM signal of low SNR can be de- tected. Meanwhile, the location of the peak value relates to the parameters of the LFM signal. Numerical simulations and experimental results show that, the proposed method can be used to efficiently detect the LFM signal masked by noise and to estimate the signal's parameters accurately. Compared with the conventional fractional Fourier transform (FRFT), the CFRFT reduces the transform complexity and improves the real-time detection performance of LFM signal.展开更多
Shape matching plays an important role in various computer vision and graphics applications such as shape retrieval, object detection, image editing,image retrieval, etc. However, detecting shapes in cluttered images ...Shape matching plays an important role in various computer vision and graphics applications such as shape retrieval, object detection, image editing,image retrieval, etc. However, detecting shapes in cluttered images is still quite challenging due to the incomplete edges and changing perspective. In this paper, we propose a novel approach that can efficiently identify a queried shape in a cluttered image. The core idea is to acquire the transformation from the queried shape to the cluttered image by summarising all pointto-point transformations between the queried shape and the image. To do so, we adopt a point-based shape descriptor, the pyramid of arc-length descriptor(PAD),to identify point pairs between the queried shape and the image having similar local shapes. We further calculate the transformations between the identified point pairs based on PAD. Finally, we summarise all transformations in a 4 D transformation histogram and search for the main cluster. Our method can handle both closed shapes and open curves, and is resistant to partial occlusions. Experiments show that our method can robustly detect shapes in images in the presence of partial occlusions, fragile edges, and cluttered backgrounds.展开更多
The current casting surface defect detection algorithms suffer from poor small target defect recognition and imbalance between detection performance and detection time.An improved algorithmic framework for casting def...The current casting surface defect detection algorithms suffer from poor small target defect recognition and imbalance between detection performance and detection time.An improved algorithmic framework for casting defect detection was proposed based on the DEtection TRansformer(DETR)algorithm.The algorithm takes ResNet with an efficient channel attention(ECA)-Net module as the backbone network.In addition,based on the original algorithm architecture,dynamic anchor boxes,improved multi-scale deformable attention module,and SIoU loss function are introduced to improve the sensitivity of transformer structure to input location information and scale size,and the small target defect detection performance is effectively improved.The recognition performance of the algorithm in a self-built casting defect dataset was studied.The improved DETR algorithm has 97.561% accuracy in recognizing two defects,namely sandinclusion and notch,with the detection rate being improved by 65.854% and 17.073% compared with the original DETR and you only look once(Yolo)-V5,respectively.This algorithm verifies the applicability of the transformer architecture target detection algorithm for casting defect detection tasks and provides new ideas for detecting other similar application scenarios.展开更多
Real-time and accurate traffic light status recognition can provide reliable data support for autonomous vehicle decision-making and control systems.To address potential problems such as the minor component of traffic...Real-time and accurate traffic light status recognition can provide reliable data support for autonomous vehicle decision-making and control systems.To address potential problems such as the minor component of traffic lights in the perceptual domain of visual sensors and the complexity of recognition scenarios,we propose an end-to-end traffic light status recognition method,ResNeSt50-CBAM-DINO(RC-DINO).First,we performed data cleaning on the Tsinghua-Tencent traffic lights(TTTL)and fused it with the Shanghai Jiao Tong University’s traffic light dataset(S2TLD)to form a Chinese urban traffic light dataset(CUTLD).Second,we combined residual network with split-attention module-50(ResNeSt50)and the convolutional block attention module(CBAM)to extract more significant traffic light features.Finally,the proposed RC-DINO and mainstream recognition algorithms were trained and analyzed using CUTLD.The experimental results show that,compared to the original DINO,RC-DINO improved the average precision(AP),AP at intersection over union(IOU)=0.5(AP50),AP for small objects(APs),average recall(AR),and balanced F score(F1-Score)by 3.1%,1.6%,3.4%,0.9%,and 0.9%,respectively,and had a certain capability to recognize the partially covered traffic light status.The above results indicate that the proposed RC-DINO improved recognition performance and robustness,making it more suitable for traffic light status recognition tasks.展开更多
文摘Recent advancement in low-cost cameras has facilitated surveillance in various developing towns in India.The video obtained from such surveillance are of low quality.Still counting vehicles from such videos are necessity to avoid traf-fic congestion and allows drivers to plan their routes more precisely.On the other hand,detecting vehicles from such low quality videos are highly challenging with vision based methodologies.In this research a meticulous attempt is made to access low-quality videos to describe traffic in Salem town in India,which is mostly an un-attempted entity by most available sources.In this work profound Detection Transformer(DETR)model is used for object(vehicle)detection.Here vehicles are anticipated in a rush-hour traffic video using a set of loss functions that carry out bipartite coordinating among estimated and information acquired on real attributes.Every frame in the traffic footage has its date and time which is detected and retrieved using Tesseract Optical Character Recognition.The date and time extricated and perceived from the input image are incorporated with the length of the recognized objects acquired from the DETR model.This furnishes the vehicles report with timestamp.Transformer Timeseries Prediction Model(TTPM)is proposed to predict the density of the vehicle for future prediction,here the regular NLP layers have been removed and the encoding temporal layer has been modified.The proposed TTPM error rate outperforms the existing models with RMSE of 4.313 and MAE of 3.812.
基金sponsored by National Basic Research Program of China (973 Program, No. 2013CB329003)National Natural Science Foundation of China (No. 91438205)+1 种基金China Postdoctoral Science Foundation (No. 2011M500664)Open Research fund Program of Key Lab. for Spacecraft TT&C and Communication, Ministry of Education, China (No.CTTC-FX201305)
文摘This paper focuses on improving the detection performance of spectrum sensing in cognitive radio(CR) networks under complicated electromagnetic environment. Some existing fast spectrum sensing algorithms cannot get specific features of the licensed users'(LUs') signal, thus they cannot be applied in this situation without knowing the power of noise. On the other hand some algorithms that yield specific features are too complicated. In this paper, an algorithm based on the cyclostationary feature detection and theory of Hilbert transformation is proposed. Comparing with the conventional cyclostationary feature detection algorithm, this approach is more flexible i.e. it can flexibly change the computational complexity according to current electromagnetic environment by changing its sampling times and the step size of cyclic frequency. Results of simulation indicate that this approach can flexibly detect the feature of received signal and provide satisfactory detection performance compared to existing approaches in low Signal-to-noise Ratio(SNR) situations.
基金supported by the NSFC (U1536206,61232016,U1405254,61373133, 61502242)BK20150925the PAPD fund
文摘Fingerprint authentication system is used to verify users' identification according to the characteristics of their fingerprints.However,this system has some security and privacy problems.For example,some artificial fingerprints can trick the fingerprint authentication system and access information using real users' identification.Therefore,a fingerprint liveness detection algorithm needs to be designed to prevent illegal users from accessing privacy information.In this paper,a new software-based liveness detection approach using multi-scale local phase quantity(LPQ) and principal component analysis(PCA) is proposed.The feature vectors of a fingerprint are constructed through multi-scale LPQ.PCA technology is also introduced to reduce the dimensionality of the feature vectors and gain more effective features.Finally,a training model is gained using support vector machine classifier,and the liveness of a fingerprint is detected on the basis of the training model.Experimental results demonstrate that our proposed method can detect the liveness of users' fingerprints and achieve high recognition accuracy.This study also confirms that multi-resolution analysis is a useful method for texture feature extraction during fingerprint liveness detection.
基金financial supports from the National Natural Science Foundation of China (No. 51134024)the National High Technology Research and Development Program of China (No. 2012AA062203)are gratefully acknowledged
文摘Since the coal mine in-pit personnel positioning system neither can effectively achieve the function to detect the uniqueness of in-pit coal-mine personnel nor can identify and eliminate violations in attendance management such as multiple cards for one person, and swiping one's cards by others in China at present. Therefore, the research introduces a uniqueness detection system and method for in-pit coal-mine personnel integrated into the in-pit coal mine personnel positioning system, establishing a system mode based on face recognition + recognition of personnel positioning card + release by automatic detection. Aiming at the facts that the in-pit personnel are wearing helmets and faces are prone to be stained during the face recognition, the study proposes the ideas that pre-process face images using the 2D-wavelet-transformation-based Mallat algorithm and extracts three face features: miner light, eyes and mouths, using the generalized symmetry transformation-based algorithm. This research carried out test with 40 clean face images with no helmets and 40 lightly-stained face images, and then compared with results with the one using the face feature extraction method based on grey-scale transformation and edge detection. The results show that the method described in the paper can detect accurately face features in the above-mentioned two cases, and the accuracy to detect face features is 97.5% in the case of wearing helmets and lightly-stained faces.
文摘We present in this paper an implementation of a multiscale edges detection algorithm on multiprocessor using SYnDEx which is a programming environment to generate optimized distributed real-time executives. The implementation has been done on three TMS320C40 and the acceleration in comparison with one processor is 2.2.
基金Beijing Municipal Natural Science Foundation of China (No. 3062012).
文摘Because the extract of the weak failure information is always the difficulty and focus of fault detection. Aiming for specific statistical properties of complex wavelet coefficients of gearbox vibration signals, a new signal-denoising method which uses local adaptive algorithm based on dual-tree complex wavelet transform (DT-CWT) is introduced to extract weak failure information in gear, especially to extract impulse components. By taking into account the non-Gaussian probability distribution and the statistical dependencies among wavelet coefficients of some signals, and by taking the advantage of near shift-invariance of DT-CWT, the higher signal-to-noise ratio (SNR) than common wavelet denoising methods can be obtained. Experiments of extracting periodic impulses in gearbox vibration signals indicate that the method can extract incipient fault feature and hidden information from heavy noise, and it has an excellent effect on identifying weak feature signals in gearbox vibration signals.
基金supported by the National Natural Science Foundation of China(11434012)
文摘A concise fractional Fourier transform (CFRFT) is proposed to detect the linear frequency-modulated (LFM) signal with low signal to noise ratio (SNR). The frequency axis in time-frequency plane of the CFRFT is rotated to get the spectrum of the signal in different an- gles using chirp multiplication and Fourier transform (FT). For LFM signal which distributes as a straight line in time-frequency plane, the CFRFT can gather the energy in the corresponding angle as a peak and improve the detection SNR, thus the LFM signal of low SNR can be de- tected. Meanwhile, the location of the peak value relates to the parameters of the LFM signal. Numerical simulations and experimental results show that, the proposed method can be used to efficiently detect the LFM signal masked by noise and to estimate the signal's parameters accurately. Compared with the conventional fractional Fourier transform (FRFT), the CFRFT reduces the transform complexity and improves the real-time detection performance of LFM signal.
基金supported by the Research Grants Council of the Hong Kong Special Administrative Region,under the RGC General Research Fund(Project No.CUHK 14217516)
文摘Shape matching plays an important role in various computer vision and graphics applications such as shape retrieval, object detection, image editing,image retrieval, etc. However, detecting shapes in cluttered images is still quite challenging due to the incomplete edges and changing perspective. In this paper, we propose a novel approach that can efficiently identify a queried shape in a cluttered image. The core idea is to acquire the transformation from the queried shape to the cluttered image by summarising all pointto-point transformations between the queried shape and the image. To do so, we adopt a point-based shape descriptor, the pyramid of arc-length descriptor(PAD),to identify point pairs between the queried shape and the image having similar local shapes. We further calculate the transformations between the identified point pairs based on PAD. Finally, we summarise all transformations in a 4 D transformation histogram and search for the main cluster. Our method can handle both closed shapes and open curves, and is resistant to partial occlusions. Experiments show that our method can robustly detect shapes in images in the presence of partial occlusions, fragile edges, and cluttered backgrounds.
基金the support of National Natural Science Foundation of China(No.51405002)Anhui Provincial Natural Science Foundation(No.2108085ME173)+2 种基金open funds from Anhui Province Key Laboratory of Metallurgical Engineering&Resources Recycling(No.SKF20-05)Opening Project of Engineering Technology Research Center of Anhui Education Department for Energy Saving and Pollutant Control in metallurgical processOpening Project of Anhui Engineering Laboratory for Intelligent Applications and Security of Industrial Internet(Grant No.IASII21-03)for financial support.
文摘The current casting surface defect detection algorithms suffer from poor small target defect recognition and imbalance between detection performance and detection time.An improved algorithmic framework for casting defect detection was proposed based on the DEtection TRansformer(DETR)algorithm.The algorithm takes ResNet with an efficient channel attention(ECA)-Net module as the backbone network.In addition,based on the original algorithm architecture,dynamic anchor boxes,improved multi-scale deformable attention module,and SIoU loss function are introduced to improve the sensitivity of transformer structure to input location information and scale size,and the small target defect detection performance is effectively improved.The recognition performance of the algorithm in a self-built casting defect dataset was studied.The improved DETR algorithm has 97.561% accuracy in recognizing two defects,namely sandinclusion and notch,with the detection rate being improved by 65.854% and 17.073% compared with the original DETR and you only look once(Yolo)-V5,respectively.This algorithm verifies the applicability of the transformer architecture target detection algorithm for casting defect detection tasks and provides new ideas for detecting other similar application scenarios.
基金supported by the National Key R&D Program of China(2021YFB2501200)the Key Program of the National Natural Science Foundation of China(52131204)the Shaanxi Province Key Research and Development Program(2022GY-300).
文摘Real-time and accurate traffic light status recognition can provide reliable data support for autonomous vehicle decision-making and control systems.To address potential problems such as the minor component of traffic lights in the perceptual domain of visual sensors and the complexity of recognition scenarios,we propose an end-to-end traffic light status recognition method,ResNeSt50-CBAM-DINO(RC-DINO).First,we performed data cleaning on the Tsinghua-Tencent traffic lights(TTTL)and fused it with the Shanghai Jiao Tong University’s traffic light dataset(S2TLD)to form a Chinese urban traffic light dataset(CUTLD).Second,we combined residual network with split-attention module-50(ResNeSt50)and the convolutional block attention module(CBAM)to extract more significant traffic light features.Finally,the proposed RC-DINO and mainstream recognition algorithms were trained and analyzed using CUTLD.The experimental results show that,compared to the original DINO,RC-DINO improved the average precision(AP),AP at intersection over union(IOU)=0.5(AP50),AP for small objects(APs),average recall(AR),and balanced F score(F1-Score)by 3.1%,1.6%,3.4%,0.9%,and 0.9%,respectively,and had a certain capability to recognize the partially covered traffic light status.The above results indicate that the proposed RC-DINO improved recognition performance and robustness,making it more suitable for traffic light status recognition tasks.