As an important component of load transfer,various fatigue damages occur in the track as the rail service life and train traffic increase gradually,such as rail corrugation,rail joint damage,uneven thermite welds,rail ...As an important component of load transfer,various fatigue damages occur in the track as the rail service life and train traffic increase gradually,such as rail corrugation,rail joint damage,uneven thermite welds,rail squats fas-tener defects,etc.Real-time recognition of track defects plays a vital role in ensuring the safe and stable operation of rail transit.In this paper,an intelligent and innovative method is proposed to detect the track defects by using axle-box vibration acceleration and deep learning network,and the coexistence of the above-mentioned typical track defects in the track system is considered.Firstly,the dynamic relationship between the track defects(using the example of the fastening defects)and the axle-box vibration acceleration(ABVA)is investigated using the dynamic vehicle-track model.Then,a simulation model for the coupled dynamics of the vehicle and track with different track defects is established,and the wavelet power spectrum(WPS)analysis is performed for the vibra-tion acceleration signals of the axle box to extract the characteristic response.Lastly,using wavelet spectrum photos as input,an automatic detection technique based on the deep convolution neural network(DCNN)is sug-gested to realize the real-time intelligent detection and identification of various track problems.Thefindings demonstrate that the suggested approach achieves a 96.72%classification accuracy.展开更多
Research in the field ofmedical image is an important part of themedical robot to operate human organs.Amedical robot is the intersection ofmulti-disciplinary research fields,in whichmedical image is an important dire...Research in the field ofmedical image is an important part of themedical robot to operate human organs.Amedical robot is the intersection ofmulti-disciplinary research fields,in whichmedical image is an important direction and has achieved fruitful results.In this paper,amethodof soft tissue surface feature tracking basedonadepthmatching network is proposed.This method is described based on the triangular matching algorithm.First,we construct a self-made sample set for training the depth matching network from the first N frames of speckle matching data obtained by the triangle matching algorithm.The depth matching network is pre-trained on the ORL face data set and then trained on the self-made training set.After the training,the speckle matching is carried out in the subsequent frames to obtain the speckle matching matrix between the subsequent frames and the first frame.From this matrix,the inter-frame feature matching results can be obtained.In this way,the inter-frame speckle tracking is completed.On this basis,the results of this method are compared with the matching results based on the convolutional neural network.The experimental results show that the proposed method has higher matching accuracy.In particular,the accuracy of the MNIST handwritten data set has reached more than 90%.展开更多
Autism spectrum disorder(ASD)can be defined as a neurodevelopmental condition or illness that can disturb kids who have heterogeneous characteristics,like changes in behavior,social disabilities,and difficulty communi...Autism spectrum disorder(ASD)can be defined as a neurodevelopmental condition or illness that can disturb kids who have heterogeneous characteristics,like changes in behavior,social disabilities,and difficulty communicating with others.Eye tracking(ET)has become a useful method to detect ASD.One vital aspect of moral erudition is the aptitude to have common visual attention.The eye-tracking approach offers valuable data regarding the visual behavior of children for accurate and early detection.Eye-tracking data can offer insightful information about the behavior and thought processes of people with ASD,but it is important to be aware of its limitations and to combine it with other types of data and assessment techniques to increase the precision of ASD detection.It operates by scanning the paths of eyes for extracting a series of eye projection points on images for examining the behavior of children with autism.The purpose of this research is to use deep learning to identify autistic disorders based on eye tracking.The Chaotic Butterfly Optimization technique is used to identify this specific disturbance.Therefore,this study develops an ET-based Autism Spectrum Disorder Diagnosis using Chaotic Butterfly Optimization with Deep Learning(ETASD-CBODL)technique.The presented ETASDCBODL technique mainly focuses on the recognition of ASD via the ET and DL models.To accomplish this,the ETASD-CBODL technique exploits the U-Net segmentation technique to recognize interested AREASS.In addition,the ETASD-CBODL technique employs Inception v3 feature extraction with CBO algorithm-based hyperparameter optimization.Finally,the long-shorttermmemory(LSTM)model is exploited for the recognition and classification of ASD.To assess the performance of the ETASD-CBODL technique,a series of simulations were performed on datasets from the figure-shared data repository.The experimental values of accuracy(99.29%),precision(98.78%),sensitivity(99.29%)and specificity(99.29%)showed a better performance in the ETASD-CBODL technique over recent approaches.展开更多
Beamforming is significant for millimeter wave multi-user massive multi-input multi-output systems.In the meanwhile,the overhead cost of channel state information and beam training is considerable,especially in dynami...Beamforming is significant for millimeter wave multi-user massive multi-input multi-output systems.In the meanwhile,the overhead cost of channel state information and beam training is considerable,especially in dynamic environments.To reduce the overhead cost,we propose a multi-user beam tracking algorithm using a distributed deep Q-learning method.With online learning of users’moving trajectories,the proposed algorithm learns to scan a beam subspace to maximize the average effective sum rate.Considering practical implementation,we model the continuous beam tracking problem as a non-Markov decision process and thus develop a simplified training scheme of deep Q-learning to reduce the training complexity.Furthermore,we propose a scalable state-action-reward design for scenarios with different users and antenna numbers.Simulation results verify the effectiveness of the designed method.展开更多
At present days,object detection and tracking concepts have gained more importance among researchers and business people.Presently,deep learning(DL)approaches have been used for object tracking as it increases the per...At present days,object detection and tracking concepts have gained more importance among researchers and business people.Presently,deep learning(DL)approaches have been used for object tracking as it increases the perfor-mance and speed of the tracking process.This paper presents a novel robust DL based object detection and tracking algorithm using Automated Image Anno-tation with ResNet based Faster regional convolutional neural network(R-CNN)named(AIA-FRCNN)model.The AIA-RFRCNN method performs image anno-tation using a Discriminative Correlation Filter(DCF)with Channel and Spatial Reliability tracker(CSR)called DCF-CSRT model.The AIA-RFRCNN model makes use of Faster RCNN as an object detector and tracker,which involves region proposal network(RPN)and Fast R-CNN.The RPN is a full convolution network that concurrently predicts the bounding box and score of different objects.The RPN is a trained model used for the generation of the high-quality region proposals,which are utilized by Fast R-CNN for detection process.Besides,Residual Network(ResNet 101)model is used as a shared convolutional neural network(CNN)for the generation of feature maps.The performance of the ResNet 101 model is further improved by the use of Adam optimizer,which tunes the hyperparameters namely learning rate,batch size,momentum,and weight decay.Finally,softmax layer is applied to classify the images.The performance of the AIA-RFRCNN method has been assessed using a benchmark dataset and a detailed comparative analysis of the results takes place.The outcome of the experiments indicated the superior characteristics of the AIA-RFRCNN model under diverse aspects.展开更多
To improve the tracking accuracy of persons in the surveillance video,we proposed an algorithm for multi-target tracking persons based on deep learning.In this paper,we used You Only Look Once v5(YOLOv5)to obtain pers...To improve the tracking accuracy of persons in the surveillance video,we proposed an algorithm for multi-target tracking persons based on deep learning.In this paper,we used You Only Look Once v5(YOLOv5)to obtain person targets of each frame in the video and used Simple Online and Realtime Tracking with a Deep Association Metric(DeepSORT)to do cascade matching and Intersection Over Union(IOU)matching of person targets between different frames.To solve the IDSwitch problem caused by the low feature extraction ability of the Re-Identification(ReID)network in the process of cascade matching,we introduced Spatial Relation-aware Global Attention(RGA-S)and Channel Relation-aware Global Attention(RGA-C)attention mechanisms into the network structure.The pre-training weights are loaded for Transfer Learning training on the dataset CUHK03.To enhance the discrimination performance of the network,we proposed a new loss function design method,which introduces the Hard-Negative-Mining way into the benchmark triplet loss.To improve the classification accuracy of the network,we introduced a Label-Smoothing regularization method to the cross-entropy loss.To facilitate the model’s convergence stability and convergence speed at the early training stage and to prevent the model from oscillating around the global optimum due to excessive learning rate at the later stage of training,this paper proposed a learning rate regulation method combining Linear-Warmup and exponential decay.The experimental results on CUHK03 show that the mean Average Precision(mAP)of the improved ReID network is 76.5%.The Top 1 is 42.5%,the Top 5 is 65.4%,and the Top 10 is 74.3%in Cumulative Matching Characteristics(CMC);Compared with the original algorithm,the tracking accuracy of the optimized DeepSORT tracking algorithm is improved by 2.5%,the tracking precision is improved by 3.8%.The number of identity switching is reduced by 25%.The algorithm effectively alleviates the IDSwitch problem,improves the tracking accuracy of persons,and has a high practical value.展开更多
Collaborative Robotics is one of the high-interest research topics in the area of academia and industry.It has been progressively utilized in numerous applications,particularly in intelligent surveillance systems.It a...Collaborative Robotics is one of the high-interest research topics in the area of academia and industry.It has been progressively utilized in numerous applications,particularly in intelligent surveillance systems.It allows the deployment of smart cameras or optical sensors with computer vision techniques,which may serve in several object detection and tracking tasks.These tasks have been considered challenging and high-level perceptual problems,frequently dominated by relative information about the environment,where main concerns such as occlusion,illumination,background,object deformation,and object class variations are commonplace.In order to show the importance of top view surveillance,a collaborative robotics framework has been presented.It can assist in the detection and tracking of multiple objects in top view surveillance.The framework consists of a smart robotic camera embedded with the visual processing unit.The existing pre-trained deep learning models named SSD and YOLO has been adopted for object detection and localization.The detection models are further combined with different tracking algorithms,including GOTURN,MEDIANFLOW,TLD,KCF,MIL,and BOOSTING.These algorithms,along with detection models,help to track and predict the trajectories of detected objects.The pre-trained models are employed;therefore,the generalization performance is also investigated through testing the models on various sequences of top view data set.The detection models achieved maximum True Detection Rate 93%to 90%with a maximum 0.6%False Detection Rate.The tracking results of different algorithms are nearly identical,with tracking accuracy ranging from 90%to 94%.Furthermore,a discussion has been carried out on output results along with future guidelines.展开更多
Traditional monitoring systems that are used in shopping malls or com-munity management,mostly use a remote control to monitor and track specific objects;therefore,it is often impossible to effectively monitor the enti...Traditional monitoring systems that are used in shopping malls or com-munity management,mostly use a remote control to monitor and track specific objects;therefore,it is often impossible to effectively monitor the entire environ-ment.Whenfinding a suspicious person,the tracked object cannot be locked in time for tracking.This research replaces the traditionalfixed-point monitor with the intelligent drone and combines the image processing technology and automatic judgment for the movements of the monitored person.This intelligent system can effectively improve the shortcomings of low efficiency and high cost of the traditional monitor system.In this article,we proposed a TIMT(The Intel-ligent Monitoring and Tracking)algorithm which can make the drone have smart surveillance and tracking capabilities.It combined with Artificial Intelligent(AI)face recognition technology and the OpenPose which is able to monitor the phy-sical movements of multiple people in real time to analyze the meaning of human body movements and to track the monitored intelligently through the remote con-trol interface of the drone.This system is highly agile and could be adjusted immediately to any angle and screen that we monitor.Therefore,the system couldfind abnormal conditions immediately and track and monitor them automatically.That is the system can immediately detect when someone invades the home or community,and the drone can automatically track the intruder to achieve that the two significant shortcomings of the traditional monitor will be improved.Experimental results show that the intelligent monitoring and tracking drone sys-tem has an excellent performance,which not only dramatically reduces the num-ber of monitors and the required equipment but also achieves perfect monitoring and tracking.展开更多
Object tracking,an important technology in the field of image processing and computer vision,is used to continuously track a specific object or person in an image.This technology may be effective in identifying the sa...Object tracking,an important technology in the field of image processing and computer vision,is used to continuously track a specific object or person in an image.This technology may be effective in identifying the same person within one image,but it has limitations in handling multiple images owing to the difficulty in identifying whether the object appearing in other images is the same.When tracking the same object using two or more images,there must be a way to determine that objects existing in different images are the same object.Therefore,this paper attempts to determine the same object present in different images using color information among the unique information of the object.Thus,this study proposes a multiple-object-tracking method using histogram stamp extraction in closed-circuit television applications.The proposed method determines the presence or absence of a target object in an image by comparing the similarity between the image containing the target object and other images.To this end,a unique color value of the target object is extracted based on its color distribution in the image using three methods:mean,mode,and interquartile range.The Top-N accuracy method is used to analyze the accuracy of each method,and the results show that the mean method had an accuracy of 93.5%(Top-2).Furthermore,the positive prediction value experimental results show that the accuracy of the mean method was 65.7%.As a result of the analysis,it is possible to detect and track the same object present in different images using the unique color of the object.Through the results,it is possible to track the same object that can minimize manpower without using personal information when detecting objects in different images.In the last response speed experiment,it was shown that when the mean was used,the color extraction of the object was possible in real time with 0.016954 s.Through this,it is possible to detect and track the same object in real time when using the proposed method.展开更多
Controlling multiple multi-joint fish-like robots has long captivated the attention of engineers and biologists,for which a fundamental but challenging topic is to robustly track the postures of the individuals in rea...Controlling multiple multi-joint fish-like robots has long captivated the attention of engineers and biologists,for which a fundamental but challenging topic is to robustly track the postures of the individuals in real time.This requires detecting multiple robots,estimating multi-joint postures,and tracking identities,as well as processing fast in real time.To the best of our knowledge,this challenge has not been tackled in the previous studies.In this paper,to precisely track the planar postures of multiple swimming multi-joint fish-like robots in real time,we propose a novel deep neural network-based method,named TAB-IOL.Its TAB part fuses the top-down and bottom-up approaches for vision-based pose estimation,while the IOL part with long short-term memory considers the motion constraints among joints for precise pose tracking.The satisfying performance of our TAB-IOL is verified by testing on a group of freely swimming fish-like robots in various scenarios with strong disturbances and by a deed comparison of accuracy,speed,and robustness with most state-of-the-art algorithms.Further,based on the precise pose estimation and tracking realized by our TAB-IOL,several formation control experiments are conducted for the group of fish-like robots.The results clearly demonstrate that our TAB-IOL lays a solid foundation for the coordination control of multiple fish-like robots in a real working environment.We believe our proposed method will facilitate the growth and development of related fields.展开更多
Presently,video surveillance is commonly employed to ensure security in public places such as traffic signals,malls,railway stations,etc.A major chal-lenge in video surveillance is the identification of anomalies that...Presently,video surveillance is commonly employed to ensure security in public places such as traffic signals,malls,railway stations,etc.A major chal-lenge in video surveillance is the identification of anomalies that exist in it such as crimes,thefts,and so on.Besides,the anomaly detection in pedestrian walkways has gained significant attention among the computer vision communities to enhance pedestrian safety.The recent advances of Deep Learning(DL)models have received considerable attention in different processes such as object detec-tion,image classification,etc.In this aspect,this article designs a new Panoptic Feature Pyramid Network based Anomaly Detection and Tracking(PFPN-ADT)model for pedestrian walkways.The proposed model majorly aims to the recognition and classification of different anomalies present in the pedestrian walkway like vehicles,skaters,etc.The proposed model involves panoptic seg-mentation model,called Panoptic Feature Pyramid Network(PFPN)is employed for the object recognition process.For object classification,Compact Bat Algo-rithm(CBA)with Stacked Auto Encoder(SAE)is applied for the classification of recognized objects.For ensuring the enhanced results better anomaly detection performance of the PFPN-ADT technique,a comparison study is made using Uni-versity of California San Diego(UCSD)Anomaly data and other benchmark data-sets(such as Cityscapes,ADE20K,COCO),and the outcomes are compared with the Mask Recurrent Convolutional Neural Network(RCNN)and Faster Convolu-tional Neural Network(CNN)models.The simulation outcome demonstrated the enhanced performance of the PFPN-ADT technique over the other methods.展开更多
Video object tracking is an important research topic of computer vision, whichfinds a wide range of applications in video surveillance, robotics, human-computerinteraction and so on. Although many moving object tracki...Video object tracking is an important research topic of computer vision, whichfinds a wide range of applications in video surveillance, robotics, human-computerinteraction and so on. Although many moving object tracking algorithms have beenproposed, there are still many difficulties in the actual tracking process, such asillumination change, occlusion, motion blurring, scale change, self-change and so on.Therefore, the development of object tracking technology is still challenging. Theemergence of deep learning theory and method provides a new opportunity for theresearch of object tracking, and it is also the main theoretical framework for the researchof moving object tracking algorithm in this paper. In this paper, the existing deeptracking-based target tracking algorithms are classified and sorted out. Based on theprevious knowledge and my own understanding, several solutions are proposed for theexisting methods. In addition, the existing deep learning target tracking method is stilldifficult to meet the requirements of real-time, how to design the network and trackingprocess to achieve speed and effect improvement, there is still a lot of research space.展开更多
With the rapid development of urban rail transit,the existing track detection has some problems such as low efficiency and insufficient detection coverage,so an intelligent and automatic track detectionmethod based on...With the rapid development of urban rail transit,the existing track detection has some problems such as low efficiency and insufficient detection coverage,so an intelligent and automatic track detectionmethod based onUAV is urgently needed to avoid major safety accidents.At the same time,the geographical distribution of IoT devices results in the inefficient use of the significant computing potential held by a large number of devices.As a result,the Dispersed Computing(DCOMP)architecture enables collaborative computing between devices in the Internet of Everything(IoE),promotes low-latency and efficient cross-wide applications,and meets users’growing needs for computing performance and service quality.This paper focuses on examining the resource allocation challenge within a dispersed computing environment that utilizes UAV inspection tracks.Furthermore,the system takes into account both resource constraints and computational constraints and transforms the optimization problem into an energy minimization problem with computational constraints.The Markov Decision Process(MDP)model is employed to capture the connection between the dispersed computing resource allocation strategy and the system environment.Subsequently,a method based on Double Deep Q-Network(DDQN)is introduced to derive the optimal policy.Simultaneously,an experience replay mechanism is implemented to tackle the issue of increasing dimensionality.The experimental simulations validate the efficacy of the method across various scenarios.展开更多
Railway inspection poses significant challenges due to the extensive use of various components in vast railway networks,especially in the case of high-speed railways.These networks demand high maintenance but offer on...Railway inspection poses significant challenges due to the extensive use of various components in vast railway networks,especially in the case of high-speed railways.These networks demand high maintenance but offer only limited inspection windows.In response,this study focuses on developing a high-performance rail inspection system tailored for high-speed railways and railroads with constrained inspection timeframes.This system leverages the latest artificial intelligence advancements,incorporating YOLOv8 for detection.Our research introduces an efficient model inference pipeline based on a producer-consumer model,effectively utilizing parallel processing and concurrent computing to enhance performance.The deployment of this pipeline,implemented using C++,TensorRT,float16 quantization,and oneTBB,represents a significant departure from traditional sequential processing methods.The results are remarkable,showcasing a substantial increase in processing speed:from 38.93 Frames Per Second(FPS)to 281.06 FPS on a desktop system equipped with an Nvidia RTX A6000 GPU and from 19.50 FPS to 200.26 FPS on the Nvidia Jetson AGX Orin edge computing platform.This proposed framework has the potential to meet the real-time inspection requirements of high-speed railways.展开更多
Fluorescence microscopy has become an essential tool for biologists,to visualize the dynamics of intracellular structures with specific labeling.Quantitatively measuring the dynamics of moving objects inside the cell ...Fluorescence microscopy has become an essential tool for biologists,to visualize the dynamics of intracellular structures with specific labeling.Quantitatively measuring the dynamics of moving objects inside the cell is pivotal for understanding of the underlying regulatory mechanism.Protein-containing vesicles are involved in various biological processes such as material transportation,organelle interaction,and hormonal regulation,whose dynamic characteristics are signi¯cant to disease diagnosis and drug screening.Although some algorithms have been developed for vesicle tracking,most of them have limited performance when dealing with images with low resolution,poor signal-to-noise ratio(SNR)and complicated motion.Here,we proposed a novel deep learning-based method for intracellular vesicle tracking.We trained the U-Net for vesicle localization and motion classification,with demonstrates great performance in both simulated datasets and real biological samples.By combination with fan-shaped tracker(FsT)we have previously developed,this hybrid new algorithm significantly improved the performance of particle tracking with the function of subsequently automated vesicle motion classification.Furthermore,its performance was further demonstrated in analyzing with vesicle dynamics in different temperature,which achieved reasonable outcomes.Thus,we anticipate that this novel method would have vast applications in analyzing the vesicle dynamics in living cells.展开更多
Aiming to the problem of pedestrian tracking with frequent or long-term occlusion in complex scenes,an anti-occlusion pedestrian tracking algorithm based on location prediction and deep feature rematch is proposed.Fir...Aiming to the problem of pedestrian tracking with frequent or long-term occlusion in complex scenes,an anti-occlusion pedestrian tracking algorithm based on location prediction and deep feature rematch is proposed.Firstly,the occlusion judgment is realized by extracting and utilizing deep feature of pedestrian’s appearance,and then the scale adaptive kernelized correlation filter is introduced to implement pedestrian tracking without occlusion.Secondly,Karman filter is introduced to predict the location of occluded pedestrian position.Finally,the deep feature is used to the rematch of pedestrian in the reappearance process.Simulation experiment and analysis show that the proposed algorithm can effectively detect and rematch pedestrian under the condition of frequent or long-term occlusion.展开更多
The latest advances in Deep Learning based methods and computational capabilities provide new opportunities for vehicle tracking. In this study, YOLOv2 (You Only Look Once—version 2) is used as an open source Convolu...The latest advances in Deep Learning based methods and computational capabilities provide new opportunities for vehicle tracking. In this study, YOLOv2 (You Only Look Once—version 2) is used as an open source Convolutional Neural Network (CNN), to process high-resolution satellite images, in order to generate the spatio-temporal GIS (Geographic Information System) tracks of moving vehicles. At first step, YOLOv2 is trained with a set of images of 1024 × 1024 resolution from the VEDAI database. The model showed satisfactory results, with an accuracy of 91%, and then at second step, is used to process aerial images extracted from aerial video. The output vehicle bounding boxes have been processed and fed into the GIS based LinkTheDots algorithm, allowing vehicles identification and spatio-temporal tracks generation in GIS format.展开更多
基金supported by the Doctoral Fund Project(Grant No.X22003Z).
文摘As an important component of load transfer,various fatigue damages occur in the track as the rail service life and train traffic increase gradually,such as rail corrugation,rail joint damage,uneven thermite welds,rail squats fas-tener defects,etc.Real-time recognition of track defects plays a vital role in ensuring the safe and stable operation of rail transit.In this paper,an intelligent and innovative method is proposed to detect the track defects by using axle-box vibration acceleration and deep learning network,and the coexistence of the above-mentioned typical track defects in the track system is considered.Firstly,the dynamic relationship between the track defects(using the example of the fastening defects)and the axle-box vibration acceleration(ABVA)is investigated using the dynamic vehicle-track model.Then,a simulation model for the coupled dynamics of the vehicle and track with different track defects is established,and the wavelet power spectrum(WPS)analysis is performed for the vibra-tion acceleration signals of the axle box to extract the characteristic response.Lastly,using wavelet spectrum photos as input,an automatic detection technique based on the deep convolution neural network(DCNN)is sug-gested to realize the real-time intelligent detection and identification of various track problems.Thefindings demonstrate that the suggested approach achieves a 96.72%classification accuracy.
基金supported by the Sichuan Science and Technology Program (Grant:2021YFQ0003,Acquired by Wenfeng Zheng).
文摘Research in the field ofmedical image is an important part of themedical robot to operate human organs.Amedical robot is the intersection ofmulti-disciplinary research fields,in whichmedical image is an important direction and has achieved fruitful results.In this paper,amethodof soft tissue surface feature tracking basedonadepthmatching network is proposed.This method is described based on the triangular matching algorithm.First,we construct a self-made sample set for training the depth matching network from the first N frames of speckle matching data obtained by the triangle matching algorithm.The depth matching network is pre-trained on the ORL face data set and then trained on the self-made training set.After the training,the speckle matching is carried out in the subsequent frames to obtain the speckle matching matrix between the subsequent frames and the first frame.From this matrix,the inter-frame feature matching results can be obtained.In this way,the inter-frame speckle tracking is completed.On this basis,the results of this method are compared with the matching results based on the convolutional neural network.The experimental results show that the proposed method has higher matching accuracy.In particular,the accuracy of the MNIST handwritten data set has reached more than 90%.
基金funded by the Deanship for Research&Innovation,Ministry of Education in Saudi Arabia,for funding this research work through Project Number:IFP22UQU4281768DSR145.
文摘Autism spectrum disorder(ASD)can be defined as a neurodevelopmental condition or illness that can disturb kids who have heterogeneous characteristics,like changes in behavior,social disabilities,and difficulty communicating with others.Eye tracking(ET)has become a useful method to detect ASD.One vital aspect of moral erudition is the aptitude to have common visual attention.The eye-tracking approach offers valuable data regarding the visual behavior of children for accurate and early detection.Eye-tracking data can offer insightful information about the behavior and thought processes of people with ASD,but it is important to be aware of its limitations and to combine it with other types of data and assessment techniques to increase the precision of ASD detection.It operates by scanning the paths of eyes for extracting a series of eye projection points on images for examining the behavior of children with autism.The purpose of this research is to use deep learning to identify autistic disorders based on eye tracking.The Chaotic Butterfly Optimization technique is used to identify this specific disturbance.Therefore,this study develops an ET-based Autism Spectrum Disorder Diagnosis using Chaotic Butterfly Optimization with Deep Learning(ETASD-CBODL)technique.The presented ETASDCBODL technique mainly focuses on the recognition of ASD via the ET and DL models.To accomplish this,the ETASD-CBODL technique exploits the U-Net segmentation technique to recognize interested AREASS.In addition,the ETASD-CBODL technique employs Inception v3 feature extraction with CBO algorithm-based hyperparameter optimization.Finally,the long-shorttermmemory(LSTM)model is exploited for the recognition and classification of ASD.To assess the performance of the ETASD-CBODL technique,a series of simulations were performed on datasets from the figure-shared data repository.The experimental values of accuracy(99.29%),precision(98.78%),sensitivity(99.29%)and specificity(99.29%)showed a better performance in the ETASD-CBODL technique over recent approaches.
文摘Beamforming is significant for millimeter wave multi-user massive multi-input multi-output systems.In the meanwhile,the overhead cost of channel state information and beam training is considerable,especially in dynamic environments.To reduce the overhead cost,we propose a multi-user beam tracking algorithm using a distributed deep Q-learning method.With online learning of users’moving trajectories,the proposed algorithm learns to scan a beam subspace to maximize the average effective sum rate.Considering practical implementation,we model the continuous beam tracking problem as a non-Markov decision process and thus develop a simplified training scheme of deep Q-learning to reduce the training complexity.Furthermore,we propose a scalable state-action-reward design for scenarios with different users and antenna numbers.Simulation results verify the effectiveness of the designed method.
文摘At present days,object detection and tracking concepts have gained more importance among researchers and business people.Presently,deep learning(DL)approaches have been used for object tracking as it increases the perfor-mance and speed of the tracking process.This paper presents a novel robust DL based object detection and tracking algorithm using Automated Image Anno-tation with ResNet based Faster regional convolutional neural network(R-CNN)named(AIA-FRCNN)model.The AIA-RFRCNN method performs image anno-tation using a Discriminative Correlation Filter(DCF)with Channel and Spatial Reliability tracker(CSR)called DCF-CSRT model.The AIA-RFRCNN model makes use of Faster RCNN as an object detector and tracker,which involves region proposal network(RPN)and Fast R-CNN.The RPN is a full convolution network that concurrently predicts the bounding box and score of different objects.The RPN is a trained model used for the generation of the high-quality region proposals,which are utilized by Fast R-CNN for detection process.Besides,Residual Network(ResNet 101)model is used as a shared convolutional neural network(CNN)for the generation of feature maps.The performance of the ResNet 101 model is further improved by the use of Adam optimizer,which tunes the hyperparameters namely learning rate,batch size,momentum,and weight decay.Finally,softmax layer is applied to classify the images.The performance of the AIA-RFRCNN method has been assessed using a benchmark dataset and a detailed comparative analysis of the results takes place.The outcome of the experiments indicated the superior characteristics of the AIA-RFRCNN model under diverse aspects.
文摘To improve the tracking accuracy of persons in the surveillance video,we proposed an algorithm for multi-target tracking persons based on deep learning.In this paper,we used You Only Look Once v5(YOLOv5)to obtain person targets of each frame in the video and used Simple Online and Realtime Tracking with a Deep Association Metric(DeepSORT)to do cascade matching and Intersection Over Union(IOU)matching of person targets between different frames.To solve the IDSwitch problem caused by the low feature extraction ability of the Re-Identification(ReID)network in the process of cascade matching,we introduced Spatial Relation-aware Global Attention(RGA-S)and Channel Relation-aware Global Attention(RGA-C)attention mechanisms into the network structure.The pre-training weights are loaded for Transfer Learning training on the dataset CUHK03.To enhance the discrimination performance of the network,we proposed a new loss function design method,which introduces the Hard-Negative-Mining way into the benchmark triplet loss.To improve the classification accuracy of the network,we introduced a Label-Smoothing regularization method to the cross-entropy loss.To facilitate the model’s convergence stability and convergence speed at the early training stage and to prevent the model from oscillating around the global optimum due to excessive learning rate at the later stage of training,this paper proposed a learning rate regulation method combining Linear-Warmup and exponential decay.The experimental results on CUHK03 show that the mean Average Precision(mAP)of the improved ReID network is 76.5%.The Top 1 is 42.5%,the Top 5 is 65.4%,and the Top 10 is 74.3%in Cumulative Matching Characteristics(CMC);Compared with the original algorithm,the tracking accuracy of the optimized DeepSORT tracking algorithm is improved by 2.5%,the tracking precision is improved by 3.8%.The number of identity switching is reduced by 25%.The algorithm effectively alleviates the IDSwitch problem,improves the tracking accuracy of persons,and has a high practical value.
基金the Framework of International Cooperation Program managed by the National Research Foundation of Korea(2019K1A3A1A8011295711).
文摘Collaborative Robotics is one of the high-interest research topics in the area of academia and industry.It has been progressively utilized in numerous applications,particularly in intelligent surveillance systems.It allows the deployment of smart cameras or optical sensors with computer vision techniques,which may serve in several object detection and tracking tasks.These tasks have been considered challenging and high-level perceptual problems,frequently dominated by relative information about the environment,where main concerns such as occlusion,illumination,background,object deformation,and object class variations are commonplace.In order to show the importance of top view surveillance,a collaborative robotics framework has been presented.It can assist in the detection and tracking of multiple objects in top view surveillance.The framework consists of a smart robotic camera embedded with the visual processing unit.The existing pre-trained deep learning models named SSD and YOLO has been adopted for object detection and localization.The detection models are further combined with different tracking algorithms,including GOTURN,MEDIANFLOW,TLD,KCF,MIL,and BOOSTING.These algorithms,along with detection models,help to track and predict the trajectories of detected objects.The pre-trained models are employed;therefore,the generalization performance is also investigated through testing the models on various sequences of top view data set.The detection models achieved maximum True Detection Rate 93%to 90%with a maximum 0.6%False Detection Rate.The tracking results of different algorithms are nearly identical,with tracking accuracy ranging from 90%to 94%.Furthermore,a discussion has been carried out on output results along with future guidelines.
文摘Traditional monitoring systems that are used in shopping malls or com-munity management,mostly use a remote control to monitor and track specific objects;therefore,it is often impossible to effectively monitor the entire environ-ment.Whenfinding a suspicious person,the tracked object cannot be locked in time for tracking.This research replaces the traditionalfixed-point monitor with the intelligent drone and combines the image processing technology and automatic judgment for the movements of the monitored person.This intelligent system can effectively improve the shortcomings of low efficiency and high cost of the traditional monitor system.In this article,we proposed a TIMT(The Intel-ligent Monitoring and Tracking)algorithm which can make the drone have smart surveillance and tracking capabilities.It combined with Artificial Intelligent(AI)face recognition technology and the OpenPose which is able to monitor the phy-sical movements of multiple people in real time to analyze the meaning of human body movements and to track the monitored intelligently through the remote con-trol interface of the drone.This system is highly agile and could be adjusted immediately to any angle and screen that we monitor.Therefore,the system couldfind abnormal conditions immediately and track and monitor them automatically.That is the system can immediately detect when someone invades the home or community,and the drone can automatically track the intruder to achieve that the two significant shortcomings of the traditional monitor will be improved.Experimental results show that the intelligent monitoring and tracking drone sys-tem has an excellent performance,which not only dramatically reduces the num-ber of monitors and the required equipment but also achieves perfect monitoring and tracking.
基金supported by the National Research Foundation of Korea(NRF)grant funded by the Korea government(MSIT)(No.2022R1F1A1068828).
文摘Object tracking,an important technology in the field of image processing and computer vision,is used to continuously track a specific object or person in an image.This technology may be effective in identifying the same person within one image,but it has limitations in handling multiple images owing to the difficulty in identifying whether the object appearing in other images is the same.When tracking the same object using two or more images,there must be a way to determine that objects existing in different images are the same object.Therefore,this paper attempts to determine the same object present in different images using color information among the unique information of the object.Thus,this study proposes a multiple-object-tracking method using histogram stamp extraction in closed-circuit television applications.The proposed method determines the presence or absence of a target object in an image by comparing the similarity between the image containing the target object and other images.To this end,a unique color value of the target object is extracted based on its color distribution in the image using three methods:mean,mode,and interquartile range.The Top-N accuracy method is used to analyze the accuracy of each method,and the results show that the mean method had an accuracy of 93.5%(Top-2).Furthermore,the positive prediction value experimental results show that the accuracy of the mean method was 65.7%.As a result of the analysis,it is possible to detect and track the same object present in different images using the unique color of the object.Through the results,it is possible to track the same object that can minimize manpower without using personal information when detecting objects in different images.In the last response speed experiment,it was shown that when the mean was used,the color extraction of the object was possible in real time with 0.016954 s.Through this,it is possible to detect and track the same object in real time when using the proposed method.
基金This work was supported in part by the National Natural Science Foundation of China(61973007,61633002).
文摘Controlling multiple multi-joint fish-like robots has long captivated the attention of engineers and biologists,for which a fundamental but challenging topic is to robustly track the postures of the individuals in real time.This requires detecting multiple robots,estimating multi-joint postures,and tracking identities,as well as processing fast in real time.To the best of our knowledge,this challenge has not been tackled in the previous studies.In this paper,to precisely track the planar postures of multiple swimming multi-joint fish-like robots in real time,we propose a novel deep neural network-based method,named TAB-IOL.Its TAB part fuses the top-down and bottom-up approaches for vision-based pose estimation,while the IOL part with long short-term memory considers the motion constraints among joints for precise pose tracking.The satisfying performance of our TAB-IOL is verified by testing on a group of freely swimming fish-like robots in various scenarios with strong disturbances and by a deed comparison of accuracy,speed,and robustness with most state-of-the-art algorithms.Further,based on the precise pose estimation and tracking realized by our TAB-IOL,several formation control experiments are conducted for the group of fish-like robots.The results clearly demonstrate that our TAB-IOL lays a solid foundation for the coordination control of multiple fish-like robots in a real working environment.We believe our proposed method will facilitate the growth and development of related fields.
文摘Presently,video surveillance is commonly employed to ensure security in public places such as traffic signals,malls,railway stations,etc.A major chal-lenge in video surveillance is the identification of anomalies that exist in it such as crimes,thefts,and so on.Besides,the anomaly detection in pedestrian walkways has gained significant attention among the computer vision communities to enhance pedestrian safety.The recent advances of Deep Learning(DL)models have received considerable attention in different processes such as object detec-tion,image classification,etc.In this aspect,this article designs a new Panoptic Feature Pyramid Network based Anomaly Detection and Tracking(PFPN-ADT)model for pedestrian walkways.The proposed model majorly aims to the recognition and classification of different anomalies present in the pedestrian walkway like vehicles,skaters,etc.The proposed model involves panoptic seg-mentation model,called Panoptic Feature Pyramid Network(PFPN)is employed for the object recognition process.For object classification,Compact Bat Algo-rithm(CBA)with Stacked Auto Encoder(SAE)is applied for the classification of recognized objects.For ensuring the enhanced results better anomaly detection performance of the PFPN-ADT technique,a comparison study is made using Uni-versity of California San Diego(UCSD)Anomaly data and other benchmark data-sets(such as Cityscapes,ADE20K,COCO),and the outcomes are compared with the Mask Recurrent Convolutional Neural Network(RCNN)and Faster Convolu-tional Neural Network(CNN)models.The simulation outcome demonstrated the enhanced performance of the PFPN-ADT technique over the other methods.
基金supported by National Natural Science Foundationof China (Grant No. 51874300)the National Natural Science Foundation of China andShanxi Provincial People’s Government Jointly Funded Project of China for Coal Baseand Low Carbon (Grant No. U1510115)+2 种基金National Natural Science Foundation of China(51104157)the Qing Lan Project, the China Postdoctoral Science Foundation (Grant No.2013T60574)the Scientific Instrument Developing Project of the Chinese Academy ofSciences (Grant No. YJKYYQ20170074).
文摘Video object tracking is an important research topic of computer vision, whichfinds a wide range of applications in video surveillance, robotics, human-computerinteraction and so on. Although many moving object tracking algorithms have beenproposed, there are still many difficulties in the actual tracking process, such asillumination change, occlusion, motion blurring, scale change, self-change and so on.Therefore, the development of object tracking technology is still challenging. Theemergence of deep learning theory and method provides a new opportunity for theresearch of object tracking, and it is also the main theoretical framework for the researchof moving object tracking algorithm in this paper. In this paper, the existing deeptracking-based target tracking algorithms are classified and sorted out. Based on theprevious knowledge and my own understanding, several solutions are proposed for theexisting methods. In addition, the existing deep learning target tracking method is stilldifficult to meet the requirements of real-time, how to design the network and trackingprocess to achieve speed and effect improvement, there is still a lot of research space.
文摘With the rapid development of urban rail transit,the existing track detection has some problems such as low efficiency and insufficient detection coverage,so an intelligent and automatic track detectionmethod based onUAV is urgently needed to avoid major safety accidents.At the same time,the geographical distribution of IoT devices results in the inefficient use of the significant computing potential held by a large number of devices.As a result,the Dispersed Computing(DCOMP)architecture enables collaborative computing between devices in the Internet of Everything(IoE),promotes low-latency and efficient cross-wide applications,and meets users’growing needs for computing performance and service quality.This paper focuses on examining the resource allocation challenge within a dispersed computing environment that utilizes UAV inspection tracks.Furthermore,the system takes into account both resource constraints and computational constraints and transforms the optimization problem into an energy minimization problem with computational constraints.The Markov Decision Process(MDP)model is employed to capture the connection between the dispersed computing resource allocation strategy and the system environment.Subsequently,a method based on Double Deep Q-Network(DDQN)is introduced to derive the optimal policy.Simultaneously,an experience replay mechanism is implemented to tackle the issue of increasing dimensionality.The experimental simulations validate the efficacy of the method across various scenarios.
基金supported by the Federal Railroad Administration (FRA)the National Academy of Science (NAS) IDEA program
文摘Railway inspection poses significant challenges due to the extensive use of various components in vast railway networks,especially in the case of high-speed railways.These networks demand high maintenance but offer only limited inspection windows.In response,this study focuses on developing a high-performance rail inspection system tailored for high-speed railways and railroads with constrained inspection timeframes.This system leverages the latest artificial intelligence advancements,incorporating YOLOv8 for detection.Our research introduces an efficient model inference pipeline based on a producer-consumer model,effectively utilizing parallel processing and concurrent computing to enhance performance.The deployment of this pipeline,implemented using C++,TensorRT,float16 quantization,and oneTBB,represents a significant departure from traditional sequential processing methods.The results are remarkable,showcasing a substantial increase in processing speed:from 38.93 Frames Per Second(FPS)to 281.06 FPS on a desktop system equipped with an Nvidia RTX A6000 GPU and from 19.50 FPS to 200.26 FPS on the Nvidia Jetson AGX Orin edge computing platform.This proposed framework has the potential to meet the real-time inspection requirements of high-speed railways.
基金supported by the National Key Research and Development Program of China(2021YFF0700305 and 2018YFE0119000)the National Natural Science Foundation of China(22104129 and 62105288)+1 种基金Zhejiang Province Science and Technology Research Plan(2022C03014)the Fundamental Research Funds for the Central Universities(2021XZZX022)and Alibaba Cloud.
文摘Fluorescence microscopy has become an essential tool for biologists,to visualize the dynamics of intracellular structures with specific labeling.Quantitatively measuring the dynamics of moving objects inside the cell is pivotal for understanding of the underlying regulatory mechanism.Protein-containing vesicles are involved in various biological processes such as material transportation,organelle interaction,and hormonal regulation,whose dynamic characteristics are signi¯cant to disease diagnosis and drug screening.Although some algorithms have been developed for vesicle tracking,most of them have limited performance when dealing with images with low resolution,poor signal-to-noise ratio(SNR)and complicated motion.Here,we proposed a novel deep learning-based method for intracellular vesicle tracking.We trained the U-Net for vesicle localization and motion classification,with demonstrates great performance in both simulated datasets and real biological samples.By combination with fan-shaped tracker(FsT)we have previously developed,this hybrid new algorithm significantly improved the performance of particle tracking with the function of subsequently automated vesicle motion classification.Furthermore,its performance was further demonstrated in analyzing with vesicle dynamics in different temperature,which achieved reasonable outcomes.Thus,we anticipate that this novel method would have vast applications in analyzing the vesicle dynamics in living cells.
基金the National Natural Science Foundation of China(No.61976080,61771006)the Key Project of Henan Province Education Department(No.19A413006).
文摘Aiming to the problem of pedestrian tracking with frequent or long-term occlusion in complex scenes,an anti-occlusion pedestrian tracking algorithm based on location prediction and deep feature rematch is proposed.Firstly,the occlusion judgment is realized by extracting and utilizing deep feature of pedestrian’s appearance,and then the scale adaptive kernelized correlation filter is introduced to implement pedestrian tracking without occlusion.Secondly,Karman filter is introduced to predict the location of occluded pedestrian position.Finally,the deep feature is used to the rematch of pedestrian in the reappearance process.Simulation experiment and analysis show that the proposed algorithm can effectively detect and rematch pedestrian under the condition of frequent or long-term occlusion.
文摘The latest advances in Deep Learning based methods and computational capabilities provide new opportunities for vehicle tracking. In this study, YOLOv2 (You Only Look Once—version 2) is used as an open source Convolutional Neural Network (CNN), to process high-resolution satellite images, in order to generate the spatio-temporal GIS (Geographic Information System) tracks of moving vehicles. At first step, YOLOv2 is trained with a set of images of 1024 × 1024 resolution from the VEDAI database. The model showed satisfactory results, with an accuracy of 91%, and then at second step, is used to process aerial images extracted from aerial video. The output vehicle bounding boxes have been processed and fed into the GIS based LinkTheDots algorithm, allowing vehicles identification and spatio-temporal tracks generation in GIS format.