Visual object tracking plays a crucial role in computer vision.In recent years,researchers have proposed various methods to achieve high-performance object tracking.Among these,methods based on Transformers have becom...Visual object tracking plays a crucial role in computer vision.In recent years,researchers have proposed various methods to achieve high-performance object tracking.Among these,methods based on Transformers have become a research hotspot due to their ability to globally model and contextualize information.However,current Transformer-based object tracking methods still face challenges such as low tracking accuracy and the presence of redundant feature information.In this paper,we introduce self-calibration multi-head self-attention Transformer(SMSTracker)as a solution to these challenges.It employs a hybrid tensor decomposition self-organizing multihead self-attention transformermechanism,which not only compresses and accelerates Transformer operations but also significantly reduces redundant data,thereby enhancing the accuracy and efficiency of tracking.Additionally,we introduce a self-calibration attention fusion block to resolve common issues of attention ambiguities and inconsistencies found in traditional trackingmethods,ensuring the stability and reliability of tracking performance across various scenarios.By integrating a hybrid tensor decomposition approach with a self-organizingmulti-head self-attentive transformer mechanism,SMSTracker enhances the efficiency and accuracy of the tracking process.Experimental results show that SMSTracker achieves competitive performance in visual object tracking,promising more robust and efficient tracking systems,demonstrating its potential to providemore robust and efficient tracking solutions in real-world applications.展开更多
Significant advancements have beenwitnessed in visual tracking applications leveragingViT in recent years,mainly due to the formidablemodeling capabilities of Vision Transformer(ViT).However,the strong performance of ...Significant advancements have beenwitnessed in visual tracking applications leveragingViT in recent years,mainly due to the formidablemodeling capabilities of Vision Transformer(ViT).However,the strong performance of such trackers heavily relies on ViT models pretrained for long periods,limitingmore flexible model designs for tracking tasks.To address this issue,we propose an efficient unsupervised ViT pretraining method for the tracking task based on masked autoencoders,called TrackMAE.During pretraining,we employ two shared-parameter ViTs,serving as the appearance encoder and motion encoder,respectively.The appearance encoder encodes randomly masked image data,while the motion encoder encodes randomly masked pairs of video frames.Subsequently,an appearance decoder and a motion decoder separately reconstruct the original image data and video frame data at the pixel level.In this way,ViT learns to understand both the appearance of images and the motion between video frames simultaneously.Experimental results demonstrate that ViT-Base and ViT-Large models,pretrained with TrackMAE and combined with a simple tracking head,achieve state-of-the-art(SOTA)performance without additional design.Moreover,compared to the currently popular MAE pretraining methods,TrackMAE consumes only 1/5 of the training time,which will facilitate the customization of diverse models for tracking.For instance,we additionally customize a lightweight ViT-XS,which achieves SOTA efficient tracking performance.展开更多
Visual object-tracking is a fundamental task applied in many applications of computer vision. Particle filter is one of the techniques which has been widely used in object tracking. Due to the virtue of extendability ...Visual object-tracking is a fundamental task applied in many applications of computer vision. Particle filter is one of the techniques which has been widely used in object tracking. Due to the virtue of extendability and flexibility on both linear and non-linear environments, various particle filter-based trackers have been proposed in the literature. However, the conventional approach cannot handle very large videos efficiently in the current data intensive information age. In this work, a parallelized particle filter is provided in a distributed framework provided by the Hadoop/Map-Reduce infrastructure to tackle object-tracking tasks. The experiments indicate that the proposed algorithm has a better convergence and accuracy as compared to the traditional particle filter. The computational power and the scalability of the proposed particle filter in single object tracking have been enhanced as well.展开更多
This paper discusses about the new approach of multiple object track-ing relative to background information.The concept of multiple object tracking through background learning is based upon the theory of relativity,th...This paper discusses about the new approach of multiple object track-ing relative to background information.The concept of multiple object tracking through background learning is based upon the theory of relativity,that involves a frame of reference in spatial domain to localize and/or track any object.Thefield of multiple object tracking has seen a lot of research,but researchers have considered the background as redundant.However,in object tracking,the back-ground plays a vital role and leads to definite improvement in the overall process of tracking.In the present work an algorithm is proposed for the multiple object tracking through background learning.The learning framework is based on graph embedding approach for localizing multiple objects.The graph utilizes the inher-ent capabilities of depth modelling that assist in prior to track occlusion avoidance among multiple objects.The proposed algorithm has been compared with the recent work available in literature on numerous performance evaluation measures.It is observed that our proposed algorithm gives better performance.展开更多
Object tracking,an important technology in the field of image processing and computer vision,is used to continuously track a specific object or person in an image.This technology may be effective in identifying the sa...Object tracking,an important technology in the field of image processing and computer vision,is used to continuously track a specific object or person in an image.This technology may be effective in identifying the same person within one image,but it has limitations in handling multiple images owing to the difficulty in identifying whether the object appearing in other images is the same.When tracking the same object using two or more images,there must be a way to determine that objects existing in different images are the same object.Therefore,this paper attempts to determine the same object present in different images using color information among the unique information of the object.Thus,this study proposes a multiple-object-tracking method using histogram stamp extraction in closed-circuit television applications.The proposed method determines the presence or absence of a target object in an image by comparing the similarity between the image containing the target object and other images.To this end,a unique color value of the target object is extracted based on its color distribution in the image using three methods:mean,mode,and interquartile range.The Top-N accuracy method is used to analyze the accuracy of each method,and the results show that the mean method had an accuracy of 93.5%(Top-2).Furthermore,the positive prediction value experimental results show that the accuracy of the mean method was 65.7%.As a result of the analysis,it is possible to detect and track the same object present in different images using the unique color of the object.Through the results,it is possible to track the same object that can minimize manpower without using personal information when detecting objects in different images.In the last response speed experiment,it was shown that when the mean was used,the color extraction of the object was possible in real time with 0.016954 s.Through this,it is possible to detect and track the same object in real time when using the proposed method.展开更多
This research introduces a challenge in integrating and cleaning the data,which is a crucial task in object matching.While the object is detected and then measured,the vibration at different light intensities may influ...This research introduces a challenge in integrating and cleaning the data,which is a crucial task in object matching.While the object is detected and then measured,the vibration at different light intensities may influence the durability and reliability of mechanical systems or structures and cause problems such as damage,abnormal stopping,and disaster.Recent research failed to improve the accuracy rate and the computation time in tracking an object and in the vibration measurement.To solve all these problems,this proposed research simplifies the scaling factor determination by assigning a known real-world dimension to a predetermined portion of the image.A novel white color sticker of the known dimensions marked with a color dot is pasted on the surface of an object for the best result in the template matching using the Improved Up-Sampled Cross-Correlation(UCC)algorithm.The vibration measurement is calculated using the Finite-Difference Algorithm(FDA),a machine vision systemfitted with a macro lens sensor that is capable of capturing the image at a closer range,which does not affect the quality of displacement measurement from the video frames.Thefield test was conducted on the TAFE(Tractors and Farm Equipment Limited)tractor parts,and the percentage of error was recorded between 30%and 50%at very low vibration values close to zero,whereas it was recorded between 5%and 10%error in most high-accelerations,the essential range for vibration analysis.Finally,the suggested system is more suitable for measuring the vibration of stationary machinery having low frequency ranges.The use of a macro lens enables to capture of image frames at very close-ups.A 30%to 50%error percentage has been reported when the vibration amplitude is very small.Therefore,this study is not suitable for Nano vibration analysis.展开更多
Aiming at the problem that a single correlation filter model is sensitive to complex scenes such as background interference and occlusion,a tracking algorithm based on multi-time-space perception and instance-specific...Aiming at the problem that a single correlation filter model is sensitive to complex scenes such as background interference and occlusion,a tracking algorithm based on multi-time-space perception and instance-specific proposals is proposed to optimize the mathematical model of the correlation filter(CF).Firstly,according to the consistency of the changes between the object frames and the filter frames,the mask matrix is introduced into the objective function of the filter,so as to extract the spatio-temporal information of the object with background awareness.Secondly,the object function of multi-feature fusion is constructed for the object location,which is optimized by the Lagrange method and solved by closed iteration.In the process of filter optimization,the constraints term of time-space perception is designed to enhance the learning ability of the CF to optimize the final track-ing results.Finally,when the tracking results fluctuate,the boundary suppres-sion factor is introduced into the instance-specific proposals to reduce the risk of model drift effectively.The accuracy and success rate of the proposed algorithm are verified by simulation analysis on two popular benchmarks,the object tracking benchmark 2015(OTB2015)and the temple color 128(TC-128).Extensive experimental results illustrate that the optimized appearance model of the proposed algorithm is effective.The distance precision rate and overlap success rate of the proposed algorithm are 0.756 and 0.656 on the OTB2015 benchmark,which are better than the results of other competing algorithms.The results of this study can solve the problem of real-time object tracking in the real traffic environment and provide a specific reference for the detection of traffic abnormalities.展开更多
The field of object tracking has recently made significant progress.Particularly,the performance results in both deep learning and correlation filters,based trackers achieved effective tracking performance.Moreover,th...The field of object tracking has recently made significant progress.Particularly,the performance results in both deep learning and correlation filters,based trackers achieved effective tracking performance.Moreover,there are still some difficulties with object tracking for example illumination and deformation(DEF).The precision and accuracy of tracking algorithms suffer from the effects of such occurrences.For this situation,finding a solution is important.This research proposes a new tracking algorithm to handle this problem.The features are extracted by using Modified LeNet-5,and the precision and accuracy are improved by developing the Real-Time Cross-modality Correlation Filtering method(RCCF).In Modified LeNet-5,the visual tracking performance is improved by adjusting the number and size of the convolution kernels in the pooling and convolution layers.The high-level,middle-level,and handcraft features are extracted from the modified LeNet-5 network.The handcraft features are used to determine the specific location of the target because the handcraft features contain more spatial information regarding the visual object.The LeNet features are more suitable for a target appearance change in object tracking.Extensive experiments were conducted by the Object Tracking Benchmarking(OTB)databases like OTB50 and OTB100.The experimental results reveal that the proposed tracker outperforms other state-of-the-art trackers under different problems.The experimental simulation is carried out in python.The overall success rate and precision of the proposed algorithm are 93.8%and 92.5%.The average running frame rate reaches 42 frames per second,which can meet the real-time requirements.展开更多
An improved estimation of motion vectors of feature points is proposed for tracking moving objects of dynamic image sequence. Feature points are firstly extracted by the improved minimum intensity change (MIC) algor...An improved estimation of motion vectors of feature points is proposed for tracking moving objects of dynamic image sequence. Feature points are firstly extracted by the improved minimum intensity change (MIC) algorithm. The matching points of these feature points are then determined by adaptive rood pattern searching. Based on the random sample consensus (RANSAC) method, the background motion is finally compensated by the parameters of an affine transform of the background motion. With reasonable morphological filtering, the moving objects are completely extracted from the background, and then tracked accurately. Experimental results show that the improved method is successful on the motion background compensation and offers great promise in tracking moving objects of the dynamic image sequence.展开更多
There are two main trends in the development of unmanned aerial vehicle(UAV)technologies:miniaturization and intellectualization,in which realizing object tracking capabilities for a nano-scale UAV is one of the most ...There are two main trends in the development of unmanned aerial vehicle(UAV)technologies:miniaturization and intellectualization,in which realizing object tracking capabilities for a nano-scale UAV is one of the most challenging problems.In this paper,we present a visual object tracking and servoing control system utilizing a tailor-made 38 g nano-scale quadrotor.A lightweight visual module is integrated to enable object tracking capabilities,and a micro positioning deck is mounted to provide accurate pose estimation.In order to be robust against object appearance variations,a novel object tracking algorithm,denoted by RMCTer,is proposed,which integrates a powerful short-term tracking module and an efficient long-term processing module.In particular,the long-term processing module can provide additional object information and modify the short-term tracking model in a timely manner.Furthermore,a positionbased visual servoing control method is proposed for the quadrotor,where an adaptive tracking controller is designed by leveraging backstepping and adaptive techniques.Stable and accurate object tracking is achieved even under disturbances.Experimental results are presented to demonstrate the high accuracy and stability of the whole tracking system.展开更多
Inspired by human behaviors, a robot object tracking model is proposed on the basis of visual attention mechanism, which is fit for the theory of topological perception. The model integrates the image-driven, bottom-u...Inspired by human behaviors, a robot object tracking model is proposed on the basis of visual attention mechanism, which is fit for the theory of topological perception. The model integrates the image-driven, bottom-up attention and the object-driven, top-down attention, whereas the previous attention model has mostly focused on either the bottom-up or top-down attention. By the bottom-up component, the whole scene is segmented into the ground region and the salient regions. Guided by top-down strategy which is achieved by a topological graph, the object regions are separated from the salient regions. The salient regions except the object regions are the barrier regions. In order to estimate the model, a mobile robot platform is developed, on which some experiments are implemented. The experimental results indicate that processing an image with a resolution of 752 × 480 pixels takes less than 200 ms and the object regions are unabridged. The analysis obtained by comparing the proposed model with the existing model demonstrates that the proposed model has some advantages in robot object tracking in terms of speed and efficiency.展开更多
Single object tracking based on deep learning has achieved the advanced performance in many applications of computer vision.However,the existing trackers have certain limitations owing to deformation,occlusion,movemen...Single object tracking based on deep learning has achieved the advanced performance in many applications of computer vision.However,the existing trackers have certain limitations owing to deformation,occlusion,movement and some other conditions.We propose a siamese attentional dense network called SiamADN in an end-to-end offline manner,especially aiming at unmanned aerial vehicle(UAV)tracking.First,it applies a dense network to reduce vanishing-gradient,which strengthens the features transfer.Second,the channel attention mechanism is involved into the Densenet structure,in order to focus on the possible key regions.The advance corner detection network is introduced to improve the following tracking process.Extensive experiments are carried out on four mainly tracking benchmarks as OTB-2015,UAV123,LaSOT and VOT.The accuracy rate on UAV123 is 78.9%,and the running speed is 32 frame per second(FPS),which demonstrates its efficiency in the practical real application.展开更多
Video object tracking is an important research topic of computer vision, whichfinds a wide range of applications in video surveillance, robotics, human-computerinteraction and so on. Although many moving object tracki...Video object tracking is an important research topic of computer vision, whichfinds a wide range of applications in video surveillance, robotics, human-computerinteraction and so on. Although many moving object tracking algorithms have beenproposed, there are still many difficulties in the actual tracking process, such asillumination change, occlusion, motion blurring, scale change, self-change and so on.Therefore, the development of object tracking technology is still challenging. Theemergence of deep learning theory and method provides a new opportunity for theresearch of object tracking, and it is also the main theoretical framework for the researchof moving object tracking algorithm in this paper. In this paper, the existing deeptracking-based target tracking algorithms are classified and sorted out. Based on theprevious knowledge and my own understanding, several solutions are proposed for theexisting methods. In addition, the existing deep learning target tracking method is stilldifficult to meet the requirements of real-time, how to design the network and trackingprocess to achieve speed and effect improvement, there is still a lot of research space.展开更多
On accomplishing an efficacious object tracking,the activity of an object concerned becomes notified in a forthright manner.An accurate form of object tracking task necessitates a robust object tracking procedures irr...On accomplishing an efficacious object tracking,the activity of an object concerned becomes notified in a forthright manner.An accurate form of object tracking task necessitates a robust object tracking procedures irrespective of hardware assistance.Such approaches inferred a vast computational complexity to track an object with high accuracy in a stipulated amount of processing time.On the other hand,the tracking gets affected owing to the existence of varied quality diminishing factors such as occlusion,illumination changes,shadows etc.,In order to rectify all these inadequacies in tracking an object,a novel background normalization procedure articulated on the basis of a textural pattern is proposed in this paper.After preprocessing an acquired image,employment of an Environmental Succession Prediction algorithm for discriminating disparate background environment by background clustering approach have been accomplished.Afterward,abstract textural characterizations through utilization of a Probability based Gradient Pattern(PGP)approach for recognizing the similarity between patterns obtained so far.Comparison between standardized frame obtained in prior and those processed patterns detects the motion exposed by an object and the object concerned gets identified within a blob.Hence,the system is resistant towards illumination variations.These illumination variation was interpreted in object tracking residing within a dynamic background.Devised approach certainly outperforms other object tracking methodologies like Group Target Tracking(GTT),Vi PER-GT,grabcut,snakes in terms of accuracy and average time.Proposed PGP-based pattern texture analysis is compared with Gamifying Video Object(GVO)approach and hence,it evidently outperforms in terms of precision,recall and F1 measure.展开更多
In this paper, a novel object tracking based on a particle filter and speeded up robust feature (SURF) is proposed, which uses both color and SURF features. The SURF feature makes the tracking result more robust. On...In this paper, a novel object tracking based on a particle filter and speeded up robust feature (SURF) is proposed, which uses both color and SURF features. The SURF feature makes the tracking result more robust. On the other hand, the particle selection can lead to save time. In addition, we also consider the matched particle applicable to calculating the SURF weight. Owing to the color, spatial, and SURF features being adopted, this method is more robust than the traditional color-based appearance model. Experimental results demonstrate the robustness and accurate tracking results with challenging sequences. Besides, the proposed method outperforms other methods during the intersection of similar color and object's partial occlusion.展开更多
Object tracking with abrupt motion is an important research topic and has attracted wide attention.To obtain accurate tracking results,an improved particle filter tracking algorithm based on sparse representation and ...Object tracking with abrupt motion is an important research topic and has attracted wide attention.To obtain accurate tracking results,an improved particle filter tracking algorithm based on sparse representation and nonlinear resampling is proposed in this paper. First,the sparse representation is used to compute particle weights by considering the fact that the weights are sparse when the object moves abruptly,so the potential object region can be predicted more precisely. Then,a nonlinear resampling process is proposed by utilizing the nonlinear sorting strategy,which can solve the problem of particle diversity impoverishment caused by traditional resampling methods. Experimental results based on videos containing objects with various abrupt motions have demonstrated the effectiveness of the proposed algorithm.展开更多
This paper presents a robust object tracking approach via a spatially constrained colour model. Local image patches of the object and spatial relation between these patches are informative and stable during object tra...This paper presents a robust object tracking approach via a spatially constrained colour model. Local image patches of the object and spatial relation between these patches are informative and stable during object tracking. So, we propose to partition an object into patches and develop a Spatially Constrained Colour Model (SCCM) by combining the colour distributions and spatial configuration of these patches. The likelihood of the candidate object is given by estimating the confidences of the pixels in the candidate object region. The appearance model is learnt from the first frame and the tracking is carried out by particle filter. The experimental results show that the proposed tracking approach can accurately track the object with scale changes, pose variance and partial occlusion.展开更多
In YCbCr colorspace, a method is proposed to reconstruct the background and extract moving objects based on the Gaussian model of chroma components. Background model is updated according to changes of chroma component...In YCbCr colorspace, a method is proposed to reconstruct the background and extract moving objects based on the Gaussian model of chroma components. Background model is updated according to changes of chroma components. In order to eliminate the disturbance of shadow, a shadow detecting principle is proposed in YCbCr colorspace. A Kalman filter is introduced to estimate objects' positions in the image and then the pedestrian is tracked according to its information of shape. Experiments show that the background reconstruction and updating are successful, object extraction and shadow suppression are satisfactory, and real-time and reliable tracking is realized.展开更多
Recently,Siamese-based trackers have achieved excellent performance in object tracking.However,the high speed and deformation of objects in the movement process make tracking difficult.Therefore,we have incorporated c...Recently,Siamese-based trackers have achieved excellent performance in object tracking.However,the high speed and deformation of objects in the movement process make tracking difficult.Therefore,we have incorporated cascaded region-proposal-network(RPN)fusion and coordinate attention into Siamese trackers.The proposed network framework consists of three parts:a feature-extraction sub-network,coordinate attention block,and cascaded RPN block.We exploit the coordinate attention block,which can embed location information into channel attention,to establish long-term spatial location dependence while maintaining channel associations.Thus,the features of different layers are enhanced by the coordinate attention block.We then send these features separately into the cascaded RPN for classification and regression.According to the two classification and regression results,the final position of the target is obtained.To verify the effectiveness of the proposed method,we conducted comprehensive experiments on the OTB100,VOT2016,UAV123,and GOT-10k datasets.Compared with other state-of-the-art trackers,the proposed tracker achieved good performance and can run at real-time speed.展开更多
Mean shift,an iterative procedure that shifts each data point to the average of data points in its neighborhood,has been applied to object tracker.However,the traditional mean shift tracker by isotropic kernel often l...Mean shift,an iterative procedure that shifts each data point to the average of data points in its neighborhood,has been applied to object tracker.However,the traditional mean shift tracker by isotropic kernel often loses the object with the changing object structure in video sequences,especially when the object structure varies fast.This paper proposes a non-rigid object tracker by anisotropic kernel mean shift in which the shape,scale,and orientation of the kernels adapt to the changing object structure.The experimental results show that the new tracker is self-adaptive and approximately twice faster than the traditional tracker,which ensures the robustness and real time of tracking.展开更多
基金supported by the National Natural Science Foundation of China under Grant 62177029the Postgraduate Research&Practice Innovation Program of Jiangsu Province(KYCX21_0740),China.
文摘Visual object tracking plays a crucial role in computer vision.In recent years,researchers have proposed various methods to achieve high-performance object tracking.Among these,methods based on Transformers have become a research hotspot due to their ability to globally model and contextualize information.However,current Transformer-based object tracking methods still face challenges such as low tracking accuracy and the presence of redundant feature information.In this paper,we introduce self-calibration multi-head self-attention Transformer(SMSTracker)as a solution to these challenges.It employs a hybrid tensor decomposition self-organizing multihead self-attention transformermechanism,which not only compresses and accelerates Transformer operations but also significantly reduces redundant data,thereby enhancing the accuracy and efficiency of tracking.Additionally,we introduce a self-calibration attention fusion block to resolve common issues of attention ambiguities and inconsistencies found in traditional trackingmethods,ensuring the stability and reliability of tracking performance across various scenarios.By integrating a hybrid tensor decomposition approach with a self-organizingmulti-head self-attentive transformer mechanism,SMSTracker enhances the efficiency and accuracy of the tracking process.Experimental results show that SMSTracker achieves competitive performance in visual object tracking,promising more robust and efficient tracking systems,demonstrating its potential to providemore robust and efficient tracking solutions in real-world applications.
基金supported in part by National Natural Science Foundation of China(No.62176041)in part by Excellent Science and Technique Talent Foundation of Dalian(No.2022RY21).
文摘Significant advancements have beenwitnessed in visual tracking applications leveragingViT in recent years,mainly due to the formidablemodeling capabilities of Vision Transformer(ViT).However,the strong performance of such trackers heavily relies on ViT models pretrained for long periods,limitingmore flexible model designs for tracking tasks.To address this issue,we propose an efficient unsupervised ViT pretraining method for the tracking task based on masked autoencoders,called TrackMAE.During pretraining,we employ two shared-parameter ViTs,serving as the appearance encoder and motion encoder,respectively.The appearance encoder encodes randomly masked image data,while the motion encoder encodes randomly masked pairs of video frames.Subsequently,an appearance decoder and a motion decoder separately reconstruct the original image data and video frame data at the pixel level.In this way,ViT learns to understand both the appearance of images and the motion between video frames simultaneously.Experimental results demonstrate that ViT-Base and ViT-Large models,pretrained with TrackMAE and combined with a simple tracking head,achieve state-of-the-art(SOTA)performance without additional design.Moreover,compared to the currently popular MAE pretraining methods,TrackMAE consumes only 1/5 of the training time,which will facilitate the customization of diverse models for tracking.For instance,we additionally customize a lightweight ViT-XS,which achieves SOTA efficient tracking performance.
文摘Visual object-tracking is a fundamental task applied in many applications of computer vision. Particle filter is one of the techniques which has been widely used in object tracking. Due to the virtue of extendability and flexibility on both linear and non-linear environments, various particle filter-based trackers have been proposed in the literature. However, the conventional approach cannot handle very large videos efficiently in the current data intensive information age. In this work, a parallelized particle filter is provided in a distributed framework provided by the Hadoop/Map-Reduce infrastructure to tackle object-tracking tasks. The experiments indicate that the proposed algorithm has a better convergence and accuracy as compared to the traditional particle filter. The computational power and the scalability of the proposed particle filter in single object tracking have been enhanced as well.
文摘This paper discusses about the new approach of multiple object track-ing relative to background information.The concept of multiple object tracking through background learning is based upon the theory of relativity,that involves a frame of reference in spatial domain to localize and/or track any object.Thefield of multiple object tracking has seen a lot of research,but researchers have considered the background as redundant.However,in object tracking,the back-ground plays a vital role and leads to definite improvement in the overall process of tracking.In the present work an algorithm is proposed for the multiple object tracking through background learning.The learning framework is based on graph embedding approach for localizing multiple objects.The graph utilizes the inher-ent capabilities of depth modelling that assist in prior to track occlusion avoidance among multiple objects.The proposed algorithm has been compared with the recent work available in literature on numerous performance evaluation measures.It is observed that our proposed algorithm gives better performance.
基金supported by the National Research Foundation of Korea(NRF)grant funded by the Korea government(MSIT)(No.2022R1F1A1068828).
文摘Object tracking,an important technology in the field of image processing and computer vision,is used to continuously track a specific object or person in an image.This technology may be effective in identifying the same person within one image,but it has limitations in handling multiple images owing to the difficulty in identifying whether the object appearing in other images is the same.When tracking the same object using two or more images,there must be a way to determine that objects existing in different images are the same object.Therefore,this paper attempts to determine the same object present in different images using color information among the unique information of the object.Thus,this study proposes a multiple-object-tracking method using histogram stamp extraction in closed-circuit television applications.The proposed method determines the presence or absence of a target object in an image by comparing the similarity between the image containing the target object and other images.To this end,a unique color value of the target object is extracted based on its color distribution in the image using three methods:mean,mode,and interquartile range.The Top-N accuracy method is used to analyze the accuracy of each method,and the results show that the mean method had an accuracy of 93.5%(Top-2).Furthermore,the positive prediction value experimental results show that the accuracy of the mean method was 65.7%.As a result of the analysis,it is possible to detect and track the same object present in different images using the unique color of the object.Through the results,it is possible to track the same object that can minimize manpower without using personal information when detecting objects in different images.In the last response speed experiment,it was shown that when the mean was used,the color extraction of the object was possible in real time with 0.016954 s.Through this,it is possible to detect and track the same object in real time when using the proposed method.
文摘This research introduces a challenge in integrating and cleaning the data,which is a crucial task in object matching.While the object is detected and then measured,the vibration at different light intensities may influence the durability and reliability of mechanical systems or structures and cause problems such as damage,abnormal stopping,and disaster.Recent research failed to improve the accuracy rate and the computation time in tracking an object and in the vibration measurement.To solve all these problems,this proposed research simplifies the scaling factor determination by assigning a known real-world dimension to a predetermined portion of the image.A novel white color sticker of the known dimensions marked with a color dot is pasted on the surface of an object for the best result in the template matching using the Improved Up-Sampled Cross-Correlation(UCC)algorithm.The vibration measurement is calculated using the Finite-Difference Algorithm(FDA),a machine vision systemfitted with a macro lens sensor that is capable of capturing the image at a closer range,which does not affect the quality of displacement measurement from the video frames.Thefield test was conducted on the TAFE(Tractors and Farm Equipment Limited)tractor parts,and the percentage of error was recorded between 30%and 50%at very low vibration values close to zero,whereas it was recorded between 5%and 10%error in most high-accelerations,the essential range for vibration analysis.Finally,the suggested system is more suitable for measuring the vibration of stationary machinery having low frequency ranges.The use of a macro lens enables to capture of image frames at very close-ups.A 30%to 50%error percentage has been reported when the vibration amplitude is very small.Therefore,this study is not suitable for Nano vibration analysis.
基金funded by the Basic Science Major Foundation(Natural Science)of the Jiangsu Higher Education Institutions of China(Grant:22KJA520012)the Xuzhou Science and Technology Plan Project(Grant:KC21303,KC22305)the sixth“333 project”of Jiangsu Province.
文摘Aiming at the problem that a single correlation filter model is sensitive to complex scenes such as background interference and occlusion,a tracking algorithm based on multi-time-space perception and instance-specific proposals is proposed to optimize the mathematical model of the correlation filter(CF).Firstly,according to the consistency of the changes between the object frames and the filter frames,the mask matrix is introduced into the objective function of the filter,so as to extract the spatio-temporal information of the object with background awareness.Secondly,the object function of multi-feature fusion is constructed for the object location,which is optimized by the Lagrange method and solved by closed iteration.In the process of filter optimization,the constraints term of time-space perception is designed to enhance the learning ability of the CF to optimize the final track-ing results.Finally,when the tracking results fluctuate,the boundary suppres-sion factor is introduced into the instance-specific proposals to reduce the risk of model drift effectively.The accuracy and success rate of the proposed algorithm are verified by simulation analysis on two popular benchmarks,the object tracking benchmark 2015(OTB2015)and the temple color 128(TC-128).Extensive experimental results illustrate that the optimized appearance model of the proposed algorithm is effective.The distance precision rate and overlap success rate of the proposed algorithm are 0.756 and 0.656 on the OTB2015 benchmark,which are better than the results of other competing algorithms.The results of this study can solve the problem of real-time object tracking in the real traffic environment and provide a specific reference for the detection of traffic abnormalities.
文摘The field of object tracking has recently made significant progress.Particularly,the performance results in both deep learning and correlation filters,based trackers achieved effective tracking performance.Moreover,there are still some difficulties with object tracking for example illumination and deformation(DEF).The precision and accuracy of tracking algorithms suffer from the effects of such occurrences.For this situation,finding a solution is important.This research proposes a new tracking algorithm to handle this problem.The features are extracted by using Modified LeNet-5,and the precision and accuracy are improved by developing the Real-Time Cross-modality Correlation Filtering method(RCCF).In Modified LeNet-5,the visual tracking performance is improved by adjusting the number and size of the convolution kernels in the pooling and convolution layers.The high-level,middle-level,and handcraft features are extracted from the modified LeNet-5 network.The handcraft features are used to determine the specific location of the target because the handcraft features contain more spatial information regarding the visual object.The LeNet features are more suitable for a target appearance change in object tracking.Extensive experiments were conducted by the Object Tracking Benchmarking(OTB)databases like OTB50 and OTB100.The experimental results reveal that the proposed tracker outperforms other state-of-the-art trackers under different problems.The experimental simulation is carried out in python.The overall success rate and precision of the proposed algorithm are 93.8%and 92.5%.The average running frame rate reaches 42 frames per second,which can meet the real-time requirements.
文摘An improved estimation of motion vectors of feature points is proposed for tracking moving objects of dynamic image sequence. Feature points are firstly extracted by the improved minimum intensity change (MIC) algorithm. The matching points of these feature points are then determined by adaptive rood pattern searching. Based on the random sample consensus (RANSAC) method, the background motion is finally compensated by the parameters of an affine transform of the background motion. With reasonable morphological filtering, the moving objects are completely extracted from the background, and then tracked accurately. Experimental results show that the improved method is successful on the motion background compensation and offers great promise in tracking moving objects of the dynamic image sequence.
基金supported in part by the Institute for Guo Qiang of Tsinghua University(2019GQG1023)in part by Graduate Education and Teaching Reform Project of Tsinghua University(202007J007)+1 种基金in part by National Natural Science Foundation of China(U19B2029,62073028,61803222)in part by the Independent Research Program of Tsinghua University(2018Z05JDX002)。
文摘There are two main trends in the development of unmanned aerial vehicle(UAV)technologies:miniaturization and intellectualization,in which realizing object tracking capabilities for a nano-scale UAV is one of the most challenging problems.In this paper,we present a visual object tracking and servoing control system utilizing a tailor-made 38 g nano-scale quadrotor.A lightweight visual module is integrated to enable object tracking capabilities,and a micro positioning deck is mounted to provide accurate pose estimation.In order to be robust against object appearance variations,a novel object tracking algorithm,denoted by RMCTer,is proposed,which integrates a powerful short-term tracking module and an efficient long-term processing module.In particular,the long-term processing module can provide additional object information and modify the short-term tracking model in a timely manner.Furthermore,a positionbased visual servoing control method is proposed for the quadrotor,where an adaptive tracking controller is designed by leveraging backstepping and adaptive techniques.Stable and accurate object tracking is achieved even under disturbances.Experimental results are presented to demonstrate the high accuracy and stability of the whole tracking system.
基金supported by National Basic Research Program of China (973 Program) (No. 2006CB300407)National Natural Science Foundation of China (No. 50775017)
文摘Inspired by human behaviors, a robot object tracking model is proposed on the basis of visual attention mechanism, which is fit for the theory of topological perception. The model integrates the image-driven, bottom-up attention and the object-driven, top-down attention, whereas the previous attention model has mostly focused on either the bottom-up or top-down attention. By the bottom-up component, the whole scene is segmented into the ground region and the salient regions. Guided by top-down strategy which is achieved by a topological graph, the object regions are separated from the salient regions. The salient regions except the object regions are the barrier regions. In order to estimate the model, a mobile robot platform is developed, on which some experiments are implemented. The experimental results indicate that processing an image with a resolution of 752 × 480 pixels takes less than 200 ms and the object regions are unabridged. The analysis obtained by comparing the proposed model with the existing model demonstrates that the proposed model has some advantages in robot object tracking in terms of speed and efficiency.
基金supported by the Zhejiang Key Laboratory of General Aviation Operation Technology(No.JDGA2020-7)the National Natural Science Foundation of China(No.62173237)+3 种基金the Natural Science Foundation of Liaoning Province(No.2019-MS-251)the Talent Project of Revitalization Liaoning Province(No.XLYC1907022)the Key R&D Projects of Liaoning Province(No.2020JH2/10100045)the High-Level Innovation Talent Project of Shenyang(No.RC190030).
文摘Single object tracking based on deep learning has achieved the advanced performance in many applications of computer vision.However,the existing trackers have certain limitations owing to deformation,occlusion,movement and some other conditions.We propose a siamese attentional dense network called SiamADN in an end-to-end offline manner,especially aiming at unmanned aerial vehicle(UAV)tracking.First,it applies a dense network to reduce vanishing-gradient,which strengthens the features transfer.Second,the channel attention mechanism is involved into the Densenet structure,in order to focus on the possible key regions.The advance corner detection network is introduced to improve the following tracking process.Extensive experiments are carried out on four mainly tracking benchmarks as OTB-2015,UAV123,LaSOT and VOT.The accuracy rate on UAV123 is 78.9%,and the running speed is 32 frame per second(FPS),which demonstrates its efficiency in the practical real application.
基金supported by National Natural Science Foundationof China (Grant No. 51874300)the National Natural Science Foundation of China andShanxi Provincial People’s Government Jointly Funded Project of China for Coal Baseand Low Carbon (Grant No. U1510115)+2 种基金National Natural Science Foundation of China(51104157)the Qing Lan Project, the China Postdoctoral Science Foundation (Grant No.2013T60574)the Scientific Instrument Developing Project of the Chinese Academy ofSciences (Grant No. YJKYYQ20170074).
文摘Video object tracking is an important research topic of computer vision, whichfinds a wide range of applications in video surveillance, robotics, human-computerinteraction and so on. Although many moving object tracking algorithms have beenproposed, there are still many difficulties in the actual tracking process, such asillumination change, occlusion, motion blurring, scale change, self-change and so on.Therefore, the development of object tracking technology is still challenging. Theemergence of deep learning theory and method provides a new opportunity for theresearch of object tracking, and it is also the main theoretical framework for the researchof moving object tracking algorithm in this paper. In this paper, the existing deeptracking-based target tracking algorithms are classified and sorted out. Based on theprevious knowledge and my own understanding, several solutions are proposed for theexisting methods. In addition, the existing deep learning target tracking method is stilldifficult to meet the requirements of real-time, how to design the network and trackingprocess to achieve speed and effect improvement, there is still a lot of research space.
文摘On accomplishing an efficacious object tracking,the activity of an object concerned becomes notified in a forthright manner.An accurate form of object tracking task necessitates a robust object tracking procedures irrespective of hardware assistance.Such approaches inferred a vast computational complexity to track an object with high accuracy in a stipulated amount of processing time.On the other hand,the tracking gets affected owing to the existence of varied quality diminishing factors such as occlusion,illumination changes,shadows etc.,In order to rectify all these inadequacies in tracking an object,a novel background normalization procedure articulated on the basis of a textural pattern is proposed in this paper.After preprocessing an acquired image,employment of an Environmental Succession Prediction algorithm for discriminating disparate background environment by background clustering approach have been accomplished.Afterward,abstract textural characterizations through utilization of a Probability based Gradient Pattern(PGP)approach for recognizing the similarity between patterns obtained so far.Comparison between standardized frame obtained in prior and those processed patterns detects the motion exposed by an object and the object concerned gets identified within a blob.Hence,the system is resistant towards illumination variations.These illumination variation was interpreted in object tracking residing within a dynamic background.Devised approach certainly outperforms other object tracking methodologies like Group Target Tracking(GTT),Vi PER-GT,grabcut,snakes in terms of accuracy and average time.Proposed PGP-based pattern texture analysis is compared with Gamifying Video Object(GVO)approach and hence,it evidently outperforms in terms of precision,recall and F1 measure.
基金supported by the NSC under Grant No.NSC101-2221-E-259-032-MY3
文摘In this paper, a novel object tracking based on a particle filter and speeded up robust feature (SURF) is proposed, which uses both color and SURF features. The SURF feature makes the tracking result more robust. On the other hand, the particle selection can lead to save time. In addition, we also consider the matched particle applicable to calculating the SURF weight. Owing to the color, spatial, and SURF features being adopted, this method is more robust than the traditional color-based appearance model. Experimental results demonstrate the robustness and accurate tracking results with challenging sequences. Besides, the proposed method outperforms other methods during the intersection of similar color and object's partial occlusion.
基金Supported by the National Natural Science Foundation of China(61701029)
文摘Object tracking with abrupt motion is an important research topic and has attracted wide attention.To obtain accurate tracking results,an improved particle filter tracking algorithm based on sparse representation and nonlinear resampling is proposed in this paper. First,the sparse representation is used to compute particle weights by considering the fact that the weights are sparse when the object moves abruptly,so the potential object region can be predicted more precisely. Then,a nonlinear resampling process is proposed by utilizing the nonlinear sorting strategy,which can solve the problem of particle diversity impoverishment caused by traditional resampling methods. Experimental results based on videos containing objects with various abrupt motions have demonstrated the effectiveness of the proposed algorithm.
基金Supported by the National Natural Science Foundation of China (No. 60677040)
文摘This paper presents a robust object tracking approach via a spatially constrained colour model. Local image patches of the object and spatial relation between these patches are informative and stable during object tracking. So, we propose to partition an object into patches and develop a Spatially Constrained Colour Model (SCCM) by combining the colour distributions and spatial configuration of these patches. The likelihood of the candidate object is given by estimating the confidences of the pixels in the candidate object region. The appearance model is learnt from the first frame and the tracking is carried out by particle filter. The experimental results show that the proposed tracking approach can accurately track the object with scale changes, pose variance and partial occlusion.
基金This project is supported by Program for New Century Excellent Talents in University, China(No.NCET-04-0545)National Defense Science Foundation of China(No.51416070104JW0404)
文摘In YCbCr colorspace, a method is proposed to reconstruct the background and extract moving objects based on the Gaussian model of chroma components. Background model is updated according to changes of chroma components. In order to eliminate the disturbance of shadow, a shadow detecting principle is proposed in YCbCr colorspace. A Kalman filter is introduced to estimate objects' positions in the image and then the pedestrian is tracked according to its information of shape. Experiments show that the background reconstruction and updating are successful, object extraction and shadow suppression are satisfactory, and real-time and reliable tracking is realized.
基金supported in part by the National Natural Science Foundation of China under Grants 61972056 and 61901061the Science Fund for Creative Research Groups of Hunan Province under Grant 2020JJ1006+3 种基金the Natural Science Foundation of Hunan Province under Grant 2020JJ5603the Postgraduate Training Innovation Base Construction Project of Hunan Province under Grant 2019-248-51the Basic Research Fund of Zhongye Changtian International Engineering Co.,Ltd.under Grant 2020JCYJ07the Scientific Research Fund of Hunan Provincial Education Department under Grant 19C0031.
文摘Recently,Siamese-based trackers have achieved excellent performance in object tracking.However,the high speed and deformation of objects in the movement process make tracking difficult.Therefore,we have incorporated cascaded region-proposal-network(RPN)fusion and coordinate attention into Siamese trackers.The proposed network framework consists of three parts:a feature-extraction sub-network,coordinate attention block,and cascaded RPN block.We exploit the coordinate attention block,which can embed location information into channel attention,to establish long-term spatial location dependence while maintaining channel associations.Thus,the features of different layers are enhanced by the coordinate attention block.We then send these features separately into the cascaded RPN for classification and regression.According to the two classification and regression results,the final position of the target is obtained.To verify the effectiveness of the proposed method,we conducted comprehensive experiments on the OTB100,VOT2016,UAV123,and GOT-10k datasets.Compared with other state-of-the-art trackers,the proposed tracker achieved good performance and can run at real-time speed.
基金Supported by National Natural Science Foundation of China(No.30300088).
文摘Mean shift,an iterative procedure that shifts each data point to the average of data points in its neighborhood,has been applied to object tracker.However,the traditional mean shift tracker by isotropic kernel often loses the object with the changing object structure in video sequences,especially when the object structure varies fast.This paper proposes a non-rigid object tracker by anisotropic kernel mean shift in which the shape,scale,and orientation of the kernels adapt to the changing object structure.The experimental results show that the new tracker is self-adaptive and approximately twice faster than the traditional tracker,which ensures the robustness and real time of tracking.