A novel Snake model with region information is proposed to detect and track moving objects. Generally, the region-information-based approach is sensitive to illumination changes and small movement in the background, w...A novel Snake model with region information is proposed to detect and track moving objects. Generally, the region-information-based approach is sensitive to illumination changes and small movement in the background, while the edge-information-based approach often obtains incorrect results for ambiguous images. The two types of information are introduced in computing the image force. Edge-information-based features make the algorithm fast and robust, and region information makes the active confour energy function obtains correct results for ambiguous images. Furthermore, an automatic contour initialization method using double difference images is given to meet the requirement of video sequence tracking. Meanwhile, a simple forecast section is added to estimate the position of the contour in the algorithm so that it can improve the convergence speed of the active contour. Experimental results show that the computation time of the algorithm is less than 0.1 s/frame. And it can be applied to a real-time system.展开更多
Determination of relative three-dimensional (3D) position, orientation, and relative motion between two reference frames is an important problem in robotic guidance, manipulation, and assembly as well as in other fi...Determination of relative three-dimensional (3D) position, orientation, and relative motion between two reference frames is an important problem in robotic guidance, manipulation, and assembly as well as in other fields such as photogrammetry. A solution to pose and motion estimation problem that uses two-dimensional (2D) intensity images from a single camera is desirable for real-time applications. The difficulty in performing this measurement is that the process of projecting 3D object features to 2D images is a nonlinear transformation. In this paper, the 3D transformation is modeled as a nonlinear stochastic system with the state estimation providing six degrees-of-freedom motion and position values, using line features in image plane as measuring inputs and dual quaternion to represent both rotation and translation in a unified notation. A filtering method called the Gaussian particle filter (GPF) based on the panicle filtering concept is presented for 3D pose and motion estimation of a moving target from monocular image sequences. The method has been implemented with simulated data, and simulation results are provided along with comparisons to the extended Kalman filter (EKF) and the unscented Kalman filter (UKF) to show the relative advantages of the GPF. Simulation results showed that GPF is a superior alternative to EKF and UKF.展开更多
MGAC (Motion Geometric Active Contours), a new variational framework of geometric active contours to track multiple nonrigid moving objects in the clutter background in image sequences is presented. This framework, in...MGAC (Motion Geometric Active Contours), a new variational framework of geometric active contours to track multiple nonrigid moving objects in the clutter background in image sequences is presented. This framework, incorporating with the motion edge information, consists of motion detection and tracking stages. At the motion detection stage, the motion edge map provides an approximate edge map of the moving objects. Then, a tracking stage, merely using the static edge information, is considered to improve the motion detection result. Force field regularization method is used to extend the capture range of the edge attraction force field in both stages. Experiments demonstrate that the proposed framework is valid for tracking multiple nonrigid objects in the clutter background.展开更多
This novel method of Pedestrian Tracking using Support Vector (PTSV) proposed for a video surveillance instrument combines the Support Vector Machine (SVM) classifier into an optic-flow based tracker. The traditional ...This novel method of Pedestrian Tracking using Support Vector (PTSV) proposed for a video surveillance instrument combines the Support Vector Machine (SVM) classifier into an optic-flow based tracker. The traditional method using optical flow tracks objects by minimizing an intensity difference function between successive frames, while PTSV tracks objects by maximizing the SVM classification score. As the SVM classifier for object and non-object is pre-trained, there is need only to classify an image block as object or non-ob-ject without having to compare the pixel region of the tracked object in the previous frame. To account for large motions between successive frames we build pyramids from the support vectors and use a coarse-to-fine scan in the classification stage. To accelerate the training of SVM, a Sequential Minimal Optimization Method (SMO) is adopted. The results of using a kernel-PTSV for pedestrian tracking from real time video are shown at the end. Comparative experimental results showed that PTSV improves the reliability of tracking compared to that of traditional tracking method using optical flow.展开更多
Robust object tracking has been an important and challenging research area in the field of computer vision for decades. With the increasing popularity of affordable depth sensors, range data is widely used in visual t...Robust object tracking has been an important and challenging research area in the field of computer vision for decades. With the increasing popularity of affordable depth sensors, range data is widely used in visual tracking for its ability to provide robustness to varying illumination and occlusions. In this paper, a novel RGBD and sparse learning based tracker is proposed. The range data is integrated into the sparse learning framework in three respects. First, an extra depth view is added to the color image based visual features as an independent view for robust appearance modeling. Then, a special occlusion template set is designed to replenish the existing dictionary for handling various occlusion conditions. Finally, a depth-based occlusion detection method is proposed to efficiently determine an accurate time for the template update. Extensive experiments on both KITTI and Princeton data sets demonstrate that the proposed tracker outperforms the state-of-the-art tracking algorithms, including both sparse learning and RGBD based methods.展开更多
文摘A novel Snake model with region information is proposed to detect and track moving objects. Generally, the region-information-based approach is sensitive to illumination changes and small movement in the background, while the edge-information-based approach often obtains incorrect results for ambiguous images. The two types of information are introduced in computing the image force. Edge-information-based features make the algorithm fast and robust, and region information makes the active confour energy function obtains correct results for ambiguous images. Furthermore, an automatic contour initialization method using double difference images is given to meet the requirement of video sequence tracking. Meanwhile, a simple forecast section is added to estimate the position of the contour in the algorithm so that it can improve the convergence speed of the active contour. Experimental results show that the computation time of the algorithm is less than 0.1 s/frame. And it can be applied to a real-time system.
基金Project (No. 2006J0017) supported by the Natural Science Foundation of Fujian Province, China
文摘Determination of relative three-dimensional (3D) position, orientation, and relative motion between two reference frames is an important problem in robotic guidance, manipulation, and assembly as well as in other fields such as photogrammetry. A solution to pose and motion estimation problem that uses two-dimensional (2D) intensity images from a single camera is desirable for real-time applications. The difficulty in performing this measurement is that the process of projecting 3D object features to 2D images is a nonlinear transformation. In this paper, the 3D transformation is modeled as a nonlinear stochastic system with the state estimation providing six degrees-of-freedom motion and position values, using line features in image plane as measuring inputs and dual quaternion to represent both rotation and translation in a unified notation. A filtering method called the Gaussian particle filter (GPF) based on the panicle filtering concept is presented for 3D pose and motion estimation of a moving target from monocular image sequences. The method has been implemented with simulated data, and simulation results are provided along with comparisons to the extended Kalman filter (EKF) and the unscented Kalman filter (UKF) to show the relative advantages of the GPF. Simulation results showed that GPF is a superior alternative to EKF and UKF.
文摘MGAC (Motion Geometric Active Contours), a new variational framework of geometric active contours to track multiple nonrigid moving objects in the clutter background in image sequences is presented. This framework, incorporating with the motion edge information, consists of motion detection and tracking stages. At the motion detection stage, the motion edge map provides an approximate edge map of the moving objects. Then, a tracking stage, merely using the static edge information, is considered to improve the motion detection result. Force field regularization method is used to extend the capture range of the edge attraction force field in both stages. Experiments demonstrate that the proposed framework is valid for tracking multiple nonrigid objects in the clutter background.
文摘This novel method of Pedestrian Tracking using Support Vector (PTSV) proposed for a video surveillance instrument combines the Support Vector Machine (SVM) classifier into an optic-flow based tracker. The traditional method using optical flow tracks objects by minimizing an intensity difference function between successive frames, while PTSV tracks objects by maximizing the SVM classification score. As the SVM classifier for object and non-object is pre-trained, there is need only to classify an image block as object or non-ob-ject without having to compare the pixel region of the tracked object in the previous frame. To account for large motions between successive frames we build pyramids from the support vectors and use a coarse-to-fine scan in the classification stage. To accelerate the training of SVM, a Sequential Minimal Optimization Method (SMO) is adopted. The results of using a kernel-PTSV for pedestrian tracking from real time video are shown at the end. Comparative experimental results showed that PTSV improves the reliability of tracking compared to that of traditional tracking method using optical flow.
基金the National Natural Science Foundation of China (No. 61571390) and the Fundamental Research Funds for the Central Universities, China (No. 2016QNA5004)
文摘Robust object tracking has been an important and challenging research area in the field of computer vision for decades. With the increasing popularity of affordable depth sensors, range data is widely used in visual tracking for its ability to provide robustness to varying illumination and occlusions. In this paper, a novel RGBD and sparse learning based tracker is proposed. The range data is integrated into the sparse learning framework in three respects. First, an extra depth view is added to the color image based visual features as an independent view for robust appearance modeling. Then, a special occlusion template set is designed to replenish the existing dictionary for handling various occlusion conditions. Finally, a depth-based occlusion detection method is proposed to efficiently determine an accurate time for the template update. Extensive experiments on both KITTI and Princeton data sets demonstrate that the proposed tracker outperforms the state-of-the-art tracking algorithms, including both sparse learning and RGBD based methods.