This paper presents a novel formulation for detecting objects with articulated rigid bodies from highresolution monitoring images, particularly engineering vehicles. There are many pixels in high-resolution monitoring...This paper presents a novel formulation for detecting objects with articulated rigid bodies from highresolution monitoring images, particularly engineering vehicles. There are many pixels in high-resolution monitoring images, and most of them represent the background. Our method first detects ob ject patches from monitoring images using a coarse detection process. In this phase, we build a descriptor based on histograms of oriented gradient, which contain color frequency information. Then we use a linear support vector machine to rapidly detect many image patches that may contain ob ject parts, with a low false negative rate and a high false positive rate. In the second phase, we apply a refinement classification to determine the patches that actually contain ob jects. In this stage, we increase the size of the image patches so that they include the complete ob ject using models of the ob ject parts.Then an accelerated and improved salient mask is used to improve the performance of the dense scale-invariant feature transform descriptor. The detection process returns the absolute position of positive ob jects in the original images. We have applied our methods to three datasets to demonstrate their effectiveness.展开更多
A method was developed to detect generic objects using a single query image. The query image could be a typical real image, a virtual image, or even a hand-drawn sketch of the object. Without a training process, the k...A method was developed to detect generic objects using a single query image. The query image could be a typical real image, a virtual image, or even a hand-drawn sketch of the object. Without a training process, the key problem is how to describe the object class from only one query image with no pre-segmentation or other pre-processing procedures. The method introduces densely computed Scale-lnvariant Feature Transform (SIFT) as the descriptor to extract "gradient distribution" features of the image. The descriptor emphasizes the edge parts and their distribution structures, which are very representative of the object class, so it is very robust and can deal with virtual images or hand-drawn sketches. Tests on car detection, face detection, and generic object detection demonstrate that the method is effective, robust, and widely applicable. The results using queries of real images compare well with other training-free methods and state-of-the-art training-based methods.展开更多
基金supported by the China Knowledge Centre for Engineering Sciences and Technology(No.CKCEST-2014-1-2)the Zhejiang Provincial Natural Science Foundation of China(No.LY14F020027)the National Natural Science Foundation of China(No.61272304)
文摘This paper presents a novel formulation for detecting objects with articulated rigid bodies from highresolution monitoring images, particularly engineering vehicles. There are many pixels in high-resolution monitoring images, and most of them represent the background. Our method first detects ob ject patches from monitoring images using a coarse detection process. In this phase, we build a descriptor based on histograms of oriented gradient, which contain color frequency information. Then we use a linear support vector machine to rapidly detect many image patches that may contain ob ject parts, with a low false negative rate and a high false positive rate. In the second phase, we apply a refinement classification to determine the patches that actually contain ob jects. In this stage, we increase the size of the image patches so that they include the complete ob ject using models of the ob ject parts.Then an accelerated and improved salient mask is used to improve the performance of the dense scale-invariant feature transform descriptor. The detection process returns the absolute position of positive ob jects in the original images. We have applied our methods to three datasets to demonstrate their effectiveness.
基金Supported by the National Key Basic Research and Development (973) Program of China (No.2007CB311004)
文摘A method was developed to detect generic objects using a single query image. The query image could be a typical real image, a virtual image, or even a hand-drawn sketch of the object. Without a training process, the key problem is how to describe the object class from only one query image with no pre-segmentation or other pre-processing procedures. The method introduces densely computed Scale-lnvariant Feature Transform (SIFT) as the descriptor to extract "gradient distribution" features of the image. The descriptor emphasizes the edge parts and their distribution structures, which are very representative of the object class, so it is very robust and can deal with virtual images or hand-drawn sketches. Tests on car detection, face detection, and generic object detection demonstrate that the method is effective, robust, and widely applicable. The results using queries of real images compare well with other training-free methods and state-of-the-art training-based methods.