In this paper, we focus on low-resolution human detection and propose a partial least squares-canonical correlation analysis (PLS-CCA) for outdoor video surveillance. The analysis relies on heterogeneous features fu...In this paper, we focus on low-resolution human detection and propose a partial least squares-canonical correlation analysis (PLS-CCA) for outdoor video surveillance. The analysis relies on heterogeneous features fusion-based human detection method. The proposed method can not only explore the relation between two individual heterogeneous features as much as possible, but also can robustly describe the visual appearance of humans with complementary information. Compared with some other methods, the experimental results show that the proposed method is effective and has a high accuracy, precision, recall rate and area under curve (AUC) value at the same time, and offers a discriminative and stable recognition performance.展开更多
Object recognition has many applications in human-machine interaction and multimedia retrieval. However, due to large intra-class variability and inter-class similarity, accurate recognition relying only on RGB data i...Object recognition has many applications in human-machine interaction and multimedia retrieval. However, due to large intra-class variability and inter-class similarity, accurate recognition relying only on RGB data is still a big challenge. Recently, with the emergence of inexpensive RGB-D devices, this challenge can be better addressed by leveraging additional depth information. A very special yet important case of object recognition is hand-held object recognition, as manipulating objects with hands is common and intuitive in human-human and human-machine interactions. In this paper, we study this problem and introduce an effective framework to address it. This framework first detects and segments the hand-held object by exploiting skeleton information combined with depth information. In the object recognition stage, this work exploits heterogeneous features extracted from different modalities and fuses them to improve the recognition accuracy. In particular, we incorporate handcrafted and deep learned features and study several multi-step fusion variants. Experimental evaluations validate the effectiveness of the proposed method.展开更多
基金supported by National Natural Science Foundation of China(Nos.61271432 and 61333016)
文摘In this paper, we focus on low-resolution human detection and propose a partial least squares-canonical correlation analysis (PLS-CCA) for outdoor video surveillance. The analysis relies on heterogeneous features fusion-based human detection method. The proposed method can not only explore the relation between two individual heterogeneous features as much as possible, but also can robustly describe the visual appearance of humans with complementary information. Compared with some other methods, the experimental results show that the proposed method is effective and has a high accuracy, precision, recall rate and area under curve (AUC) value at the same time, and offers a discriminative and stable recognition performance.
基金This work was supported in part by the National Basic Research 973 Program of China under Grant No. 2012CB316400, the National Natural Science Foundation of China under Grant Nos. 61322212 and 61450110446, the National High Technology Research and Development 863 Program of China under Grant No. 2014AA015202, and the Chinese Academy of Sciences Fellowships for Young International Scientists under Grant No. 2011Y1GB05. This work is also funded by Lenovo Outstanding Young Scientists Program (LOYS).
文摘Object recognition has many applications in human-machine interaction and multimedia retrieval. However, due to large intra-class variability and inter-class similarity, accurate recognition relying only on RGB data is still a big challenge. Recently, with the emergence of inexpensive RGB-D devices, this challenge can be better addressed by leveraging additional depth information. A very special yet important case of object recognition is hand-held object recognition, as manipulating objects with hands is common and intuitive in human-human and human-machine interactions. In this paper, we study this problem and introduce an effective framework to address it. This framework first detects and segments the hand-held object by exploiting skeleton information combined with depth information. In the object recognition stage, this work exploits heterogeneous features extracted from different modalities and fuses them to improve the recognition accuracy. In particular, we incorporate handcrafted and deep learned features and study several multi-step fusion variants. Experimental evaluations validate the effectiveness of the proposed method.