The number of films is numerous and the film contents are complex over the Internet and multimedia sources. It is time consuming for a viewer to select a favorite film. This paper presents an automatic recognition sys...The number of films is numerous and the film contents are complex over the Internet and multimedia sources. It is time consuming for a viewer to select a favorite film. This paper presents an automatic recognition system of film types. Initially, a film is firstly sampled as frame sequences. The color space, including hue, saturation,and brightness value(HSV), is analyzed for each sampled frame by computing the deviation and mean of HSV for each film. These features are utilized as inputs to a deep-learning neural network(DNN) for the recognition of film types. One hundred films are utilized to train and validate the model parameters of DNN. In the testing phase, a film is recognized as one of the five categories, including action, comedy, horror thriller, romance, and science fiction, by the trained DNN. The experimental results reveal that the film types can be effectively recognized by the proposed approach, enabling the viewer to select an interesting film accurately and quickly.展开更多
Two lines of image representation based on multiple features fusion demonstrate excellent performance in image retrieval.However,there are some problems in both of them:1)the methods defining directly texture in color...Two lines of image representation based on multiple features fusion demonstrate excellent performance in image retrieval.However,there are some problems in both of them:1)the methods defining directly texture in color space put more emphasis on color than texture feature;2)the methods extract several features respectively and combine them into a vector,in which bad features may lead to worse performance after combining directly good and bad features.To address the problems above,a novel hybrid framework for color image retrieval through combination of local and global features achieves higher retrieval precision.The bag-of-visual words(BoW)models and color intensity-based local difference patterns(CILDP)are exploited to capture local and global features of an image.The proposed fusion framework combines the ranking results of BoW and CILDP through graph-based density method.The performance of our proposed framework in terms of average precision on Corel-1K database is86.26%,and it improves the average precision by approximately6.68%and12.53%over CILDP and BoW,respectively.Extensive experiments on different databases demonstrate the effectiveness of the proposed framework for image retrieval.展开更多
This paper presents an adaptive method of objects and shadows detection in video streams. Models of background are firstly set up and adaptively updated in Hue Saturation Intensity (HSI) color space to detect motion r...This paper presents an adaptive method of objects and shadows detection in video streams. Models of background are firstly set up and adaptively updated in Hue Saturation Intensity (HSI) color space to detect motion regions. Then, detection errors are dealt with by motion continuity and velocity consistency. Finally, cast shadows are removed by the generic properties of luminance, chrominance and gradient density. Experimental results and their evaluation are presented to verify the effectiveness of this new method.展开更多
An adaptive background model based on max-imum statistical probability and a shadow suppression scheme for indoor and outdoor people detection by exploiting hue saturation value(HSV)color information is proposed.To ob...An adaptive background model based on max-imum statistical probability and a shadow suppression scheme for indoor and outdoor people detection by exploiting hue saturation value(HSV)color information is proposed.To obtain the initial background scene,the frequency of R,G,and B component values for each pixel at the same position in the learning sequence are respec-tively calculated;the R,G,and B component values with the biggest ratios are incorporated to model the initial background.The background maintenance,or the so-called background re-initiation,is also proposed to adapt to scene changes such as illumination changes and scene geometry changes.Moving cast shadows generally exhibit a challenge for accurate moving target detection.Based on the observation that a shadow cast on a background region lowers its brightness but does not change its chro-maticity significantly,we address this problem in the ar-ticle by exploiting HSV color information.In addition,quantitative metrics is introduced to evaluate the algo-rithm on a benchmark suite of indoor and outdoor video sequences.The experimental results are given to show the performance of the algorithm.展开更多
基金supported by MOST under Grant No.MOST 104-2221-E-468-007。
文摘The number of films is numerous and the film contents are complex over the Internet and multimedia sources. It is time consuming for a viewer to select a favorite film. This paper presents an automatic recognition system of film types. Initially, a film is firstly sampled as frame sequences. The color space, including hue, saturation,and brightness value(HSV), is analyzed for each sampled frame by computing the deviation and mean of HSV for each film. These features are utilized as inputs to a deep-learning neural network(DNN) for the recognition of film types. One hundred films are utilized to train and validate the model parameters of DNN. In the testing phase, a film is recognized as one of the five categories, including action, comedy, horror thriller, romance, and science fiction, by the trained DNN. The experimental results reveal that the film types can be effectively recognized by the proposed approach, enabling the viewer to select an interesting film accurately and quickly.
基金Projects(61370200,61672130,61602082) supported by the National Natural Science Foundation of ChinaProject(1721203049-1) supported by the Science and Technology Research and Development Plan Project of Handan,Hebei Province,China
文摘Two lines of image representation based on multiple features fusion demonstrate excellent performance in image retrieval.However,there are some problems in both of them:1)the methods defining directly texture in color space put more emphasis on color than texture feature;2)the methods extract several features respectively and combine them into a vector,in which bad features may lead to worse performance after combining directly good and bad features.To address the problems above,a novel hybrid framework for color image retrieval through combination of local and global features achieves higher retrieval precision.The bag-of-visual words(BoW)models and color intensity-based local difference patterns(CILDP)are exploited to capture local and global features of an image.The proposed fusion framework combines the ranking results of BoW and CILDP through graph-based density method.The performance of our proposed framework in terms of average precision on Corel-1K database is86.26%,and it improves the average precision by approximately6.68%and12.53%over CILDP and BoW,respectively.Extensive experiments on different databases demonstrate the effectiveness of the proposed framework for image retrieval.
基金the National Natural Science Foundation of China (60472072)the Specialized Research Foundation for the Doctoral Program of Higher Education (20040699034)+1 种基金the Aeronautical Science Foundation of China (04I50370)the Natural Science Foundation of Shaan’xi Province (2004K05-G23).
文摘This paper presents an adaptive method of objects and shadows detection in video streams. Models of background are firstly set up and adaptively updated in Hue Saturation Intensity (HSI) color space to detect motion regions. Then, detection errors are dealt with by motion continuity and velocity consistency. Finally, cast shadows are removed by the generic properties of luminance, chrominance and gradient density. Experimental results and their evaluation are presented to verify the effectiveness of this new method.
文摘An adaptive background model based on max-imum statistical probability and a shadow suppression scheme for indoor and outdoor people detection by exploiting hue saturation value(HSV)color information is proposed.To obtain the initial background scene,the frequency of R,G,and B component values for each pixel at the same position in the learning sequence are respec-tively calculated;the R,G,and B component values with the biggest ratios are incorporated to model the initial background.The background maintenance,or the so-called background re-initiation,is also proposed to adapt to scene changes such as illumination changes and scene geometry changes.Moving cast shadows generally exhibit a challenge for accurate moving target detection.Based on the observation that a shadow cast on a background region lowers its brightness but does not change its chro-maticity significantly,we address this problem in the ar-ticle by exploiting HSV color information.In addition,quantitative metrics is introduced to evaluate the algo-rithm on a benchmark suite of indoor and outdoor video sequences.The experimental results are given to show the performance of the algorithm.