This paper presents a novel system for violent scenes detection, which is based on machine learning to handle visual and audio features. MKL (Multiple Kernel Learning) is applied so that multimodality of videos can ...This paper presents a novel system for violent scenes detection, which is based on machine learning to handle visual and audio features. MKL (Multiple Kernel Learning) is applied so that multimodality of videos can be maximized. The largest features of our system is that mid-level concepts clustering is proposed and implemented in order to learn mid-level concepts implicitly. By this algorithm, our system does not need manually tagged annotations. The whole system is trained on the dataset from MediaEval 2013 Affect Task and evaluated by its official metric. The obtained results outperformed its best score.展开更多
文摘This paper presents a novel system for violent scenes detection, which is based on machine learning to handle visual and audio features. MKL (Multiple Kernel Learning) is applied so that multimodality of videos can be maximized. The largest features of our system is that mid-level concepts clustering is proposed and implemented in order to learn mid-level concepts implicitly. By this algorithm, our system does not need manually tagged annotations. The whole system is trained on the dataset from MediaEval 2013 Affect Task and evaluated by its official metric. The obtained results outperformed its best score.