摘要
在视觉分析中,人的同一动作在不同场景下会有截然不同的理解。为了判断在不同场景中行为是否为异常,在监控系统中使用双层词包模型来解决这个问题。把视频信息放在第1层包中,把场景动作文本词放在第2层包中。视频由一系列时空兴趣点组成的时空词典表示,动作性质由在指定场景下的动作文本词集合来确定。使用潜在语义分析概率模型(pLSA)不但能自动学习时空词的概率分布,找到与之对应的动作类别,也能在监督情况下学习在规定场景下运动文本词概率分布并区分出对应异常或正常行动结果。经过训练学习后,该算法可以识别新视频在相应场景下行为的异常或正常。
Different understanding results in different scenarios even for the same person to conduct visual analysis. In order to determine whether the behavior is abnormal in different scenarios, a double-layer bag-of-words model is proposed to solve the problem in surveillance system. The video information is processed in the first layer of Bag-of-Words, and the information of scenario-action text words is included in the second one. A video sequence is represented as a collection of spatial-temporal codebook by extracting space-time interest points. A behavior characteristic is represented as a collection of behavior text words in special scenarios. Probabilistic latent semantic analysis (pLSA) model is adopted to automatically learn the probability distributions of spatial-temporal words and the topics correspond to human action categories. PLSA also can learn the probability distributions of the motion text words in a scenario with supervisor and the topics correspond to anomalous or normal actions. The algorithm can categorize the human anomalous or normal action contained in the special occasion to a novel video sequence after being trained.
出处
《中国图象图形学报》
CSCD
北大核心
2009年第10期2097-2101,共5页
Journal of Image and Graphics
基金
广东省教育部产学研结合项目(2006D90704017)
河北省教育厅科研项目(Z2009127)
关键词
异常行为
兴趣点
双层词包
潜在含语义分析概率
可变场所
abnormal action, interest points, double-layer bag-of-words, probabilistic latent semantic analysis, variable scenarios