摘要
提出了一种高效获取词包模型中视觉字典容量的方法,并研究了该方法与隐狄利克雷分配模型(Latent Dirichlet Allocation,LDA)相结合情况下的场景分类性能.在用SIFT特征构建场景图像数据集特征矩阵的基础上,首先采用吸引子传播方法获取场景图像集特征矩阵的合理聚类数目族,并将其中的最小聚类数目作为视觉字典容量,进而生成视觉字典;然后利用所构建视觉字典中的单词描述场景图像训练集和测试集;最后采用LDA模型对场景图像测试集进行场景分类实验.实验结果表明,提出的方法不仅保持了较高场景分类准确率,同时显著提高了场景分类的效率.
An approach is proposed to obtain the dictionary capacity of bag of words(BoW) model efficiently, which is combined with The Latent Dirichlet Allocation (LDA) model to analyze the performance of scene category. Based on the feature matrix of scene image data sets constructed by SIFT feature, the affinity propagation method is firstly employed to obtain the clustering numbers, and to take the minimal clustering number as the visual dictionary capacity before generating a visual dictionary. Secondly, the scene training and testing sets are described by these visual words. Finally, the LDA model is employed to classify the testing data set. The experiments show that the proposed approach in this paper maintains higher accuracy of scene classification and can improve efficiency greatly.
出处
《广东工业大学学报》
CAS
2015年第4期150-154,共5页
Journal of Guangdong University of Technology
基金
广东省科技计划项目(2010A030500006)