摘要
针对图像自动标注中底层视觉特征与高层语义之间的语义鸿沟问题,在传统字典学习的基础上,提出一种基于多标签判别字典学习的图像自动标注方法。首先,为每幅图像提取多种类型特征,将多种特征组合作为字典学习输入特征空间的输入信息;然后,设计一个标签一致性正则化项,将原始样本的标签信息融入到初始的输入特征数据中,结合标签一致性判别字典和标签一致性正则化项进行字典学习;最后,通过得到的字典和稀疏编码矩阵求解标签稀疏编向量,实现未知图像的语义标注。在Corel 5K数据集上测试其标注性能,所提标注方法平均查准率和平均查全率分别可达到35%和48%;与传统的稀疏编码方法(MSC)相比,分别提高了10个百分点和16个百分点;与距离约束稀疏/组稀疏编码方法(DCSC/DCGSC)相比,分别提高了3个百分点和14个百分点。实验结果表明,所提方法能够较好地预测未知图像的语义信息,与当前几种流行的图像标注方法进行比较,所提方法具有较好的标注性能。
Concerning the problem of semantic gap between low-level visual features and high-level semantics in automatic image annotation, based on traditional dictionary learning, a multi-label discriminative dictionary learning method was proposed to automatic image annotation. First of all, multiple types of features for each image were extracted, and a combination of a variety of features was used as input information of the input feature space to the dictionary learning. Then, a label consistency regularization term was designed to integrate the label information of the original samples into the initial input feature data, and the dictionary of label consistency and the label consistency regularization term were combined to learn the dictionary. Finally, the label sparse coding vector was obtained by the dictionary and sparse coding matrix to implement the semantic annotation for an unknown image. The performance of the annotation was tested on the Corel 5 K data set. The average precision and average recall could reach 35% and 48% respectively, compared with the traditional Sparse Coding Method( MSC), which were increased by 10 percentage points and 16 percentage points respectively, and increased by 3 percentage points and 14 percentage points respectively than the method of Distance Constraint Sparse/Group Sparse Coding( DCSC/DCGSC) for automatic image lableing. Compared with the current image annotation methods, the experimental results show the proposed method can predict the semantic information for an unknown image properly, and has better annotation performance.
作者
杨晓玲
李志清
刘雨桐
YANG Xiaoling;LI Zhiqing;LIU Yutong(Key Laboratory of Intelligent Computing & laformation Processing of Ministry of Education,Xiangtan University,Xiangtan Hunan 411100,China)
出处
《计算机应用》
CSCD
北大核心
2018年第5期1294-1298,1303,共6页
journal of Computer Applications
关键词
图像自动标注
字典学习
特征表示
稀疏编码
图像检索
automatic image annotation
dictionary learning
feature representation
sparse coding
image retrieval