摘要
针对大数据图像分类耗时长、实时性差等问题,利用云计算技术的优点,以获得理想的大数据图像分类结果为目标,提出一种基于Hadoop平台的大数据图像分类机制.首先收集大量的图像,构建图像数据库,并提取图像分类的有效特征;然后基于Hadoop平台,采用Map函数对大数据图像分类问题进行细分,用多节点并行、分布式地对子问题进行图像分类求解,得到相应的图像分类结果;最后利用Reduce函数对子问题的图像分类结果进行组合,并用VC++6.0编程实现大数据图像分类的仿真模拟测试.测试结果表明,该分类机制较好地克服了当前图像分类机制存在的弊端,大幅度缩短了图像分类的时间,分类速度可适应大数据图像分类的在线要求,且图像分类的整体效果明显优于当前其他图像分类机制.
Aiming at the problem of long time-consuming and poor real-time of large data images classification,using the advantages of cloud computing technology to obtain the ideal classification results of large data images,we proposed a large data image classification mechanism based on Hadoop platform.Firstly,a large number of images were collected,the image database was constructed,and the effective features of the image classification wereextracted.Secondly,based on the Hadoop platform,the Map function was used tosubdivide the large data image classification problems,and subproblems were classified and solved by multiple nodes parallel and distributed,and the corresponding image classification results were obtained.Finally,the Reduce function wasused to combine image classification results of subproblems,and the simulation test of large data image classification was realizedby using VC++6.0 programming.Test results show that the proposed classification mechanism can overcome the drawbacks of current image classification mechanism,greatly shorten the time of image classification,and the classification speed can adapt to online requirements of large data image classification,and the overall effect of image classification is obviously superior to the other image classification mechanisms.
作者
张睿萍
马宗梅
ZHANG Ruiping;MA Zongmei(Department of Computer Science and Technology,Zhongyuan University of Technology,Zhengzhou 450007,China)
出处
《吉林大学学报(理学版)》
CAS
CSCD
北大核心
2018年第5期1206-1212,共7页
Journal of Jilin University:Science Edition
基金
河南省科技厅项目(批准号:162102210248)
关键词
图像分类机制
特征库
实时性
分类节点
特征匹配
image classification mechanism
feature database
real-time
classification node
feature matching