摘要
Multi-way join is critical for many big data applications such as data mining and knowledge discovery. Even though lots of research have been devoted to processing multi-way joins using MapReduce, there are still several problems in general to be further improved, such as transferring numerous unpromising intermediate data and lacking of better coordination mechanisms. This work proposes an efficient multi-way joins processing model using MapReduce, named Sharing-Coordination-MapReduce (SC-MapReduce), which has the functions of sharing and coordination. Our SC-MapReduce model can filter the unpromising intermediatedata largely by using the sharing mechanism and optimize the multiple tasks coordination of multi-way joins. Extensive experiments show that the proposed model is efficient, robust and scalable.
出处
《国际计算机前沿大会会议论文集》
2015年第1期23-24,共2页
International Conference of Pioneering Computer Scientists, Engineers and Educators(ICPCSEE)
基金
This work was supported by the National Natural Science Foundation of China under Grant No.60873068,61472169
the Program for Excellent Talents in Liaoning Province under Grant No.LR201017.