Online social networks(OSNs)offer people the opportunity to join communities where they share a common interest or objective.This kind of community is useful for studying the human behavior,diffusion of information,an...Online social networks(OSNs)offer people the opportunity to join communities where they share a common interest or objective.This kind of community is useful for studying the human behavior,diffusion of information,and dynamics of groups.As the members of a community are always changing,an efficient solution is needed to query information in real time.This paper introduces the Follow Model to present the basic relationship between users in OSNs,and combines it with the MapReduce solution to develop new algorithms with parallel paradigms for querying.Two models for reverse relation and high-order relation of the users were implemented in the Hadoop system.Based on 75 GB message data and 26 GB relation network data from Twitter,a case study was realized using two dynamic discussion communities:#musicmonday and#beatcancer.The querying performance demonstrates that the new solution with the implementation in Hadoop significantly improves the ability to find useful information from OSNs.展开更多
基金Project supported by the Brazilian National Council for Scientific and Technological Development(CNPq)(No.304058/2010-6)
文摘Online social networks(OSNs)offer people the opportunity to join communities where they share a common interest or objective.This kind of community is useful for studying the human behavior,diffusion of information,and dynamics of groups.As the members of a community are always changing,an efficient solution is needed to query information in real time.This paper introduces the Follow Model to present the basic relationship between users in OSNs,and combines it with the MapReduce solution to develop new algorithms with parallel paradigms for querying.Two models for reverse relation and high-order relation of the users were implemented in the Hadoop system.Based on 75 GB message data and 26 GB relation network data from Twitter,a case study was realized using two dynamic discussion communities:#musicmonday and#beatcancer.The querying performance demonstrates that the new solution with the implementation in Hadoop significantly improves the ability to find useful information from OSNs.