This paper focuses on some key problems in web community discovery and link analysis.Based on the topic-oriented technology,the characteristics of a bipartite graph are studied.An Х bipartite core set is introduced t...This paper focuses on some key problems in web community discovery and link analysis.Based on the topic-oriented technology,the characteristics of a bipartite graph are studied.An Х bipartite core set is introduced to more clearly define extracting ways.By scanning the topic subgraph to construct Х bipartite graph and then prune the graph with i and j ,an Х bipartite core set,which is also the minimum element of a community,can be found.Finally,a hierarchical clustering algorithm is applied to many Х bipartite core sets and the dendrogram of the community inner construction is obtained.The correctness of the constructing and pruning method is proved and the algorithm is designed.The typical datasets in the experiment are prepared according to the way in HITS(hyperlink-induced topic search).Ten topics and four search engines are chosen and the returned results are integrated.The modularity,which is a measure of the strength of the community structure in the social network,is used to validate the efficiency of the proposed method.The experimental results show that the proposed algorithm is effective and efficient.展开更多
基金The National Natural Science Foundation of China(No.60773216)the National High Technology Research and Development Program of China(863Program)(No.2006AA010109)+1 种基金the Natural Science Foundation of Renmin University of China(No.06XNB052)Free Exploration Project(985 Project of Renmin University of China)(No.21361231)
文摘This paper focuses on some key problems in web community discovery and link analysis.Based on the topic-oriented technology,the characteristics of a bipartite graph are studied.An Х bipartite core set is introduced to more clearly define extracting ways.By scanning the topic subgraph to construct Х bipartite graph and then prune the graph with i and j ,an Х bipartite core set,which is also the minimum element of a community,can be found.Finally,a hierarchical clustering algorithm is applied to many Х bipartite core sets and the dendrogram of the community inner construction is obtained.The correctness of the constructing and pruning method is proved and the algorithm is designed.The typical datasets in the experiment are prepared according to the way in HITS(hyperlink-induced topic search).Ten topics and four search engines are chosen and the returned results are integrated.The modularity,which is a measure of the strength of the community structure in the social network,is used to validate the efficiency of the proposed method.The experimental results show that the proposed algorithm is effective and efficient.