Community detection is a fundamental work to analyse the structural and functional properties of complex networks. The label propagation algorithm (LPA) is a near linear time algorithm to find a good community struc...Community detection is a fundamental work to analyse the structural and functional properties of complex networks. The label propagation algorithm (LPA) is a near linear time algorithm to find a good community structure. Despite various subsequent advances, an important issue of this algorithm has not yet been properly addressed. Random update orders within the algorithm severely hamper the stability of the identified community structure. In this paper, we executed the basic label propagation algorithm on networks multiple times, to obtain a set of consensus partitions. Based on these consensus partitions, we created a consensus weighted graph. In this consensus weighted graph, the weight value of the edge was the proportion value that the number of node pairs allocated in the same cluster was divided by the total number of partitions. Then, we introduced consensus weight to indicate the direction of label propagation. In label update steps, by computing the mixing value of consensus weight and label frequency, a node adopted the label which has the maximum mixing value instead of the most frequent one. For extending to different networks, we introduced a proportion parameter to adjust the proportion of consensus weight and label frequency in computing mixing value. Finally, we proposed an approach named the label propagation algorithm with consensus weight (LPAcw), and the experimental results showed that the LPAcw could enhance considerably both the stability and the accuracy of community partitions.展开更多
Purpose:This study focuses on understanding the collaboration relationships among mathematicians,particularly those esteemed as elites,to reveal the structures of their communities and evaluate their impact on the fie...Purpose:This study focuses on understanding the collaboration relationships among mathematicians,particularly those esteemed as elites,to reveal the structures of their communities and evaluate their impact on the field of mathematics.Design/methodology/approach:Two community detection algorithms,namely Greedy Modularity Maximization and Infomap,are utilized to examine collaboration patterns among mathematicians.We conduct a comparative analysis of mathematicians’centrality,emphasizing the influence of award-winning individuals in connecting network roles such as Betweenness,Closeness,and Harmonic centrality.Additionally,we investigate the distribution of elite mathematicians across communities and their relationships within different mathematical sub-fields.Findings:The study identifies the substantial influence exerted by award-winning mathematicians in connecting network roles.The elite distribution across the network is uneven,with a concentration within specific communities rather than being evenly dispersed.Secondly,the research identifies a positive correlation between distinct mathematical sub-fields and the communities,indicating collaborative tendencies among scientists engaged in related domains.Lastly,the study suggests that reduced research diversity within a community might lead to a higher concentration of elite scientists within that specific community.Research limitations:The study’s limitations include its narrow focus on mathematicians,which may limit the applicability of the findings to broader scientific fields.Issues with manually collected data affect the reliability of conclusions about collaborative networks.Practical implications:This study offers valuable insights into how elite mathematicians collaborate and how knowledge is disseminated within mathematical circles.Understanding these collaborative behaviors could aid in fostering better collaboration strategies among mathematicians and institutions,potentially enhancing scientific progress in mathematics.Originality/value:The study adds value to understanding collaborative dynamics within the realm of mathematics,offering a unique angle for further exploration and research.展开更多
Current methodologies for cleaning wind power anomaly data exhibit limited capabilities in identifying abnormal data within extensive datasets and struggle to accommodate the considerable variability and intricacy of ...Current methodologies for cleaning wind power anomaly data exhibit limited capabilities in identifying abnormal data within extensive datasets and struggle to accommodate the considerable variability and intricacy of wind farm data.Consequently,a method for cleaning wind power anomaly data by combining image processing with community detection algorithms(CWPAD-IPCDA)is proposed.To precisely identify and initially clean anomalous data,wind power curve(WPC)images are converted into graph structures,which employ the Louvain community recognition algorithm and graph-theoretic methods for community detection and segmentation.Furthermore,the mathematical morphology operation(MMO)determines the main part of the initially cleaned wind power curve images and maps them back to the normal wind power points to complete the final cleaning.The CWPAD-IPCDA method was applied to clean datasets from 25 wind turbines(WTs)in two wind farms in northwest China to validate its feasibility.A comparison was conducted using density-based spatial clustering of applications with noise(DBSCAN)algorithm,an improved isolation forest algorithm,and an image-based(IB)algorithm.The experimental results demonstrate that the CWPAD-IPCDA method surpasses the other three algorithms,achieving an approximately 7.23%higher average data cleaning rate.The mean value of the sum of the squared errors(SSE)of the dataset after cleaning is approximately 6.887 lower than that of the other algorithms.Moreover,the mean of overall accuracy,as measured by the F1-score,exceeds that of the other methods by approximately 10.49%;this indicates that the CWPAD-IPCDA method is more conducive to improving the accuracy and reliability of wind power curve modeling and wind farm power forecasting.展开更多
Community detection is an important methodology for understanding the intrinsic structure and function of a realworld network. In this paper, we propose an effective and efficient algorithm, called Dominant Label Prop...Community detection is an important methodology for understanding the intrinsic structure and function of a realworld network. In this paper, we propose an effective and efficient algorithm, called Dominant Label Propagation Algorithm(Abbreviated as DLPA), to detect communities in complex networks. The algorithm simulates a special voting process to detect overlapping and non-overlapping community structure in complex networks simultaneously. Our algorithm is very efficient, since its computational complexity is almost linear to the number of edges in the network. Experimental results on both real-world and synthetic networks show that our algorithm also possesses high accuracies on detecting community structure in networks.展开更多
A new genetic algorithm for community detection in complex networks was proposed. It adopts matrix encoding that enables traditional crossover between individuals. Initial populations are generated using nodes similar...A new genetic algorithm for community detection in complex networks was proposed. It adopts matrix encoding that enables traditional crossover between individuals. Initial populations are generated using nodes similarity, which enhances the diversity of initial individuals while retaining an acceptable level of accuracy, and improves the efficiency of optimal solution search. Individual crossover is based on the quality of individuals' genes; all nodes unassigned to any community are grouped into a new community, while ambiguously placed nodes are assigned to the community to which most of their neighbors belong. Individual mutation, which splits a gene into two new genes or randomly fuses it into other genes, is non-uniform. The simplicity and effectiveness of the algorithm are revealed in experimental tests using artificial random networks and real networks. The accuracy of the algorithm is superior to that of some classic algorithms, and is comparable to that of some recent high-precision algorithms.展开更多
This paper aims to effectively solve the problem of the influence maximization in social networks.For this purpose,an influence maximization method that can identify influential nodes via the community structure and t...This paper aims to effectively solve the problem of the influence maximization in social networks.For this purpose,an influence maximization method that can identify influential nodes via the community structure and the influence distribution difference is proposed.Firstly,the network embedding-based community detection approach is developed,by which the social network is divided into several high-quality communities.Secondly,the solution of influence maximization is composed of the candidate stage and the greedy stage.The candidate stage is to select candidate nodes from the interior and the boundary of each community using a heuristic algorithm,and the greedy stage is to determine seed nodes with the largest marginal influence increment from the candidate set through the sub-modular property-based Greedy algorithm.Finally,experimental results demonstrate the superiority of the proposed method compared with existing methods,from which one can further find that our work can achieve a good tradeoff between the influence spread and the running time.展开更多
Motivated by the relationship of the dynamic behaviors and network structure, in this paper, we present two efficient dynamic community detection algorithms. The phases of the nodes in the network can evolve according...Motivated by the relationship of the dynamic behaviors and network structure, in this paper, we present two efficient dynamic community detection algorithms. The phases of the nodes in the network can evolve according to our proposed differential equations. In each iteration, the phases of the nodes are controlled by several parameters. It is found that the phases of the nodes are ultimately clustered into several communities after a short period of evolution. They can be adopted to detect the communities successfully. The second differential equation can dynamically adjust several parameters, so it can obtain satisfactory detection results. Simulations on some test networks have verified the efficiency of the presented algorithms.展开更多
Overlapping community detection in a network is a challenging issue which attracts lots of attention in recent years.A notion of hesitant node(HN) is proposed. An HN contacts with multiple communities while the comm...Overlapping community detection in a network is a challenging issue which attracts lots of attention in recent years.A notion of hesitant node(HN) is proposed. An HN contacts with multiple communities while the communications are not strong or even accidental, thus the HN holds an implicit community structure.However, HNs are not rare in the real world network. It is important to identify them because they can be efficient hubs which form the overlapping portions of communities or simple attached nodes to some communities. Current approaches have difficulties in identifying and clustering HNs. A density-based rough set model(DBRSM) is proposed by combining the virtue of densitybased algorithms and rough set models. It incorporates the macro perspective of the community structure of the whole network and the micro perspective of the local information held by HNs, which would facilitate the further "growth" of HNs in community. We offer a theoretical support for this model from the point of strength of the trust path. The experiments on the real-world and synthetic datasets show the practical significance of analyzing and clustering the HNs based on DBRSM. Besides, the clustering based on DBRSM promotes the modularity optimization.展开更多
Citation network is often used for academic recommendation. However, it is difficult to achieve high recommendation accuracy and low time complexity because it is often very large and sparse and different citations ha...Citation network is often used for academic recommendation. However, it is difficult to achieve high recommendation accuracy and low time complexity because it is often very large and sparse and different citations have different purposes. What's more, some citations include unreasonable information, such as in case of intentional self-citation. To improve the accuracy of citation network-based academic recommendation and reduce the time complexity, we propose an academic recommendation method for recommending authors and papers. In which, an author-paper bilayer citation network is built, then an enhanced topic model, Author Community Topic Time Model(ACTTM) is proposed to detect high quality author communities in the author layer, and a set of attributes are proposed to comprehensively depict the author/paper nodes in the bilayer citation network. Experimental results prove that the proposed ACTTM can detect high quality author communities and facilitate low time complexity, and the proposed academic recommendation method can effectively improve the recommendation accuracy.展开更多
With the fast-growth of mobile social network, people' s interactions are frequently marked with location information, such as longitude and latitude of visited base station. This boom of data has led to considerable...With the fast-growth of mobile social network, people' s interactions are frequently marked with location information, such as longitude and latitude of visited base station. This boom of data has led to considerable interest in research fields such as user behavior mining, trajectory discovery and social demographics. However, there is little research on community discovery in mobile social networks, and this is the problem this work tackles with. In this work, we take advantage of one simple property that people in different locations often belong to different social circles in order to discover communities in these networks. Based on this property, which we referred to as Location-lnteraction Disparity (LID), we proposed a state network and then define a quality function evaluating community detection results. We also propose a hybrid community- detection algorithm using LID tor discovering location-based communities effectively and efficiently. Experiments on synthesis networks show that this algorithm can run effectively in time and discover communities with high precision. In realworld networks, the method reveals people's different social circles in different places with high efficiency.展开更多
Community detection in social networks is a hard problem because of the size,and the need of a deep understanding of network structure and functions.While several methods with significant effort in this direction have...Community detection in social networks is a hard problem because of the size,and the need of a deep understanding of network structure and functions.While several methods with significant effort in this direction have been devised,an outstanding open problem is the unknown number of communities,it is generally believed that the role of influential nodes that are surrounded by neighbors is very important.In addition,the similarity among nodes inside the same cluster is greater than among nodes from other clusters.Lately,the global and local methods of community detection have been getting more attention.Therefore,in this study,we propose an advanced communitydetection model for social networks in order to identify network communities based on global and local information.Our proposed model initially detects the most influential nodes by using an Eigen score then performs local expansion powered by label propagation.This process is conducted with the same color till nodes reach maximum similarity.Finally,the communities are formed,and a clear community graph is displayed to the user.Our proposed model is completely parameter-free,and therefore,no prior information is required,such as the number of communities,etc.We perform simulations and experiments using well-known synthetic and real network benchmarks,and compare them with well-known state-of-the-art models.The results prove that our model is efficient in all aspects,because it quickly identifies communities in the network.Moreover,it can easily be used for friendship recommendations or in business recommendation systems.展开更多
Recommender system (RS) has become a very important factor in many eCommerce sites. In our daily life, we rely on the recommendation from other persons either by word of mouth, recommendation letters, movie, item and ...Recommender system (RS) has become a very important factor in many eCommerce sites. In our daily life, we rely on the recommendation from other persons either by word of mouth, recommendation letters, movie, item and book reviews printed in newspapers, etc. The typical Recommender Systems are software tools and techniques that provide support to people by identifying interesting products and services in online store. It also provides a recommendation for certain users who search for the recommendations. The most important open challenge in Collaborative filtering recommender system is the cold start problem. If the adequate or sufficient information is not available for a new item or users, the recommender system runs into the cold start problem. To increase the usefulness of collaborative recommender systems, it could be desirable to eliminate the challenge such as cold start problem. Revealing the community structures is crucial to understand and more important with the increasing popularity of online social networks. The community detection is a key issue in social network analysis in which nodes of the communities are tightly connected each other and loosely connected between other communities. Many algorithms like Givan-Newman algorithm, modularity maximization, leading eigenvector, walk trap, etc., are used to detect the communities in the networks. To test the community division is meaningful we define a quality function called modularity. Modularity is that the links within a community are higher than the expected links in those communities. In this paper, we try to give a solution to the cold-start problem based on community detection algorithm that extracts the community from the social networks and identifies the similar users on that network. Hence, within the proposed work several intrinsic details are taken as a rule of thumb to boost the results higher. Moreover, the simulation experiment was taken to solve the cold start problem.展开更多
Purpose: This paper aims to provide a method to detect research communities based on research interest in researcher network, which combines the topological structure and vertex attributes in a unified manner.Design/m...Purpose: This paper aims to provide a method to detect research communities based on research interest in researcher network, which combines the topological structure and vertex attributes in a unified manner.Design/methodology/approach: A heterogeneous researcher network has been constructed by combining multiple relations of academic researchers. Vertex attributes and their similarities were considered and calculated. An approach has been proposed and tested to detect research community in research organizations based on this multi-relation researcher network.Findings: Detection of topologically well-connected, semantically coherent and meaningful research community was achieved.Research limitations: The sample size of evaluation experiments was relatively small. In the present study, a limited number of 72 researchers were analyzed for constructing researcher network and detecting research community. Therefore, a large sample size is required to give more information and reliable results.Practical implications: The proposed multi-relation researcher network and approaches for discovering research communities of similar research interests will contribute to collective innovation behavior such as brainstorming and to promote interdisciplinary cooperation.Originality/value: Recent researches on community detection devote most efforts to singlerelation researcher networks and put the main focus on the topological structure of networks.In reality, there exist multi-relation social networks. Vertex attribute also plays an important role in community detection. The present study combined multiple single-relational researcher networks into a multi-relational network and proposed a structure-attribute clustering method for detecting research community in research organizations.展开更多
In recent years,local community detection algorithms have developed rapidly because of their nearly linear computing time and the convenience of obtaining the local information of real-world networks.However,there are...In recent years,local community detection algorithms have developed rapidly because of their nearly linear computing time and the convenience of obtaining the local information of real-world networks.However,there are still some issues that need to be further studied.First,there is no local community detection algorithm dedicated to detecting a seed-oriented local community,that is,the local community with the seed as the core.The second and third issues are that the quality of local communities detected by the previous local community detection algorithms are largely dependent on the position of the seed and predefined parameters,respectively.To solve the existing problems,we propose a seed-oriented local community detection algorithm,named SOLCD,that is based on influence spreading.First,we propose a novel measure of node influence named k-core centrality that is based on the k-core value of adjacent nodes.Second,we obtain the seed-oriented local community,which is composed of the may-members and the must-member chain of the seed,by detecting the influence scope of the seed.The may-members and the must-members of the seed are determined by judging the influence relationship between the node and the seed.Five state-of-art algorithms are compared to SOLCD on six real-world networks and three groups of artificial networks.The experimental results show that SOLCD can achieve a high-quality seed-oriented local community for various real-world networks and artificial networks with different parameters.In addition,when taking nodes with different influence as seeds,SOLCD can stably obtain high-quality seed-oriented local communities.展开更多
Community detection in signed networks has been studied widely in recent years. In this paper, a discrete difference equation is proposed to imitate the consistently changing phases of the nodes. During the interactio...Community detection in signed networks has been studied widely in recent years. In this paper, a discrete difference equation is proposed to imitate the consistently changing phases of the nodes. During the interaction, each node will update its phase based on the difference equation. Each node has many different nodes connected with it, and these neighbors have different influences on it. The similarity between two nodes is applied to describe the influences between them. Nodes with high positive similarities will get together and nodes with negative similarities will be far away from each other.Communities are detected ultimately when the phases of the nodes are stable. Experiments on real world and synthetic signed networks show the efficiency of detection performance. Moreover, the presented method gains better detection performance than two existing good algorithms.展开更多
The statistical model for community detection is a promising research area in network analysis.Most existing statistical models of community detection are designed for networks with a known type of community structure...The statistical model for community detection is a promising research area in network analysis.Most existing statistical models of community detection are designed for networks with a known type of community structure,but in many practical situations,the types of community structures are unknown.To cope with unknown community structures,diverse types should be considered in one model.We propose a model that incorporates the latent interaction pattern,which is regarded as the basis of constructions of diverse community structures by us.The interaction pattern can parameterize various types of community structures in one model.A collapsed Gibbs sampling inference is proposed to estimate the community assignments and other hyper-parameters.With the Pitman-Yor process as a prior,our model can automatically detect the numbers and sizes of communities without a known type of community structure beforehand.Via Bayesian inference,our model can detect some hidden interaction patterns that offer extra information for network analysis.Experiments on networks with diverse community structures demonstrate that our model outperforms four state-of-the-art models.展开更多
to the capability of reflecting social perception on semantic of resources, folksonomy has been proposed to improve the social learning for education and scholar researching. However, its actual impact is significantl...to the capability of reflecting social perception on semantic of resources, folksonomy has been proposed to improve the social learning for education and scholar researching. However, its actual impact is significantly influenced by the semantic ambiguity problem of tags. So, in this paper, we proposed a novel way of detecting homonyms, one of the main sources of tag's semantic ambiguity problem, in noisy folksonomies. The study is based on two hypotheses: 1) Users having different interests tend to have different understanding of the same tag. 2) Users having similar interest tend to have common understanding of the same tag. Therefore, we firstly discover user communities according to users' interests. Then, tag contexts are discovered in subsets of folksonomy on the basis of user communities. The experimental results show that our method is effective and outperform the method finding tag contexts using all tags in folksonomy with overlapping clustering algorithm especially when various users having different interests are contained by the folksonomyo展开更多
With the increasing number of airports and the expansion of their scale,the aviation network has become complex and hierarchical.In order to investigate the complex network characteristics of aviation networks,this pa...With the increasing number of airports and the expansion of their scale,the aviation network has become complex and hierarchical.In order to investigate the complex network characteristics of aviation networks,this paper constructs a Chinese aviation network model and carries out related research based on complex network theory and K-means algorithm.Initially,the P-space model is employed to construct the Chinese aviation network model.Then,complex network indicators such as degree,clustering coefficient,average path length,betweenness and coreness are selected to investigate the complex characteristics and hierarchical features of aviation networks and explore their causes.Secondly,using K-means clustering algorithm,five values are obtained as the initial clustering parameter K values for each of the aviation network hierarchies classified according to five complex network indicators.Meanwhile,clustering simulation experiments are conducted to obtain the visual clustering results of Chinese aviation network nodes under different K values,as well as silhouette coefficients for evaluating the clustering effect of each indicator in order to obtain the hierarchical classification of aviation networks under different indicators.Finally,the silhouette coefficient is optimal when the K value is 4.Thus,the clustering results of the four layers of the aviation network can be obtained.According to the experimental results,the complex network association discovery method combined with K-means algorithm has better applicability and simplicity,while the accuracy is improved.展开更多
The structure and dynamic nature of real-world networks can be revealed by communities that help in promotion of recommendation systems.Social Media platforms were initially developed for effective communication,but n...The structure and dynamic nature of real-world networks can be revealed by communities that help in promotion of recommendation systems.Social Media platforms were initially developed for effective communication,but now it is being used widely for extending and to obtain profit among business community.The numerous data generated through these platforms are utilized by many companies that make a huge profit out of it.A giant network of people in social media is grouped together based on their similar properties to form a community.Commu-nity detection is recent topic among the research community due to the increase usage of online social network.Community is one of a significant property of a net-work that may have many communities which have similarity among them.Community detection technique play a vital role to discover similarities among the nodes and keep them strongly connected.Similar nodes in a network are grouped together in a single community.Communities can be merged together to avoid lot of groups if there exist more edges between them.Machine Learning algorithms use community detection to identify groups with common properties and thus for recommen-dation systems,health care assistance systems and many more.Considering the above,this paper presents alternative method SimEdge-CD(Similarity and Edge between's based Community Detection)for community detection.The two stages of SimEdge-CD initiallyfind the similarity among nodes and group them into one community.During the second stage,it identifies the exact affiliations of boundary nodes using edge betweenness to create well defined communities.Evaluation of proposed method on synthetic and real datasets proved to achieve a better accuracy-efficiency trade-of compared to other existing methods.Our proposed SimEdge-CD achieves ideal value of 1 which is higher than existing sim closure like LPA,Attractor,Leiden and walktrap techniques.展开更多
Performing analytics on the load curve(LC)of customers is the foundation for demand response which requires a better understanding of customers'consumption pattern(CP)by analyzing the load curve.However,the perfor...Performing analytics on the load curve(LC)of customers is the foundation for demand response which requires a better understanding of customers'consumption pattern(CP)by analyzing the load curve.However,the performances of previous widely-used LC clustering methods are poor in two folds:larger number of clusters,huge variances within a cluster(a CP is extracted from a cluster),bringing huge difficulty to understand the electricity consumption pattern of customers.In this paper,to improve the performance of LC clustering,a clustering framework incorporated with community detection is proposed.The framework includes three parts:network construction,community detection,and CP extraction.According to the cluster validity index(CVI),the integrated approach outperforms the previous state-of-the-art method with the same amount of clusters.And the approach needs fewer clusters to achieve the same performance measured by CVI.展开更多
基金supported by the National Natural Science Foundation of China(Grant No.61370073)the China Scholarship Council,China(Grant No.201306070037)
文摘Community detection is a fundamental work to analyse the structural and functional properties of complex networks. The label propagation algorithm (LPA) is a near linear time algorithm to find a good community structure. Despite various subsequent advances, an important issue of this algorithm has not yet been properly addressed. Random update orders within the algorithm severely hamper the stability of the identified community structure. In this paper, we executed the basic label propagation algorithm on networks multiple times, to obtain a set of consensus partitions. Based on these consensus partitions, we created a consensus weighted graph. In this consensus weighted graph, the weight value of the edge was the proportion value that the number of node pairs allocated in the same cluster was divided by the total number of partitions. Then, we introduced consensus weight to indicate the direction of label propagation. In label update steps, by computing the mixing value of consensus weight and label frequency, a node adopted the label which has the maximum mixing value instead of the most frequent one. For extending to different networks, we introduced a proportion parameter to adjust the proportion of consensus weight and label frequency in computing mixing value. Finally, we proposed an approach named the label propagation algorithm with consensus weight (LPAcw), and the experimental results showed that the LPAcw could enhance considerably both the stability and the accuracy of community partitions.
基金supported by grants from the National Natural Science Foundation of China No.NSFC62006109 and NSFC12031005the 13th Five-year plan for Education Science Funding of Guangdong Province No.2021GXJK349,No.2020GXJK457the Stable Support Plan Program of Shenzhen Natural Science Fund No.20220814165010001.
文摘Purpose:This study focuses on understanding the collaboration relationships among mathematicians,particularly those esteemed as elites,to reveal the structures of their communities and evaluate their impact on the field of mathematics.Design/methodology/approach:Two community detection algorithms,namely Greedy Modularity Maximization and Infomap,are utilized to examine collaboration patterns among mathematicians.We conduct a comparative analysis of mathematicians’centrality,emphasizing the influence of award-winning individuals in connecting network roles such as Betweenness,Closeness,and Harmonic centrality.Additionally,we investigate the distribution of elite mathematicians across communities and their relationships within different mathematical sub-fields.Findings:The study identifies the substantial influence exerted by award-winning mathematicians in connecting network roles.The elite distribution across the network is uneven,with a concentration within specific communities rather than being evenly dispersed.Secondly,the research identifies a positive correlation between distinct mathematical sub-fields and the communities,indicating collaborative tendencies among scientists engaged in related domains.Lastly,the study suggests that reduced research diversity within a community might lead to a higher concentration of elite scientists within that specific community.Research limitations:The study’s limitations include its narrow focus on mathematicians,which may limit the applicability of the findings to broader scientific fields.Issues with manually collected data affect the reliability of conclusions about collaborative networks.Practical implications:This study offers valuable insights into how elite mathematicians collaborate and how knowledge is disseminated within mathematical circles.Understanding these collaborative behaviors could aid in fostering better collaboration strategies among mathematicians and institutions,potentially enhancing scientific progress in mathematics.Originality/value:The study adds value to understanding collaborative dynamics within the realm of mathematics,offering a unique angle for further exploration and research.
基金supported by the National Natural Science Foundation of China(Project No.51767018)Natural Science Foundation of Gansu Province(Project No.23JRRA836).
文摘Current methodologies for cleaning wind power anomaly data exhibit limited capabilities in identifying abnormal data within extensive datasets and struggle to accommodate the considerable variability and intricacy of wind farm data.Consequently,a method for cleaning wind power anomaly data by combining image processing with community detection algorithms(CWPAD-IPCDA)is proposed.To precisely identify and initially clean anomalous data,wind power curve(WPC)images are converted into graph structures,which employ the Louvain community recognition algorithm and graph-theoretic methods for community detection and segmentation.Furthermore,the mathematical morphology operation(MMO)determines the main part of the initially cleaned wind power curve images and maps them back to the normal wind power points to complete the final cleaning.The CWPAD-IPCDA method was applied to clean datasets from 25 wind turbines(WTs)in two wind farms in northwest China to validate its feasibility.A comparison was conducted using density-based spatial clustering of applications with noise(DBSCAN)algorithm,an improved isolation forest algorithm,and an image-based(IB)algorithm.The experimental results demonstrate that the CWPAD-IPCDA method surpasses the other three algorithms,achieving an approximately 7.23%higher average data cleaning rate.The mean value of the sum of the squared errors(SSE)of the dataset after cleaning is approximately 6.887 lower than that of the other algorithms.Moreover,the mean of overall accuracy,as measured by the F1-score,exceeds that of the other methods by approximately 10.49%;this indicates that the CWPAD-IPCDA method is more conducive to improving the accuracy and reliability of wind power curve modeling and wind farm power forecasting.
基金Project supported by the National Natural Science Foundation of China(Grant Nos.61173093 and 61202182)the Postdoctoral Science Foundation of China(Grant No.2012 M521776)+2 种基金the Fundamental Research Funds for the Central Universities of Chinathe Postdoctoral Science Foundation of Shannxi Province,Chinathe Natural Science Basic Research Plan of Shaanxi Province,China(Grant Nos.2013JM8019 and 2014JQ8359)
文摘Community detection is an important methodology for understanding the intrinsic structure and function of a realworld network. In this paper, we propose an effective and efficient algorithm, called Dominant Label Propagation Algorithm(Abbreviated as DLPA), to detect communities in complex networks. The algorithm simulates a special voting process to detect overlapping and non-overlapping community structure in complex networks simultaneously. Our algorithm is very efficient, since its computational complexity is almost linear to the number of edges in the network. Experimental results on both real-world and synthetic networks show that our algorithm also possesses high accuracies on detecting community structure in networks.
文摘A new genetic algorithm for community detection in complex networks was proposed. It adopts matrix encoding that enables traditional crossover between individuals. Initial populations are generated using nodes similarity, which enhances the diversity of initial individuals while retaining an acceptable level of accuracy, and improves the efficiency of optimal solution search. Individual crossover is based on the quality of individuals' genes; all nodes unassigned to any community are grouped into a new community, while ambiguously placed nodes are assigned to the community to which most of their neighbors belong. Individual mutation, which splits a gene into two new genes or randomly fuses it into other genes, is non-uniform. The simplicity and effectiveness of the algorithm are revealed in experimental tests using artificial random networks and real networks. The accuracy of the algorithm is superior to that of some classic algorithms, and is comparable to that of some recent high-precision algorithms.
基金The authors are grateful to the anonymous reviewers and the editor for their valuable comments and suggestions.This work is supported by Natural Science Foundation of China(Grant Nos.61702066 and 11747125)Major Project of Science and Technology Research Program of Chongqing Education Commission of China(Grant No.KJZD-M201900601)+3 种基金Chongqing Research Program of Basic Research and Frontier Technology(Grant Nos.cstc2017jcyjAX0256 and cstc2018jcy-jAX0154)Project Supported by Chongqing Municipal Key Laboratory of Institutions of Higher Education(Grant No.cqupt-mct-201901)Tech-nology Foundation of Guizhou Province(QianKeHeJiChu[2020]1Y269)New academic seedling cultivation and exploration innovation project(QianKeHe Platform Talents[2017]5789-21).
文摘This paper aims to effectively solve the problem of the influence maximization in social networks.For this purpose,an influence maximization method that can identify influential nodes via the community structure and the influence distribution difference is proposed.Firstly,the network embedding-based community detection approach is developed,by which the social network is divided into several high-quality communities.Secondly,the solution of influence maximization is composed of the candidate stage and the greedy stage.The candidate stage is to select candidate nodes from the interior and the boundary of each community using a heuristic algorithm,and the greedy stage is to determine seed nodes with the largest marginal influence increment from the candidate set through the sub-modular property-based Greedy algorithm.Finally,experimental results demonstrate the superiority of the proposed method compared with existing methods,from which one can further find that our work can achieve a good tradeoff between the influence spread and the running time.
基金supported by the National Natural Science Foundation of China(Grant No.61272279)the TianYuan Special Funds of the National Natural Science Foundation of China(Grant No.11326239)+1 种基金the Higher School Science and Technology Research Project of Inner Mongolia,China(Grant No.NJZY13119)the Inner Mongolia University of Technology,China(Grant No.ZD201221)
文摘Motivated by the relationship of the dynamic behaviors and network structure, in this paper, we present two efficient dynamic community detection algorithms. The phases of the nodes in the network can evolve according to our proposed differential equations. In each iteration, the phases of the nodes are controlled by several parameters. It is found that the phases of the nodes are ultimately clustered into several communities after a short period of evolution. They can be adopted to detect the communities successfully. The second differential equation can dynamically adjust several parameters, so it can obtain satisfactory detection results. Simulations on some test networks have verified the efficiency of the presented algorithms.
基金supported by the National Natural Science Foundation of China(71271018)
文摘Overlapping community detection in a network is a challenging issue which attracts lots of attention in recent years.A notion of hesitant node(HN) is proposed. An HN contacts with multiple communities while the communications are not strong or even accidental, thus the HN holds an implicit community structure.However, HNs are not rare in the real world network. It is important to identify them because they can be efficient hubs which form the overlapping portions of communities or simple attached nodes to some communities. Current approaches have difficulties in identifying and clustering HNs. A density-based rough set model(DBRSM) is proposed by combining the virtue of densitybased algorithms and rough set models. It incorporates the macro perspective of the community structure of the whole network and the micro perspective of the local information held by HNs, which would facilitate the further "growth" of HNs in community. We offer a theoretical support for this model from the point of strength of the trust path. The experiments on the real-world and synthetic datasets show the practical significance of analyzing and clustering the HNs based on DBRSM. Besides, the clustering based on DBRSM promotes the modularity optimization.
基金supported by the grants from Natural Science Foundation of China (Project No.61471060)
文摘Citation network is often used for academic recommendation. However, it is difficult to achieve high recommendation accuracy and low time complexity because it is often very large and sparse and different citations have different purposes. What's more, some citations include unreasonable information, such as in case of intentional self-citation. To improve the accuracy of citation network-based academic recommendation and reduce the time complexity, we propose an academic recommendation method for recommending authors and papers. In which, an author-paper bilayer citation network is built, then an enhanced topic model, Author Community Topic Time Model(ACTTM) is proposed to detect high quality author communities in the author layer, and a set of attributes are proposed to comprehensively depict the author/paper nodes in the bilayer citation network. Experimental results prove that the proposed ACTTM can detect high quality author communities and facilitate low time complexity, and the proposed academic recommendation method can effectively improve the recommendation accuracy.
基金supported by the National High Technology Research and Development Program of China under Grant No.2014AA015103Beijing Natural Science Foundation under Grant No.4152023+1 种基金the National Natural Science Foundation of China under Grant No.61473006the National Science and Technology Support Plan under Grant No.2014BAG01B02
文摘With the fast-growth of mobile social network, people' s interactions are frequently marked with location information, such as longitude and latitude of visited base station. This boom of data has led to considerable interest in research fields such as user behavior mining, trajectory discovery and social demographics. However, there is little research on community discovery in mobile social networks, and this is the problem this work tackles with. In this work, we take advantage of one simple property that people in different locations often belong to different social circles in order to discover communities in these networks. Based on this property, which we referred to as Location-lnteraction Disparity (LID), we proposed a state network and then define a quality function evaluating community detection results. We also propose a hybrid community- detection algorithm using LID tor discovering location-based communities effectively and efficiently. Experiments on synthesis networks show that this algorithm can run effectively in time and discover communities with high precision. In realworld networks, the method reveals people's different social circles in different places with high efficiency.
基金This research was supported by the Ministry of Trade,Industry&Energy(MOTIE,Korea)under the Industrial Technology Innovation Program,No.10063130by the Basic Science Research Program through the National Research Foundation of Korea(NRF)funded by the Ministry of Education(NRF-2019R1A2C1006159)by the Ministry of Science and ICT(MSIT),Korea,under the Information Technology Research Center(ITRC)support program(IITP-2019-2016-0-00313)supervised by the Institute for Information&communications Technology Promotion(IITP).
文摘Community detection in social networks is a hard problem because of the size,and the need of a deep understanding of network structure and functions.While several methods with significant effort in this direction have been devised,an outstanding open problem is the unknown number of communities,it is generally believed that the role of influential nodes that are surrounded by neighbors is very important.In addition,the similarity among nodes inside the same cluster is greater than among nodes from other clusters.Lately,the global and local methods of community detection have been getting more attention.Therefore,in this study,we propose an advanced communitydetection model for social networks in order to identify network communities based on global and local information.Our proposed model initially detects the most influential nodes by using an Eigen score then performs local expansion powered by label propagation.This process is conducted with the same color till nodes reach maximum similarity.Finally,the communities are formed,and a clear community graph is displayed to the user.Our proposed model is completely parameter-free,and therefore,no prior information is required,such as the number of communities,etc.We perform simulations and experiments using well-known synthetic and real network benchmarks,and compare them with well-known state-of-the-art models.The results prove that our model is efficient in all aspects,because it quickly identifies communities in the network.Moreover,it can easily be used for friendship recommendations or in business recommendation systems.
文摘Recommender system (RS) has become a very important factor in many eCommerce sites. In our daily life, we rely on the recommendation from other persons either by word of mouth, recommendation letters, movie, item and book reviews printed in newspapers, etc. The typical Recommender Systems are software tools and techniques that provide support to people by identifying interesting products and services in online store. It also provides a recommendation for certain users who search for the recommendations. The most important open challenge in Collaborative filtering recommender system is the cold start problem. If the adequate or sufficient information is not available for a new item or users, the recommender system runs into the cold start problem. To increase the usefulness of collaborative recommender systems, it could be desirable to eliminate the challenge such as cold start problem. Revealing the community structures is crucial to understand and more important with the increasing popularity of online social networks. The community detection is a key issue in social network analysis in which nodes of the communities are tightly connected each other and loosely connected between other communities. Many algorithms like Givan-Newman algorithm, modularity maximization, leading eigenvector, walk trap, etc., are used to detect the communities in the networks. To test the community division is meaningful we define a quality function called modularity. Modularity is that the links within a community are higher than the expected links in those communities. In this paper, we try to give a solution to the cold-start problem based on community detection algorithm that extracts the community from the social networks and identifies the similar users on that network. Hence, within the proposed work several intrinsic details are taken as a rule of thumb to boost the results higher. Moreover, the simulation experiment was taken to solve the cold start problem.
基金supported by the National Natural Science Foundation of China(Grant No.:71203164)
文摘Purpose: This paper aims to provide a method to detect research communities based on research interest in researcher network, which combines the topological structure and vertex attributes in a unified manner.Design/methodology/approach: A heterogeneous researcher network has been constructed by combining multiple relations of academic researchers. Vertex attributes and their similarities were considered and calculated. An approach has been proposed and tested to detect research community in research organizations based on this multi-relation researcher network.Findings: Detection of topologically well-connected, semantically coherent and meaningful research community was achieved.Research limitations: The sample size of evaluation experiments was relatively small. In the present study, a limited number of 72 researchers were analyzed for constructing researcher network and detecting research community. Therefore, a large sample size is required to give more information and reliable results.Practical implications: The proposed multi-relation researcher network and approaches for discovering research communities of similar research interests will contribute to collective innovation behavior such as brainstorming and to promote interdisciplinary cooperation.Originality/value: Recent researches on community detection devote most efforts to singlerelation researcher networks and put the main focus on the topological structure of networks.In reality, there exist multi-relation social networks. Vertex attribute also plays an important role in community detection. The present study combined multiple single-relational researcher networks into a multi-relational network and proposed a structure-attribute clustering method for detecting research community in research organizations.
基金National Natural Science Foundation of China(Nos.61672179,61370083,61402126)Heilongjiang Province Natural Science Foundation of China(No.F2015030)+1 种基金Science Fund for Youths in Heilongjiang Province(No.QC2016083)Postdoctoral Fellowship in Heilongjiang Province(No.LBH-Z14071).
文摘In recent years,local community detection algorithms have developed rapidly because of their nearly linear computing time and the convenience of obtaining the local information of real-world networks.However,there are still some issues that need to be further studied.First,there is no local community detection algorithm dedicated to detecting a seed-oriented local community,that is,the local community with the seed as the core.The second and third issues are that the quality of local communities detected by the previous local community detection algorithms are largely dependent on the position of the seed and predefined parameters,respectively.To solve the existing problems,we propose a seed-oriented local community detection algorithm,named SOLCD,that is based on influence spreading.First,we propose a novel measure of node influence named k-core centrality that is based on the k-core value of adjacent nodes.Second,we obtain the seed-oriented local community,which is composed of the may-members and the must-member chain of the seed,by detecting the influence scope of the seed.The may-members and the must-members of the seed are determined by judging the influence relationship between the node and the seed.Five state-of-art algorithms are compared to SOLCD on six real-world networks and three groups of artificial networks.The experimental results show that SOLCD can achieve a high-quality seed-oriented local community for various real-world networks and artificial networks with different parameters.In addition,when taking nodes with different influence as seeds,SOLCD can stably obtain high-quality seed-oriented local communities.
基金supported by the National Natural Science Foundation of China(Grant Nos.11261034,71561020,61503203,and 11326239)the Higher School Science and Technology Research Project of Inner Mongolia,China(Grant No.NJZY13119)the Natural Science Foundation of Inner Mongolia,China(Grant Nos.2015MS0103 and 2014BS0105)
文摘Community detection in signed networks has been studied widely in recent years. In this paper, a discrete difference equation is proposed to imitate the consistently changing phases of the nodes. During the interaction, each node will update its phase based on the difference equation. Each node has many different nodes connected with it, and these neighbors have different influences on it. The similarity between two nodes is applied to describe the influences between them. Nodes with high positive similarities will get together and nodes with negative similarities will be far away from each other.Communities are detected ultimately when the phases of the nodes are stable. Experiments on real world and synthetic signed networks show the efficiency of detection performance. Moreover, the presented method gains better detection performance than two existing good algorithms.
基金Project supported by Beijing Natural Science Foundation,China(Grant Nos.L181010 and 4172054)the National Key R&D Program of China(Grant No.2016YFB0801100)the National Basic Research Program of China(Grant No.2013CB329605)。
文摘The statistical model for community detection is a promising research area in network analysis.Most existing statistical models of community detection are designed for networks with a known type of community structure,but in many practical situations,the types of community structures are unknown.To cope with unknown community structures,diverse types should be considered in one model.We propose a model that incorporates the latent interaction pattern,which is regarded as the basis of constructions of diverse community structures by us.The interaction pattern can parameterize various types of community structures in one model.A collapsed Gibbs sampling inference is proposed to estimate the community assignments and other hyper-parameters.With the Pitman-Yor process as a prior,our model can automatically detect the numbers and sizes of communities without a known type of community structure beforehand.Via Bayesian inference,our model can detect some hidden interaction patterns that offer extra information for network analysis.Experiments on networks with diverse community structures demonstrate that our model outperforms four state-of-the-art models.
基金supported by the National Natural Science Foundation of China under Grant No.61300087National Science and Technology Ministry(ID:2013BAH01B00)
文摘to the capability of reflecting social perception on semantic of resources, folksonomy has been proposed to improve the social learning for education and scholar researching. However, its actual impact is significantly influenced by the semantic ambiguity problem of tags. So, in this paper, we proposed a novel way of detecting homonyms, one of the main sources of tag's semantic ambiguity problem, in noisy folksonomies. The study is based on two hypotheses: 1) Users having different interests tend to have different understanding of the same tag. 2) Users having similar interest tend to have common understanding of the same tag. Therefore, we firstly discover user communities according to users' interests. Then, tag contexts are discovered in subsets of folksonomy on the basis of user communities. The experimental results show that our method is effective and outperform the method finding tag contexts using all tags in folksonomy with overlapping clustering algorithm especially when various users having different interests are contained by the folksonomyo
文摘With the increasing number of airports and the expansion of their scale,the aviation network has become complex and hierarchical.In order to investigate the complex network characteristics of aviation networks,this paper constructs a Chinese aviation network model and carries out related research based on complex network theory and K-means algorithm.Initially,the P-space model is employed to construct the Chinese aviation network model.Then,complex network indicators such as degree,clustering coefficient,average path length,betweenness and coreness are selected to investigate the complex characteristics and hierarchical features of aviation networks and explore their causes.Secondly,using K-means clustering algorithm,five values are obtained as the initial clustering parameter K values for each of the aviation network hierarchies classified according to five complex network indicators.Meanwhile,clustering simulation experiments are conducted to obtain the visual clustering results of Chinese aviation network nodes under different K values,as well as silhouette coefficients for evaluating the clustering effect of each indicator in order to obtain the hierarchical classification of aviation networks under different indicators.Finally,the silhouette coefficient is optimal when the K value is 4.Thus,the clustering results of the four layers of the aviation network can be obtained.According to the experimental results,the complex network association discovery method combined with K-means algorithm has better applicability and simplicity,while the accuracy is improved.
文摘The structure and dynamic nature of real-world networks can be revealed by communities that help in promotion of recommendation systems.Social Media platforms were initially developed for effective communication,but now it is being used widely for extending and to obtain profit among business community.The numerous data generated through these platforms are utilized by many companies that make a huge profit out of it.A giant network of people in social media is grouped together based on their similar properties to form a community.Commu-nity detection is recent topic among the research community due to the increase usage of online social network.Community is one of a significant property of a net-work that may have many communities which have similarity among them.Community detection technique play a vital role to discover similarities among the nodes and keep them strongly connected.Similar nodes in a network are grouped together in a single community.Communities can be merged together to avoid lot of groups if there exist more edges between them.Machine Learning algorithms use community detection to identify groups with common properties and thus for recommen-dation systems,health care assistance systems and many more.Considering the above,this paper presents alternative method SimEdge-CD(Similarity and Edge between's based Community Detection)for community detection.The two stages of SimEdge-CD initiallyfind the similarity among nodes and group them into one community.During the second stage,it identifies the exact affiliations of boundary nodes using edge betweenness to create well defined communities.Evaluation of proposed method on synthetic and real datasets proved to achieve a better accuracy-efficiency trade-of compared to other existing methods.Our proposed SimEdge-CD achieves ideal value of 1 which is higher than existing sim closure like LPA,Attractor,Leiden and walktrap techniques.
基金Supported by the Major Program of National Natural Science Foundation of China(No.61432006)。
文摘Performing analytics on the load curve(LC)of customers is the foundation for demand response which requires a better understanding of customers'consumption pattern(CP)by analyzing the load curve.However,the performances of previous widely-used LC clustering methods are poor in two folds:larger number of clusters,huge variances within a cluster(a CP is extracted from a cluster),bringing huge difficulty to understand the electricity consumption pattern of customers.In this paper,to improve the performance of LC clustering,a clustering framework incorporated with community detection is proposed.The framework includes three parts:network construction,community detection,and CP extraction.According to the cluster validity index(CVI),the integrated approach outperforms the previous state-of-the-art method with the same amount of clusters.And the approach needs fewer clusters to achieve the same performance measured by CVI.