The search engines are indispensable tools to find information amidst massive web pages and documents. A good search engine needs to retrieve information not only in a shorter time, but also relevant to the users’ qu...The search engines are indispensable tools to find information amidst massive web pages and documents. A good search engine needs to retrieve information not only in a shorter time, but also relevant to the users’ queries. Most search engines provide short time retrieval to user queries;however, they provide a little guarantee of precision even to the highly detailed users’ queries. In such cases, documents clustering centered on the subject and contents might improve search results. This paper presents a novel method of document clustering, which uses semantic clique. First, we extracted the Features from the documents. Later, the associations between frequently co-occurring terms were defined, which were called as semantic cliques. Each connected component in the semantic clique represented a theme. The documents clustered based on the theme, for which we designed an aggregation algorithm. We evaluated the aggregation algorithm effectiveness using four kinds of datasets. The result showed that the semantic clique based document clustering algorithm performed significantly better than traditional clustering algorithms such as Principal Direction Divisive Partitioning (PDDP), k-means, Auto-Class, and Hierarchical Clustering (HAC). We found that the Semantic Clique Aggregation is a potential model to represent association rules in text and could be immensely useful for automatic document clustering.展开更多
A remarkable connection between the clique number and the Lagrangian of a graph was established by Motzkin and Straus. Later, Rota Bul′o and Pelillo extended the theorem of Motzkin-Straus to r-uniform hypergraphs by ...A remarkable connection between the clique number and the Lagrangian of a graph was established by Motzkin and Straus. Later, Rota Bul′o and Pelillo extended the theorem of Motzkin-Straus to r-uniform hypergraphs by studying the relation of local(global) minimizers of a homogeneous polynomial function of degree r and the maximal(maximum) cliques of an r-uniform hypergraph. In this paper, we study polynomial optimization problems for non-uniform hypergraphs with four different types of edges and apply it to get an upper bound of Tur′an densities of complete non-uniform hypergraphs.展开更多
Keyword search is an alternative for structured languages in querying graph-structured data.A result to a keyword query is a connected structure covering all or part of the queried keywords.The textual coverage and st...Keyword search is an alternative for structured languages in querying graph-structured data.A result to a keyword query is a connected structure covering all or part of the queried keywords.The textual coverage and structural compactness have been known as the two main properties of a relevant result to a keyword query.Many previous works examined these properties after retrieving all of the candidate results using a ranking function in a comparative manner.However,this needs a time-consuming search process,which is not appropriate for an interactive system in which the user expects results in the least possible time.This problem has been addressed in recent works by confining the shape of results to examine their coverage and compactness during the search.However,these methods still suffer from the existence of redundant nodes in the retrieved results.In this paper,we introduce the semantic of minimal covered r-clique(MCCr)for the results of a keyword query as an extended model of existing definitions.We propose some efficient algorithms to detect the MCCrs of a given query.These algorithms can retrieve a comprehensive set of non-duplicate MCCrs in response to a keyword query.In addition,these algorithms can be executed in a distributive manner,which makes them outstanding in the field of keyword search.We also propose the approximate versions of these algorithms to retrieve the top-k approximate MCCrs in a polynomial delay.It is proved that the approximate algorithms can retrieve results in two-approximation.Extensive experiments on two real-world datasets confirm the efficiency and effectiveness of the proposed algorithms.展开更多
Analyses of spatial relationships and social interactions provide insights into the social structure of animal societies and the ways in which social preferences among and between dyads affect higher order social rela...Analyses of spatial relationships and social interactions provide insights into the social structure of animal societies and the ways in which social preferences among and between dyads affect higher order social relationships. In this paper we de- scribe the patterns of spatial associations and social interactions among adult male northern muriquis in order to evaluate the dy- namics of their social networks above the dyadic levels. Systematic observations were made on the 17 adult males present in a multi-male/multi-female group from April 2004 through February 2005, and in July 2005. Analyses of their spatial relationships identified two distinct male cliques; some adult males (called "N" males) were more connected to the females and immatures than other adult males ("MU" males), which were more connected to one another. Affiliative interactions were significantly higher among dyads belonging to the same clique than to different cliques. Although frequencies of dyadic agonistic interactions were similarly low among individuals within and between cliques, MU males appeared to be subordinate to N males. Nonetheless, there were no significant differences in the copulation rates estimated for MU males and N males. Mutual benefits of cooperation between MU and N cliques in intergroup encounters might explain their ongoing associations in the same mixed-sex group展开更多
A graph is said to be claw-free if it does not contain an induced subgraph isomorphic to K_(1,3). Let s and k be two integers with 0≤s≤k and let G be a claw-free graph of order n. In this paper, we investigate cli...A graph is said to be claw-free if it does not contain an induced subgraph isomorphic to K_(1,3). Let s and k be two integers with 0≤s≤k and let G be a claw-free graph of order n. In this paper, we investigate clique partition problems in claw-free graphs. It is proved that if n≥3 s +4(k-s) and d(x)+ d(y)≥n-2 s +2 k +1 for any pair of non-adjacent vertices x, y of G, then G contains s disjoint K3 s and k-s disjoint K4 s such that all of them are disjoint. Moreover, the degree condition is sharp in some cases.展开更多
X. Deng et al. proved Chvātal's conjecture on maximal stable sets and maximal cliques in graphs. G. Ding made a conjecture to generalize Chvátal's conjecture. The purpose of this paper is to prove this conject...X. Deng et al. proved Chvātal's conjecture on maximal stable sets and maximal cliques in graphs. G. Ding made a conjecture to generalize Chvátal's conjecture. The purpose of this paper is to prove this conjecture in planar graphs and the complement of planar graphs.展开更多
Objective: To identify module genes that are closely related to clinical features of hepatocellular carcinoma (HCC) by weighted gene co‑expression network analysis, and to provide a reference for early clinical diagno...Objective: To identify module genes that are closely related to clinical features of hepatocellular carcinoma (HCC) by weighted gene co‑expression network analysis, and to provide a reference for early clinical diagnosis and treatment. Methods: GSE84598 chip data were downloaded from the GEO database, and module genes closely related to the clinical features of HCC were extracted by comprehensive weighted gene co‑expression network analysis. Hub genes were identified through protein interaction network analysis by the maximum clique centrality (MCC) algorithm;Finally, the expression of hub genes was validated by TCGA database and the Kaplan Meier plotter online database was used to evaluate the prognostic relationship between hub genes and HCC patients. Results: By comparing the gene expression data between HCC tissue samples and normal liver tissue samples, a total of 6 262 differentially expressed genes were obtained, of which 2 207 were upregulated and 4 055 were downregulated. Weighted gene co‑expression network analysis was applied to identify 120 genes of key modules. By intersecting with the differentially expressed genes, 115 candidate hub genes were obtained. The results of enrichment analysis showed that the candidate hub genes were closely related to cell mitosis, p53 signaling pathway and so on. Further application of the MCC algorithm to the protein interaction network of 115 candidate hub genes identified five hub genes, namely NUF2, RRM2, UBE2C, CDC20 and MAD2L1. Validation of hub genes by TCGA database revealed that all five hub genes were significantly upregulated in HCC tissues compared to normal liver tissues;Moreover, survival analysis revealed that high expression of hub genes was closely associated with poor prognosis in HCC patients. Conclusions: This study identifies five hub genes by combining multiple databases, which may provide directions for the clinical diagnosis and treatment of HCC.展开更多
In this study, we consider the problem of triangulated graphs. Precisely we give a necessary and sufficient condition for a graph to be triangulated. This gives an alternative characterization of triangulated graphs. ...In this study, we consider the problem of triangulated graphs. Precisely we give a necessary and sufficient condition for a graph to be triangulated. This gives an alternative characterization of triangulated graphs. Our method is based on the so-called perfectly nested sequences.展开更多
高维数据具有稀疏性与易受维度灾难影响的特点,这使高维数据聚类的精度与的效率一直难以得到保证,因此采用子空间聚类的方式减小稀疏性与维度灾难对聚类结果的影响。首先采用随机抽样的方式从高维数据中挑选出适合聚类的维度生成子空间...高维数据具有稀疏性与易受维度灾难影响的特点,这使高维数据聚类的精度与的效率一直难以得到保证,因此采用子空间聚类的方式减小稀疏性与维度灾难对聚类结果的影响。首先采用随机抽样的方式从高维数据中挑选出适合聚类的维度生成子空间,并结合hoeffding界保证抽样结果的有效性。其次利用网格的邻接性,在子空间内生成混合网格,即可以保证数据的完整性也可以提高子空间密度。最后根据子空间的相似度与相异度,对维度剪枝,再次提高子空间密度。算法在加州大学欧文分校数据集(University of California-Irvine,UCI)上能够取得较好的结果,而且算法在的伸缩性以及抗噪声能力上有较好的表现。展开更多
文摘The search engines are indispensable tools to find information amidst massive web pages and documents. A good search engine needs to retrieve information not only in a shorter time, but also relevant to the users’ queries. Most search engines provide short time retrieval to user queries;however, they provide a little guarantee of precision even to the highly detailed users’ queries. In such cases, documents clustering centered on the subject and contents might improve search results. This paper presents a novel method of document clustering, which uses semantic clique. First, we extracted the Features from the documents. Later, the associations between frequently co-occurring terms were defined, which were called as semantic cliques. Each connected component in the semantic clique represented a theme. The documents clustered based on the theme, for which we designed an aggregation algorithm. We evaluated the aggregation algorithm effectiveness using four kinds of datasets. The result showed that the semantic clique based document clustering algorithm performed significantly better than traditional clustering algorithms such as Principal Direction Divisive Partitioning (PDDP), k-means, Auto-Class, and Hierarchical Clustering (HAC). We found that the Semantic Clique Aggregation is a potential model to represent association rules in text and could be immensely useful for automatic document clustering.
基金Supported by the National Natural Science Foundation of China(No.11671124)
文摘A remarkable connection between the clique number and the Lagrangian of a graph was established by Motzkin and Straus. Later, Rota Bul′o and Pelillo extended the theorem of Motzkin-Straus to r-uniform hypergraphs by studying the relation of local(global) minimizers of a homogeneous polynomial function of degree r and the maximal(maximum) cliques of an r-uniform hypergraph. In this paper, we study polynomial optimization problems for non-uniform hypergraphs with four different types of edges and apply it to get an upper bound of Tur′an densities of complete non-uniform hypergraphs.
文摘Keyword search is an alternative for structured languages in querying graph-structured data.A result to a keyword query is a connected structure covering all or part of the queried keywords.The textual coverage and structural compactness have been known as the two main properties of a relevant result to a keyword query.Many previous works examined these properties after retrieving all of the candidate results using a ranking function in a comparative manner.However,this needs a time-consuming search process,which is not appropriate for an interactive system in which the user expects results in the least possible time.This problem has been addressed in recent works by confining the shape of results to examine their coverage and compactness during the search.However,these methods still suffer from the existence of redundant nodes in the retrieved results.In this paper,we introduce the semantic of minimal covered r-clique(MCCr)for the results of a keyword query as an extended model of existing definitions.We propose some efficient algorithms to detect the MCCrs of a given query.These algorithms can retrieve a comprehensive set of non-duplicate MCCrs in response to a keyword query.In addition,these algorithms can be executed in a distributive manner,which makes them outstanding in the field of keyword search.We also propose the approximate versions of these algorithms to retrieve the top-k approximate MCCrs in a polynomial delay.It is proved that the approximate algorithms can retrieve results in two-approximation.Extensive experiments on two real-world datasets confirm the efficiency and effectiveness of the proposed algorithms.
文摘Analyses of spatial relationships and social interactions provide insights into the social structure of animal societies and the ways in which social preferences among and between dyads affect higher order social relationships. In this paper we de- scribe the patterns of spatial associations and social interactions among adult male northern muriquis in order to evaluate the dy- namics of their social networks above the dyadic levels. Systematic observations were made on the 17 adult males present in a multi-male/multi-female group from April 2004 through February 2005, and in July 2005. Analyses of their spatial relationships identified two distinct male cliques; some adult males (called "N" males) were more connected to the females and immatures than other adult males ("MU" males), which were more connected to one another. Affiliative interactions were significantly higher among dyads belonging to the same clique than to different cliques. Although frequencies of dyadic agonistic interactions were similarly low among individuals within and between cliques, MU males appeared to be subordinate to N males. Nonetheless, there were no significant differences in the copulation rates estimated for MU males and N males. Mutual benefits of cooperation between MU and N cliques in intergroup encounters might explain their ongoing associations in the same mixed-sex group
基金Supported by the National Natural Science Foundation of China(Grant No.11271230,11671232)
文摘A graph is said to be claw-free if it does not contain an induced subgraph isomorphic to K_(1,3). Let s and k be two integers with 0≤s≤k and let G be a claw-free graph of order n. In this paper, we investigate clique partition problems in claw-free graphs. It is proved that if n≥3 s +4(k-s) and d(x)+ d(y)≥n-2 s +2 k +1 for any pair of non-adjacent vertices x, y of G, then G contains s disjoint K3 s and k-s disjoint K4 s such that all of them are disjoint. Moreover, the degree condition is sharp in some cases.
基金Supported by the National Natural Science Foundation of China (No. 10671081)self-determined research funds of CCNU09Y01005 and CCNU09Y01018 from the colleges’ basic research and operation of MOE
文摘X. Deng et al. proved Chvātal's conjecture on maximal stable sets and maximal cliques in graphs. G. Ding made a conjecture to generalize Chvátal's conjecture. The purpose of this paper is to prove this conjecture in planar graphs and the complement of planar graphs.
基金National Natural Science Foundation of China (No.81760851)Guangxi University Youth Promotion Program (No.2019KY0348)。
文摘Objective: To identify module genes that are closely related to clinical features of hepatocellular carcinoma (HCC) by weighted gene co‑expression network analysis, and to provide a reference for early clinical diagnosis and treatment. Methods: GSE84598 chip data were downloaded from the GEO database, and module genes closely related to the clinical features of HCC were extracted by comprehensive weighted gene co‑expression network analysis. Hub genes were identified through protein interaction network analysis by the maximum clique centrality (MCC) algorithm;Finally, the expression of hub genes was validated by TCGA database and the Kaplan Meier plotter online database was used to evaluate the prognostic relationship between hub genes and HCC patients. Results: By comparing the gene expression data between HCC tissue samples and normal liver tissue samples, a total of 6 262 differentially expressed genes were obtained, of which 2 207 were upregulated and 4 055 were downregulated. Weighted gene co‑expression network analysis was applied to identify 120 genes of key modules. By intersecting with the differentially expressed genes, 115 candidate hub genes were obtained. The results of enrichment analysis showed that the candidate hub genes were closely related to cell mitosis, p53 signaling pathway and so on. Further application of the MCC algorithm to the protein interaction network of 115 candidate hub genes identified five hub genes, namely NUF2, RRM2, UBE2C, CDC20 and MAD2L1. Validation of hub genes by TCGA database revealed that all five hub genes were significantly upregulated in HCC tissues compared to normal liver tissues;Moreover, survival analysis revealed that high expression of hub genes was closely associated with poor prognosis in HCC patients. Conclusions: This study identifies five hub genes by combining multiple databases, which may provide directions for the clinical diagnosis and treatment of HCC.
文摘In this study, we consider the problem of triangulated graphs. Precisely we give a necessary and sufficient condition for a graph to be triangulated. This gives an alternative characterization of triangulated graphs. Our method is based on the so-called perfectly nested sequences.
文摘高维数据具有稀疏性与易受维度灾难影响的特点,这使高维数据聚类的精度与的效率一直难以得到保证,因此采用子空间聚类的方式减小稀疏性与维度灾难对聚类结果的影响。首先采用随机抽样的方式从高维数据中挑选出适合聚类的维度生成子空间,并结合hoeffding界保证抽样结果的有效性。其次利用网格的邻接性,在子空间内生成混合网格,即可以保证数据的完整性也可以提高子空间密度。最后根据子空间的相似度与相异度,对维度剪枝,再次提高子空间密度。算法在加州大学欧文分校数据集(University of California-Irvine,UCI)上能够取得较好的结果,而且算法在的伸缩性以及抗噪声能力上有较好的表现。