期刊文献+
共找到9篇文章
< 1 >
每页显示 20 50 100
Discovering Frequent Subtrees from XML Data Using Neural Networks
1
作者 SUN Wei LIU Da-xin WANG Tong 《Wuhan University Journal of Natural Sciences》 EI CAS 2006年第1期117-121,共5页
By rapid progress of network and storage technologies, a huge amount of electronic data such as Web pages and XML has been available on Internet. In this paper, we study a data-mining problem of discovering frequent o... By rapid progress of network and storage technologies, a huge amount of electronic data such as Web pages and XML has been available on Internet. In this paper, we study a data-mining problem of discovering frequent ordered sub-trees in a large collection of XML data, where both of the patterns and the data are modeled by labeled ordered trees. We present an efficient algorithm of Ordered Subtree Miner (OSTMiner) based on two- layer neural networks with Hebb rule, that computes all ordered sub-trees appearing in a collection of XML trees with frequent above a user-specified threshold using a special structure EM-tree. In this algo- rithm, EM-tree is used as an extended merging tree to supply scheme information for efficient pruning and mining frequent sub-trees. Experiments results showed that OSTMiner has good response time and scales well. 展开更多
关键词 XML frequent subtrees data mining neural networks
下载PDF
Mining Compressed Frequent Subtrees Set
2
作者 ZHAO Chuanshen WANG Xianyong +1 位作者 SUN Zhihui LI Yuetian 《Wuhan University Journal of Natural Sciences》 CAS 2009年第1期29-34,共6页
The number of frequent subtrees usually grows exponentially with the tree size because of combinatorial explosion. As a result, there are too many frequent subtrees for users to manage and use. To solve this problem, ... The number of frequent subtrees usually grows exponentially with the tree size because of combinatorial explosion. As a result, there are too many frequent subtrees for users to manage and use. To solve this problem, we generalize a compressed frame based on δ-cluster to the problem of compressing frequent-subtree sets, and propose an algorithm RPTlocal which can mine compressed frequent subtrees set directly. This algorithm sacrifices the theoretical bounds but still has good compression quality. By pruning the search space and generating frequent subtrees directly, this algorithm is also efficient. Experiment result shows that the representative subtrees mining by RPTlocal is almost two orders of magnitude less than the whole collection of the closed subtrees, and is more efficient than CMtreeMiner, the algorithm for mining both closed and Maximal frequent subtrees. 展开更多
关键词 data mining frequent subtrees projected branch
原文传递
Historical relationships of the Mesoamerican highlands,with emphasis on tropical montane cloud forests:a temporal cladistic biogeographical analysis
3
作者 CASTRO-TORREBLANCA Marisol ESPINOSA David +1 位作者 BUENO-HERNANDEZ Alfredo LUNA-VEGA Isolda 《Journal of Mountain Science》 SCIE CSCD 2023年第1期129-140,共12页
The historical relationships of nine areas of endemism of the tropical montane cloud forests(TMCFs)were analysed based on a temporal cladistic biogeographical approach.Three cladistic biogeographical analyses were con... The historical relationships of nine areas of endemism of the tropical montane cloud forests(TMCFs)were analysed based on a temporal cladistic biogeographical approach.Three cladistic biogeographical analyses were conducted based on 29cladograms of terrestrial taxa by partitioning them into three time-slices,namely,Miocene,Pliocene,and Pleistocene.The results showed different area relationships over time.For the Miocene and Pliocene time slices,the Isthmus of Tehuantepec acted as a geographic barrier that fragmented the TMCFs into two portions:west of the Isthmus and east of the Isthmus.In the case of the Pleistocene,the TMCFs were broken into two portions,one related to the Neotropical region and the other to the Nearctic region.Furthermore,the analyses allowed us to detect the influences of different geological and paleoclimatological events on the distribution of the TMCFs over time.Therefore,the TMCFs current distribution might have been driven by geological events during the Miocene-Pliocene,whereas climatic fluctuations have the highest impact during the Pleistocene. 展开更多
关键词 Dispersal Evolutionary biogeography Paralogy free subtree analysis VICARIANCE
下载PDF
Mining condensed frequent subtree base
4
作者 王涛 卢炎生 《Journal of Southeast University(English Edition)》 EI CAS 2006年第1期48-53,共6页
In frequent tree pattern mining, the number of frequent subtrees generated is often too large. To tackle this problem, the concept of condensed frequent subtree base is proposed. The base consists of the maximal frequ... In frequent tree pattern mining, the number of frequent subtrees generated is often too large. To tackle this problem, the concept of condensed frequent subtree base is proposed. The base consists of the maximal frequent subtrees for a series of support thresholds. It is a subset of frequent subtrees, and is used to approximate the support of arbitrary frequent subtrees with guaranteed maximal error bound. In addition, an algorithm is developed to mine such a condensed subtree base in a database of labeled rooted ordered trees. The algorithm adopts the way of fight-most extension to generate systematically all frequent rooted ordered subtrees. Several techniques are proposed to prune the branches that do not correspond to the maximal frequent subtrees. Heuristic techniques are used to arrange the order of computation so that relatively expensive computation is avoided as much as possible. Experimental results show that the size of the base is less than 10% of that of the complete set, and the algorithm outperforms the previous algorithms. 展开更多
关键词 data mining tree pattern condensed subtree base
下载PDF
Subtree, TASS and an Analysis of the Genus Caragana
5
作者 张明理 Pauline Y. LADIGES Gareth NELSON 《Acta Botanica Sinica》 CSCD 2002年第10期1213-1218,共6页
Subtree analysis and three area satements (TASS) procedure are used to deal with the area relationship in historical biogeography. On the basis of the taxon cladogram, the procedure could identify and eliminate the ... Subtree analysis and three area satements (TASS) procedure are used to deal with the area relationship in historical biogeography. On the basis of the taxon cladogram, the procedure could identify and eliminate the paralogy node and determine the informative subtree. The area relationships are generally illustrated in several subtrees and the consensus tree. The distribution pattern of the genus Caragana comprising about 72 species and occurring in 13 areas, was analyzed by using subtree method and TASS procedure in this study. The results showed seven subtrees representing the area relationships of section and series of the genus, and the consensus tree provided the 13 area relationships. These results are congruent with our former result using component analysis for Caragana. 展开更多
关键词 SUBTREE three area satements (TASS) CARAGANA distribution pattern
下载PDF
Burning Numbers of Barbells
6
作者 Hui-qing LIU Rui-ting ZHANG Xiao-lan HU 《Acta Mathematicae Applicatae Sinica》 SCIE CSCD 2024年第2期526-538,共13页
Motivated by a discrete-time process intended to measure the speed of the spread of contagion in a graph,the burning number b(G)of a graph G,is defined as the smallest integer k for which there are vertices x1,…xk su... Motivated by a discrete-time process intended to measure the speed of the spread of contagion in a graph,the burning number b(G)of a graph G,is defined as the smallest integer k for which there are vertices x1,…xk such that for every vertex u of G,there exists i∈{1,…k}with dG(u,xi)≤k−i,and dG(xi,xj)≥j−i for any 1≤i<j≤k.The graph burning problem has been shown to be NP-complete even for some acyclic graphs with maximum degree three.In this paper,we determine the burning numbers of all short barbells and long barbells,respectively. 展开更多
关键词 burning number short/long barbell isometric subtree CYCLE
原文传递
Structure Polynomials and Subgraphs of Rooted Regular Trees
7
作者 Daniele D'Angeli Alfredo Donno 《Algebra Colloquium》 SCIE CSCD 2018年第1期45-70,共26页
We introduce an algebraic structure allowing us to describe subgraphs of a regular rooted tree. Its elements are called structure polynomials, and they are in a one- to-one correspondence with the set of all subgraphs... We introduce an algebraic structure allowing us to describe subgraphs of a regular rooted tree. Its elements are called structure polynomials, and they are in a one- to-one correspondence with the set of all subgraphs of the tree. We define two operations, the sum and the product of structure polynomials, giving a graph interpretation of them. Then we introduce an equivalence relation between polynomials, using the action of the full automorphism group of the tree, and we count equivalence classes of subgraphs modulo this equivalence. We also prove that this action gives rise to symmetric Gelfand pairs. Finally, when the regularity degree of the tree is a prime p, we regard each level of the tree as a finite dimensional vector space over the finite field Fp, and we are able to completely characterize structure polynomials corresponding to subgraphs whose leaf set is a vector subspace. 展开更多
关键词 rooted tree structure polynomial rooted subtree automorphism group of a tree Gelfand pair
原文传递
MTMO: an efficient network-centric algorithm for subtree counting and enumeration
8
作者 Guanghui Li Jiawei Luo +1 位作者 Zheng Xiao Cheng Liang 《Frontiers of Electrical and Electronic Engineering in China》 CSCD 2018年第2期142-154,共13页
Background: The frequency of small subtrees in biological, social, and other types of networks could shed light into the structure, function, and evolution of such networks. However, counting all possible subtrees of... Background: The frequency of small subtrees in biological, social, and other types of networks could shed light into the structure, function, and evolution of such networks. However, counting all possible subtrees of a prescribed size can be computationally expensive because of their potentially large number even in small, sparse networks. Moreover, most of the existing algorithms for subtree counting belong to the subtree-centric approaches, which search for a specific single subtree type at a time, potentially taking more time by searching again on the same network. Methods: In this paper, we propose a network-centric algorithm (MTMO) to efficiently count k-size subtrees. Our algorithm is based on the enumeration of all connected sets of k-1 edges, incorporates a labeled rooted tree data structure in the enumeration process to reduce the number of isomorphism tests required, and uses an array-based indexing scheme to simplify the subtree counting method. Results: The experiments on three representative undirected complex networks show that our algorithm is roughly an order of magnitude faster than existing subtree-centric approaches and base network-centric algorithm which does not use rooted tree, allowing for counting larger subtrees in larger networks than previously possible. We also show major differences between unicellular and multicellular organisms. In addition, our algorithm is applied to find network motifs based on pattern growth approach. Conclusions: A network-centric algorithm which allows for a This enables us to count larger motif in larger networks than faster counting of non-induced subtrees is proposed previously. 展开更多
关键词 complex network evolutionary systems biology network motif discovery subtree counting subtreeisomorphism
原文传递
Distances Between Phylogenetic Trees: A Survey
9
作者 Feng Shi Qilong Feng +2 位作者 Jianer Chen Lusheng Wang Jianxin Wang 《Tsinghua Science and Technology》 SCIE EI CAS 2013年第5期490-499,共10页
Phylogenetic trees have been widely used in the study of evolutionary biology for representing the tree-like evolution of a collection of species. However, different data sets and different methods often lead to the c... Phylogenetic trees have been widely used in the study of evolutionary biology for representing the tree-like evolution of a collection of species. However, different data sets and different methods often lead to the construction of different phylogenetic trees for the same set of species. Therefore, comparing these trees to determine similarities or, equivalently, dissimilarities, becomes the fundamental issue. Typically, Tree Bisection and Reconnection(TBR)and Subtree Prune and Regraft(SPR) distances have been proposed to facilitate the comparison between different phylogenetic trees. In this paper, we give a survey on the aspects of computational complexity, fixed-parameter algorithms, and approximation algorithms for computing the TBR and SPR distances of phylogenetic trees. 展开更多
关键词 phylogenetic tree tree bisection and reconnection subtree prune and regraft fixed-parameter algorithm approximation algorithm
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部