期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
Hierarchical Clustering of Complex Symbolic Data and Application for Emitter Identification 被引量:1
1
作者 Xin Xu jiaheng lu Wei Wang 《Journal of Computer Science & Technology》 SCIE EI CSCD 2018年第4期807-822,共16页
It is well-known that the values of symbolic variables may take various forms such as an interval, a set of stochastic measurements of some underlying patterns or qualitative multi-values and so on. However, the major... It is well-known that the values of symbolic variables may take various forms such as an interval, a set of stochastic measurements of some underlying patterns or qualitative multi-values and so on. However, the majority of existing work in symbolic data analysis still focuses on interval values. Although some pioneering work in stochastic pattern based symbolic data and mixture of symbolic variables has been explored, it still lacks flexibility and computation efficiency to make full use of the distinctive individual symbolic variables. Therefore, we bring forward a novel hierarchical clustering method with weighted general Jaccard distance and effective global pruning strategy for complex symbolic data and apply it to emitter identification. Extensive experiments indicate that our method has outperformed its peers in both computational efficiency and emitter identification accuracy. 展开更多
关键词 symbolic data analysis stochastic pattern fuzzy interval hierarchical clustering emitter identification
原文传递
Improving performance by creating a native join-index for OLAP 被引量:3
2
作者 Yansong ZHANG Shan WANG jiaheng lu 《Frontiers of Computer Science》 SCIE EI CSCD 2011年第2期236-249,共14页
The performance of online analytical processing (OLAP) is critical for meeting the increasing requirements of massive volume analytical applications. Typical techniques, such as in-memory processing, column-storage,... The performance of online analytical processing (OLAP) is critical for meeting the increasing requirements of massive volume analytical applications. Typical techniques, such as in-memory processing, column-storage, and join indexes focus on high perfor- mance storage media, efficient storage models, and reduced query processing. While they effectively perform OLAP applications, there is a vital limitation: main- memory database based OLAP (MMOLAP) cannot provide high performance for a large size data set. In this paper, we propose a novel memory dimension table model, in which the primary keys of the dimension table can be directly mapped to dimensional tuple addresses. To achieve higher performance of dimensional tuple access, we optimize our storage model for dimension tables based on OLAP query workload features. We present directly dimensional tuple accessing (DDTA) based join (DDTA- JOIN), a technique to optimize query processing on the memory dimension table by direct dimensional tuple access. We also contribute by proposing an optimization of the predicate tree to shorten predicate operation length by pruning useless predicate processing. Our experimental results show that the DDTA-JOIN algorithm is superior to both simulated row-store main memory query processing and the open-source column-store main memory database MonetDB, thanks to the reduced join cost and simple yet efficient query processing. 展开更多
关键词 directly dimensional tuple accessing (DDTA) DDTA JOIN native join index predicate tree
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部