期刊文献+
共找到8篇文章
< 1 >
每页显示 20 50 100
ISTC: A New Method for Clustering Search Results 被引量:2
1
作者 ZHANG Wei XU Baowen +1 位作者 ZHANG Weifeng XU Junling 《Wuhan University Journal of Natural Sciences》 CAS 2008年第4期501-504,共4页
A new common phrase scoring method is proposed according to term frequency-inverse document frequency (TFIDF) and independence of the phrase. Combining the two properties can help identify more reasonable common phr... A new common phrase scoring method is proposed according to term frequency-inverse document frequency (TFIDF) and independence of the phrase. Combining the two properties can help identify more reasonable common phrases, which improve the accuracy of clustering. Also, the equation to measure the in-dependence of a phrase is proposed in this paper. The new algorithm which improves suffix tree clustering algorithm (STC) is named as improved suffix tree clustering (ISTC). To validate the proposed algorithm, a prototype system is implemented and used to cluster several groups of web search results obtained from Google search engine. Experimental results show that the improved algorithm offers higher accuracy than traditional suffix tree clustering. 展开更多
关键词 Web search results clustering suffix tree term frequency-inverse document frequency (TFIDF) independence of phrases
下载PDF
Optimizing combination of aircraft maintenance tasks by adaptive genetic algorithm based on cluster search 被引量:5
2
作者 Huaiyuan Li Hongfu Zuo +3 位作者 Kun Liang Juan Xu Jing Cai Junqiang Liu 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2016年第1期140-156,共17页
It is significant to combine multiple tasks into an optimal work package in decision-making of aircraft maintenance to reduce cost,so a cost rate model of combinatorial maintenance is an urgent need.However,the optima... It is significant to combine multiple tasks into an optimal work package in decision-making of aircraft maintenance to reduce cost,so a cost rate model of combinatorial maintenance is an urgent need.However,the optimal combination under various constraints not only involves numerical calculations but also is an NP-hard combinatorial problem.To solve the problem,an adaptive genetic algorithm based on cluster search,which is divided into two phases,is put forward.In the first phase,according to the density,all individuals can be homogeneously scattered over the whole solution space through crossover and mutation and better individuals are collected as candidate cluster centres.In the second phase,the search is confined to the neighbourhood of some selected possible solutions to accurately solve with cluster radius decreasing slowly,meanwhile all clusters continuously move to better regions until all the peaks in the question space is searched.This algorithm can efficiently solve the combination problem.Taking the optimization on decision-making of aircraft maintenance by the algorithm for an example,maintenance which combines multiple parts or tasks can significantly enhance economic benefit when the halt cost is rather high. 展开更多
关键词 cluster search genetic algorithm combinatorial optimization multi-part maintenance grouping maintenance.
下载PDF
Effective approach for outdoor obstacle detection by clustering LIDAR data context 被引量:1
3
作者 王军政 乔佳楠 李静 《Journal of Beijing Institute of Technology》 EI CAS 2016年第4期483-490,共8页
A method of environment mapping using laser-based light detection and ranging (LIDAR) is proposed in this paper. This method not only has a good detection performance in a wide range of detection angles, but also fa... A method of environment mapping using laser-based light detection and ranging (LIDAR) is proposed in this paper. This method not only has a good detection performance in a wide range of detection angles, but also facilitates the detection of dynamic and hollowed-out obstacles. Essentially using this method, an improved clustering algorithm based on fast search and discovery of density peaks (CBFD) is presented to extract various obstacles in the environment map. By comparing with other cluster algorithms, CBFD can obtain a favorable number of clusterings automatically. Furthermore, the experiments show that CBFD is better and more robust in functionality and performance than the K-means and iterative self-organizing data analysis techniques algorithm (ISODATA). 展开更多
关键词 context modeling clustering algorithm based on fast search and discovery of densitypeaks(CBFD) Hull algorithm obstacle detection obstacle fusion
下载PDF
Optimization of satellite searching strategy of the non-stationary antenna
4
作者 曹海青 王渝 姚志英 《Journal of Beijing Institute of Technology》 EI CAS 2015年第3期398-404,共7页
The tiny searching step length and the satellite distribution density are the major factors to influence the efficiency of the satellite finder,so a scientific and reasonable method to calculate the tiny searching ste... The tiny searching step length and the satellite distribution density are the major factors to influence the efficiency of the satellite finder,so a scientific and reasonable method to calculate the tiny searching step length is proposed to optimize the satellite searching strategy. The pattern clustering and BP neural network are applied to optimize the tiny searching step length. The calculated tiny searching step length is approximately equal to the theoretic value for each satellite. In application,the satellite searching results will be dynamically added to the training samples to re-train the network to improve the generalizability and the precision. Experiments validate that the optimization of the tiny searching step length can avoid the error of locating target satellite and improve the searching efficiency. 展开更多
关键词 tiny searching step length satellite finder patter clustering neural network
下载PDF
Accelerated steered response power method for sound source localization via clustering search 被引量:5
5
作者 ZHAO XiaoYan TANG Jie +1 位作者 ZHOU Lin WU ZhenYang 《Science China(Physics,Mechanics & Astronomy)》 SCIE EI CAS 2013年第7期1329-1338,共10页
The steered response power-phase transform (SRP-PHAT) sound source localization algorithm is robust in a real environment. However, the large computation complexity limits the practical application of SRP-PHAT. For a ... The steered response power-phase transform (SRP-PHAT) sound source localization algorithm is robust in a real environment. However, the large computation complexity limits the practical application of SRP-PHAT. For a microphone array, each location corresponds to a set of time differences of arrival (TDOAs), and this paper collects them into a TDOA vector. Since the TDOA vectors in the adjacent regions are similar, we present a fast algorithm based on clustering search to reduce the computation complexity of SRP-PHAT. In the training stage, the K-means or Iterative Self-Organizing Data Analysis Technique (ISODATA) clustering algorithm is used to find the centroid in each cluster with similar TDOA vectors. In the procedure of sound localization, the optimal cluster is found by comparing the steered response powers (SRPs) of all centroids. The SRPs of all candidate locations in the optimal cluster are compared to localize the sound source. Experiments both in simulation environments and real environments have been performed to compare the localization accuracy and computational load of the proposed method with those of the conventional SRP-PHAT algorithm. The results show that the proposed method is able to reduce the computational load drastically and maintains almost the same localization accuracy and robustness as those of the conventional SRP-PHAT algorithm. The difference in localization performance brought by different clustering algorithms used in the training stage is trivial. 展开更多
关键词 sound source localization microphone array steered response power clustering search
原文传递
基于Elastic的分布式大数据联机分析处理体系
6
作者 陈志敏 《通讯世界》 2020年第10期68-69,74,共3页
本文介绍了在电信运营商环境下,如何集中管理非结构化文本数据。提出一种轻便的架构,可以降低处理数据的时延,减少中心节点的成本。电信运营商中存在大量的日志数据,如用户的行为数据、话单数据、CDN访问日志、互联网电视的播放数据等等... 本文介绍了在电信运营商环境下,如何集中管理非结构化文本数据。提出一种轻便的架构,可以降低处理数据的时延,减少中心节点的成本。电信运营商中存在大量的日志数据,如用户的行为数据、话单数据、CDN访问日志、互联网电视的播放数据等等,这些数据存在数据量大、分析需求多样化的特征。本文基于开源软件,通过两年多的实践经验总结一种方法,在电信运营商中的日志场景下的快速搭建,快速构建查询报表,跨省跨节点快速度查询的技术实践。 展开更多
关键词 Cross cluster search ELK Kafka Flink Elasticsearch
下载PDF
Meta-Path-Based Search and Mining in Heterogeneous Information Networks 被引量:17
7
作者 Yizhou Sun Jiawei Han 《Tsinghua Science and Technology》 SCIE EI CAS 2013年第4期329-338,共10页
Information networks that can be extracted from many domains are widely studied recently. Different functions for mining these networks are proposed and developed, such as ranking, community detection, and link predic... Information networks that can be extracted from many domains are widely studied recently. Different functions for mining these networks are proposed and developed, such as ranking, community detection, and link prediction. Most existing network studies are on homogeneous networks, where nodes and links are assumed from one single type. In reality, however, heterogeneous information networks can better model the real-world systems, which are typically semi-structured and typed, following a network schema. In order to mine these heterogeneous information networks directly, we propose to explore the meta structure of the information network, i.e., the network schema. The concepts of meta-paths are proposed to systematically capture numerous semantic relationships across multiple types of objects, which are defined as a path over the graph of network schema. Meta-paths can provide guidance for search and mining of the network and help analyze and understand the semantic meaning of the objects and relations in the network. Under this framework, similarity search and other mining tasks such as relationship prediction and clustering can be addressed by systematic exploration of the network meta structure. Moreover, with user's guidance or feedback, we can select the best meta-path or their weighted combination for a specific mining task. 展开更多
关键词 heterogeneous information network meta-path similarity search relationship prediction user-guided clustering
原文传递
Similarity Search Algorithm over Data Supply Chain Based on Key Points 被引量:1
8
作者 Peng Li Hong Luo Yan Sun 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2017年第2期174-184,共11页
In this paper, we target a similarity search among data supply chains, which plays an essential role in optimizing the supply chain and extending its value. This problem is very challenging for application-oriented da... In this paper, we target a similarity search among data supply chains, which plays an essential role in optimizing the supply chain and extending its value. This problem is very challenging for application-oriented data supply chains because the high complexity of the data supply chain makes the computation of similarity extremely complex and inefficient. In this paper, we propose a feature space representation model based on key points,which can extract the key features from the subsequences of the original data supply chain and simplify it into a feature vector form. Then, we formulate the similarity computation of the subsequences based on the multiscale features. Further, we propose an improved hierarchical clustering algorithm for a similarity search over the data supply chains. The main idea is to separate the subsequences into disjoint groups such that each group meets one specific clustering criteria; thus, the cluster containing the query object is the similarity search result. The experimental results show that the proposed approach is both effective and efficient for data supply chain retrieval. 展开更多
关键词 data supply chain similarity search feature space hierarchical clustering
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部