The idea of positional inverted index is exploited for indexing of graph database. The main idea is the use of hashing tables in order to prune a considerable portion of graph database that cannot contain the answer s...The idea of positional inverted index is exploited for indexing of graph database. The main idea is the use of hashing tables in order to prune a considerable portion of graph database that cannot contain the answer set. These tables are implemented using column-based techniques and are used to store graphs of database, frequent sub-graphs and the neighborhood of nodes. In order to exact checking of remaining graphs, the vertex invariant is used for isomorphism test which can be parallel implemented. The results of evaluation indicate that proposed method outperforms existing methods.展开更多
For those refineries which have to deal with different types of crude oil, blending is an attractive solution to obtain a quality feedstock. In this paper, a novel scheduling strategy is proposed for a practical crude...For those refineries which have to deal with different types of crude oil, blending is an attractive solution to obtain a quality feedstock. In this paper, a novel scheduling strategy is proposed for a practical crude oil blending process. The objective is to keep the property of feedstock, mainly described by the true boiling point (TBP) data, consistent and suitable. Firstly, the mathematical model is established. Then, a heuristically initialized hybrid iterative (HIHI) algorithm based on a two-level optimization structure, in which tabu search (TS) and differential evolution (DE) are used for upper-level and lower-level optimization, respectively, is proposed to get the model solution. Finally, the effectiveness and efficiency of the scheduling strategy is validated via real data from a certain refinery.展开更多
Image registration is the overlaying of two images of the same scene taken at different times or by different sensors. It is one of the essential steps in information processing in remote sensing. To attain a highly a...Image registration is the overlaying of two images of the same scene taken at different times or by different sensors. It is one of the essential steps in information processing in remote sensing. To attain a highly accurate, reliable and low computation cost in image registration a suitable and similarity metric and reduction in search data and search space is required. In this paper, the author shows that if the right bin size is chosen, mutual information can be more robust than correlation in the registration of multi-temporal images. The author also compares the sensitivity of mutual information and correlation to Gaussian and multiplicative speckle noise. The author investigates automatic subimage selection as a reduction in search data strategy. The author proposes a measure, called alienability, which shows the ability ofa subimage to provide reliable registration. Alternate subimage selection methods such as using gradient, entropy and variance are also investigated. The author furthermore looks into a search space strategy using a gradient approach to maximize mutual information and show our first results.展开更多
Breadth-first search(BFS) is an important kernel for graph traversal and has been used by many graph processing applications. Extensive studies have been devoted in boosting the performance of BFS. As the most effecti...Breadth-first search(BFS) is an important kernel for graph traversal and has been used by many graph processing applications. Extensive studies have been devoted in boosting the performance of BFS. As the most effective solution, GPU-acceleration achieves the state-of-the-art result of 3.3×109 traversed edges per second on a NVIDIA Tesla C2050 GPU. A novel vertex frontier based GPU BFS algorithm is proposed, and its main features are three-fold. Firstly, to obtain a better workload balance for irregular graphs, a virtual-queue task decomposition and mapping strategy is introduced for vertex frontier expanding. Secondly, a global deduplicate detection scheme is proposed to remove reduplicative vertices from vertex frontier effectively. Finally, a GPU-based bottom-up BFS approach is employed to process large frontier. The experimental results demonstrate that the algorithm can achieve 10% improvement over the state-of-the-art method on diverse graphs. Especially, it exhibits 2-3 times speedup on low-diameter and scale-free graphs over the state-of-the-art on a NVIDIA Tesla K20 c GPU, reaching a peak traversal rate of 11.2×109 edges/s.展开更多
Aiming at the shortcoming that certain existing blockingmatching algorithrns, such as full search, three-step search, and dia- mond search algorithms, usually can not keep a good balance between high acoaracy and low ...Aiming at the shortcoming that certain existing blockingmatching algorithrns, such as full search, three-step search, and dia- mond search algorithms, usually can not keep a good balance between high acoaracy and low computational complexity, a block-maching motion estimation algorithm based on two-step search is proposed in this paper. According to the fact that the gray values of adjacent pixels will not vary fast, the algorithm employs an interlaced search pattem in the search window to estimate the motion vector of the objectblock. Simulation and actual experiments demanstrate that the proposed algmithm greatly outperforms the well-known three-step search and dianond search algoritlam, no matter the motion vector is large or small. Comparedc with the full search algorithm, the proposed one achieves similar peffomance but requires much less computation, therefore, the algorithm is well qualified for real-time video image processing.展开更多
To improve efficiency of search engines, the query result cache has drawn much attention re- cently. According to the query processing and user's query logs locality, a new hybrid result cache strategy which associat...To improve efficiency of search engines, the query result cache has drawn much attention re- cently. According to the query processing and user's query logs locality, a new hybrid result cache strategy which associates with caching heat and worth is proposed to compute cache score in accord- ance with cost-aware strategies. Exactly, query repeated distance and query length factor are utilized to improve the static result policy, and the dynamic policy is adjusted by the caching worth. The hy- brid result cache is implemented in term of the document content and document ids (docIds) se- quence. Based on a score format and the new hybrid structure, an initial algorithm and a new rou- ting algorithm are designed for result cache. Experiments' results show that the improved caching policies decrease the average response time effectively, and increase the system throughput signifi- cantly. By choosing comfortable combination of page cache and docIds cache, the new hybrid cac- hing strategy almost reduces more than 20% of the only cache and docId-only cache. average query time compared with the basic page-展开更多
This paper introduces a parallel search system for dynamic multi-objective traveling salesman problem. We design a multi-objective TSP in a stochastic dynamic environment. This dynamic setting of the problem is very u...This paper introduces a parallel search system for dynamic multi-objective traveling salesman problem. We design a multi-objective TSP in a stochastic dynamic environment. This dynamic setting of the problem is very useful for routing in ad-hoc networks. The proposed search system first uses parallel processors to identify the extreme solutions of the search space for each ofk objectives individually at the same time. These solutions are merged into the so-called hit-frequency matrix E. The solutions in E are then searched by parallel processors and evaluated for dominance relationship. The search system is implemented in two different ways master-worker architecture and pipeline architecture.展开更多
This paper analyzes multiple structural changes by GMDH (Group Meth- ods of Data Handling), which have obvious advantages. Our method extends the model of Lumsdaine & Papell[1] (1997), and it could be applied to ...This paper analyzes multiple structural changes by GMDH (Group Meth- ods of Data Handling), which have obvious advantages. Our method extends the model of Lumsdaine & Papell[1] (1997), and it could be applied to the case of more than two structural changes. Because of simultaneously considering every structural change of the hypothesis, it is likely to be of particular relevance in practice. And it can decrease large investigation costs by MATLAB programming. What is more, we can select the criterion value of F incremental statistic to control the significance of the breaks, based on kinds of investigation intentions. And the empirical evidences on Shenzhen Composite Index are presented to illustrate the usefulness of our method.展开更多
文摘The idea of positional inverted index is exploited for indexing of graph database. The main idea is the use of hashing tables in order to prune a considerable portion of graph database that cannot contain the answer set. These tables are implemented using column-based techniques and are used to store graphs of database, frequent sub-graphs and the neighborhood of nodes. In order to exact checking of remaining graphs, the vertex invariant is used for isomorphism test which can be parallel implemented. The results of evaluation indicate that proposed method outperforms existing methods.
基金Supported by the National High Technology Research and Development Program of China (2007AA04Z193) the National Natural Science Foundation of China (60974008 60704032)
文摘For those refineries which have to deal with different types of crude oil, blending is an attractive solution to obtain a quality feedstock. In this paper, a novel scheduling strategy is proposed for a practical crude oil blending process. The objective is to keep the property of feedstock, mainly described by the true boiling point (TBP) data, consistent and suitable. Firstly, the mathematical model is established. Then, a heuristically initialized hybrid iterative (HIHI) algorithm based on a two-level optimization structure, in which tabu search (TS) and differential evolution (DE) are used for upper-level and lower-level optimization, respectively, is proposed to get the model solution. Finally, the effectiveness and efficiency of the scheduling strategy is validated via real data from a certain refinery.
文摘Image registration is the overlaying of two images of the same scene taken at different times or by different sensors. It is one of the essential steps in information processing in remote sensing. To attain a highly accurate, reliable and low computation cost in image registration a suitable and similarity metric and reduction in search data and search space is required. In this paper, the author shows that if the right bin size is chosen, mutual information can be more robust than correlation in the registration of multi-temporal images. The author also compares the sensitivity of mutual information and correlation to Gaussian and multiplicative speckle noise. The author investigates automatic subimage selection as a reduction in search data strategy. The author proposes a measure, called alienability, which shows the ability ofa subimage to provide reliable registration. Alternate subimage selection methods such as using gradient, entropy and variance are also investigated. The author furthermore looks into a search space strategy using a gradient approach to maximize mutual information and show our first results.
基金Projects(61272142,61103082,61003075,61170261,61103193)supported by the National Natural Science Foundation of ChinaProject supported by the Program for New Century Excellent Talents in University of ChinaProjects(2012AA01A301,2012AA010901)supported by the National High Technology Research and Development Program of China
文摘Breadth-first search(BFS) is an important kernel for graph traversal and has been used by many graph processing applications. Extensive studies have been devoted in boosting the performance of BFS. As the most effective solution, GPU-acceleration achieves the state-of-the-art result of 3.3×109 traversed edges per second on a NVIDIA Tesla C2050 GPU. A novel vertex frontier based GPU BFS algorithm is proposed, and its main features are three-fold. Firstly, to obtain a better workload balance for irregular graphs, a virtual-queue task decomposition and mapping strategy is introduced for vertex frontier expanding. Secondly, a global deduplicate detection scheme is proposed to remove reduplicative vertices from vertex frontier effectively. Finally, a GPU-based bottom-up BFS approach is employed to process large frontier. The experimental results demonstrate that the algorithm can achieve 10% improvement over the state-of-the-art method on diverse graphs. Especially, it exhibits 2-3 times speedup on low-diameter and scale-free graphs over the state-of-the-art on a NVIDIA Tesla K20 c GPU, reaching a peak traversal rate of 11.2×109 edges/s.
基金supported by the Lab Open Fund of Beijing Microchemical Research Institute(P2008026EB)
文摘Aiming at the shortcoming that certain existing blockingmatching algorithrns, such as full search, three-step search, and dia- mond search algorithms, usually can not keep a good balance between high acoaracy and low computational complexity, a block-maching motion estimation algorithm based on two-step search is proposed in this paper. According to the fact that the gray values of adjacent pixels will not vary fast, the algorithm employs an interlaced search pattem in the search window to estimate the motion vector of the objectblock. Simulation and actual experiments demanstrate that the proposed algmithm greatly outperforms the well-known three-step search and dianond search algoritlam, no matter the motion vector is large or small. Comparedc with the full search algorithm, the proposed one achieves similar peffomance but requires much less computation, therefore, the algorithm is well qualified for real-time video image processing.
基金Supported by the National Natural Science Foundation of China(No.61173024)
文摘To improve efficiency of search engines, the query result cache has drawn much attention re- cently. According to the query processing and user's query logs locality, a new hybrid result cache strategy which associates with caching heat and worth is proposed to compute cache score in accord- ance with cost-aware strategies. Exactly, query repeated distance and query length factor are utilized to improve the static result policy, and the dynamic policy is adjusted by the caching worth. The hy- brid result cache is implemented in term of the document content and document ids (docIds) se- quence. Based on a score format and the new hybrid structure, an initial algorithm and a new rou- ting algorithm are designed for result cache. Experiments' results show that the improved caching policies decrease the average response time effectively, and increase the system throughput signifi- cantly. By choosing comfortable combination of page cache and docIds cache, the new hybrid cac- hing strategy almost reduces more than 20% of the only cache and docId-only cache. average query time compared with the basic page-
文摘This paper introduces a parallel search system for dynamic multi-objective traveling salesman problem. We design a multi-objective TSP in a stochastic dynamic environment. This dynamic setting of the problem is very useful for routing in ad-hoc networks. The proposed search system first uses parallel processors to identify the extreme solutions of the search space for each ofk objectives individually at the same time. These solutions are merged into the so-called hit-frequency matrix E. The solutions in E are then searched by parallel processors and evaluated for dominance relationship. The search system is implemented in two different ways master-worker architecture and pipeline architecture.
文摘This paper analyzes multiple structural changes by GMDH (Group Meth- ods of Data Handling), which have obvious advantages. Our method extends the model of Lumsdaine & Papell[1] (1997), and it could be applied to the case of more than two structural changes. Because of simultaneously considering every structural change of the hypothesis, it is likely to be of particular relevance in practice. And it can decrease large investigation costs by MATLAB programming. What is more, we can select the criterion value of F incremental statistic to control the significance of the breaks, based on kinds of investigation intentions. And the empirical evidences on Shenzhen Composite Index are presented to illustrate the usefulness of our method.