Modern applications require large databases to be searched for regions that are similar to a given pattern. The DNA sequence analysis, speech and text recognition, artificial intelligence, Internet of Things, and many...Modern applications require large databases to be searched for regions that are similar to a given pattern. The DNA sequence analysis, speech and text recognition, artificial intelligence, Internet of Things, and many other applications highly depend on pattern matching or similarity searches. In this paper, we discuss some of the string matching solutions developed in the past. Then, we present a novel mathematical model to search for a given pattern and it’s near approximates in the text.展开更多
Pattern matching method is one of the classic classifications of existing online portfolio selection strategies. This article aims to study the key aspects of this method—measurement of similarity and selection of si...Pattern matching method is one of the classic classifications of existing online portfolio selection strategies. This article aims to study the key aspects of this method—measurement of similarity and selection of similarity sets, and proposes a Portfolio Selection Method based on Pattern Matching with Dual Information of Direction and Distance (PMDI). By studying different combination methods of indicators such as Euclidean distance, Chebyshev distance, and correlation coefficient, important information such as direction and distance in stock historical price information is extracted, thereby filtering out the similarity set required for pattern matching based investment portfolio selection algorithms. A large number of experiments conducted on two datasets of real stock markets have shown that PMDI outperforms other algorithms in balancing income and risk. Therefore, it is suitable for the financial environment in the real world.展开更多
The problem of pattern-based subspace clustering, a special type of subspace clustering that uses pattern similarity as a measure of similarity, is studied. Unlike most traditional clustering algorithms that group the...The problem of pattern-based subspace clustering, a special type of subspace clustering that uses pattern similarity as a measure of similarity, is studied. Unlike most traditional clustering algorithms that group the close values of objects in all the dimensions or a set of dimensions, clustering by pattern similarity shows an interesting pattern, where objects exhibit a coherent pattern of rise and fall in subspaces. A novel approach, named EMaPle to mine the maximal pattern-based subspace clusters, is designed. The EMaPle searches clusters only in the attribute enumeration spaces which are relatively few compared to the large number of row combinations in the typical datasets, and it exploits novel pruning techniques. EMaPle can find the clusters satisfying coherent constraints, size constraints and sign constraints neglected in MaPle. Both synthetic data sets and real data sets are used to evaluate EMaPle and demonstrate that it is more effective and scalable than MaPle.展开更多
Based on rough similarity degree of rough sets and close degree of fuzzy sets, the definitions of rough similarity degree and rough close degree of rough fuzzy sets are given, which can be used to measure the similar ...Based on rough similarity degree of rough sets and close degree of fuzzy sets, the definitions of rough similarity degree and rough close degree of rough fuzzy sets are given, which can be used to measure the similar degree between two rough fuzzy sets. The properties and theorems are listed. Using the two new measures, the method of clustering in the rough fuzzy system can be obtained. After clustering, the new fuzzy sample can be recognized by the principle of maximal similarity degree.展开更多
The definition of rough similarity degree is given based on the axiomatic similarity degree, and the properties of rough similarity degree are listed. Using the properties of rough similarity degree, the method of clu...The definition of rough similarity degree is given based on the axiomatic similarity degree, and the properties of rough similarity degree are listed. Using the properties of rough similarity degree, the method of clustering in rough systems can be obtained. After clustering, a new sample can be recognized by the principle of maximal rough similarity degree.展开更多
There exist some controversies over the larger zoogeographic divisions of the arid areas of Central Asia, whose characteristics include complex ecological environments, complex fauna origins and unique patterns of ani...There exist some controversies over the larger zoogeographic divisions of the arid areas of Central Asia, whose characteristics include complex ecological environments, complex fauna origins and unique patterns of animal distribution. The aim of this study was to determine, using quantitative analysis, the distribution patterns of amphibians and reptiles in the arid areas of Central Asia, whose various physiographical regions were divided into 17 Operative Geographical Units (OGUs). Based on the presence or absence of 52 amphibian and reptile genera in the 17 OGUs, and by making use of the Czekanowski Similarity Index, the Baroni-Urbani and Buser's Similarity Index, and the strong and weak boundary test, we studied the biotic boundaries within these contested regions. In accordance with our results, the classification dendrogram was divided into two main branches. One branch is composed of the northern OGUs of the Altai Mountains which are a part of the Euro-Siberian Subrealm. The other branch includes all of the OGUs south of the Altai Mountains, which belong to the Central Asia Subrealm. There is a significantly weak biotic boundary (DW〉0, GW〉GS, P〈0.01) between the Euro-Siberian Subrealm and the Central Asia Subrealm that corre- sponded to the transitional zones. The boundary between the two subrealms runs along the Altai Mountains, the Sayan Mountains, the Hangai Mountains and the Mongolian Dagurr Mountains. The boundaries between the main branches in the Central Asia Subrealm are weak, reflecting the widespread existence of transitional zones in the arid areas of Central Asia. The Tianshan Mountains should be elevated to form its own separate region, "the Middle Asian Mountain Region", which, due to its special fauna and environment, would be classified at the same level as the Mongolia-Xinjiang Region. With the approach of creating a cluster analysis dendogram based upon the genera of amphibians and reptiles, the relationship of these higher level zoogeographical divisions was successfully resolved and the error of long-branch attraction was also avoided. With our clustering dendrogram as the foundation, the in- dependence test was applied to strong and weak boundaries, and this resolved the problem of where to attribute the transition areas and revealed as well the barrier effect that physical, geographic boundaries have upon amphibians and reptiles. The approach of combining genera clustering analysis with a statistical boundary test should be applicable not only to the distribution patterns of other animal groups, but also to delineating large-scale zoogeographical divisions.展开更多
Cement paste with low water/cement ratio of 0.3 was observed using AFM. It is found that C-S-H has self similarity trait from scanning scale 20 um×20 um down to 300 nm× 300 nm, and the feature of C-S-H at la...Cement paste with low water/cement ratio of 0.3 was observed using AFM. It is found that C-S-H has self similarity trait from scanning scale 20 um×20 um down to 300 nm× 300 nm, and the feature of C-S-H at large scale is very similar to those smaller scales. It can be concluded that C-S-H is composed with some fundamental spherical globule, the fundamental globules agglomerate into bigger ones, moreover the bigger ones agglomerate into even bigger one. A C-S-H globule fractal model was put forward to describe the self similarity of the C-S-H globule, which can be used to reveal how the C-S-H globule contacts with each other.展开更多
This paper presents a new method of damage condition assessment that allows accommodating other types of uncertainties due to ambiguity, vagueness, and fuzziness that are statistically nondescribable. In this method, ...This paper presents a new method of damage condition assessment that allows accommodating other types of uncertainties due to ambiguity, vagueness, and fuzziness that are statistically nondescribable. In this method, healthy observations are used to construct a fury set representing sound performance characteristics. Additionally, the bounds on the similarities among the structural damage states are prescribed by using the state similarity matrix. Thus, an optimal group fuzzy sets representing damage states such as little, moderate, and severe damage can be inferred as an inverse problem from healthy observations only. The optimal group of damage fuzzy sets is used to classify a set of observations at any unknown state of damage using the principles of fitzzy pattern recognition based on an approximate principle . This method can be embedded into the system of Structural Health Monitoring (SHM) to give advice about structural maintenance and life predictio comes from Reference [ 9 ] for damage pattern recognition is presented n. Finally, a case and discussed. The study, which compared result illustrates our method is more effective and general, so it is very practical in engineering.展开更多
Based on the similarity criterion, volcanic rock samples were taken from outcrops to make experimental models. Water flooding experiments of five-spot well pattern, nine-spot well pattern, five-spot to nine-spot well ...Based on the similarity criterion, volcanic rock samples were taken from outcrops to make experimental models. Water flooding experiments of five-spot well pattern, nine-spot well pattern, five-spot to nine-spot well pattern, the relationship between relative well and fracture positions, and injection rate in dissolution vug-cave reservoirs with/without fractures were carried out to analyze variation regularities of development indexes, find out development characteristics of water flooding with different well patterns and sort out the optimal water flooding development mode. For dissolution vug-cave reservoirs without fractures, five-spot well pattern waterflooding has very small sweeping area, serious water channeling and low oil recovery. When the well pattern was adjusted from five-spot to nine-spot well pattern, oil recovery could be largely improved, but the corner well far from the injector is little affected. In dissolution vug-cave reservoirs with fractures, when the injector and producer are not connected by fractures, the fractures could effectively connect the poorly linked vugs to improve the development effect of water flooding. Whether there are fractures or not in dissolution vug-cave reservoirs, the development effect of nine-spot well-pattern is much better than that of five-spot well pattern and five-spot to nine-spot well pattern, this is more evident when there are fractures, and the edge well has better development indexes than corner well. At the high-water cut stage of water flooding with nine-spot well pattern, the oil recovery can be further improved with staggered line-drive pattern by converting the corner well into injection well. It is helpful to increase the oil production of corner well of nine-spot well pattern by increasing the injection rate, and improve ultimate oil recovery, but the water-free production period would be greatly shortened and water-free recovery would decrease.展开更多
文摘Modern applications require large databases to be searched for regions that are similar to a given pattern. The DNA sequence analysis, speech and text recognition, artificial intelligence, Internet of Things, and many other applications highly depend on pattern matching or similarity searches. In this paper, we discuss some of the string matching solutions developed in the past. Then, we present a novel mathematical model to search for a given pattern and it’s near approximates in the text.
文摘Pattern matching method is one of the classic classifications of existing online portfolio selection strategies. This article aims to study the key aspects of this method—measurement of similarity and selection of similarity sets, and proposes a Portfolio Selection Method based on Pattern Matching with Dual Information of Direction and Distance (PMDI). By studying different combination methods of indicators such as Euclidean distance, Chebyshev distance, and correlation coefficient, important information such as direction and distance in stock historical price information is extracted, thereby filtering out the similarity set required for pattern matching based investment portfolio selection algorithms. A large number of experiments conducted on two datasets of real stock markets have shown that PMDI outperforms other algorithms in balancing income and risk. Therefore, it is suitable for the financial environment in the real world.
基金The National Natural Science Foundation of China(No60273075)
文摘The problem of pattern-based subspace clustering, a special type of subspace clustering that uses pattern similarity as a measure of similarity, is studied. Unlike most traditional clustering algorithms that group the close values of objects in all the dimensions or a set of dimensions, clustering by pattern similarity shows an interesting pattern, where objects exhibit a coherent pattern of rise and fall in subspaces. A novel approach, named EMaPle to mine the maximal pattern-based subspace clusters, is designed. The EMaPle searches clusters only in the attribute enumeration spaces which are relatively few compared to the large number of row combinations in the typical datasets, and it exploits novel pruning techniques. EMaPle can find the clusters satisfying coherent constraints, size constraints and sign constraints neglected in MaPle. Both synthetic data sets and real data sets are used to evaluate EMaPle and demonstrate that it is more effective and scalable than MaPle.
基金the Fujian Provincial Natural Science Foundation of China (Z0510492006J0391)
文摘Based on rough similarity degree of rough sets and close degree of fuzzy sets, the definitions of rough similarity degree and rough close degree of rough fuzzy sets are given, which can be used to measure the similar degree between two rough fuzzy sets. The properties and theorems are listed. Using the two new measures, the method of clustering in the rough fuzzy system can be obtained. After clustering, the new fuzzy sample can be recognized by the principle of maximal similarity degree.
基金the Fujian Provincial Natural Science Foundation of China (Z051049, 2006J0391).
文摘The definition of rough similarity degree is given based on the axiomatic similarity degree, and the properties of rough similarity degree are listed. Using the properties of rough similarity degree, the method of clustering in rough systems can be obtained. After clustering, a new sample can be recognized by the principle of maximal rough similarity degree.
基金supported by International Science & Technology Cooperation Program of China (2010DFA92720)the National Natural Science Foundation of China (31260511, 30360014)
文摘There exist some controversies over the larger zoogeographic divisions of the arid areas of Central Asia, whose characteristics include complex ecological environments, complex fauna origins and unique patterns of animal distribution. The aim of this study was to determine, using quantitative analysis, the distribution patterns of amphibians and reptiles in the arid areas of Central Asia, whose various physiographical regions were divided into 17 Operative Geographical Units (OGUs). Based on the presence or absence of 52 amphibian and reptile genera in the 17 OGUs, and by making use of the Czekanowski Similarity Index, the Baroni-Urbani and Buser's Similarity Index, and the strong and weak boundary test, we studied the biotic boundaries within these contested regions. In accordance with our results, the classification dendrogram was divided into two main branches. One branch is composed of the northern OGUs of the Altai Mountains which are a part of the Euro-Siberian Subrealm. The other branch includes all of the OGUs south of the Altai Mountains, which belong to the Central Asia Subrealm. There is a significantly weak biotic boundary (DW〉0, GW〉GS, P〈0.01) between the Euro-Siberian Subrealm and the Central Asia Subrealm that corre- sponded to the transitional zones. The boundary between the two subrealms runs along the Altai Mountains, the Sayan Mountains, the Hangai Mountains and the Mongolian Dagurr Mountains. The boundaries between the main branches in the Central Asia Subrealm are weak, reflecting the widespread existence of transitional zones in the arid areas of Central Asia. The Tianshan Mountains should be elevated to form its own separate region, "the Middle Asian Mountain Region", which, due to its special fauna and environment, would be classified at the same level as the Mongolia-Xinjiang Region. With the approach of creating a cluster analysis dendogram based upon the genera of amphibians and reptiles, the relationship of these higher level zoogeographical divisions was successfully resolved and the error of long-branch attraction was also avoided. With our clustering dendrogram as the foundation, the in- dependence test was applied to strong and weak boundaries, and this resolved the problem of where to attribute the transition areas and revealed as well the barrier effect that physical, geographic boundaries have upon amphibians and reptiles. The approach of combining genera clustering analysis with a statistical boundary test should be applicable not only to the distribution patterns of other animal groups, but also to delineating large-scale zoogeographical divisions.
文摘Cement paste with low water/cement ratio of 0.3 was observed using AFM. It is found that C-S-H has self similarity trait from scanning scale 20 um×20 um down to 300 nm× 300 nm, and the feature of C-S-H at large scale is very similar to those smaller scales. It can be concluded that C-S-H is composed with some fundamental spherical globule, the fundamental globules agglomerate into bigger ones, moreover the bigger ones agglomerate into even bigger one. A C-S-H globule fractal model was put forward to describe the self similarity of the C-S-H globule, which can be used to reveal how the C-S-H globule contacts with each other.
基金This paper is supported by the National High Technology Research and Development Program ("863" Program) of China under Grant No.2006AA04Z437
文摘This paper presents a new method of damage condition assessment that allows accommodating other types of uncertainties due to ambiguity, vagueness, and fuzziness that are statistically nondescribable. In this method, healthy observations are used to construct a fury set representing sound performance characteristics. Additionally, the bounds on the similarities among the structural damage states are prescribed by using the state similarity matrix. Thus, an optimal group fuzzy sets representing damage states such as little, moderate, and severe damage can be inferred as an inverse problem from healthy observations only. The optimal group of damage fuzzy sets is used to classify a set of observations at any unknown state of damage using the principles of fitzzy pattern recognition based on an approximate principle . This method can be embedded into the system of Structural Health Monitoring (SHM) to give advice about structural maintenance and life predictio comes from Reference [ 9 ] for damage pattern recognition is presented n. Finally, a case and discussed. The study, which compared result illustrates our method is more effective and general, so it is very practical in engineering.
基金Supported by the China National Science and Technology Major Project(2016ZX05014-003-004)
文摘Based on the similarity criterion, volcanic rock samples were taken from outcrops to make experimental models. Water flooding experiments of five-spot well pattern, nine-spot well pattern, five-spot to nine-spot well pattern, the relationship between relative well and fracture positions, and injection rate in dissolution vug-cave reservoirs with/without fractures were carried out to analyze variation regularities of development indexes, find out development characteristics of water flooding with different well patterns and sort out the optimal water flooding development mode. For dissolution vug-cave reservoirs without fractures, five-spot well pattern waterflooding has very small sweeping area, serious water channeling and low oil recovery. When the well pattern was adjusted from five-spot to nine-spot well pattern, oil recovery could be largely improved, but the corner well far from the injector is little affected. In dissolution vug-cave reservoirs with fractures, when the injector and producer are not connected by fractures, the fractures could effectively connect the poorly linked vugs to improve the development effect of water flooding. Whether there are fractures or not in dissolution vug-cave reservoirs, the development effect of nine-spot well-pattern is much better than that of five-spot well pattern and five-spot to nine-spot well pattern, this is more evident when there are fractures, and the edge well has better development indexes than corner well. At the high-water cut stage of water flooding with nine-spot well pattern, the oil recovery can be further improved with staggered line-drive pattern by converting the corner well into injection well. It is helpful to increase the oil production of corner well of nine-spot well pattern by increasing the injection rate, and improve ultimate oil recovery, but the water-free production period would be greatly shortened and water-free recovery would decrease.