The knowledge graph with relational abundant information has been widely used as the basic data support for the retrieval platforms.Image and text descriptions added to the knowledge graph enrich the node information,...The knowledge graph with relational abundant information has been widely used as the basic data support for the retrieval platforms.Image and text descriptions added to the knowledge graph enrich the node information,which accounts for the advantage of the multi-modal knowledge graph.In the field of cross-modal retrieval platforms,multi-modal knowledge graphs can help to improve retrieval accuracy and efficiency because of the abundant relational infor-mation provided by knowledge graphs.The representation learning method is sig-nificant to the application of multi-modal knowledge graphs.This paper proposes a distributed collaborative vector retrieval platform(DCRL-KG)using the multi-modal knowledge graph VisualSem as the foundation to achieve efficient and high-precision multimodal data retrieval.Firstly,use distributed technology to classify and store the data in the knowledge graph to improve retrieval efficiency.Secondly,this paper uses BabelNet to expand the knowledge graph through multi-ple filtering processes and increase the diversification of information.Finally,this paper builds a variety of retrieval models to achieve the fusion of retrieval results through linear combination methods to achieve high-precision language retrieval and image retrieval.The paper uses sentence retrieval and image retrieval experi-ments to prove that the platform can optimize the storage structure of the multi-modal knowledge graph and have good performance in multi-modal space.展开更多
Aiming at the in situ and mobile observation of urban environmental air pollution,a portable instrument using ultraviolet spectrum retrieval algorithm was developed based on the basis of Differential Optical Absorptio...Aiming at the in situ and mobile observation of urban environmental air pollution,a portable instrument using ultraviolet spectrum retrieval algorithm was developed based on the basis of Differential Optical Absorption Spectroscopy(DOAS)and multiple-pass cell technique.Typical trace gas pollutants,NH3,SO2,and NO2,were explored using their optical spectral characteristics in deep ultraviolet wavelength range from 210 to 215 nm.The gas concentration was retrieved by Lambert-Beer’s law and nonlinear least square method.With an optimized optical alignment,the detection limits of NH3,SO2,NO2 were estimated to be 2.2,2.3,and 36.2 ppb,respectively.The system was used in carrying out some cruise observations in Chengdu,China.During the entire period,the polluted gases showed varied distribution and typical daily average concentrations ofNH3,SO2,NO2 were 23.2,3.5,and 106.0 ppb,respectively.The contributions from different sources were analyzed combined with the HYSPLIT model.Results show that the portable DOAS system is a convenient and effective tool for regional distribution measurement and pollution source monitoring.展开更多
Operation control of power systems has become challenging with an increase in the scale and complexity of power distribution systems and extensive access to renewable energy.Therefore,improvement of the ability of dat...Operation control of power systems has become challenging with an increase in the scale and complexity of power distribution systems and extensive access to renewable energy.Therefore,improvement of the ability of data-driven operation management,intelligent analysis,and mining is urgently required.To investigate and explore similar regularities of the historical operating section of the power distribution system and assist the power grid in obtaining high-value historical operation,maintenance experience,and knowledge by rule and line,a neural information retrieval model with an attention mechanism is proposed based on graph data computing technology.Based on the processing flow of the operating data of the power distribution system,a technical framework of neural information retrieval is established.Combined with the natural graph characteristics of the power distribution system,a unified graph data structure and a data fusion method of data access,data complement,and multi-source data are constructed.Further,a graph node feature-embedding representation learning algorithm and a neural information retrieval algorithm model are constructed.The neural information retrieval algorithm model is trained and tested using the generated graph node feature representation vector set.The model is verified on the operating section of the power distribution system of a provincial grid area.The results show that the proposed method demonstrates high accuracy in the similarity matching of historical operation characteristics and effectively supports intelligent fault diagnosis and elimination in power distribution systems.展开更多
In this paper, we employ genetic algorithms to solve the migration problem (MP). We propose a new encoding scheme to represent trees, which is composed of two parts: the pre-ordered traversal sequence of tree vertices...In this paper, we employ genetic algorithms to solve the migration problem (MP). We propose a new encoding scheme to represent trees, which is composed of two parts: the pre-ordered traversal sequence of tree vertices and the children number sequence of corresponding tree vertices. The proposed encoding scheme has the advantages of simplicity for encoding and decoding, ease for GA operations, and better equilibrium between exploration and exploitation. It is also adaptive in that, with few restrictions on the length of code, it can be freely lengthened or shortened according to the characteristics of the problem space. Furthermore, the encoding scheme is highly applicable to the degree-constrained minimum spanning tree problem because it also contains the degree information of each node. The simulation results demonstrate the higher performance of our algorithm, with fast convergence to the optima or sub-optima on various problem sizes. Comparing with the binary string encoding of vertices, when the problem size is large, our algorithm runs remarkably faster with comparable search capability. Key words distributed information retrieval - mobile agents - migration problem - genetic algorithms CLC number TP 301. 6 Foundation item: Supported by the National Natural Science Foundation of China (90104005), the Natural Science Foundation of Hubei Province and the Hong Kong Polytechnic University under the grant G-YD63Biography: He Yan-xiang (1952-), male, Professor, research direction: distributed and parallel processing, multi-agent systems, data mining and e-business.展开更多
The traditional method first classifies the user information and combines the query method to retrieve the interest information, but neglects to calculate the weight of the user interest information, which leads to th...The traditional method first classifies the user information and combines the query method to retrieve the interest information, but neglects to calculate the weight of the user interest information, which leads to the low retrieval accuracy. A retrieval method based on the fuzzy proximity classification technology is proposed. Approximation between the fuzzy sets is used to represent the consistency between the user interest information features, and the consistency calculation formula and the skewness confidence matrix between the user interest information features are given. The fuzzy classification of the user interest information can obtain the most consistent confidence data and eliminate the redundant approximation interference data. The probabilistic model of the information word frequency and the user interest information length calculates the weight of the user interest information, and adjusts the weight formula constantly.展开更多
Big data analytics in business intelligence do not provide effective data retrieval methods and job scheduling that will cause execution inefficiency and low system throughput.This paper aims to enhance the capability...Big data analytics in business intelligence do not provide effective data retrieval methods and job scheduling that will cause execution inefficiency and low system throughput.This paper aims to enhance the capability of data retrieval and job scheduling to speed up the operation of big data analytics to overcome inefficiency and low throughput problems.First,integrating stacked sparse autoencoder and Elasticsearch indexing explored fast data searching and distributed indexing,which reduces the search scope of the database and dramatically speeds up data searching.Next,exploiting a deep neural network to predict the approximate execution time of a job gives prioritized job scheduling based on the shortest job first,which reduces the average waiting time of job execution.As a result,the proposed data retrieval approach outperforms the previous method using a deep autoencoder and Solr indexing,significantly improving the speed of data retrieval up to 53%and increasing system throughput by 53%.On the other hand,the proposed job scheduling algorithmdefeats both first-in-first-out andmemory-sensitive heterogeneous early finish time scheduling algorithms,effectively shortening the average waiting time up to 5%and average weighted turnaround time by 19%,respectively.展开更多
To reuse and share the valuable knowledge embedded in repositories of engineering models for accelerating the design process, improving product quality, and reducing costs, it is crucial to devise search engines capab...To reuse and share the valuable knowledge embedded in repositories of engineering models for accelerating the design process, improving product quality, and reducing costs, it is crucial to devise search engines capable of matching 3D models efficiently and effectively. In this paper, an enhanced shape distributions-based technique of using geometrical and topological information to search 3D engineering models represented by polygonal meshes was presented. A simplification method of polygonal meshes was used to simplify engineering model as the pretreatment for generation of sample points. The method of sampling points was improved and a pair of functions that was more sensitive to shape was employed to construct a 2D shape distribution. Experiments were conducted to evaluate the proposed algorithm utilizing the Engineering Shape Benchmark (ESB) database. The experiential results suggest that the search effectiveness is significantly improved by enforcing the simplification and enhanced shape distributions to engineering model retrieval.展开更多
In this paper, without recourse to the nonlinear dynamical equations of the waves, the nonlinear random waves are retrieved from the non-Gaussian characteristic of the sea surface elevation distribution. The question ...In this paper, without recourse to the nonlinear dynamical equations of the waves, the nonlinear random waves are retrieved from the non-Gaussian characteristic of the sea surface elevation distribution. The question of coincidence of the nonlinear wave profile, spectrum and its distributions of maximum (or minimum) values of the sea surface elevation with results derived from some existing nonlinear theories is expounded under the narrow-band spectrum condition. Taking the shoaling sea wave as an example, the nonlinear random wave process and its spectrum in shallow water are retrieved from both the non-Gaussian characteristics of the sea surface elevation distribution in shallow water and the normal sea waves in deep water and compared with the values actually measured. Results show that they can coincide with the actually measured values quite well, thus, this can confirm that the method proposed in this paper is feasible.展开更多
The objective of this research was to acquire a raindrop size distribution(DSDs)retrieved from C-band polarimetric radar observations scheme for the first time in south China.An observation period of the precipitation...The objective of this research was to acquire a raindrop size distribution(DSDs)retrieved from C-band polarimetric radar observations scheme for the first time in south China.An observation period of the precipitation process was selected,and the shape-slope(μ-Λ)relationship of this region was statistically analyzed using the raindrop sample observations from the two-dimensional video disdrometer(2DVD)at Xinfeng Station,Guangdong Province.Simulated data of the C-band polarimetric radar reflectivity ZHHand differential reflectivity ZDRwere obtained through scattering simulation.The simulation data were combined with DSD fitting to determine the ZDR-Λand log10(ZHH/N0)-Λrelationships.Using Xinfeng C-band polarimetric radar observations ZDRand ZHH,the raindrop Gamma size distribution parametersμ,Λ,and N0were retrieved.A scheme for using C-band polarimetric radar to retrieve the DSDs was developed.This research revealed that during precipitation process,the DSDs obtained using the C-band polarimetric radar retrieval scheme are similar to the 2DVD observations,the precipitation characteristics of rainfall intensity(R),mass-weighted mean diameter(Dm)and intercept parameter(Nw)with time obtained by radar retrieval are basically consistent with the observational results of the 2DVD.This scheme establishes the relationship between the observations of the C-band polarimetric radar and the physical quantities of the numerical model.This method not only can test the prediction of the model data assimilation system on the convective scale and determine error sources,but also can improve the microphysical precipitation processes analysis and radar quantitative precipitation estimation.The present research will facilitate radar data assimilation in the future.展开更多
In this paper, we present a novel and efficient scheme for extracting, indexing and retrieving color images. Our motivation was to reduce the space overhead of partition-based approaches taking advantage of the fact t...In this paper, we present a novel and efficient scheme for extracting, indexing and retrieving color images. Our motivation was to reduce the space overhead of partition-based approaches taking advantage of the fact that only a relatively low number of distinct values of a particular visual feature is present in most images. To extract color feature and build indices into our image database we take into consideration factors such as human color perception and perceptual range, and the image is partitioned into a set of regions by using a simple classifying scheme. The compact color feature vector and the spatial color histogram, which are extracted from the seqmented image region, are used for representing the color and spatial information in the image. We have also developed the region-based distance measures to compare the similarity of two images. Extensive tests on a large image collection were conducted to demonstrate the effectiveness of the proposed approach.展开更多
The rotating fan-beam scatterometer (RFSCAT) is a new type of satellite scatterometer that is proposed approximately 10 a ago. However, similar to other rotating scatterometers, relatively larger wind retrieval erro...The rotating fan-beam scatterometer (RFSCAT) is a new type of satellite scatterometer that is proposed approximately 10 a ago. However, similar to other rotating scatterometers, relatively larger wind retrieval errors occur in the nadir and outer regions compared with the middle regions of the swath. For the RFSCAT with the given parameters, a wind direction retrieval accuracy decreases by approximately 9 in the outer regions compared with the middle region. To address this problem, an advanced wind vector retrieval algorithm for the RFSCAT is presented. The new algorithm features an adaptive extension of the range of wind direction for each wind vector cell position across the whole swath according to the distribution histogram of a retrieved wind direction bias. One hundred orbits of Level 2A data are simulated to validate and evaluate the new algorithm. Retrieval experiments demonstrate that the new advanced algorithm can effectively improve the wind direction retrieval accuracy in the nadir and outer regions of the RFSCAT swath. Approximately 1.6 and 9 improvements in the wind direction retrieval are achieved for the wind vector cells located at the nadir and the edge point of the swath, respectively.展开更多
The matching and retrieval of the 2D shapes are challenging issues in object recognition and computer vision. In this paper, we propose a new object contour descriptor termed ECPDH (Elliptic Contour Points Distributio...The matching and retrieval of the 2D shapes are challenging issues in object recognition and computer vision. In this paper, we propose a new object contour descriptor termed ECPDH (Elliptic Contour Points Distribution Histogram), which is based on the distribution of the points on an object contour under the polar coordinates. ECPDH has the essential merits of invariance to scale and translation. Dynamic Programming (DP) algorithm is used to measure the distance between the ECPDHs. The effectiveness of the proposed method is demonstrated using some standard tests on MPEG-7 shape database. The results show the precision and recall of our method over other recent methods in the literature.展开更多
高效的在线字符串模式匹配算法对云数据库检索至关重要,然而搜索内容的泄露会威胁用户隐私。现有的字符串模式匹配算法没有考虑用户搜索内容的保护,可搜索加密方案虽然可以保护用户的搜索内容,但存在索引构建代价大、检索效率低等问题...高效的在线字符串模式匹配算法对云数据库检索至关重要,然而搜索内容的泄露会威胁用户隐私。现有的字符串模式匹配算法没有考虑用户搜索内容的保护,可搜索加密方案虽然可以保护用户的搜索内容,但存在索引构建代价大、检索效率低等问题。因此,提出了两种保护用户搜索内容的模式匹配算法:基于分布式点函数的模式匹配(pattern matching based on distributed point function,PMDPF)算法和基于分布式点函数的跳跃式模式匹配(jumping pattern matching based on distributed point function,JPMDPF)算法。PMDPF算法利用指纹函数以及分布式点函数构造模式串真值表,并分发给两台独立的服务器,把搜索中字符对比操作转换为查表操作,从而保护搜索内容。为了提升搜索效率,提出了JPMDPF算法。通过字符跳转,JPMDPF算法以泄露更多信息为代价,其搜索效率比PMDPF算法平均提高了约m倍,其中m为搜索内容长度,同时显著降低了因指纹函数碰撞而导致的误判的概率。实验结果表明,PMDPF算法的搜索效率比基于指纹函数的经典算法提高约5%,并优于现有的可搜索加密方案,PMDPF算法的搜索耗时在搜索内容长度为4时是JPMDPF算法的4.2倍。展开更多
基金This work is supported by the Fundamental Research Funds for the Central Universities(Grant No.HIT.NSRIF.201714)Weihai Science and Technology Development Program(2016DX GJMS15)+1 种基金Weihai Scientific Research and Innovation Fund(2020)Key Research and Development Program in Shandong Provincial(2017GGX90103).
文摘The knowledge graph with relational abundant information has been widely used as the basic data support for the retrieval platforms.Image and text descriptions added to the knowledge graph enrich the node information,which accounts for the advantage of the multi-modal knowledge graph.In the field of cross-modal retrieval platforms,multi-modal knowledge graphs can help to improve retrieval accuracy and efficiency because of the abundant relational infor-mation provided by knowledge graphs.The representation learning method is sig-nificant to the application of multi-modal knowledge graphs.This paper proposes a distributed collaborative vector retrieval platform(DCRL-KG)using the multi-modal knowledge graph VisualSem as the foundation to achieve efficient and high-precision multimodal data retrieval.Firstly,use distributed technology to classify and store the data in the knowledge graph to improve retrieval efficiency.Secondly,this paper uses BabelNet to expand the knowledge graph through multi-ple filtering processes and increase the diversification of information.Finally,this paper builds a variety of retrieval models to achieve the fusion of retrieval results through linear combination methods to achieve high-precision language retrieval and image retrieval.The paper uses sentence retrieval and image retrieval experi-ments to prove that the platform can optimize the storage structure of the multi-modal knowledge graph and have good performance in multi-modal space.
基金supported by the National Natural Science Foundation of China(Nos.61805257,41905130)in part by the China Postdoctoral Science Foundation(Nos.2020M671383,2020M681517)in part by the Science and Technology Development Plan Foundation of Suzhou(No.SS202148).
文摘Aiming at the in situ and mobile observation of urban environmental air pollution,a portable instrument using ultraviolet spectrum retrieval algorithm was developed based on the basis of Differential Optical Absorption Spectroscopy(DOAS)and multiple-pass cell technique.Typical trace gas pollutants,NH3,SO2,and NO2,were explored using their optical spectral characteristics in deep ultraviolet wavelength range from 210 to 215 nm.The gas concentration was retrieved by Lambert-Beer’s law and nonlinear least square method.With an optimized optical alignment,the detection limits of NH3,SO2,NO2 were estimated to be 2.2,2.3,and 36.2 ppb,respectively.The system was used in carrying out some cruise observations in Chengdu,China.During the entire period,the polluted gases showed varied distribution and typical daily average concentrations ofNH3,SO2,NO2 were 23.2,3.5,and 106.0 ppb,respectively.The contributions from different sources were analyzed combined with the HYSPLIT model.Results show that the portable DOAS system is a convenient and effective tool for regional distribution measurement and pollution source monitoring.
基金supported by the National Key R&D Program of China(2020YFB0905900).
文摘Operation control of power systems has become challenging with an increase in the scale and complexity of power distribution systems and extensive access to renewable energy.Therefore,improvement of the ability of data-driven operation management,intelligent analysis,and mining is urgently required.To investigate and explore similar regularities of the historical operating section of the power distribution system and assist the power grid in obtaining high-value historical operation,maintenance experience,and knowledge by rule and line,a neural information retrieval model with an attention mechanism is proposed based on graph data computing technology.Based on the processing flow of the operating data of the power distribution system,a technical framework of neural information retrieval is established.Combined with the natural graph characteristics of the power distribution system,a unified graph data structure and a data fusion method of data access,data complement,and multi-source data are constructed.Further,a graph node feature-embedding representation learning algorithm and a neural information retrieval algorithm model are constructed.The neural information retrieval algorithm model is trained and tested using the generated graph node feature representation vector set.The model is verified on the operating section of the power distribution system of a provincial grid area.The results show that the proposed method demonstrates high accuracy in the similarity matching of historical operation characteristics and effectively supports intelligent fault diagnosis and elimination in power distribution systems.
文摘In this paper, we employ genetic algorithms to solve the migration problem (MP). We propose a new encoding scheme to represent trees, which is composed of two parts: the pre-ordered traversal sequence of tree vertices and the children number sequence of corresponding tree vertices. The proposed encoding scheme has the advantages of simplicity for encoding and decoding, ease for GA operations, and better equilibrium between exploration and exploitation. It is also adaptive in that, with few restrictions on the length of code, it can be freely lengthened or shortened according to the characteristics of the problem space. Furthermore, the encoding scheme is highly applicable to the degree-constrained minimum spanning tree problem because it also contains the degree information of each node. The simulation results demonstrate the higher performance of our algorithm, with fast convergence to the optima or sub-optima on various problem sizes. Comparing with the binary string encoding of vertices, when the problem size is large, our algorithm runs remarkably faster with comparable search capability. Key words distributed information retrieval - mobile agents - migration problem - genetic algorithms CLC number TP 301. 6 Foundation item: Supported by the National Natural Science Foundation of China (90104005), the Natural Science Foundation of Hubei Province and the Hong Kong Polytechnic University under the grant G-YD63Biography: He Yan-xiang (1952-), male, Professor, research direction: distributed and parallel processing, multi-agent systems, data mining and e-business.
文摘The traditional method first classifies the user information and combines the query method to retrieve the interest information, but neglects to calculate the weight of the user interest information, which leads to the low retrieval accuracy. A retrieval method based on the fuzzy proximity classification technology is proposed. Approximation between the fuzzy sets is used to represent the consistency between the user interest information features, and the consistency calculation formula and the skewness confidence matrix between the user interest information features are given. The fuzzy classification of the user interest information can obtain the most consistent confidence data and eliminate the redundant approximation interference data. The probabilistic model of the information word frequency and the user interest information length calculates the weight of the user interest information, and adjusts the weight formula constantly.
基金supported and granted by the Ministry of Science and Technology,Taiwan(MOST110-2622-E-390-001 and MOST109-2622-E-390-002-CC3).
文摘Big data analytics in business intelligence do not provide effective data retrieval methods and job scheduling that will cause execution inefficiency and low system throughput.This paper aims to enhance the capability of data retrieval and job scheduling to speed up the operation of big data analytics to overcome inefficiency and low throughput problems.First,integrating stacked sparse autoencoder and Elasticsearch indexing explored fast data searching and distributed indexing,which reduces the search scope of the database and dramatically speeds up data searching.Next,exploiting a deep neural network to predict the approximate execution time of a job gives prioritized job scheduling based on the shortest job first,which reduces the average waiting time of job execution.As a result,the proposed data retrieval approach outperforms the previous method using a deep autoencoder and Solr indexing,significantly improving the speed of data retrieval up to 53%and increasing system throughput by 53%.On the other hand,the proposed job scheduling algorithmdefeats both first-in-first-out andmemory-sensitive heterogeneous early finish time scheduling algorithms,effectively shortening the average waiting time up to 5%and average weighted turnaround time by 19%,respectively.
基金The Basic Research of COSTIND,China (No.D0420060521)
文摘To reuse and share the valuable knowledge embedded in repositories of engineering models for accelerating the design process, improving product quality, and reducing costs, it is crucial to devise search engines capable of matching 3D models efficiently and effectively. In this paper, an enhanced shape distributions-based technique of using geometrical and topological information to search 3D engineering models represented by polygonal meshes was presented. A simplification method of polygonal meshes was used to simplify engineering model as the pretreatment for generation of sample points. The method of sampling points was improved and a pair of functions that was more sensitive to shape was employed to construct a 2D shape distribution. Experiments were conducted to evaluate the proposed algorithm utilizing the Engineering Shape Benchmark (ESB) database. The experiential results suggest that the search effectiveness is significantly improved by enforcing the simplification and enhanced shape distributions to engineering model retrieval.
基金This work is funded by National Natural Science Foundation of China
文摘In this paper, without recourse to the nonlinear dynamical equations of the waves, the nonlinear random waves are retrieved from the non-Gaussian characteristic of the sea surface elevation distribution. The question of coincidence of the nonlinear wave profile, spectrum and its distributions of maximum (or minimum) values of the sea surface elevation with results derived from some existing nonlinear theories is expounded under the narrow-band spectrum condition. Taking the shoaling sea wave as an example, the nonlinear random wave process and its spectrum in shallow water are retrieved from both the non-Gaussian characteristics of the sea surface elevation distribution in shallow water and the normal sea waves in deep water and compared with the values actually measured. Results show that they can coincide with the actually measured values quite well, thus, this can confirm that the method proposed in this paper is feasible.
基金National Key R&D Program of China(2018YFC1507401)Science and Technology Planning Project of Guangdong Province(2017B020244002)+1 种基金National Natural Science Foundation of China(41975138,41705020)Natural Science Foundation of Guangdong Province(2019A1515010814)。
文摘The objective of this research was to acquire a raindrop size distribution(DSDs)retrieved from C-band polarimetric radar observations scheme for the first time in south China.An observation period of the precipitation process was selected,and the shape-slope(μ-Λ)relationship of this region was statistically analyzed using the raindrop sample observations from the two-dimensional video disdrometer(2DVD)at Xinfeng Station,Guangdong Province.Simulated data of the C-band polarimetric radar reflectivity ZHHand differential reflectivity ZDRwere obtained through scattering simulation.The simulation data were combined with DSD fitting to determine the ZDR-Λand log10(ZHH/N0)-Λrelationships.Using Xinfeng C-band polarimetric radar observations ZDRand ZHH,the raindrop Gamma size distribution parametersμ,Λ,and N0were retrieved.A scheme for using C-band polarimetric radar to retrieve the DSDs was developed.This research revealed that during precipitation process,the DSDs obtained using the C-band polarimetric radar retrieval scheme are similar to the 2DVD observations,the precipitation characteristics of rainfall intensity(R),mass-weighted mean diameter(Dm)and intercept parameter(Nw)with time obtained by radar retrieval are basically consistent with the observational results of the 2DVD.This scheme establishes the relationship between the observations of the C-band polarimetric radar and the physical quantities of the numerical model.This method not only can test the prediction of the model data assimilation system on the convective scale and determine error sources,but also can improve the microphysical precipitation processes analysis and radar quantitative precipitation estimation.The present research will facilitate radar data assimilation in the future.
文摘In this paper, we present a novel and efficient scheme for extracting, indexing and retrieving color images. Our motivation was to reduce the space overhead of partition-based approaches taking advantage of the fact that only a relatively low number of distinct values of a particular visual feature is present in most images. To extract color feature and build indices into our image database we take into consideration factors such as human color perception and perceptual range, and the image is partitioned into a set of regions by using a simple classifying scheme. The compact color feature vector and the spatial color histogram, which are extracted from the seqmented image region, are used for representing the color and spatial information in the image. We have also developed the region-based distance measures to compare the similarity of two images. Extensive tests on a large image collection were conducted to demonstrate the effectiveness of the proposed approach.
基金The National Natural Science Foundation of China under contract Nos 41476152 and 41506206the National High Technology Research and Development Program(863 Program) of China under contract No.2013AA09A505the Major Project on the Integration of Industry,Education,and Research of Guangzhou City of China under contract No.201508020109
文摘The rotating fan-beam scatterometer (RFSCAT) is a new type of satellite scatterometer that is proposed approximately 10 a ago. However, similar to other rotating scatterometers, relatively larger wind retrieval errors occur in the nadir and outer regions compared with the middle regions of the swath. For the RFSCAT with the given parameters, a wind direction retrieval accuracy decreases by approximately 9 in the outer regions compared with the middle region. To address this problem, an advanced wind vector retrieval algorithm for the RFSCAT is presented. The new algorithm features an adaptive extension of the range of wind direction for each wind vector cell position across the whole swath according to the distribution histogram of a retrieved wind direction bias. One hundred orbits of Level 2A data are simulated to validate and evaluate the new algorithm. Retrieval experiments demonstrate that the new advanced algorithm can effectively improve the wind direction retrieval accuracy in the nadir and outer regions of the RFSCAT swath. Approximately 1.6 and 9 improvements in the wind direction retrieval are achieved for the wind vector cells located at the nadir and the edge point of the swath, respectively.
文摘The matching and retrieval of the 2D shapes are challenging issues in object recognition and computer vision. In this paper, we propose a new object contour descriptor termed ECPDH (Elliptic Contour Points Distribution Histogram), which is based on the distribution of the points on an object contour under the polar coordinates. ECPDH has the essential merits of invariance to scale and translation. Dynamic Programming (DP) algorithm is used to measure the distance between the ECPDHs. The effectiveness of the proposed method is demonstrated using some standard tests on MPEG-7 shape database. The results show the precision and recall of our method over other recent methods in the literature.
文摘高效的在线字符串模式匹配算法对云数据库检索至关重要,然而搜索内容的泄露会威胁用户隐私。现有的字符串模式匹配算法没有考虑用户搜索内容的保护,可搜索加密方案虽然可以保护用户的搜索内容,但存在索引构建代价大、检索效率低等问题。因此,提出了两种保护用户搜索内容的模式匹配算法:基于分布式点函数的模式匹配(pattern matching based on distributed point function,PMDPF)算法和基于分布式点函数的跳跃式模式匹配(jumping pattern matching based on distributed point function,JPMDPF)算法。PMDPF算法利用指纹函数以及分布式点函数构造模式串真值表,并分发给两台独立的服务器,把搜索中字符对比操作转换为查表操作,从而保护搜索内容。为了提升搜索效率,提出了JPMDPF算法。通过字符跳转,JPMDPF算法以泄露更多信息为代价,其搜索效率比PMDPF算法平均提高了约m倍,其中m为搜索内容长度,同时显著降低了因指纹函数碰撞而导致的误判的概率。实验结果表明,PMDPF算法的搜索效率比基于指纹函数的经典算法提高约5%,并优于现有的可搜索加密方案,PMDPF算法的搜索耗时在搜索内容长度为4时是JPMDPF算法的4.2倍。