This study introduces the Orbit Weighting Scheme(OWS),a novel approach aimed at enhancing the precision and efficiency of Vector Space information retrieval(IR)models,which have traditionally relied on weighting schem...This study introduces the Orbit Weighting Scheme(OWS),a novel approach aimed at enhancing the precision and efficiency of Vector Space information retrieval(IR)models,which have traditionally relied on weighting schemes like tf-idf and BM25.These conventional methods often struggle with accurately capturing document relevance,leading to inefficiencies in both retrieval performance and index size management.OWS proposes a dynamic weighting mechanism that evaluates the significance of terms based on their orbital position within the vector space,emphasizing term relationships and distribution patterns overlooked by existing models.Our research focuses on evaluating OWS’s impact on model accuracy using Information Retrieval metrics like Recall,Precision,InterpolatedAverage Precision(IAP),andMeanAverage Precision(MAP).Additionally,we assessOWS’s effectiveness in reducing the inverted index size,crucial for model efficiency.We compare OWS-based retrieval models against others using different schemes,including tf-idf variations and BM25Delta.Results reveal OWS’s superiority,achieving a 54%Recall and 81%MAP,and a notable 38%reduction in the inverted index size.This highlights OWS’s potential in optimizing retrieval processes and underscores the need for further research in this underrepresented area to fully leverage OWS’s capabilities in information retrieval methodologies.展开更多
The retrieval of the biomass parameters from active/passive microwave remote sensing data (10.2 GHz) is performed based on an iterative inversion of BP neural network model with fuzzy optimization. The BP neural net...The retrieval of the biomass parameters from active/passive microwave remote sensing data (10.2 GHz) is performed based on an iterative inversion of BP neural network model with fuzzy optimization. The BP neural network is trained by a set of the measurements of active and passive remote sensing and the ground truth data versus Day of Year during growth. Once the network training is complete, the model can be used to retrieve the temporal variations of the biomass parameters from another set of observation data. The model was used in weights and microware observation data of wheat growth in 1989 to retrieve biomass parameters change of wheat growth this year. The retrieved biomass parameters correspond well with the real data of the growth, which shows that the BP model is scientific and sound.展开更多
Content-based 3D model retrieval is of great help to facilitate the reuse of existing designs and to inspire designers during conceptual design. However, there is still a gap to apply it in industry due to the low tim...Content-based 3D model retrieval is of great help to facilitate the reuse of existing designs and to inspire designers during conceptual design. However, there is still a gap to apply it in industry due to the low time efficiency. This paper presents two new methods with high efficiency to build a Content-based 3D model retrieval system. First, an improvement is made on the "Shape Distribution (D2)" algorithm, and a new algorithm named "Quick D2" is proposed. Four sample 3D mechanical models are used in an experiment to compare the time cost of the two algorithms. The result indicates that the time cost of Quick D2 is much lower than that of D2, while the descriptors extracted by the two algorithms are almost the same. Second, an expandable 3D model repository index method with high performance, namely, RBK index, is presented. On the basis of RBK index, the search space is pruned effectively during the search process, leading to a speed up of the whole system. The factors that influence the values of the key parameters of RBK index are discussed and an experimental method to find the optimal values of the key parameters is given. Finally, "3D Searcher", a content-based 3D model retrieval system is developed. By using the methods proposed, the time cost for the system to respond one query online is reduced by 75% on average. The system has been implemented in a manufacturing enterprise, and practical query examples during a case of the automobile rear axle design are also shown. The research method presented shows a new research perspective and can effectively improve the content-based 3D model retrieval efficiency.展开更多
An algorithm for retrieving polarimetric variables from numerical model fields is developed. By using this technique, radar reflectivity at horizontal polarization~ differential reflectivity, specific differential pha...An algorithm for retrieving polarimetric variables from numerical model fields is developed. By using this technique, radar reflectivity at horizontal polarization~ differential reflectivity, specific differential phase shift and correlation coefficients between the horizontal and vertical polarization signals at zero lag can be derived from rain, snow and hail contents of numerical model outputs. Effects of environmental temperature and the melting process on polarimetric variables are considered in the algorithm. The algorithm is applied to the Advanced Regional Prediction System (ARPS) model simulation results for a hail storm. The spatial distributions of the derived parameters are reasonable when compared with observational knowledge. This work provides a forward model for assimilation of dual linear polarization radar data into a mesoscale model.展开更多
A hybrid model that is based on the Combination of keywords and concept was put forward. The hybrid model is built on vector space model and probabilistic reasoning network. It not only can exert the advantages of key...A hybrid model that is based on the Combination of keywords and concept was put forward. The hybrid model is built on vector space model and probabilistic reasoning network. It not only can exert the advantages of keywords retrieval and concept retrieval but also can compensate for their shortcomings. Their parameters can be adjusted according to different usage in order to accept the best information retrieval result, and it has been proved by our experiments.展开更多
In this paper, we present a novel Support Vector Machine active learning algorithm for effective 3D model retrieval using the concept of relevance feedback. The proposed method learns from the most informative objects...In this paper, we present a novel Support Vector Machine active learning algorithm for effective 3D model retrieval using the concept of relevance feedback. The proposed method learns from the most informative objects which are marked by the user, and then creates a boundary separating the relevant models from irrelevant ones. What it needs is only a small number of 3D models labelled by the user. It can grasp the user's semantic knowledge rapidly and accurately. Experimental results showed that the proposed algorithm significantly improves the retrieval effectiveness. Compared with four state-of-the-art query refinement schemes for 3D model retrieval, it provides superior retrieval performance after no more than two rounds of relevance feedback.展开更多
A large number of 3D models are created on computers and available for networks. Some content-based retrieval technologies are indispensable to find out particular data from such anonymous datasets. Though several sha...A large number of 3D models are created on computers and available for networks. Some content-based retrieval technologies are indispensable to find out particular data from such anonymous datasets. Though several shape retrieval technologies have been developed, little attention has been given to the points on human's sense and impression (as known as Kansei) in the conventional techniques, In this paper, the authors propose a novel method of shape retrieval based on shape impression of human's Kansei. The key to the method is the Gaussian curvature distribution from 3D models as features for shape retrieval. Then it classifies the 3D models by extracted feature and measures similarity among models in storage.展开更多
In order to improve the accuracy and efficiency of 3D model retrieval,the method based on affinity propagation clustering algorithm is proposed. Firstly,projection ray-based method is proposed to improve the feature e...In order to improve the accuracy and efficiency of 3D model retrieval,the method based on affinity propagation clustering algorithm is proposed. Firstly,projection ray-based method is proposed to improve the feature extraction efficiency of 3D models. Based on the relationship between model and its projection,the intersection in 3D space is transformed into intersection in 2D space,which reduces the number of intersection and improves the efficiency of the extraction algorithm. In feature extraction,multi-layer spheres method is analyzed. The two-layer spheres method makes the feature vector more accurate and improves retrieval precision. Secondly,Semi-supervised Affinity Propagation ( S-AP) clustering is utilized because it can be applied to different cluster structures. The S-AP algorithm is adopted to find the center models and then the center model collection is built. During retrieval process,the collection is utilized to classify the query model into corresponding model base and then the most similar model is retrieved in the model base. Finally,75 sample models from Princeton library are selected to do the experiment and then 36 models are used for retrieval test. The results validate that the proposed method outperforms the original method and the retrieval precision and recall ratios are improved effectively.展开更多
Currently,the video captioning models based on an encoder-decoder mainly rely on a single video input source.The contents of video captioning are limited since few studies employed external corpus information to guide...Currently,the video captioning models based on an encoder-decoder mainly rely on a single video input source.The contents of video captioning are limited since few studies employed external corpus information to guide the generation of video captioning,which is not conducive to the accurate descrip-tion and understanding of video content.To address this issue,a novel video captioning method guided by a sentence retrieval generation network(ED-SRG)is proposed in this paper.First,a ResNeXt network model,an efficient convolutional network for online video understanding(ECO)model,and a long short-term memory(LSTM)network model are integrated to construct an encoder-decoder,which is utilized to extract the 2D features,3D features,and object features of video data respectively.These features are decoded to generate textual sentences that conform to video content for sentence retrieval.Then,a sentence-transformer network model is employed to retrieve different sentences in an external corpus that are semantically similar to the above textual sentences.The candidate sentences are screened out through similarity measurement.Finally,a novel GPT-2 network model is constructed based on GPT-2 network structure.The model introduces a designed random selector to randomly select predicted words with a high probability in the corpus,which is used to guide and generate textual sentences that are more in line with human natural language expressions.The proposed method in this paper is compared with several existing works by experiments.The results show that the indicators BLEU-4,CIDEr,ROUGE_L,and METEOR are improved by 3.1%,1.3%,0.3%,and 1.5%on a public dataset MSVD and 1.3%,0.5%,0.2%,1.9%on a public dataset MSR-VTT respectively.It can be seen that the proposed method in this paper can generate video captioning with richer semantics than several state-of-the-art approaches.展开更多
The geophysical model function (GMF) describes the relationship between a backscattering and a sea surface wind, and enables a wind vector retrieval from backscattering measurements. It is clear that the GMF plays a...The geophysical model function (GMF) describes the relationship between a backscattering and a sea surface wind, and enables a wind vector retrieval from backscattering measurements. It is clear that the GMF plays an important role in an ocean wind vector retrieval. The performance of the existing Ku-band model function QSCAT-1 is considered to be effective at low and moderate wind speed ranges. However, in the conditions of higher wind speeds, the existing algorithms diverge alarmingly, owing to the lack of in situ data required for developing the GMF for the high wind conditions, the QSCAT-1 appears to overestimate the a0, which results in underestimating the wind speeds. Several match-up QuikSCAT and special sensor microwave/imager (SSM/I) wind speed measurements of the typhoons occurring in the west Pacific Ocean are analyzed. The results show that the SSM/I wind exhibits better agreement with the "best track" analysis wind speed than the QuikSCAT wind retrieved using QSCAT-1. On the basis of this evaluation, a correction of the QSCAT-1 model function for wind speed above 16 m/s is proposed, which uses the collocated SSM/I and QuikSCAT measurements as a training set, and a neural network approach as a multiple nonlinear regression technologytechnology.In order to validate the revised GMF for high winds, the modified GMF was applied to the QuikSCAT observations of Hurricane IOKE. The wind estimated by the QuikSCAT for Typhoon IOKE in 2006 was improved with the maximum wind speed reaching 55 m/s. An error analysis was performed using the wind fields from the Holland model as the surface truth. The results show an improved agreement with the Holland model wind when compared with the wind estimated using the QSCAT-1. However, large bias still existed, indicating that the effects of rain must be considered for further improvement.展开更多
CAD model retrieval based on functional semantics is more significant than content-based 3D model retrieval during the mechanical conceptual design phase. However, relevant research is still not fully discussed. There...CAD model retrieval based on functional semantics is more significant than content-based 3D model retrieval during the mechanical conceptual design phase. However, relevant research is still not fully discussed. Therefore, a functional semantic-based CAD model annotation and retrieval method is proposed to support mechanical conceptual design and design reuse, inspire designer creativity through existing CAD models, shorten design cycle, and reduce costs. Firstly, the CAD model functional semantic ontology is constructed to formally represent the functional semantics of CAD models and describe the mechanical conceptual design space comprehensively and consistently. Secondly, an approach to represent CAD models as attributed adjacency graphs(AAG) is proposed. In this method, the geometry and topology data are extracted from STEP models. On the basis of AAG, the functional semantics of CAD models are annotated semi-automatically by matching CAD models that contain the partial features of which functional semantics have been annotated manually, thereby constructing CAD Model Repository that supports model retrieval based on functional semantics. Thirdly, a CAD model retrieval algorithm that supports multi-function extended retrieval is proposed to explore more potential creative design knowledge in the semantic level. Finally, a prototype system, called Functional Semantic-based CAD Model Annotation and Retrieval System(FSMARS), is implemented. A case demonstrates that FSMARS can successfully botain multiple potential CAD models that conform to the desired function. The proposed research addresses actual needs and presents a new way to acquire CAD models in the mechanical conceptual design phase.展开更多
This paper explores the application of term dependency in information retrieval (IR) and proposes a novel dependency retrieval model. This retrieval model suggests an extension to the existing language modeling (LM) a...This paper explores the application of term dependency in information retrieval (IR) and proposes a novel dependency retrieval model. This retrieval model suggests an extension to the existing language modeling (LM) approach to IR by introducing dependency models for both query and document. Relevance between document and query is then evaluated by reference to the Kullback-Leibler divergence between their dependency models. This paper introduces a novel hybrid dependency structure, which allows integration of various forms of dependency within a single framework. A pseudo relevance feedback based method is also introduced for constructing query dependency model. The basic idea is to use query-relevant top-ranking sentences extracted from the top documents at retrieval time as the augmented representation of query, from which the relationships between query terms are identified. A Markov Random Field (MRF) based approach is presented to ensure the relevance of the extracted sentences, which utilizes the association features between query terms within a sentence to evaluate the relevance of each sentence. This dependency retrieval model was compared with other traditional retrieval models. Experiments indicated that it produces significant improvements in retrieval effectiveness.展开更多
This paper aims to develop Machine Learning algorithms to classify electronic articles related to this phenomenon by retrieving information and topic modelling.The Methodology of this study is categorized into three p...This paper aims to develop Machine Learning algorithms to classify electronic articles related to this phenomenon by retrieving information and topic modelling.The Methodology of this study is categorized into three phases:the Text Classification Approach(TCA),the Proposed Algorithms Interpretation(PAI),andfinally,Information Retrieval Approach(IRA).The TCA reflects the text preprocessing pipeline called a clean corpus.The Global Vec-tors for Word Representation(Glove)pre-trained model,FastText,Term Frequency-Inverse Document Fre-quency(TF-IDF),and Bag-of-Words(BOW)for extracting the features have been interpreted in this research.The PAI manifests the Bidirectional Long Short-Term Memory(Bi-LSTM)and Convolutional Neural Network(CNN)to classify the COVID-19 news.Again,the IRA explains the mathematical interpretation of Latent Dirich-let Allocation(LDA),obtained for modelling the topic of Information Retrieval(IR).In this study,99%accuracy was obtained by performing K-fold cross-validation on Bi-LSTM with Glove.A comparative analysis between Deep Learning and Machine Learning based on feature extraction and computational complexity exploration has been performed in this research.Furthermore,some text analyses and the most influential aspects of each document have been explored in this study.We have utilized Bidirectional Encoder Representations from Trans-formers(BERT)as a Deep Learning mechanism in our model training,but the result has not been uncovered satisfactory.However,the proposed system can be adjustable in the real-time news classification of COVID-19.展开更多
To reuse and share the valuable knowledge embedded in repositories of engineering models for accelerating the design process, improving product quality, and reducing costs, it is crucial to devise search engines capab...To reuse and share the valuable knowledge embedded in repositories of engineering models for accelerating the design process, improving product quality, and reducing costs, it is crucial to devise search engines capable of matching 3D models efficiently and effectively. In this paper, an enhanced shape distributions-based technique of using geometrical and topological information to search 3D engineering models represented by polygonal meshes was presented. A simplification method of polygonal meshes was used to simplify engineering model as the pretreatment for generation of sample points. The method of sampling points was improved and a pair of functions that was more sensitive to shape was employed to construct a 2D shape distribution. Experiments were conducted to evaluate the proposed algorithm utilizing the Engineering Shape Benchmark (ESB) database. The experiential results suggest that the search effectiveness is significantly improved by enforcing the simplification and enhanced shape distributions to engineering model retrieval.展开更多
Blog opinion retrieval aims to find blogs with opinionated information related to a given topic.Its main problem is to compute the opinion score,which balances topic relevance and opinion relevance.To deal with this p...Blog opinion retrieval aims to find blogs with opinionated information related to a given topic.Its main problem is to compute the opinion score,which balances topic relevance and opinion relevance.To deal with this problem a generative model deduced by a Bayesian approach is pro-posed,and an improved mixture model is proposed to estimate the opinion relevance between a blog and a given topic in our retrieval framework.Moreover,pointwise mutual information is used to expand sentiment words for different topics based on a general sentimental lexicon.The correlation between topic and candidate words is applied in the process of both expanding sentiment words and estimating sentence opinion scores.Experimental results show that the proposed approaches improve upon the state-of-the-art opinion retrieval method on TREC2010 dataset.展开更多
Aiming at the difficulty of accurately constructing the dynamic model of subtropical high, based on the potential height field time series over 500 hPa layer of T106 numerical forecast products, by using EOF(empirica...Aiming at the difficulty of accurately constructing the dynamic model of subtropical high, based on the potential height field time series over 500 hPa layer of T106 numerical forecast products, by using EOF(empirical orthogonal function) temporal-spatial separation technique, the disassembled EOF time coefficients series were regarded as dynamical model variables, and dynamic system retrieval idea as well as genetic algorithm were introduced to make dynamical model parameters optimization search, then, a reasonable non-linear dynamic model of EOF time-coefficients was established. By dynamic model integral and EOF temporal-spatial components assembly, a mid-/long-term forecast of subtropical high was carried out. The experimental results show that the forecast results of dynamic model are superior to that of general numerical model forecast results. A new modeling idea and forecast technique is presented for diagnosing and forecasting such complicated weathers as subtropical high.展开更多
In this paper, we propose a dynamic multi-descriptor fusion (DMDF) approach to improving the retrieval accuracy of 3-dimensional (3D) model retrieval systems. First, an independent retrieval list is generated by u...In this paper, we propose a dynamic multi-descriptor fusion (DMDF) approach to improving the retrieval accuracy of 3-dimensional (3D) model retrieval systems. First, an independent retrieval list is generated by using each individual descriptor. Second, we propose an automatic relevant/irrelevant models selection (ARMS) approach to selecting the relevant and irrelevant 3D models automatically without any user interaction. A weighted distance, in which the weight associated with each individual descriptor is learnt by using the selected relevant and irrelevant models, is used to measure the similarity between two 3D models. Furthermore, a descriptor-dependent adaptive query point movement (AQPM) approach is employed to update every feature vector. This set of new feature vectors is used to index 3D models in the next search process. Four 3D model databases are used to compare the retrieval accuracy of our proposed DMDF approach with several descriptors as well as some well-known information fusion methods. Experimental results have shown that our proposed DMDF approach provides a promising retrieval result and always yields the best retrieval accuracy.展开更多
A language model for information retrieval is built by using a query language model to generate queries and a document language model to generate documents. The documents are ranked according to the relative entropies...A language model for information retrieval is built by using a query language model to generate queries and a document language model to generate documents. The documents are ranked according to the relative entropies of estimated document language models with respect to the estimated query language model. Two popular and relatively efficient smoothing methods, the Jelinek- Mercer method and the absolute discounting method, are used to smooth the document language model in estimation of the document language, A combined model composed of the feedback document language model and the collection language model is used to estimate the query model. A performacne comparison between the new retrieval method and the existing method with feedback is made, and the retrieval performances of the proposed method with the two different smoothing techniques are evaluated on three Text Retrieval Conference (TREC) data sets. Experimental results show that the method is effective and performs better than the basic language modeling approach; moreover, the method using the Jelinek-Mercer technique performs better than that using the absolute discounting technique, and the perfomance is sensitive to the smoothing peramters.展开更多
Ontology as an important representation model of semantic web has valuable application. A new ontology model on the basis of Computer Graphics (CG) knowledge is proposed, called CG ontology model. The protégé...Ontology as an important representation model of semantic web has valuable application. A new ontology model on the basis of Computer Graphics (CG) knowledge is proposed, called CG ontology model. The protégé is used to build this ontology model conveniently. The Jena API is applied to store CG owl documents in MySQL, set inference rule and achieve search queries on the ontology database. Finally, the Jena-based ontology model retrieval system is developed.展开更多
In this paper a novel 3D model retrieval method that employs multi-level spherical moment analysis and relies on voxelization and spherical mapping of the 3D models is proposed. For a given polygon-soup 3D model, firs...In this paper a novel 3D model retrieval method that employs multi-level spherical moment analysis and relies on voxelization and spherical mapping of the 3D models is proposed. For a given polygon-soup 3D model, first a pose normalization step is done to align the model into a canonical coordinate frame so as to define the shape representation with respect to this orientation. Afterward we rasterize its exterior surface into cubical voxel grids, then a series of homocentric spheres with their center superposing the center of the voxel grids cut the voxel grids into several spherical images. Finally moments belonging to each sphere are computed and the moments of all spheres constitute the descriptor of the model. Experiments showed that Euclidean distance based on this kind of feature vector can distinguish different 3D models well and that the 3D model retrieval system based on this arithmetic yields satisfactory performance.展开更多
文摘This study introduces the Orbit Weighting Scheme(OWS),a novel approach aimed at enhancing the precision and efficiency of Vector Space information retrieval(IR)models,which have traditionally relied on weighting schemes like tf-idf and BM25.These conventional methods often struggle with accurately capturing document relevance,leading to inefficiencies in both retrieval performance and index size management.OWS proposes a dynamic weighting mechanism that evaluates the significance of terms based on their orbital position within the vector space,emphasizing term relationships and distribution patterns overlooked by existing models.Our research focuses on evaluating OWS’s impact on model accuracy using Information Retrieval metrics like Recall,Precision,InterpolatedAverage Precision(IAP),andMeanAverage Precision(MAP).Additionally,we assessOWS’s effectiveness in reducing the inverted index size,crucial for model efficiency.We compare OWS-based retrieval models against others using different schemes,including tf-idf variations and BM25Delta.Results reveal OWS’s superiority,achieving a 54%Recall and 81%MAP,and a notable 38%reduction in the inverted index size.This highlights OWS’s potential in optimizing retrieval processes and underscores the need for further research in this underrepresented area to fully leverage OWS’s capabilities in information retrieval methodologies.
文摘The retrieval of the biomass parameters from active/passive microwave remote sensing data (10.2 GHz) is performed based on an iterative inversion of BP neural network model with fuzzy optimization. The BP neural network is trained by a set of the measurements of active and passive remote sensing and the ground truth data versus Day of Year during growth. Once the network training is complete, the model can be used to retrieve the temporal variations of the biomass parameters from another set of observation data. The model was used in weights and microware observation data of wheat growth in 1989 to retrieve biomass parameters change of wheat growth this year. The retrieved biomass parameters correspond well with the real data of the growth, which shows that the BP model is scientific and sound.
基金supported by National Natural Science Foundation of China(Grant No. 51175287)National Science and Technology Major Project(Grant No. 2011ZX02403)
文摘Content-based 3D model retrieval is of great help to facilitate the reuse of existing designs and to inspire designers during conceptual design. However, there is still a gap to apply it in industry due to the low time efficiency. This paper presents two new methods with high efficiency to build a Content-based 3D model retrieval system. First, an improvement is made on the "Shape Distribution (D2)" algorithm, and a new algorithm named "Quick D2" is proposed. Four sample 3D mechanical models are used in an experiment to compare the time cost of the two algorithms. The result indicates that the time cost of Quick D2 is much lower than that of D2, while the descriptors extracted by the two algorithms are almost the same. Second, an expandable 3D model repository index method with high performance, namely, RBK index, is presented. On the basis of RBK index, the search space is pruned effectively during the search process, leading to a speed up of the whole system. The factors that influence the values of the key parameters of RBK index are discussed and an experimental method to find the optimal values of the key parameters is given. Finally, "3D Searcher", a content-based 3D model retrieval system is developed. By using the methods proposed, the time cost for the system to respond one query online is reduced by 75% on average. The system has been implemented in a manufacturing enterprise, and practical query examples during a case of the automobile rear axle design are also shown. The research method presented shows a new research perspective and can effectively improve the content-based 3D model retrieval efficiency.
文摘An algorithm for retrieving polarimetric variables from numerical model fields is developed. By using this technique, radar reflectivity at horizontal polarization~ differential reflectivity, specific differential phase shift and correlation coefficients between the horizontal and vertical polarization signals at zero lag can be derived from rain, snow and hail contents of numerical model outputs. Effects of environmental temperature and the melting process on polarimetric variables are considered in the algorithm. The algorithm is applied to the Advanced Regional Prediction System (ARPS) model simulation results for a hail storm. The spatial distributions of the derived parameters are reasonable when compared with observational knowledge. This work provides a forward model for assimilation of dual linear polarization radar data into a mesoscale model.
文摘A hybrid model that is based on the Combination of keywords and concept was put forward. The hybrid model is built on vector space model and probabilistic reasoning network. It not only can exert the advantages of keywords retrieval and concept retrieval but also can compensate for their shortcomings. Their parameters can be adjusted according to different usage in order to accept the best information retrieval result, and it has been proved by our experiments.
基金the National Basic Research Program (973) of China (No. 2004CB719401)the National Research Foundation for the Doctoral Program of Higher Education of China (No.20060003060)
文摘In this paper, we present a novel Support Vector Machine active learning algorithm for effective 3D model retrieval using the concept of relevance feedback. The proposed method learns from the most informative objects which are marked by the user, and then creates a boundary separating the relevant models from irrelevant ones. What it needs is only a small number of 3D models labelled by the user. It can grasp the user's semantic knowledge rapidly and accurately. Experimental results showed that the proposed algorithm significantly improves the retrieval effectiveness. Compared with four state-of-the-art query refinement schemes for 3D model retrieval, it provides superior retrieval performance after no more than two rounds of relevance feedback.
文摘A large number of 3D models are created on computers and available for networks. Some content-based retrieval technologies are indispensable to find out particular data from such anonymous datasets. Though several shape retrieval technologies have been developed, little attention has been given to the points on human's sense and impression (as known as Kansei) in the conventional techniques, In this paper, the authors propose a novel method of shape retrieval based on shape impression of human's Kansei. The key to the method is the Gaussian curvature distribution from 3D models as features for shape retrieval. Then it classifies the 3D models by extracted feature and measures similarity among models in storage.
基金Sponsored by the National Natural Science Foundation of China (Grant No. 51075083)
文摘In order to improve the accuracy and efficiency of 3D model retrieval,the method based on affinity propagation clustering algorithm is proposed. Firstly,projection ray-based method is proposed to improve the feature extraction efficiency of 3D models. Based on the relationship between model and its projection,the intersection in 3D space is transformed into intersection in 2D space,which reduces the number of intersection and improves the efficiency of the extraction algorithm. In feature extraction,multi-layer spheres method is analyzed. The two-layer spheres method makes the feature vector more accurate and improves retrieval precision. Secondly,Semi-supervised Affinity Propagation ( S-AP) clustering is utilized because it can be applied to different cluster structures. The S-AP algorithm is adopted to find the center models and then the center model collection is built. During retrieval process,the collection is utilized to classify the query model into corresponding model base and then the most similar model is retrieved in the model base. Finally,75 sample models from Princeton library are selected to do the experiment and then 36 models are used for retrieval test. The results validate that the proposed method outperforms the original method and the retrieval precision and recall ratios are improved effectively.
基金supported in part by the National Natural Science Foundation of China under Grants 62273272 and 61873277in part by the Chinese Postdoctoral Science Foundation under Grant 2020M673446+1 种基金in part by the Key Research and Development Program of Shaanxi Province under Grant 2023-YBGY-243in part by the Youth Innovation Team of Shaanxi Universities.
文摘Currently,the video captioning models based on an encoder-decoder mainly rely on a single video input source.The contents of video captioning are limited since few studies employed external corpus information to guide the generation of video captioning,which is not conducive to the accurate descrip-tion and understanding of video content.To address this issue,a novel video captioning method guided by a sentence retrieval generation network(ED-SRG)is proposed in this paper.First,a ResNeXt network model,an efficient convolutional network for online video understanding(ECO)model,and a long short-term memory(LSTM)network model are integrated to construct an encoder-decoder,which is utilized to extract the 2D features,3D features,and object features of video data respectively.These features are decoded to generate textual sentences that conform to video content for sentence retrieval.Then,a sentence-transformer network model is employed to retrieve different sentences in an external corpus that are semantically similar to the above textual sentences.The candidate sentences are screened out through similarity measurement.Finally,a novel GPT-2 network model is constructed based on GPT-2 network structure.The model introduces a designed random selector to randomly select predicted words with a high probability in the corpus,which is used to guide and generate textual sentences that are more in line with human natural language expressions.The proposed method in this paper is compared with several existing works by experiments.The results show that the indicators BLEU-4,CIDEr,ROUGE_L,and METEOR are improved by 3.1%,1.3%,0.3%,and 1.5%on a public dataset MSVD and 1.3%,0.5%,0.2%,1.9%on a public dataset MSR-VTT respectively.It can be seen that the proposed method in this paper can generate video captioning with richer semantics than several state-of-the-art approaches.
基金The National Natural Science Foundation of China under contract No.41106152the National Science and Technology Support Program under contract No.2013BAD13B01+3 种基金the National High Technology Research and Development Program(863 Program)of China under contract No.2013AA09A505the International Science and Technology Cooperation Program of China under contract No.2011DFA22260the National High Technology Industrialization Project under contract No.[2012]2083the Marine Public Projects of China under contract Nos 201105032,201305032 and 201105002-07
文摘The geophysical model function (GMF) describes the relationship between a backscattering and a sea surface wind, and enables a wind vector retrieval from backscattering measurements. It is clear that the GMF plays an important role in an ocean wind vector retrieval. The performance of the existing Ku-band model function QSCAT-1 is considered to be effective at low and moderate wind speed ranges. However, in the conditions of higher wind speeds, the existing algorithms diverge alarmingly, owing to the lack of in situ data required for developing the GMF for the high wind conditions, the QSCAT-1 appears to overestimate the a0, which results in underestimating the wind speeds. Several match-up QuikSCAT and special sensor microwave/imager (SSM/I) wind speed measurements of the typhoons occurring in the west Pacific Ocean are analyzed. The results show that the SSM/I wind exhibits better agreement with the "best track" analysis wind speed than the QuikSCAT wind retrieved using QSCAT-1. On the basis of this evaluation, a correction of the QSCAT-1 model function for wind speed above 16 m/s is proposed, which uses the collocated SSM/I and QuikSCAT measurements as a training set, and a neural network approach as a multiple nonlinear regression technologytechnology.In order to validate the revised GMF for high winds, the modified GMF was applied to the QuikSCAT observations of Hurricane IOKE. The wind estimated by the QuikSCAT for Typhoon IOKE in 2006 was improved with the maximum wind speed reaching 55 m/s. An error analysis was performed using the wind fields from the Holland model as the surface truth. The results show an improved agreement with the Holland model wind when compared with the wind estimated using the QSCAT-1. However, large bias still existed, indicating that the effects of rain must be considered for further improvement.
基金Supported by National Natural Science Foundation of China (Grant No.51175287)National Science and Technology Major Project of China (Grant No.2011ZX02403)
文摘CAD model retrieval based on functional semantics is more significant than content-based 3D model retrieval during the mechanical conceptual design phase. However, relevant research is still not fully discussed. Therefore, a functional semantic-based CAD model annotation and retrieval method is proposed to support mechanical conceptual design and design reuse, inspire designer creativity through existing CAD models, shorten design cycle, and reduce costs. Firstly, the CAD model functional semantic ontology is constructed to formally represent the functional semantics of CAD models and describe the mechanical conceptual design space comprehensively and consistently. Secondly, an approach to represent CAD models as attributed adjacency graphs(AAG) is proposed. In this method, the geometry and topology data are extracted from STEP models. On the basis of AAG, the functional semantics of CAD models are annotated semi-automatically by matching CAD models that contain the partial features of which functional semantics have been annotated manually, thereby constructing CAD Model Repository that supports model retrieval based on functional semantics. Thirdly, a CAD model retrieval algorithm that supports multi-function extended retrieval is proposed to explore more potential creative design knowledge in the semantic level. Finally, a prototype system, called Functional Semantic-based CAD Model Annotation and Retrieval System(FSMARS), is implemented. A case demonstrates that FSMARS can successfully botain multiple potential CAD models that conform to the desired function. The proposed research addresses actual needs and presents a new way to acquire CAD models in the mechanical conceptual design phase.
基金Project (No. 2006CB303000) supported in part by the National Basic Research Program (973) of China
文摘This paper explores the application of term dependency in information retrieval (IR) and proposes a novel dependency retrieval model. This retrieval model suggests an extension to the existing language modeling (LM) approach to IR by introducing dependency models for both query and document. Relevance between document and query is then evaluated by reference to the Kullback-Leibler divergence between their dependency models. This paper introduces a novel hybrid dependency structure, which allows integration of various forms of dependency within a single framework. A pseudo relevance feedback based method is also introduced for constructing query dependency model. The basic idea is to use query-relevant top-ranking sentences extracted from the top documents at retrieval time as the augmented representation of query, from which the relationships between query terms are identified. A Markov Random Field (MRF) based approach is presented to ensure the relevance of the extracted sentences, which utilizes the association features between query terms within a sentence to evaluate the relevance of each sentence. This dependency retrieval model was compared with other traditional retrieval models. Experiments indicated that it produces significant improvements in retrieval effectiveness.
文摘This paper aims to develop Machine Learning algorithms to classify electronic articles related to this phenomenon by retrieving information and topic modelling.The Methodology of this study is categorized into three phases:the Text Classification Approach(TCA),the Proposed Algorithms Interpretation(PAI),andfinally,Information Retrieval Approach(IRA).The TCA reflects the text preprocessing pipeline called a clean corpus.The Global Vec-tors for Word Representation(Glove)pre-trained model,FastText,Term Frequency-Inverse Document Fre-quency(TF-IDF),and Bag-of-Words(BOW)for extracting the features have been interpreted in this research.The PAI manifests the Bidirectional Long Short-Term Memory(Bi-LSTM)and Convolutional Neural Network(CNN)to classify the COVID-19 news.Again,the IRA explains the mathematical interpretation of Latent Dirich-let Allocation(LDA),obtained for modelling the topic of Information Retrieval(IR).In this study,99%accuracy was obtained by performing K-fold cross-validation on Bi-LSTM with Glove.A comparative analysis between Deep Learning and Machine Learning based on feature extraction and computational complexity exploration has been performed in this research.Furthermore,some text analyses and the most influential aspects of each document have been explored in this study.We have utilized Bidirectional Encoder Representations from Trans-formers(BERT)as a Deep Learning mechanism in our model training,but the result has not been uncovered satisfactory.However,the proposed system can be adjustable in the real-time news classification of COVID-19.
基金The Basic Research of COSTIND,China (No.D0420060521)
文摘To reuse and share the valuable knowledge embedded in repositories of engineering models for accelerating the design process, improving product quality, and reducing costs, it is crucial to devise search engines capable of matching 3D models efficiently and effectively. In this paper, an enhanced shape distributions-based technique of using geometrical and topological information to search 3D engineering models represented by polygonal meshes was presented. A simplification method of polygonal meshes was used to simplify engineering model as the pretreatment for generation of sample points. The method of sampling points was improved and a pair of functions that was more sensitive to shape was employed to construct a 2D shape distribution. Experiments were conducted to evaluate the proposed algorithm utilizing the Engineering Shape Benchmark (ESB) database. The experiential results suggest that the search effectiveness is significantly improved by enforcing the simplification and enhanced shape distributions to engineering model retrieval.
基金Supported by the National Natural Science Foundation of China(61370137,61672098,61272361)the Ministry of Education-China Mobile Research Foundation Project(2015/5-9,2016/2-7)
文摘Blog opinion retrieval aims to find blogs with opinionated information related to a given topic.Its main problem is to compute the opinion score,which balances topic relevance and opinion relevance.To deal with this problem a generative model deduced by a Bayesian approach is pro-posed,and an improved mixture model is proposed to estimate the opinion relevance between a blog and a given topic in our retrieval framework.Moreover,pointwise mutual information is used to expand sentiment words for different topics based on a general sentimental lexicon.The correlation between topic and candidate words is applied in the process of both expanding sentiment words and estimating sentence opinion scores.Experimental results show that the proposed approaches improve upon the state-of-the-art opinion retrieval method on TREC2010 dataset.
基金Project supported by the National Natural Science Foundation of China (No.40375019) the Tropical Marine and Meteorology Science Foundation (No.200609) the Jiangsu Key Laboratory of Meteorological Disaster Foundation (No.KLME0507)
文摘Aiming at the difficulty of accurately constructing the dynamic model of subtropical high, based on the potential height field time series over 500 hPa layer of T106 numerical forecast products, by using EOF(empirical orthogonal function) temporal-spatial separation technique, the disassembled EOF time coefficients series were regarded as dynamical model variables, and dynamic system retrieval idea as well as genetic algorithm were introduced to make dynamical model parameters optimization search, then, a reasonable non-linear dynamic model of EOF time-coefficients was established. By dynamic model integral and EOF temporal-spatial components assembly, a mid-/long-term forecast of subtropical high was carried out. The experimental results show that the forecast results of dynamic model are superior to that of general numerical model forecast results. A new modeling idea and forecast technique is presented for diagnosing and forecasting such complicated weathers as subtropical high.
基金supported in part by“MOST”under Grants No.102-2632-E-216-001-MY3 and No.104-2221-E-216-010-MY2
文摘In this paper, we propose a dynamic multi-descriptor fusion (DMDF) approach to improving the retrieval accuracy of 3-dimensional (3D) model retrieval systems. First, an independent retrieval list is generated by using each individual descriptor. Second, we propose an automatic relevant/irrelevant models selection (ARMS) approach to selecting the relevant and irrelevant 3D models automatically without any user interaction. A weighted distance, in which the weight associated with each individual descriptor is learnt by using the selected relevant and irrelevant models, is used to measure the similarity between two 3D models. Furthermore, a descriptor-dependent adaptive query point movement (AQPM) approach is employed to update every feature vector. This set of new feature vectors is used to index 3D models in the next search process. Four 3D model databases are used to compare the retrieval accuracy of our proposed DMDF approach with several descriptors as well as some well-known information fusion methods. Experimental results have shown that our proposed DMDF approach provides a promising retrieval result and always yields the best retrieval accuracy.
基金The National Natural Science Founda-tion of China ( No. 60473004)the Science and ResearchFoundation Program of Henan University of Science and Tech-nology (No.2004ZY041)the Natural and Science FoundationProgram of the Education Department of Henan Province (No.200410464004)
文摘A language model for information retrieval is built by using a query language model to generate queries and a document language model to generate documents. The documents are ranked according to the relative entropies of estimated document language models with respect to the estimated query language model. Two popular and relatively efficient smoothing methods, the Jelinek- Mercer method and the absolute discounting method, are used to smooth the document language model in estimation of the document language, A combined model composed of the feedback document language model and the collection language model is used to estimate the query model. A performacne comparison between the new retrieval method and the existing method with feedback is made, and the retrieval performances of the proposed method with the two different smoothing techniques are evaluated on three Text Retrieval Conference (TREC) data sets. Experimental results show that the method is effective and performs better than the basic language modeling approach; moreover, the method using the Jelinek-Mercer technique performs better than that using the absolute discounting technique, and the perfomance is sensitive to the smoothing peramters.
文摘Ontology as an important representation model of semantic web has valuable application. A new ontology model on the basis of Computer Graphics (CG) knowledge is proposed, called CG ontology model. The protégé is used to build this ontology model conveniently. The Jena API is applied to store CG owl documents in MySQL, set inference rule and achieve search queries on the ontology database. Finally, the Jena-based ontology model retrieval system is developed.
基金Project (No. 60573146) supported by the National Natural Science Foundation of China
文摘In this paper a novel 3D model retrieval method that employs multi-level spherical moment analysis and relies on voxelization and spherical mapping of the 3D models is proposed. For a given polygon-soup 3D model, first a pose normalization step is done to align the model into a canonical coordinate frame so as to define the shape representation with respect to this orientation. Afterward we rasterize its exterior surface into cubical voxel grids, then a series of homocentric spheres with their center superposing the center of the voxel grids cut the voxel grids into several spherical images. Finally moments belonging to each sphere are computed and the moments of all spheres constitute the descriptor of the model. Experiments showed that Euclidean distance based on this kind of feature vector can distinguish different 3D models well and that the 3D model retrieval system based on this arithmetic yields satisfactory performance.