Representing the relationships between ontologies is the key problem of semantic annotations based on multi-ontologies. Traditional approaches only had the ability of denoting the simple concept subsumption relations ...Representing the relationships between ontologies is the key problem of semantic annotations based on multi-ontologies. Traditional approaches only had the ability of denoting the simple concept subsumption relations between ontologies. Through analyzing and classifying the relationships between ontologies, the idea of bridge ontology was proposed, which had the powerful capability of expressing the complex relationships between concepts and relationships between relations in multi-ontologies. Meanwhile, a new approach employing bridge ontology was proposed to deal with the multi-ontologies-based semantic annotation problem. The bridge ontology is a peculiar ontology, which can be created and maintained conveniently, and is effective in the multi-ontologies-based semantic annotation. The approach using bridge ontology has the advantages of low-cost, scalable, robust in the web circumstance, and avoiding the unnecessary ontology extending and integration. Key words semantic web - bridge ontology - multi-ontologies - semantic annotation CLC number TP 391 Foundation item: Supported by the National Natural Science Foundation of China (60373066, 60303024). National Grand Fundamental Research 973 Program of China (2002CB312000), National Re-search Foundation for the Doctoral Program of Higher Education of China (20020286004)Biography: WANG Peng (1977-), male, Ph.D candidate, research direction: semantic web, ontology, and knowledge representation on the Web.展开更多
Aimming at the difficulty in getting semantic informarton from each problem in problem set archives, We propose a new method of ontology based semantic annotation for problem set archives, which utilizes programming k...Aimming at the difficulty in getting semantic informarton from each problem in problem set archives, We propose a new method of ontology based semantic annotation for problem set archives, which utilizes programming knowledge domain ontology to add semantic annotations to problems in the Web. The system we developed adds semantic annotation for each problem in the form of Extensible Makeup Language. Our method overcomes the difficulty of extracting semantics from problem set archives and the efficiency of this method is demonstrated through a case study. Having semantic annotations of problems, a student can efficiently locate the problems that logically corre spond to his knowledge.展开更多
In recent years,multimedia annotation problem has been attracting significant research attention in multimedia and computer vision areas,especially for automatic image annotation,whose purpose is to provide an efficie...In recent years,multimedia annotation problem has been attracting significant research attention in multimedia and computer vision areas,especially for automatic image annotation,whose purpose is to provide an efficient and effective searching environment for users to query their images more easily. In this paper,a semi-supervised learning based probabilistic latent semantic analysis( PLSA) model for automatic image annotation is presenred. Since it's often hard to obtain or create labeled images in large quantities while unlabeled ones are easier to collect,a transductive support vector machine( TSVM) is exploited to enhance the quality of the training image data. Then,different image features with different magnitudes will result in different performance for automatic image annotation. To this end,a Gaussian normalization method is utilized to normalize different features extracted from effective image regions segmented by the normalized cuts algorithm so as to reserve the intrinsic content of images as complete as possible. Finally,a PLSA model with asymmetric modalities is constructed based on the expectation maximization( EM) algorithm to predict a candidate set of annotations with confidence scores. Extensive experiments on the general-purpose Corel5k dataset demonstrate that the proposed model can significantly improve performance of traditional PLSA for the task of automatic image annotation.展开更多
Automatic image annotation has been an active topic of research in computer vision and pattern recognition for decades.A two stage automatic image annotation method based on Gaussian mixture model(GMM) and random walk...Automatic image annotation has been an active topic of research in computer vision and pattern recognition for decades.A two stage automatic image annotation method based on Gaussian mixture model(GMM) and random walk model(abbreviated as GMM-RW) is presented.To start with,GMM fitted by the rival penalized expectation maximization(RPEM) algorithm is employed to estimate the posterior probabilities of each annotation keyword.Subsequently,a random walk process over the constructed label similarity graph is implemented to further mine the potential correlations of the candidate annotations so as to capture the refining results,which plays a crucial role in semantic based image retrieval.The contributions exhibited in this work are multifold.First,GMM is exploited to capture the initial semantic annotations,especially the RPEM algorithm is utilized to train the model that can determine the number of components in GMM automatically.Second,a label similarity graph is constructed by a weighted linear combination of label similarity and visual similarity of images associated with the corresponding labels,which is able to avoid the phenomena of polysemy and synonym efficiently during the image annotation process.Third,the random walk is implemented over the constructed label graph to further refine the candidate set of annotations generated by GMM.Conducted experiments on the standard Corel5 k demonstrate that GMM-RW is significantly more effective than several state-of-the-arts regarding their effectiveness and efficiency in the task of automatic image annotation.展开更多
A large semantic gap exists between content based index retrieval(CBIR) and high-level semantic,additional semantic information should be attached to the images,it refers in three respects including semantic represent...A large semantic gap exists between content based index retrieval(CBIR) and high-level semantic,additional semantic information should be attached to the images,it refers in three respects including semantic representation model,semantic information building and semantic retrieval techniques.In this paper,we introduce an associated semantic network and an automatic semantic annotation system.In the system,a semantic network model is employed as the semantic representation model,it uses semantic Key words,linguistic ontology and low-level features in semantic similarity calculating.Through several times of users' relevance feedback,semantic network is enriched automatically.To speed up the growth of semantic network and get a balance annotation,semantic seeds and semantic loners are employed especially.展开更多
Purpose: To design an efficient high-performance algorithm for semantic annotation of biodiversity documents in Chinese.Design/methodology/approach: Data set consists of 1,000 randomly selected documents from Flora of...Purpose: To design an efficient high-performance algorithm for semantic annotation of biodiversity documents in Chinese.Design/methodology/approach: Data set consists of 1,000 randomly selected documents from Flora of China. Comparative evaluation of the proposed approach with the Na ve Bayes algorithm have been developed before for the same purpose.Findings: Experimental results show that the heuristics based algorithm outperformed the Na ve Bayes algorithm. The use of leading words helped improving the annotation performance while prioritizing rule application based on their weights had no significant impact on algorithm performance.Research limitations: The ICTCLAS was used to identify word boundaries off-shelf without optimatization for biodiversity domain. This may have not made the best use of the tool.Practical implications & Originality/value: The performance of heuristics based approach,enhanced by leading words analysis, reached an F value of 0.9216, which is sufficiently accurate for practical use.展开更多
Daily newspapers publish a tremendous amount of information disseminated through the Internet.Freely available and easily accessible large online repositories are not indexed and are in an un-processable format.The ma...Daily newspapers publish a tremendous amount of information disseminated through the Internet.Freely available and easily accessible large online repositories are not indexed and are in an un-processable format.The major hindrance in developing and evaluating existing/new monolingual text in an image is that it is not linked and indexed.There is no method to reuse the online news images because of the unavailability of standardized benchmark corpora,especially for South Asian languages.The corpus is a vital resource for developing and evaluating text in an image to reuse local news systems in general and specifically for the Urdu language.Lack of indexing,primarily semantic indexing of the daily news items,makes news items impracticable for any querying.Moreover,the most straightforward search facility does not support these unindexed news resources.Our study addresses this gap by associating and marking the newspaper images with one of the widely spoken but under-resourced languages,i.e.,Urdu.The present work proposed a method to build a benchmark corpus of news in image form by introducing a web crawler.The corpus is then semantically linked and annotated with daily news items.Two techniques are proposed for image annotation,free annotation and fixed cross examination annotation.The second technique got higher accuracy.Build news ontology in protégéusing OntologyWeb Language(OWL)language and indexed the annotations under it.The application is also built and linked with protégéso that the readers and journalists have an interface to query the news items directly.Similarly,news items linked together will provide complete coverage and bring together different opinions at a single location for readers to do the analysis themselves.展开更多
A novel image auto-annotation method is presented based on probabilistic latent semantic analysis(PLSA) model and multiple Markov random fields(MRF).A PLSA model with asymmetric modalities is first constructed to esti...A novel image auto-annotation method is presented based on probabilistic latent semantic analysis(PLSA) model and multiple Markov random fields(MRF).A PLSA model with asymmetric modalities is first constructed to estimate the joint probability between images and semantic concepts,then a subgraph is extracted served as the corresponding structure of Markov random fields and inference over it is performed by the iterative conditional modes so as to capture the final annotation for the image.The novelty of our method mainly lies in two aspects:exploiting PLSA to estimate the joint probability between images and semantic concepts as well as multiple MRF to further explore the semantic context among keywords for accurate image annotation.To demonstrate the effectiveness of this approach,an experiment on the Corel5 k dataset is conducted and its results are compared favorably with the current state-of-the-art approaches.展开更多
This paper presents a new method for refining image annotation by integrating probabilistic la- tent semantic analysis (PLSA) with conditional random field (CRF). First a PLSA model with asymmetric modalities is c...This paper presents a new method for refining image annotation by integrating probabilistic la- tent semantic analysis (PLSA) with conditional random field (CRF). First a PLSA model with asymmetric modalities is constructed to predict a candidate set of annotations with confidence scores, and then model semantic relationship among the candidate annotations by leveraging conditional ran- dom field. In CRF, the confidence scores generated lay the PLSA model and the Fliekr distance be- tween pairwise candidate annotations are considered as local evidences and contextual potentials re- spectively. The novelty of our method mainly lies in two aspects : exploiting PLSA to predict a candi- date set of annotations with confidence scores as well as CRF to further explore the semantic context among candidate annotations for precise image annotation. To demonstrate the effectiveness of the method proposed in this paper, an experiment is conducted on the standard Corel dataset and its re- sults are 'compared favorably with several state-of-the-art approaches.展开更多
To improve motion graph based motion synthesis,semantic control was introduced.Hybrid motion features including both numerical and user-defined semantic relational features were extracted to encode the characteristic ...To improve motion graph based motion synthesis,semantic control was introduced.Hybrid motion features including both numerical and user-defined semantic relational features were extracted to encode the characteristic aspects contained in the character's poses of the given motion sequences.Motion templates were then automatically derived from the training motions for capturing the spatio-temporal characteristics of an entire given class of semantically related motions.The data streams of motion documents were automatically annotated with semantic motion class labels by matching their respective motion class templates.Finally,the semantic control was introduced into motion graph based human motion synthesis.Experiments of motion synthesis demonstrate the effectiveness of the approach which enables users higher level of semantically intuitive control and high quality in human motion synthesis from motion capture database.展开更多
In order to bridge the semantic gap exists in image retrieval, this paper propose an approach combining generative and discriminative learning to accomplish the task of automatic image annotation and retrieval. We fir...In order to bridge the semantic gap exists in image retrieval, this paper propose an approach combining generative and discriminative learning to accomplish the task of automatic image annotation and retrieval. We firstly present continuous probabilistic latent semantic analysis (PLSA) to model continuous quantity. Furthermore, we propose a hybrid framework which employs continuous PLSA to model visual features of images in generative learning stage and uses ensembles of classifier chains to classify the multi-label data in discriminative learning stage. Since the framework combines the advantages of generative and discriminative learning, it can predict semantic annotation precisely for unseen images. Finally, we conduct a series of experiments on a standard Corel dataset. The experiment results show that our approach outperforms many state-of-the-art approaches.展开更多
Purpose: The objective of this paper is to testify the effect of ontology-based semantic annotation on the performance of document retrieval.Design/methodology/approach: An integrated document retrieval method is put ...Purpose: The objective of this paper is to testify the effect of ontology-based semantic annotation on the performance of document retrieval.Design/methodology/approach: An integrated document retrieval method is put forward in this paper, in which the entities of documents are annotated by the upper ontology and domain ontology, then the documents are further indexed by the entity annotation as well as traditional keywords.Findings: The research result shows that the structured entity retrieval and relation retrieval can be realized by the ontology-based entity index, which is beyond the ability of the tradition keyword-based retrieval. Meanwhile, the experiment shows that the recall and precision of document retrieval are improved effectively.Research limitations: Due to the small amount of our current tourism domain ontology, the document retrieval with the ontology-based semantic index is limited by the size of ontology and the precision of semantic annotation. Meanwhile, the semantic annotation algorithm mainly relies on the current information extraction strategy of KIM Platform. Therefore,the performance of disambiguation and relation extraction algorithm need to be further improved.Practical implications: Our method can improve the efficiency of document retrieval system,which facilitates the knowledge and document management in corporations, governments and other organizations.Originality/value: The integrated document retrieval method proposed in the paper can combine the entity index based on the general ontology with domain ontology and the keyword index. Our result verified the effectiveness of the combined index strategy.展开更多
In the proposal, a construction project memory process based on the semantic annotation is presented. A project Mem-ory concerns the representation and the identification of the experience acquired during projects rea...In the proposal, a construction project memory process based on the semantic annotation is presented. A project Mem-ory concerns the representation and the identification of the experience acquired during projects realization. The main feature of this approach is that the semantic annotation is used to build a continuous semantic design rationale process. We propose in this paper, a semantic traceability in four stages (identifying, structuring, annotating and integrating). The identification and the structuring phases use a model called Extended Marguerite model which fully considers the objectives of the project memory. The annotation phase exploits the results of precedent phase to prepare the final phase. Examples are presented from a case study in an Algerian firm called ENMTP. The architecture supporting the modelling engine is presented. Finally, an evaluation of the degree of the semantic annotation brought by proposed process is given.展开更多
In this paper we propose a novel model "recursive directed graph" based on feature structure, and apply it to represent the semantic relations of postpositive attributive structures in biomedical texts. The usages o...In this paper we propose a novel model "recursive directed graph" based on feature structure, and apply it to represent the semantic relations of postpositive attributive structures in biomedical texts. The usages of postpositive attributive are complex and variable, especially three categories: present participle phrase, past participle phrase, and preposition phrase as postpositire attributive, which always bring the difficulties of automatic parsing. We summarize these categories and annotate the semantic information. Compared with dependency structure, feature structure, being recursive directed graph, enhances semantic information extraction in biomedical field. The annotation results show that recursive directed graph is more suitable to extract complex semantic relations for biomedical text mining.展开更多
Semantic annotation of Web objects is a key problem for Web information extraction. The Web contains an abundance of useful semi-structured information about real world objects, and the empirical study shows that stro...Semantic annotation of Web objects is a key problem for Web information extraction. The Web contains an abundance of useful semi-structured information about real world objects, and the empirical study shows that strong two-dimensional sequence characteristics and correlative characteristics exist for Web information about objects of the same type across different Web sites. Conditional Random Fields (CRFs) are the state-of-the-art approaches taking the sequence characteristics to do better labeling. However, as the appearance of correlative characteristics between Web object elements, previous CRFs have their limitations for semantic annotation of Web objects and cannot deal with the long distance dependencies between Web object elements efficiently. To better incorporate the long distance dependencies, on one hand, this paper describes long distance dependencies by correlative edges, which are built by making good use of structured information and the characteristics of records from external databases; and on the other hand, this paper presents a two-dimensional Correlative-Chain Conditional Random Fields (2DCC-CRFs) to do semantic annotation of Web objects. This approach extends a classic model, two-dimensional Conditional Random Fields (2DCRFs), by adding correlative edges. Experimental results using a large number of real-world data collected from diverse domains show that the proposed approach can significantly improve the semantic annotation accuracy of Web objects.展开更多
文摘Representing the relationships between ontologies is the key problem of semantic annotations based on multi-ontologies. Traditional approaches only had the ability of denoting the simple concept subsumption relations between ontologies. Through analyzing and classifying the relationships between ontologies, the idea of bridge ontology was proposed, which had the powerful capability of expressing the complex relationships between concepts and relationships between relations in multi-ontologies. Meanwhile, a new approach employing bridge ontology was proposed to deal with the multi-ontologies-based semantic annotation problem. The bridge ontology is a peculiar ontology, which can be created and maintained conveniently, and is effective in the multi-ontologies-based semantic annotation. The approach using bridge ontology has the advantages of low-cost, scalable, robust in the web circumstance, and avoiding the unnecessary ontology extending and integration. Key words semantic web - bridge ontology - multi-ontologies - semantic annotation CLC number TP 391 Foundation item: Supported by the National Natural Science Foundation of China (60373066, 60303024). National Grand Fundamental Research 973 Program of China (2002CB312000), National Re-search Foundation for the Doctoral Program of Higher Education of China (20020286004)Biography: WANG Peng (1977-), male, Ph.D candidate, research direction: semantic web, ontology, and knowledge representation on the Web.
基金Supported by the National Natural Science Fundationof China (60273051)
文摘Aimming at the difficulty in getting semantic informarton from each problem in problem set archives, We propose a new method of ontology based semantic annotation for problem set archives, which utilizes programming knowledge domain ontology to add semantic annotations to problems in the Web. The system we developed adds semantic annotation for each problem in the form of Extensible Makeup Language. Our method overcomes the difficulty of extracting semantics from problem set archives and the efficiency of this method is demonstrated through a case study. Having semantic annotations of problems, a student can efficiently locate the problems that logically corre spond to his knowledge.
基金Supported by the National Program on Key Basic Research Project(No.2013CB329502)the National Natural Science Foundation of China(No.61202212)+1 种基金the Special Research Project of the Educational Department of Shaanxi Province of China(No.15JK1038)the Key Research Project of Baoji University of Arts and Sciences(No.ZK16047)
文摘In recent years,multimedia annotation problem has been attracting significant research attention in multimedia and computer vision areas,especially for automatic image annotation,whose purpose is to provide an efficient and effective searching environment for users to query their images more easily. In this paper,a semi-supervised learning based probabilistic latent semantic analysis( PLSA) model for automatic image annotation is presenred. Since it's often hard to obtain or create labeled images in large quantities while unlabeled ones are easier to collect,a transductive support vector machine( TSVM) is exploited to enhance the quality of the training image data. Then,different image features with different magnitudes will result in different performance for automatic image annotation. To this end,a Gaussian normalization method is utilized to normalize different features extracted from effective image regions segmented by the normalized cuts algorithm so as to reserve the intrinsic content of images as complete as possible. Finally,a PLSA model with asymmetric modalities is constructed based on the expectation maximization( EM) algorithm to predict a candidate set of annotations with confidence scores. Extensive experiments on the general-purpose Corel5k dataset demonstrate that the proposed model can significantly improve performance of traditional PLSA for the task of automatic image annotation.
基金Supported by the National Basic Research Program of China(No.2013CB329502)the National Natural Science Foundation of China(No.61202212)+1 种基金the Special Research Project of the Educational Department of Shaanxi Province of China(No.15JK1038)the Key Research Project of Baoji University of Arts and Sciences(No.ZK16047)
文摘Automatic image annotation has been an active topic of research in computer vision and pattern recognition for decades.A two stage automatic image annotation method based on Gaussian mixture model(GMM) and random walk model(abbreviated as GMM-RW) is presented.To start with,GMM fitted by the rival penalized expectation maximization(RPEM) algorithm is employed to estimate the posterior probabilities of each annotation keyword.Subsequently,a random walk process over the constructed label similarity graph is implemented to further mine the potential correlations of the candidate annotations so as to capture the refining results,which plays a crucial role in semantic based image retrieval.The contributions exhibited in this work are multifold.First,GMM is exploited to capture the initial semantic annotations,especially the RPEM algorithm is utilized to train the model that can determine the number of components in GMM automatically.Second,a label similarity graph is constructed by a weighted linear combination of label similarity and visual similarity of images associated with the corresponding labels,which is able to avoid the phenomena of polysemy and synonym efficiently during the image annotation process.Third,the random walk is implemented over the constructed label graph to further refine the candidate set of annotations generated by GMM.Conducted experiments on the standard Corel5 k demonstrate that GMM-RW is significantly more effective than several state-of-the-arts regarding their effectiveness and efficiency in the task of automatic image annotation.
文摘A large semantic gap exists between content based index retrieval(CBIR) and high-level semantic,additional semantic information should be attached to the images,it refers in three respects including semantic representation model,semantic information building and semantic retrieval techniques.In this paper,we introduce an associated semantic network and an automatic semantic annotation system.In the system,a semantic network model is employed as the semantic representation model,it uses semantic Key words,linguistic ontology and low-level features in semantic similarity calculating.Through several times of users' relevance feedback,semantic network is enriched automatically.To speed up the growth of semantic network and get a balance annotation,semantic seeds and semantic loners are employed especially.
基金supported by the National Social Science Foundation of China (Grant No.:11BTQ024)the Foundation for Humanities and Social Sciences of the Chinese Ministry of Education (Grant No.:10YJC87004)
文摘Purpose: To design an efficient high-performance algorithm for semantic annotation of biodiversity documents in Chinese.Design/methodology/approach: Data set consists of 1,000 randomly selected documents from Flora of China. Comparative evaluation of the proposed approach with the Na ve Bayes algorithm have been developed before for the same purpose.Findings: Experimental results show that the heuristics based algorithm outperformed the Na ve Bayes algorithm. The use of leading words helped improving the annotation performance while prioritizing rule application based on their weights had no significant impact on algorithm performance.Research limitations: The ICTCLAS was used to identify word boundaries off-shelf without optimatization for biodiversity domain. This may have not made the best use of the tool.Practical implications & Originality/value: The performance of heuristics based approach,enhanced by leading words analysis, reached an F value of 0.9216, which is sufficiently accurate for practical use.
基金King Saud University through Researchers Supporting Project number(RSP-2021/387),King Saud University,Riyadh,Saudi Arabia.
文摘Daily newspapers publish a tremendous amount of information disseminated through the Internet.Freely available and easily accessible large online repositories are not indexed and are in an un-processable format.The major hindrance in developing and evaluating existing/new monolingual text in an image is that it is not linked and indexed.There is no method to reuse the online news images because of the unavailability of standardized benchmark corpora,especially for South Asian languages.The corpus is a vital resource for developing and evaluating text in an image to reuse local news systems in general and specifically for the Urdu language.Lack of indexing,primarily semantic indexing of the daily news items,makes news items impracticable for any querying.Moreover,the most straightforward search facility does not support these unindexed news resources.Our study addresses this gap by associating and marking the newspaper images with one of the widely spoken but under-resourced languages,i.e.,Urdu.The present work proposed a method to build a benchmark corpus of news in image form by introducing a web crawler.The corpus is then semantically linked and annotated with daily news items.Two techniques are proposed for image annotation,free annotation and fixed cross examination annotation.The second technique got higher accuracy.Build news ontology in protégéusing OntologyWeb Language(OWL)language and indexed the annotations under it.The application is also built and linked with protégéso that the readers and journalists have an interface to query the news items directly.Similarly,news items linked together will provide complete coverage and bring together different opinions at a single location for readers to do the analysis themselves.
基金Supported by the National Basic Research Priorities Program(No.2013CB329502)the National High-tech R&D Program of China(No.2012AA011003)+1 种基金National Natural Science Foundation of China(No.61035003,61072085,60933004,60903141)the National Scienceand Technology Support Program of China(No.2012BA107B02)
文摘A novel image auto-annotation method is presented based on probabilistic latent semantic analysis(PLSA) model and multiple Markov random fields(MRF).A PLSA model with asymmetric modalities is first constructed to estimate the joint probability between images and semantic concepts,then a subgraph is extracted served as the corresponding structure of Markov random fields and inference over it is performed by the iterative conditional modes so as to capture the final annotation for the image.The novelty of our method mainly lies in two aspects:exploiting PLSA to estimate the joint probability between images and semantic concepts as well as multiple MRF to further explore the semantic context among keywords for accurate image annotation.To demonstrate the effectiveness of this approach,an experiment on the Corel5 k dataset is conducted and its results are compared favorably with the current state-of-the-art approaches.
基金Supported by the National Basic Research Priorities Programme(No.2013CB329502)the National High Technology Research and Development Programme of China(No.2012AA011003)+1 种基金the Natural Science Basic Research Plan in Shanxi Province of China(No.2014JQ2-6036)the Science and Technology R&D Program of Baoji City(No.203020013,2013R2-2)
文摘This paper presents a new method for refining image annotation by integrating probabilistic la- tent semantic analysis (PLSA) with conditional random field (CRF). First a PLSA model with asymmetric modalities is constructed to predict a candidate set of annotations with confidence scores, and then model semantic relationship among the candidate annotations by leveraging conditional ran- dom field. In CRF, the confidence scores generated lay the PLSA model and the Fliekr distance be- tween pairwise candidate annotations are considered as local evidences and contextual potentials re- spectively. The novelty of our method mainly lies in two aspects : exploiting PLSA to predict a candi- date set of annotations with confidence scores as well as CRF to further explore the semantic context among candidate annotations for precise image annotation. To demonstrate the effectiveness of the method proposed in this paper, an experiment is conducted on the standard Corel dataset and its re- sults are 'compared favorably with several state-of-the-art approaches.
基金Project(60801053) supported by the National Natural Science Foundation of ChinaProject(4082025) supported by the Beijing Natural Science Foundation,China+4 种基金Project(20070004037) supported by the Doctoral Foundation of ChinaProject(2009JBM135,2011JBM023) supported by the Fundamental Research Funds for the Central Universities of ChinaProject(151139522) supported by the Hongguoyuan Innovative Talent Program of Beijing Jiaotong University,ChinaProject(YB20081000401) supported by the Beijing Excellent Doctoral Thesis Program,ChinaProject (2006CB303105) supported by the National Basic Research Program of China
文摘To improve motion graph based motion synthesis,semantic control was introduced.Hybrid motion features including both numerical and user-defined semantic relational features were extracted to encode the characteristic aspects contained in the character's poses of the given motion sequences.Motion templates were then automatically derived from the training motions for capturing the spatio-temporal characteristics of an entire given class of semantically related motions.The data streams of motion documents were automatically annotated with semantic motion class labels by matching their respective motion class templates.Finally,the semantic control was introduced into motion graph based human motion synthesis.Experiments of motion synthesis demonstrate the effectiveness of the approach which enables users higher level of semantically intuitive control and high quality in human motion synthesis from motion capture database.
文摘In order to bridge the semantic gap exists in image retrieval, this paper propose an approach combining generative and discriminative learning to accomplish the task of automatic image annotation and retrieval. We firstly present continuous probabilistic latent semantic analysis (PLSA) to model continuous quantity. Furthermore, we propose a hybrid framework which employs continuous PLSA to model visual features of images in generative learning stage and uses ensembles of classifier chains to classify the multi-label data in discriminative learning stage. Since the framework combines the advantages of generative and discriminative learning, it can predict semantic annotation precisely for unseen images. Finally, we conduct a series of experiments on a standard Corel dataset. The experiment results show that our approach outperforms many state-of-the-art approaches.
基金supported by the National Social Science Foundation of China(Grant No.11CTQ003)
文摘Purpose: The objective of this paper is to testify the effect of ontology-based semantic annotation on the performance of document retrieval.Design/methodology/approach: An integrated document retrieval method is put forward in this paper, in which the entities of documents are annotated by the upper ontology and domain ontology, then the documents are further indexed by the entity annotation as well as traditional keywords.Findings: The research result shows that the structured entity retrieval and relation retrieval can be realized by the ontology-based entity index, which is beyond the ability of the tradition keyword-based retrieval. Meanwhile, the experiment shows that the recall and precision of document retrieval are improved effectively.Research limitations: Due to the small amount of our current tourism domain ontology, the document retrieval with the ontology-based semantic index is limited by the size of ontology and the precision of semantic annotation. Meanwhile, the semantic annotation algorithm mainly relies on the current information extraction strategy of KIM Platform. Therefore,the performance of disambiguation and relation extraction algorithm need to be further improved.Practical implications: Our method can improve the efficiency of document retrieval system,which facilitates the knowledge and document management in corporations, governments and other organizations.Originality/value: The integrated document retrieval method proposed in the paper can combine the entity index based on the general ontology with domain ontology and the keyword index. Our result verified the effectiveness of the combined index strategy.
文摘In the proposal, a construction project memory process based on the semantic annotation is presented. A project Mem-ory concerns the representation and the identification of the experience acquired during projects realization. The main feature of this approach is that the semantic annotation is used to build a continuous semantic design rationale process. We propose in this paper, a semantic traceability in four stages (identifying, structuring, annotating and integrating). The identification and the structuring phases use a model called Extended Marguerite model which fully considers the objectives of the project memory. The annotation phase exploits the results of precedent phase to prepare the final phase. Examples are presented from a case study in an Algerian firm called ENMTP. The architecture supporting the modelling engine is presented. Finally, an evaluation of the degree of the semantic annotation brought by proposed process is given.
基金Supported by the National Natural Science Foundation of China(61202193,61202304)the Major Projects of Chinese National Social Science Foundation(11&ZD189)the Chinese Postdoctoral Science Foundation(2013M540593,2014T70722)
文摘In this paper we propose a novel model "recursive directed graph" based on feature structure, and apply it to represent the semantic relations of postpositive attributive structures in biomedical texts. The usages of postpositive attributive are complex and variable, especially three categories: present participle phrase, past participle phrase, and preposition phrase as postpositire attributive, which always bring the difficulties of automatic parsing. We summarize these categories and annotate the semantic information. Compared with dependency structure, feature structure, being recursive directed graph, enhances semantic information extraction in biomedical field. The annotation results show that recursive directed graph is more suitable to extract complex semantic relations for biomedical text mining.
基金Supported by the National Natural Science Foundation of China under Grant No.90818001the Natural Science Foundation of Shandong Province of China under Grant No.Y2007G24
文摘Semantic annotation of Web objects is a key problem for Web information extraction. The Web contains an abundance of useful semi-structured information about real world objects, and the empirical study shows that strong two-dimensional sequence characteristics and correlative characteristics exist for Web information about objects of the same type across different Web sites. Conditional Random Fields (CRFs) are the state-of-the-art approaches taking the sequence characteristics to do better labeling. However, as the appearance of correlative characteristics between Web object elements, previous CRFs have their limitations for semantic annotation of Web objects and cannot deal with the long distance dependencies between Web object elements efficiently. To better incorporate the long distance dependencies, on one hand, this paper describes long distance dependencies by correlative edges, which are built by making good use of structured information and the characteristics of records from external databases; and on the other hand, this paper presents a two-dimensional Correlative-Chain Conditional Random Fields (2DCC-CRFs) to do semantic annotation of Web objects. This approach extends a classic model, two-dimensional Conditional Random Fields (2DCRFs), by adding correlative edges. Experimental results using a large number of real-world data collected from diverse domains show that the proposed approach can significantly improve the semantic annotation accuracy of Web objects.