Distributed data sources which employ taxonomy hierarchy to describe the contents of their objects are considered, and a super-peer-based semantic overlay network (SSON) is proposed for sharing and searching their d...Distributed data sources which employ taxonomy hierarchy to describe the contents of their objects are considered, and a super-peer-based semantic overlay network (SSON) is proposed for sharing and searching their data objects. In SSON, peers are dynamically clustered into many semantic clusters based on the semantics of their data objects and organized in the semantic clusters into a semantic overlay network. Each semantic cluster consists of a super-peer and more peers, and is only responsible for answering queries in its semantic subspace. A query is first routed to the appropriate semantic clusters by an efficient searching algorithm, and then it is forwarded to the specific peers that hold the relevant data objects. Experimental results indicate that SSON has good scalability and achieves a competitive trade-off between search efficiency and costs.展开更多
In order to mine production and security information from security supervising data and to ensure security and safety involved in production and decision-making,a clustering analysis algorithm for security supervising...In order to mine production and security information from security supervising data and to ensure security and safety involved in production and decision-making,a clustering analysis algorithm for security supervising data based on a semantic description in coal mines is studied.First,the semantic and numerical-based hybrid description method of security supervising data in coal mines is described.Secondly,the similarity measurement method of semantic and numerical data are separately given and a weight-based hybrid similarity measurement method for the security supervising data based on a semantic description in coal mines is presented.Thirdly,taking the hybrid similarity measurement method as the distance criteria and using a grid methodology for reference,an improved CURE clustering algorithm based on the grid is presented.Finally,the simulation results of a security supervising data set in coal mines validate the efficiency of the algorithm.展开更多
Data aggregation from various web sources is very significant for web data analysis domain. In ad- dition, the recognition of coherence micro cluster is one of the most interesting issues in the field of data aggregat...Data aggregation from various web sources is very significant for web data analysis domain. In ad- dition, the recognition of coherence micro cluster is one of the most interesting issues in the field of data aggregation. Until now, many algorithms have been proposed to work on this issue. However, the deficiency of these solutions is that they cannot recognize the micro-cluster data stream accurately. A semantic-based coherent micro-cluster recognition algorithm for hybrid web data stream is nronosed.Firstly, an objective function is proposed to recognize the coherence micro-cluster and then the coher- ence micro-cluster recognition algorithm for hybrid web data stream based on semantic is raised. Fi-展开更多
As the popularity of digital images is rapidly increasing on the Internet, research on technologies for semantic image classification has become an important research topic. However, the well-known content-based image...As the popularity of digital images is rapidly increasing on the Internet, research on technologies for semantic image classification has become an important research topic. However, the well-known content-based image classification methods do not overcome the so-called semantic gap problem in which low-level visual features cannot represent the high-level semantic content of images. Image classification using visual and textual information often performs poorly since the extracted textual features are often too limited to accurately represent the images. In this paper, we propose a semantic image classification ap- proach using multi-context analysis. For a given image, we model the relevant textual information as its multi-modal context, and regard the related images connected by hyperlinks as its link context. Two kinds of context analysis models, i.e., cross-modal correlation analysis and link-based correlation model, are used to capture the correlation among different modals of features and the topical dependency among images induced by the link structure. We propose a new collective classification model called relational support vector classifier (RSVC) based on the well-known Support Vector Machines (SVMs) and the link-based cor- relation model. Experiments showed that the proposed approach significantly improved classification accuracy over that of SVM classifiers using visual and/or textual features.展开更多
We introduce a novel Sermntic-Category- Tree (SCT) model to present the sen-antic structure of a sentence for Chinese-English Machine Translation (MT). We use the SCT model to handle the reordering in a hierarchic...We introduce a novel Sermntic-Category- Tree (SCT) model to present the sen-antic structure of a sentence for Chinese-English Machine Translation (MT). We use the SCT model to handle the reordering in a hierarchical structure in which one reordering is dependent on the others. Different from other reordering approaches, we handle the reordering at three levels: sentence level, chunk level, and word level. The chunk-level reordering is dependent on the sentence-level reordering, and the word-level reordering is dependent on the chunk-level reordering. In this paper, we formally describe the SCT model and discuss the translation strategy based on the SCT model. Further, we present an algorithm for analyzing the source language in SCT and transforming the source SCT into the target SCT. We apply the SCT model to a role-based patent text MT to evaluate the ability of the SCT model. The experimental results show that SCT is efficient in handling the hierarehical reordering operation in MT.展开更多
This research paper mainly discusses gender of English nouns and its corresponding issues. Gender in other Indo-European languages is a grammatical abstract notion, but English gender is a semantic concrete conception...This research paper mainly discusses gender of English nouns and its corresponding issues. Gender in other Indo-European languages is a grammatical abstract notion, but English gender is a semantic concrete conception. English nouns can be divided into four categories: masculine, feminine, common and neuter. Gender genre of an English noun involves the choice of a pronoun that is employed to substitute it. Gender of the pronoun should be identical with its referent. However, the rule may be broken under special conditions. English has lost most word-ending inflectional changes, including grammatical gender of nouns.展开更多
On the semantic web, data interoperability and ontology heterogeneity are becoming ever more important issues. To resolve these problems, multiple classification methods can be used to learn the matching between ontol...On the semantic web, data interoperability and ontology heterogeneity are becoming ever more important issues. To resolve these problems, multiple classification methods can be used to learn the matching between ontologies. The paper uses the general statistic classification method to discover category features in data instances and use the first-order learning algorithm FOIL to exploit the semantic relations among data instances. When using multistrategy learning approach, a central problem is the evaluation of multistrategy classifiers. The goal and the conditions of using multistrategy classifiers within ontology matching are different from the ones for general text classification. This paper describes the combination rule of multiple classifiers called the Best Outstanding Champion, which is suitable for heterogeneous ontology mapping. On the prediction results of individual methods, the method can well accumulate the correct matching of alone classifier. The experiments show that the approach achieves high accuracy on real-world domain.展开更多
Cognitive linguistics is a new approach to language study, which can offer better explanation for language phenomena, and provide more theoretical instructions for English learning. In recent years, the prototype theo...Cognitive linguistics is a new approach to language study, which can offer better explanation for language phenomena, and provide more theoretical instructions for English learning. In recent years, the prototype theory, as the cornerstone of cognitive linguistics, has been employed in different research fields, such as phonology, syntax, semantics, English teaching, etc..This thesis attempts to review and discuss the researches done so far on semantics and categorization based on a cognitive theory -- the prototype theory.展开更多
The ability of achieving a semantic understanding of workspaces is an important capability for mobile robot. A method is proposed to categorize different places in a typical indoor environment by using a Kinect sensor...The ability of achieving a semantic understanding of workspaces is an important capability for mobile robot. A method is proposed to categorize different places in a typical indoor environment by using a Kinect sensors for mobile robot exploration. At first, the invariant feature based images stitching approach is adopted to form a panoramic image according to Kinect visual information, and the translation between Kinect depth information and obstacle distance information is performed to obtain virtual LIDAR data. Then, the semantic classifier is designed by using convolutional neural networks (CNN) for indoor place eategorization based on Kinect visual observations with panoramic view. At last, a frontier-based exploration method is applied to carry out indoor autonomous exploration of mo- bile robots, which integrates the CNN-based categorization approach. The proposed method has been implemented and tested on a real robot, and experiment results demonstrate the approach effective- ness on solving the semantic categorization problem for mobile robot exploration.展开更多
The paper considers the problem of semantic processing of web documents by designing an approach, which combines extracted semantic document model and domain- related knowledge base. The knowledge base is populated wi...The paper considers the problem of semantic processing of web documents by designing an approach, which combines extracted semantic document model and domain- related knowledge base. The knowledge base is populated with learnt classification rules categorizing documents into topics. Classification provides for the reduction of the dimensio0ality of the document feature space. The semantic model of retrieved web documents is semantically labeled by querying domain ontology and processed with content-based classification method. The model obtained is mapped to the existing knowledge base by implementing inference algorithm. It enables models of the same semantic type to be recognized and integrated into the knowledge base. The approach provides for the domain knowledge integration and assists the extraction and modeling web documents semantics. Implementation results of the proposed approach are presented.展开更多
基金The National Natural Science Foundation of China(No60573089)the Natural Science Foundation of Liaoning Province(No20052031)the National High Technology Research and Develop-ment Program of China (863Program)(No2006AA09Z139)
文摘Distributed data sources which employ taxonomy hierarchy to describe the contents of their objects are considered, and a super-peer-based semantic overlay network (SSON) is proposed for sharing and searching their data objects. In SSON, peers are dynamically clustered into many semantic clusters based on the semantics of their data objects and organized in the semantic clusters into a semantic overlay network. Each semantic cluster consists of a super-peer and more peers, and is only responsible for answering queries in its semantic subspace. A query is first routed to the appropriate semantic clusters by an efficient searching algorithm, and then it is forwarded to the specific peers that hold the relevant data objects. Experimental results indicate that SSON has good scalability and achieves a competitive trade-off between search efficiency and costs.
基金The National Natural Science Foundation of China(No.50674086)Specialized Research Fund for the Doctoral Program of Higher Education(No.20060290508)the Postdoctoral Scientific Program of Jiangsu Province(No.0701045B)
文摘In order to mine production and security information from security supervising data and to ensure security and safety involved in production and decision-making,a clustering analysis algorithm for security supervising data based on a semantic description in coal mines is studied.First,the semantic and numerical-based hybrid description method of security supervising data in coal mines is described.Secondly,the similarity measurement method of semantic and numerical data are separately given and a weight-based hybrid similarity measurement method for the security supervising data based on a semantic description in coal mines is presented.Thirdly,taking the hybrid similarity measurement method as the distance criteria and using a grid methodology for reference,an improved CURE clustering algorithm based on the grid is presented.Finally,the simulation results of a security supervising data set in coal mines validate the efficiency of the algorithm.
基金Supported by the National High Technology Research and Development Programme of China(No.2011AA120300,2011AA120302)the National Key Technology Support Program of China(No.2013BAH66F02)
文摘Data aggregation from various web sources is very significant for web data analysis domain. In ad- dition, the recognition of coherence micro cluster is one of the most interesting issues in the field of data aggregation. Until now, many algorithms have been proposed to work on this issue. However, the deficiency of these solutions is that they cannot recognize the micro-cluster data stream accurately. A semantic-based coherent micro-cluster recognition algorithm for hybrid web data stream is nronosed.Firstly, an objective function is proposed to recognize the coherence micro-cluster and then the coher- ence micro-cluster recognition algorithm for hybrid web data stream based on semantic is raised. Fi-
基金Project supported by the Hi-Tech Research and Development Pro-gram (863) of China (No. 2003AA119010), and China-American Digital Academic Library (CADAL) Project (No. CADAL2004002)
文摘As the popularity of digital images is rapidly increasing on the Internet, research on technologies for semantic image classification has become an important research topic. However, the well-known content-based image classification methods do not overcome the so-called semantic gap problem in which low-level visual features cannot represent the high-level semantic content of images. Image classification using visual and textual information often performs poorly since the extracted textual features are often too limited to accurately represent the images. In this paper, we propose a semantic image classification ap- proach using multi-context analysis. For a given image, we model the relevant textual information as its multi-modal context, and regard the related images connected by hyperlinks as its link context. Two kinds of context analysis models, i.e., cross-modal correlation analysis and link-based correlation model, are used to capture the correlation among different modals of features and the topical dependency among images induced by the link structure. We propose a new collective classification model called relational support vector classifier (RSVC) based on the well-known Support Vector Machines (SVMs) and the link-based cor- relation model. Experiments showed that the proposed approach significantly improved classification accuracy over that of SVM classifiers using visual and/or textual features.
基金supported by the National High Technology Research and Development Program of China under Grant No.2012AA011104the Fundamental Research Funds for the Center Universities
文摘We introduce a novel Sermntic-Category- Tree (SCT) model to present the sen-antic structure of a sentence for Chinese-English Machine Translation (MT). We use the SCT model to handle the reordering in a hierarchical structure in which one reordering is dependent on the others. Different from other reordering approaches, we handle the reordering at three levels: sentence level, chunk level, and word level. The chunk-level reordering is dependent on the sentence-level reordering, and the word-level reordering is dependent on the chunk-level reordering. In this paper, we formally describe the SCT model and discuss the translation strategy based on the SCT model. Further, we present an algorithm for analyzing the source language in SCT and transforming the source SCT into the target SCT. We apply the SCT model to a role-based patent text MT to evaluate the ability of the SCT model. The experimental results show that SCT is efficient in handling the hierarehical reordering operation in MT.
文摘This research paper mainly discusses gender of English nouns and its corresponding issues. Gender in other Indo-European languages is a grammatical abstract notion, but English gender is a semantic concrete conception. English nouns can be divided into four categories: masculine, feminine, common and neuter. Gender genre of an English noun involves the choice of a pronoun that is employed to substitute it. Gender of the pronoun should be identical with its referent. However, the rule may be broken under special conditions. English has lost most word-ending inflectional changes, including grammatical gender of nouns.
文摘On the semantic web, data interoperability and ontology heterogeneity are becoming ever more important issues. To resolve these problems, multiple classification methods can be used to learn the matching between ontologies. The paper uses the general statistic classification method to discover category features in data instances and use the first-order learning algorithm FOIL to exploit the semantic relations among data instances. When using multistrategy learning approach, a central problem is the evaluation of multistrategy classifiers. The goal and the conditions of using multistrategy classifiers within ontology matching are different from the ones for general text classification. This paper describes the combination rule of multiple classifiers called the Best Outstanding Champion, which is suitable for heterogeneous ontology mapping. On the prediction results of individual methods, the method can well accumulate the correct matching of alone classifier. The experiments show that the approach achieves high accuracy on real-world domain.
文摘Cognitive linguistics is a new approach to language study, which can offer better explanation for language phenomena, and provide more theoretical instructions for English learning. In recent years, the prototype theory, as the cornerstone of cognitive linguistics, has been employed in different research fields, such as phonology, syntax, semantics, English teaching, etc..This thesis attempts to review and discuss the researches done so far on semantics and categorization based on a cognitive theory -- the prototype theory.
基金Supported by the National Key Basic Research Program of China(No.2013CB035503)
文摘The ability of achieving a semantic understanding of workspaces is an important capability for mobile robot. A method is proposed to categorize different places in a typical indoor environment by using a Kinect sensors for mobile robot exploration. At first, the invariant feature based images stitching approach is adopted to form a panoramic image according to Kinect visual information, and the translation between Kinect depth information and obstacle distance information is performed to obtain virtual LIDAR data. Then, the semantic classifier is designed by using convolutional neural networks (CNN) for indoor place eategorization based on Kinect visual observations with panoramic view. At last, a frontier-based exploration method is applied to carry out indoor autonomous exploration of mo- bile robots, which integrates the CNN-based categorization approach. The proposed method has been implemented and tested on a real robot, and experiment results demonstrate the approach effective- ness on solving the semantic categorization problem for mobile robot exploration.
文摘The paper considers the problem of semantic processing of web documents by designing an approach, which combines extracted semantic document model and domain- related knowledge base. The knowledge base is populated with learnt classification rules categorizing documents into topics. Classification provides for the reduction of the dimensio0ality of the document feature space. The semantic model of retrieved web documents is semantically labeled by querying domain ontology and processed with content-based classification method. The model obtained is mapped to the existing knowledge base by implementing inference algorithm. It enables models of the same semantic type to be recognized and integrated into the knowledge base. The approach provides for the domain knowledge integration and assists the extraction and modeling web documents semantics. Implementation results of the proposed approach are presented.