This paper gives a semantic fuzzy retrieval method of multimedia object, discusses the principle of fuzzy semantic retrieval technique, presents a fuzzy reasoning mechanism based on the knowledge base, and designs the...This paper gives a semantic fuzzy retrieval method of multimedia object, discusses the principle of fuzzy semantic retrieval technique, presents a fuzzy reasoning mechanism based on the knowledge base, and designs the relevant reasoning algorithms. Researchful results have innovative significance.展开更多
Long-document semantic measurement has great significance in many applications such as semantic searchs, plagiarism detection, and automatic technical surveys. However, research efforts have mainly focused on the sema...Long-document semantic measurement has great significance in many applications such as semantic searchs, plagiarism detection, and automatic technical surveys. However, research efforts have mainly focused on the semantic similarity of short texts. Document-level semantic measurement remains an open issue due to problems such as the omission of background knowledge and topic transition. In this paper, we propose a novel semantic matching method for long documents in the academic domain. To accurately represent the general meaning of an academic article, we construct a semantic profile in which key semantic elements such as the research purpose, methodology, and domain are included and enriched. As such, we can obtain the overall semantic similarity of two papers by computing the distance between their profiles. The distances between the concepts of two different semantic profiles are measured by word vectors. To improve the semantic representation quality of word vectors, we propose a joint word-embedding model for incorporating a domain-specific semantic relation constraint into the traditional context constraint. Our experimental results demonstrate that, in the measurement of document semantic similarity, our approach achieves substantial improvement over state-of-the-art methods, and our joint word-embedding model produces significantly better word representations than traditional word-embedding models.展开更多
A new scheme is presented to detect a large number ofKeywordsin voice controlled switchboard tasks. The new scheme is based on two stages. In the first stage, N best syllable candidates with their corresponding acous...A new scheme is presented to detect a large number ofKeywordsin voice controlled switchboard tasks. The new scheme is based on two stages. In the first stage, N best syllable candidates with their corresponding acoustic scores are generated by an acoustic recognizer. In the second stage, a semantic model based parser is applied to determine the optimum keywords by searching through the lattice of N best candidates. The experimental results show that when the spoken input deviates from the predefined syntactic constraints, the parser can also demonstrate high performance. For comparison purposes, the most common way to incorporate the syntactic knowledge of the task directly into the acoustic recognizer in the form of a finite state network is also investigated. Furthermore, to address the sparse data problems, out of domain data in the form of newspaper text are used to obtain a more robust combined semantic model. The experiments show that the combined semantic model can improve the keywords detection rate from 90.07% to 92.91% when 80 ungrammatical sentences which do not conform to the task grammar are used as testing material.展开更多
Emotional space refers to a multi-dimensional emotional model that describes a group of subjective feelings or emotions. Since the existing discrete emotional space is mainly aimed at human’s primary emotions, it can...Emotional space refers to a multi-dimensional emotional model that describes a group of subjective feelings or emotions. Since the existing discrete emotional space is mainly aimed at human’s primary emotions, it cannot describe the complex emotions evoked when watching movies. In order to solve this problem, an emotional fusion space for videos was constructed by selecting movies and TV dramas with rich emotional semantics as the research objects. Firstly, emotional words based on movie and TV drama videos are acquired and analyzed by using subjective evaluation and semantic analysis methods. Then, the emotional word vectors obtained from the above analysis are fused, reduced dimension by t-distributed stochastic neighbor embedding(t-SNE) algorithm, and clustered by bisecting K-means clustering algorithm to get a discrete emotional space for movie and TV drama videos. This emotional fusion space can obtain different categories by changing the value of the emotion classification number without re-labeling and calculation.展开更多
文摘This paper gives a semantic fuzzy retrieval method of multimedia object, discusses the principle of fuzzy semantic retrieval technique, presents a fuzzy reasoning mechanism based on the knowledge base, and designs the relevant reasoning algorithms. Researchful results have innovative significance.
基金supported by the Foundation of the State Key Laboratory of Software Development Environment(No.SKLSDE-2015ZX-04)
文摘Long-document semantic measurement has great significance in many applications such as semantic searchs, plagiarism detection, and automatic technical surveys. However, research efforts have mainly focused on the semantic similarity of short texts. Document-level semantic measurement remains an open issue due to problems such as the omission of background knowledge and topic transition. In this paper, we propose a novel semantic matching method for long documents in the academic domain. To accurately represent the general meaning of an academic article, we construct a semantic profile in which key semantic elements such as the research purpose, methodology, and domain are included and enriched. As such, we can obtain the overall semantic similarity of two papers by computing the distance between their profiles. The distances between the concepts of two different semantic profiles are measured by word vectors. To improve the semantic representation quality of word vectors, we propose a joint word-embedding model for incorporating a domain-specific semantic relation constraint into the traditional context constraint. Our experimental results demonstrate that, in the measurement of document semantic similarity, our approach achieves substantial improvement over state-of-the-art methods, and our joint word-embedding model produces significantly better word representations than traditional word-embedding models.
基金the State High-Tech Developments Planof China !( No. 863 -3 0 6-0 2 -1) Chinese211Engineering Project!( No.9610 3 -2 )
文摘A new scheme is presented to detect a large number ofKeywordsin voice controlled switchboard tasks. The new scheme is based on two stages. In the first stage, N best syllable candidates with their corresponding acoustic scores are generated by an acoustic recognizer. In the second stage, a semantic model based parser is applied to determine the optimum keywords by searching through the lattice of N best candidates. The experimental results show that when the spoken input deviates from the predefined syntactic constraints, the parser can also demonstrate high performance. For comparison purposes, the most common way to incorporate the syntactic knowledge of the task directly into the acoustic recognizer in the form of a finite state network is also investigated. Furthermore, to address the sparse data problems, out of domain data in the form of newspaper text are used to obtain a more robust combined semantic model. The experiments show that the combined semantic model can improve the keywords detection rate from 90.07% to 92.91% when 80 ungrammatical sentences which do not conform to the task grammar are used as testing material.
基金supported by the Key Laboratory of the Ministry of Culture and Tourism (WLBSYS2005)the Fundamental Research Funds for the Central Universities (CUC19ZD005)。
文摘Emotional space refers to a multi-dimensional emotional model that describes a group of subjective feelings or emotions. Since the existing discrete emotional space is mainly aimed at human’s primary emotions, it cannot describe the complex emotions evoked when watching movies. In order to solve this problem, an emotional fusion space for videos was constructed by selecting movies and TV dramas with rich emotional semantics as the research objects. Firstly, emotional words based on movie and TV drama videos are acquired and analyzed by using subjective evaluation and semantic analysis methods. Then, the emotional word vectors obtained from the above analysis are fused, reduced dimension by t-distributed stochastic neighbor embedding(t-SNE) algorithm, and clustered by bisecting K-means clustering algorithm to get a discrete emotional space for movie and TV drama videos. This emotional fusion space can obtain different categories by changing the value of the emotion classification number without re-labeling and calculation.