In order to solve the problem that current search engines provide query-oriented searches rather than user-oriented ones, and that this improper orientation leads to the search engines' inability to meet the personal...In order to solve the problem that current search engines provide query-oriented searches rather than user-oriented ones, and that this improper orientation leads to the search engines' inability to meet the personalized requirements of users, a novel method based on probabilistic latent semantic analysis (PLSA) is proposed to convert query-oriented web search to user-oriented web search. First, a user profile represented as a user' s topics of interest vector is created by analyzing the user' s click through data based on PLSA, then the user' s queries are mapped into categories based on the user' s preferences, and finally the result list is re-ranked according to the user' s interests based on the new proposed method named user-oriented PageRank (UOPR). Experiments on real life datasets show that the user-oriented search system that adopts PLSA takes considerable consideration of user preferences and better satisfies a user' s personalized information needs.展开更多
To further enhance the efficiencies of search engines,achieving capabilities of searching,indexing and locating the information in the deep web,latent semantic analysis is a simple and effective way.Through the latent...To further enhance the efficiencies of search engines,achieving capabilities of searching,indexing and locating the information in the deep web,latent semantic analysis is a simple and effective way.Through the latent semantic analysis of the attributes in the query interfaces and the unique entrances of the deep web sites,the hidden semantic structure information can be retrieved and dimension reduction can be achieved to a certain extent.Using this semantic structure information,the contents in the site can be inferred and the similarity measures among sites in deep web can be revised.Experimental results show that latent semantic analysis revises and improves the semantic understanding of the query form in the deep web,which overcomes the shortcomings of the keyword-based methods.This approach can be used to effectively search the most similar site for any given site and to obtain a site list which conforms to the restrictions one specifies.展开更多
A novel image auto-annotation method is presented based on probabilistic latent semantic analysis(PLSA) model and multiple Markov random fields(MRF).A PLSA model with asymmetric modalities is first constructed to esti...A novel image auto-annotation method is presented based on probabilistic latent semantic analysis(PLSA) model and multiple Markov random fields(MRF).A PLSA model with asymmetric modalities is first constructed to estimate the joint probability between images and semantic concepts,then a subgraph is extracted served as the corresponding structure of Markov random fields and inference over it is performed by the iterative conditional modes so as to capture the final annotation for the image.The novelty of our method mainly lies in two aspects:exploiting PLSA to estimate the joint probability between images and semantic concepts as well as multiple MRF to further explore the semantic context among keywords for accurate image annotation.To demonstrate the effectiveness of this approach,an experiment on the Corel5 k dataset is conducted and its results are compared favorably with the current state-of-the-art approaches.展开更多
In this study, methods to classify advertising reviews from shopping mall reviews are suggested. Advertising reviews are mostly written by companies and contain advertising contents. There are a few studies regarding ...In this study, methods to classify advertising reviews from shopping mall reviews are suggested. Advertising reviews are mostly written by companies and contain advertising contents. There are a few studies regarding the classification of opinion spam documents, which is very rare in foreign studies; however, there are no studies that classify advertising reviews from Korean reviews. In this study, the Naive Bayes Classifier was used to classify review documents and the POS (Part-of-Speech)-Tagging and bigram methods were used to extract specific words. The frequency calculation methods for the probability value of specific words were: (1) The general number of appearances of words (2) the frequency calculation of specific words through the suggested Latent Semantic Analysis (LSA), and by recalculating the result from (1) in (2), the performances of each method were compared. As a result, the methods from (2) showed 88.43% accuracy which is 8.89% higher than 79.54% which was the previous result from using the POS-Tagging + Bigram method. Therefore, it was proved that the method suggested in this study is effective at classifying or extracting advertising reviews from Korean product review documents.展开更多
Although English proverbs and quotations are quite popular with language teachers and learners, and although there are sizeable body of research addressing the various aspects of English proverbs and quotations, their...Although English proverbs and quotations are quite popular with language teachers and learners, and although there are sizeable body of research addressing the various aspects of English proverbs and quotations, their potential motivational values lacks theoretical support to date. This paper intends to explore the theoretical rationales for English proverbs and quotations to be a valuable tool for inspiring teachers and textbooks in Chinese EFL teaching. Based on QIN and WEN's conceptual model of motivation (2002) as well as teacher-specific and group-specific motivational theories, it analyzes the potential uses of English proverbs and quotations, with the conclusion that many English proverbs and quotations may serve to be an impetus as well as affective/cognitive mediators in the pre-actional and actional stages of learning, and may help to promote teacher-specific and group-specific motivational dynamics, which in turn benefit student achievement.展开更多
With the spread of globalization and information technology, the status of English as a lingua franca worldwide is undisputable. Consequently, the impact of globalization on ELT (English language teaching) is phenom...With the spread of globalization and information technology, the status of English as a lingua franca worldwide is undisputable. Consequently, the impact of globalization on ELT (English language teaching) is phenomenal. In responding to the impact of global competitors and internationalization, English teaching is often regarded as a principal issue in education. English education is treated as a tool to keep up with the rapid globalization of the world economy. Discussing ELT in Taiwan as an example, this paper aims to raise caution among the periphery educators that under the impact of globalization, the imposition of western culture and language via ELT could be potentially hegemonic and harmful to the local culture and language. This paper will be organized into three parts. Firstly, the impact of globalization on the dominance of English will be discussed. Secondly, the potential threats of linguistic hegemony via ELT will be analyzed. Thirdly, proactive solutions to the hegemonic threats of ELT are proposed as the key to ride out the wave of globalization.展开更多
基金The National Natural Science Foundation of China(No60573090,60673139)
文摘In order to solve the problem that current search engines provide query-oriented searches rather than user-oriented ones, and that this improper orientation leads to the search engines' inability to meet the personalized requirements of users, a novel method based on probabilistic latent semantic analysis (PLSA) is proposed to convert query-oriented web search to user-oriented web search. First, a user profile represented as a user' s topics of interest vector is created by analyzing the user' s click through data based on PLSA, then the user' s queries are mapped into categories based on the user' s preferences, and finally the result list is re-ranked according to the user' s interests based on the new proposed method named user-oriented PageRank (UOPR). Experiments on real life datasets show that the user-oriented search system that adopts PLSA takes considerable consideration of user preferences and better satisfies a user' s personalized information needs.
文摘To further enhance the efficiencies of search engines,achieving capabilities of searching,indexing and locating the information in the deep web,latent semantic analysis is a simple and effective way.Through the latent semantic analysis of the attributes in the query interfaces and the unique entrances of the deep web sites,the hidden semantic structure information can be retrieved and dimension reduction can be achieved to a certain extent.Using this semantic structure information,the contents in the site can be inferred and the similarity measures among sites in deep web can be revised.Experimental results show that latent semantic analysis revises and improves the semantic understanding of the query form in the deep web,which overcomes the shortcomings of the keyword-based methods.This approach can be used to effectively search the most similar site for any given site and to obtain a site list which conforms to the restrictions one specifies.
基金Supported by the National Basic Research Priorities Program(No.2013CB329502)the National High-tech R&D Program of China(No.2012AA011003)+1 种基金National Natural Science Foundation of China(No.61035003,61072085,60933004,60903141)the National Scienceand Technology Support Program of China(No.2012BA107B02)
文摘A novel image auto-annotation method is presented based on probabilistic latent semantic analysis(PLSA) model and multiple Markov random fields(MRF).A PLSA model with asymmetric modalities is first constructed to estimate the joint probability between images and semantic concepts,then a subgraph is extracted served as the corresponding structure of Markov random fields and inference over it is performed by the iterative conditional modes so as to capture the final annotation for the image.The novelty of our method mainly lies in two aspects:exploiting PLSA to estimate the joint probability between images and semantic concepts as well as multiple MRF to further explore the semantic context among keywords for accurate image annotation.To demonstrate the effectiveness of this approach,an experiment on the Corel5 k dataset is conducted and its results are compared favorably with the current state-of-the-art approaches.
文摘In this study, methods to classify advertising reviews from shopping mall reviews are suggested. Advertising reviews are mostly written by companies and contain advertising contents. There are a few studies regarding the classification of opinion spam documents, which is very rare in foreign studies; however, there are no studies that classify advertising reviews from Korean reviews. In this study, the Naive Bayes Classifier was used to classify review documents and the POS (Part-of-Speech)-Tagging and bigram methods were used to extract specific words. The frequency calculation methods for the probability value of specific words were: (1) The general number of appearances of words (2) the frequency calculation of specific words through the suggested Latent Semantic Analysis (LSA), and by recalculating the result from (1) in (2), the performances of each method were compared. As a result, the methods from (2) showed 88.43% accuracy which is 8.89% higher than 79.54% which was the previous result from using the POS-Tagging + Bigram method. Therefore, it was proved that the method suggested in this study is effective at classifying or extracting advertising reviews from Korean product review documents.
文摘Although English proverbs and quotations are quite popular with language teachers and learners, and although there are sizeable body of research addressing the various aspects of English proverbs and quotations, their potential motivational values lacks theoretical support to date. This paper intends to explore the theoretical rationales for English proverbs and quotations to be a valuable tool for inspiring teachers and textbooks in Chinese EFL teaching. Based on QIN and WEN's conceptual model of motivation (2002) as well as teacher-specific and group-specific motivational theories, it analyzes the potential uses of English proverbs and quotations, with the conclusion that many English proverbs and quotations may serve to be an impetus as well as affective/cognitive mediators in the pre-actional and actional stages of learning, and may help to promote teacher-specific and group-specific motivational dynamics, which in turn benefit student achievement.
文摘With the spread of globalization and information technology, the status of English as a lingua franca worldwide is undisputable. Consequently, the impact of globalization on ELT (English language teaching) is phenomenal. In responding to the impact of global competitors and internationalization, English teaching is often regarded as a principal issue in education. English education is treated as a tool to keep up with the rapid globalization of the world economy. Discussing ELT in Taiwan as an example, this paper aims to raise caution among the periphery educators that under the impact of globalization, the imposition of western culture and language via ELT could be potentially hegemonic and harmful to the local culture and language. This paper will be organized into three parts. Firstly, the impact of globalization on the dominance of English will be discussed. Secondly, the potential threats of linguistic hegemony via ELT will be analyzed. Thirdly, proactive solutions to the hegemonic threats of ELT are proposed as the key to ride out the wave of globalization.