期刊文献+

Sentiment Analysis of Japanese Tourism Online Reviews 被引量:2

Sentiment Analysis of Japanese Tourism Online Reviews
下载PDF
导出
摘要 Purpose: Online reviews on tourism attractions provide important references for potential tourists to choose tourism spots. The main goal of this study is conducting sentiment analysis to facilitate users comprehending the large scale of the reviews, based on the comments about Chinese attractions from Japanese tourism website 4 Travel.Design/methodology/approach: Different statistics-and rule-based methods are used to analyze the sentiment of the reviews. Three groups of novel statistics-based methods combining feature selection functions and the traditional term frequency-inverse document frequency(TF-IDF) method are proposed. We also make seven groups of different rulesbased methods. The macro-average and micro-average values for the best classification results of the methods are calculated respectively and the performance of the methods are shown.Findings: We compare the statistics-based and rule-based methods separately and compare the overall performance of the two method. According to the results, it is concluded that the combination of feature selection functions and weightings can strongly improve the overall performance. The emotional vocabulary in the field of tourism(EVT), kaomojis, negative and transitional words can notably improve the performance in all of three categories. The rule-based methods outperform the statistics-based ones with a narrow advantage.Research limitation: Two limitations can be addressed: 1) the empirical studies to verify the validity of the proposed methods are only conducted on Japanese languages; and 2) the deep learning technology is not been incorporated in the methods.Practical implications: The results help to elucidate the intrinsic characteristics of the Japanese language and the influence on sentiment analysis. These findings also provide practical usage guidelines within the field of sentiment analysis of Japanese online tourism reviews.Originality/value: Our research is of practicability. Currently, there are no studies that focus on the sentiment analysis of Japanese reviews about Chinese attractions. Purpose: Online reviews on tourism attractions provide important references for potential tourists to choose tourism spots. The main goal of this study is conducting sentiment analysis to facilitate users comprehending the large scale of the reviews, based on the comments about Chinese attractions from Japanese tourism website 4 Travel.Design/methodology/approach: Different statistics-and rule-based methods are used to analyze the sentiment of the reviews. Three groups of novel statistics-based methods combining feature selection functions and the traditional term frequency-inverse document frequency(TF-IDF) method are proposed. We also make seven groups of different rulesbased methods. The macro-average and micro-average values for the best classification results of the methods are calculated respectively and the performance of the methods are shown.Findings: We compare the statistics-based and rule-based methods separately and compare the overall performance of the two method. According to the results, it is concluded that the combination of feature selection functions and weightings can strongly improve the overall performance. The emotional vocabulary in the field of tourism(EVT), kaomojis, negative and transitional words can notably improve the performance in all of three categories. The rule-based methods outperform the statistics-based ones with a narrow advantage.Research limitation: Two limitations can be addressed: 1) the empirical studies to verify the validity of the proposed methods are only conducted on Japanese languages; and 2) the deep learning technology is not been incorporated in the methods.Practical implications: The results help to elucidate the intrinsic characteristics of the Japanese language and the influence on sentiment analysis. These findings also provide practical usage guidelines within the field of sentiment analysis of Japanese online tourism reviews.Originality/value: Our research is of practicability. Currently, there are no studies that focus on the sentiment analysis of Japanese reviews about Chinese attractions.
出处 《Journal of Data and Information Science》 CSCD 2019年第1期89-113,共25页 数据与情报科学学报(英文版)
基金 supported by the National Natural Science Foundation of China under the grant #71373286 and # 71603189 the Major Project of the Ministry of Education of China (Grant No. 17JZD034)
关键词 SENTIMENT analysis JAPANESE REVIEWS RULE-BASED METHODS Statistics-based METHODS TOURISM REVIEWS Sentiment analysis Japanese reviews Rule-based methods Statistics-based methods Tourism reviews
  • 相关文献

参考文献2

二级参考文献19

  • 1王细薇,樊兴华,赵军.一种基于特征扩展的中文短文本分类方法[J].计算机应用,2009,29(3):843-845. 被引量:36
  • 2James Auen.Natural Language Understandin[M].The Benjamin/Cummings Publishing Company, 1991-05.
  • 3Apte C,Damerau F J,Weiss S M.Automated Learning of Decision Rules for Text Categorization[J].ACM Trans On Inform Syst,12(3): 233-251.
  • 4Salton G,Buckley B.Term-weighting Approaches in Automatic Text Retrieval[J].Information Processing and Management, 1998 ; 24(5 ) :513 -523.
  • 5Larkey L S.A Patent Search and Classification System[C].In:proceedings of DL-99,4th ACM Conference on Digital Libraries Berkeley,CA,1999:179-187.
  • 6Salton G,Lesk M E.Computer Evaluation of Indexing and Text Processing[J].Association for Computing Machinery, 1968 ; 15 ( 1 ) : 8-36.
  • 7DONG H H, HUI S C, HE Y T. Structural. analysis of chat messa- ges for topic detection [ J]. Online Information Review, 2006, 5 (30) : 496 - 516.
  • 8DABBLE F, SEBASTIAN F. Supervised term weighting for automa- ted text categorization [ C ]// SAC '03: Proceedings of the 2003 ACM Symposium on Applied Computing. New York: ACM, 2003. 784 - 788.
  • 9ZHANG C, FAN X H, CHEN X N. Hot topic detection on Chinese short text [ C]//Advaneed Research on Computer Education, Simu- lation and Modeling: Communications in Computer and Information Science, CCIS 176. Berlin: Springer-Verlag, 2011:207 -212.
  • 10胡佳妮,郭军,徐蔚然.一种基于短文本的独立语义特征抽取算法[C]//2007年全国网络与信息安全技术研讨会论文集.[出版地不详]:中国通信学会,2007:332-335.

共引文献62

同被引文献19

引证文献2

二级引证文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部