期刊文献+

在线评论信息挖掘分析的数据来源可靠性研究 被引量:6

Analysis of Reliability Data Source on Online Reviews' Information Mining
下载PDF
导出
摘要 通过将研究分解成三个子任务,对网络数据从运用PageRank与TrustRank剔除作弊网页开始;借助结合网页间主题相关度、时间差以及在线评论比例的权重的TC-PageRank算法,提炼与产品主题高度相关并包含大量在线评论数据的网页集;最后考虑了网页与产品主题的相似度以及网页的链接增幅对网页权威性的影响,运用改进的HITS算法,确定在线评论分析数据来源的权威网页集;而基于MapReduce的矩阵分块运算,降低了算法时空的复杂度。并通过仿真实验验证了该方法的可行性与准确性。 Through resolve the research into three subtasks,starting from operation PageRank and Trust Rank eliminate cheating page of network. Refining web page of high topic relevance by TC-PageRank combined topic relevancy between web pages and weight of time difference and reviews on web page. Finally,thought of similarity between page and topic of product and amplification of page have the influence on the web authority,determine the authority of the web page of online review analysis data source by the improved HITS. The partitioning of matrix operation based on Map Reduce,reduces the time and space complexity of the algorithm. And through the simulation experiments it verifies the feasibility and accuracy of the method.
出处 《软科学》 CSSCI 北大核心 2015年第4期94-99,共6页 Soft Science
基金 国家自然科学基金项目(71302087) 江苏省普通高校研究生科研创新计划项目(KYZZ_0287)
关键词 在线评论 PAGERANK 主题漂移 链接增幅 online reviews PageRank topic drift amplification of page
  • 相关文献

参考文献11

二级参考文献70

  • 1杨思洛.搜索引擎的排序技术研究[J].现代图书情报技术,2005(1):43-47. 被引量:23
  • 2温忠麟,侯杰泰,张雷.调节效应与中介效应的比较和应用[J].心理学报,2005,37(2):268-274. 被引量:3076
  • 3戚华春,黄德才,郑月锋.具有时间反馈的PageRank改进算法[J].浙江工业大学学报,2005,33(3):272-275. 被引量:27
  • 4朱嫣岚,闵锦,周雅倩,黄萱菁,吴立德.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20. 被引量:326
  • 5关辉,董大海.品牌形象对消费者行为倾向影响的实证研究[J].中国流通经济,2007,21(7):42-45. 被引量:41
  • 6POPESCU A M,YATES A,ETZIONI Q.Class extraction from the World Wide Web[C] //Proc of AAAI-04 Workshop on Adaptive Text Extraction and Mining.San Jose,CA:American Association for Artificial Intelligence,2004:1-6.
  • 7HU Ming-qing,LIU Bing.Mining and summarizing customer reviews[C] //Proc of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.New York:ACM Press,2004:168-177.
  • 8LIU Bing,HU Ming-qing,CHENG Jun-sheng.Opinion observer:analyzing and comparing opinions on the Web[C] //Proc of the 14th International Conference on World Wide Web.New York:ACM Press,2005:342-351.
  • 9KOBAYASHI N,INUI K,MATSUMOTO Y,et al.Collecting evalua-tive expressions for opinion extraction[C] //Proc of the 1st International Joint Conference on Natural Language Processing.Berlin:Springer,2005:596-605.
  • 10POPESCU A M,ETZIONI Q.Extracting product features and opi-nions from reviews[C] //Proc of HLT-EMNLP.Morristown,NJ:Association for Compatational Linguistics,2005:339-346.

共引文献235

同被引文献78

引证文献6

二级引证文献26

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部