期刊文献+

一种基于动机感知的用户识别实时算法 被引量:2

A real-time motivation aware user identification algorithm
下载PDF
导出
摘要 用户识别是电商大数据行为挖掘的基础,本文提出了一种电商用户识别的新算法,该算法引入用户行为动机感知技术,采用初次匹配和精确识别二阶段模式来识别用户。初次匹配阶段算法利用启发式规则划分用户数据,在精确识别阶段通过实时分析用户的访问动机,依据用户行为相异数矩阵来识别用户。在Spark上的优化使算法在分布式场景中具备实时处理大规模数据的能力。实验结果表明该算法的准确率达97.89%,并具有良好的识别效率。 User identification is the basis of electronic commerce big data behavior mining.A new algorithm for electronic commerce user identification is proposed.This algorithm introduces the technology of user behavior motivation perception,and identifies the users by using the rough match and the accurate identification of two phases.User data is divided by heuristic rules in the stage of rough matching,and the user’s motivation is analyzed in real time during the precise identification phase,and the user is identified according to the dissimilarity matrix of user behaviors.Finally,the Spark computing framework is used to deal with large-scale data in distributed scenarios.Experiment results show that the accuracy of the proposed algorithm reaches 97.89%,and it has good identification efficiency.
作者 张梦菲 邱强 肖茁建 姚晓 方金云 Zhang Mengfei;Qiu Qiang;Xiao Zhuojian;Yao Xiao;Fang Jinyun(Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190;University of Chinese Academy of Sciences,Beijing 100190)
出处 《高技术通讯》 EI CAS 北大核心 2020年第3期259-267,共9页 Chinese High Technology Letters
基金 国家重点研发计划(2016YFB0502300,2016YFB0502302)资助项目。
关键词 用户识别 电子商务 SPARK 用户动机 分布式计算 user identification electronic commerce Spark user’s intention distributed computing
  • 相关文献

参考文献3

二级参考文献82

  • 1方成效,袁可风.Web日志挖掘的数据预处理研究[J].计算机与现代化,2006(4):79-81. 被引量:12
  • 2熊忠阳,周亚峰.Web访问挖掘的预处理技术的研究[J].计算机技术与发展,2007,17(8):11-14. 被引量:19
  • 3Pirolli P, Pitkow J, Rao R. Silk from a Sow's Ear: Extracting Usable Structures from the Web, CHI-96, Human Factors in Computing Systems, 1996.
  • 4Big data. Nature, 2008, 455(7209): 1-136.
  • 5Dealing with data. Science,2011,331(6018): 639-806.
  • 6Holland J. Emergence: From Chaos to Order. RedwoodCity,California: Addison-Wesley? 1997.
  • 7Anthony J G Hey. The Fourth Paradigm: Data-intensiveScientific Discovery. Microsoft Research, 2009.
  • 8Phan X H, Nguyen L M,Horiguchi S. Learning to classifyshort and sparse text Web with hidden topics from large-scale data collections//Proceedings of the 17th InternationalConference on World Wide Web. Beijing, China,2008:91-100.
  • 9Sahami M, Heilman T D. A web-based kernel function formeasuring the similarity of short text snippets//Proceedingsof the 15th International Conference on World Wide Web.Edinburgh, Scotland, 2006: 377-386.
  • 10Efron M, Organisciak P,Fenlon K. Improving retrieval ofshort texts through document expansion//Proceedings of the35th International ACM SIGIR Conference on Research andDevelopment in Information Retrieval. Portland, OR, USA,2012: 911-920.

共引文献1005

同被引文献28

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部