期刊文献+

基于记忆曲线的数据密集型动态用户行为建模 被引量:1

Data Intensive Modeling of Dynamic User Behaviors Based on Forgetting Curve
下载PDF
导出
摘要 分析用户行为的历史数据,使用特定方法建立用户的偏好模型,是目前研究的热点和关键。考虑了数据产生的时序特征,以及具有时间特征的变量在用户行为模型中的影响,以心理学中的记忆曲线模型为依据,从用户的行为数据出发,给出了用户偏好的表示,并为用户的每个偏好建立一个记忆曲线模型,实时地表示用户的偏好。针对海量的用户行为数据,提出了基于MapReduce的模型参数增量更新算法和动态用户偏好计算方法,从而使得模型能反映动态变化的用户偏好。建立在真实数据上的实验结果表明,提出的模型和算法具有高效性、正确性和可用性。 Analyzing historical user behavior data and establishing user preference model by some certain method is the critical subject with great attention. This paper considers the time series characteristics of data generation and the influence of the variables with temporal characteristics on user behavior models. Based on the forgetting curve in psychology,this paper starts from user behavior data and gives the representation of user preferences. Thus, a forgetting curve model can be established for each preference and user preferences can be represented by real time manner.Aiming at massive user behavior data, this paper proposes the MapReduce- based algorithms for the incremental update of model parameters and the computation of dynamic user preferences. Thus, inherently dynamic user preferences can be reflected by the constructed user behavior model. The experimental results conducted on real data show that the proposed model and algorithms are efficient, correct and applicable.
出处 《计算机科学与探索》 CSCD 北大核心 2016年第10期1376-1386,共11页 Journal of Frontiers of Computer Science and Technology
基金 国家自然科学基金,Nos.61472345,61163003,61462056,61562090 云南省应用基础研究计划,Nos.2014FA023,2014FA028 云南省中青年学术和技术带头人后备人才培养计划.No.2012HB004 云南大学创新团队培育计划,No.XT412011 云南大学青年英才培养计划,No.XT412003~~
关键词 动态用户行为模型 用户偏好 记忆曲线 增量更新 MAPREDUCE dynamic user behavior model user preference forgetting curve incremental update MapReduce
  • 相关文献

参考文献4

二级参考文献56

  • 1倪巍伟,陆介平,孙志挥.基于向量内积不等式的分布式k均值聚类算法[J].计算机研究与发展,2005,42(9):1493-1497. 被引量:15
  • 2[OL].<http://hadoop.apache.org.>.
  • 3WinterCorp: 2005 TopTen Program Summary. http:// www. wintercorp, com/WhitePapers/WC TopTenWP. pdf.
  • 4TDWI Checklist Report: Big Data Analytics. http://tdwi. org/research/2010/08/Big-Data-Analytics, aspx.
  • 5Chaudhuri S, Dayal U. An overview of data warehousing and OLAP technology. SIGMOD Rec, 1997,26(1): 65-74.
  • 6Madden S, DeWitt D J, Stonebraker M. Database parallelism choices greatly impact scalability. DatabaseColumn Blog. http://www, databasecolumn, com/2007/10/database-parallelism-choices, html.
  • 7Dean J, Ghemawat S. MapReduce: Simplified data processing on large clusters//Proceedings of the 6th Symposium on Operating System Design and Implementation (OSDI ' 04). San Francisco, California, USA, 2004: 137-150.
  • 8DeWitt D J, Gerber R H, Graefe G, Heytens M L, Kumar K B, Muralikrishna M. GAMMA--A high performance dataflow database machine//Proceedings of the 12th International Conference on Very Large Data Bases (VLDB' 86). Kyoto, Japan, 1986:228-237.
  • 9Fushimi S, Kitsuregawa M, Tanaka H. An overview of the system software of a parallel relational database machine// Proceedings of the 12th International Conference on Very Large DataBases(VLDB'86). Kyoto, Japan, 1986:209-219.
  • 10Brewer E A. Towards robust distributed systems//Proceedings of the 19th Annual ACM Symposium on Principles of Distributed Computing (PODC' 00). Portland, Oregon, USA, 2000:7.

共引文献830

同被引文献5

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部