摘要
针对现有基于差分隐私的频繁轨迹模式挖掘算法全局敏感度过高、挖掘结果可用性较低的问题,提出一种基于前缀序列格和轨迹截断的差分隐私下频繁轨迹模式挖掘算法——LTPM。该算法首先利用自适应的方法获得最优截断长度,然后采用一种动态规划的策略对原始数据库进行截断处理,在此基础上,利用等价关系构建前缀序列格,并挖掘频繁轨迹模式。理论分析表明LTPM算法满足ε-差分隐私;实验结果表明,LTPM算法的准确率(TPR)和平均相对误差(ARE)明显优于N-gram和Prefix算法,能有效提高挖掘结果的可用性。
To address the problems of high global query sensitivity and low utility of mining results in the existing works, a Lattice-Trajectory Pattern Mining (LTPM) algorithm based on prefix sequence lattice and trajectory truncation was proposed for mining sequential patterns with differential privacy. An adaptive method was employed to obtain the optimal truncation length, and a dynamic programming strategy was used to truncate the original database. Based on the truncated database, the equivalent relation was used to construct the prefix sequence lattice for mining trajectory patterns. Theoretical analysis shows that LTPM satisfies c-differential privacy. The experimental results show that the True Postive Rate (TPR) and Average Relative Error (ARE) of LTPM are better than those of N-gram and Prefix algorithms, which verifies that LTPM can effectively improve the utility of the mining results.
出处
《计算机应用》
CSCD
北大核心
2017年第10期2938-2945,2951,共9页
journal of Computer Applications
基金
国家自然科学基金资助项目(61502146
91646203)
河南省自然科学基金资助项目(162300410006)
河南省科技攻关项目(162102310411)
河南省教育厅高等学校重点科研项目(16A520002)
河南财经政法大学青年拔尖人才项目~~
关键词
差分隐私
隐私保护
频繁模式挖掘
轨迹截断
前缀序列格
differential privacy
privacy protection
frequent pattern mining
trajectory truncation
prefix sequential lattice