摘要
对于移动计算领域的移动对象轨迹数据流的管理,最普遍采用的技术手段是采样技术,而传统的均匀采样易丢失一些关键的变化数据,造成信息丢失现象。针对这一问题,提出一种基于概率密度聚类的数据流偏倚采样算法。该算法在滑动窗口模型下,充分利用了轨迹数据流自身的分布特性,结合偏倚采样算法思想克服了均匀采样的数据丢失问题。算法首先采用基于数据存在密度的聚类技术将滑动窗口划分为强簇、弱簇和过度簇,然后针对不同的簇给予不同的采样率,进行偏倚采样,进而得到最终的数据流摘要。经过实际数据集的实验检测,证明算法较好地保证了采样质量,并具有较快的数据处理能力。
In management of the mobile object trajectory data stream in the field of mobile computing, the most com- monly used technical means is sampling techniques, but the traditional uniform sampling is easy to lose some of the key changes in data, resulting in the phenomenon of loss of information. To solve this problem, we proposed a data stream based on the probability density clustering bias sampling algorithm. The algorithm in a sliding window model, makes full use of the distribution of characteristics of the the trajectory data stream itself, combines a bias sampling algorithm ideo- logy to overcome uniformly sampled data loss problems. Firstly the sliding window is divided into a strong cluster clus- tering techniques based on density data exists, weak clusters and excessive cluster, and then different sampling rates for different clusters biased sampling are given, thereby to obtain a final summary of the data stream. The experimental tes- ting results of the set of actual data show that the algorithm ensures the sampling quality and has faster data processing capability.
出处
《计算机科学》
CSCD
北大核心
2013年第9期254-256,269,共4页
Computer Science
基金
辽宁省计划项目基金(2012232001)
辽宁省自然科学基金(201202119)资助
关键词
轨迹数据流
滑动窗口
密度聚类
偏倚采样
Trajectory data stream, Sliding window, Density clustering, Bias sampling