摘要
在微博平台中用户的消息以流的形式按照时间顺序到达系统,对微博数据流的有效管理可以及时地响应用户的查询操作。基于数据库的数据世系思想,提出了一种基于数据世系的微博信息管理方法。首先,根据事件的产生、发展以及变化,将同一社会事件包含的消息定义为数据世系;其次,将微博消息流划分为不同的数据世系,并根据新消息动态地维护数据世系集合;最后,应用数据世系中的文本消息响应用户的查询。实验表明,基于数据世系的微博信息管理方法使用的内存少,运行效率高,可用于微博消息流的实时处理及查询响应工作。
In microblog platform, users' messages arrive the system in a temporally ordered sequence, and efficient management of microblog streaming data can handle users' queries timely. Based on provenance of database, a prove- nance based information management method for microblog messages was proposed. Firstly, the provenance is defined as messages about a common event according to the generation, development and changing of an event. Secondly, the mes- sage streaming is divided into different provenances and they are maintained dynamically when a new message comes. Finally, the messages of provenance are used to answer user's queries. The experiments show that the proposed method is efficient in memory usage and time cost, and can be used to timely response of users' queries.
出处
《计算机科学》
CSCD
北大核心
2015年第10期198-201,共4页
Computer Science
基金
国家自然科学基金(61170306)
湖北省科技攻关基金项目(2003AA101B05)资助
关键词
数据世系
数据流
微博
信息检索
Provenance, Streaming data, Microblog, Information retrieval