How to quickly and accurately detect new topics from massive data online becomes a main problem of public opinion monitoring in cyberspace. This paper presents a new event detection method for the current new event de...How to quickly and accurately detect new topics from massive data online becomes a main problem of public opinion monitoring in cyberspace. This paper presents a new event detection method for the current new event detection system,based on sorted subtopic matching algorithm and constructs the entire design framework. In this paper,the subtopics contained in old topics(or news stories) are sorted in descending order according to their importance to the topic(or news stories),and form a sorted subtopic sequence. In the process of subtopic matching,subtopic scoring matrix is used to determine whether a new story is reporting a new event. Experimental results show that the sorted subtopic matching model improved the accuracy and effectiveness of the new event detection system in cyberspace.展开更多
为了解决传统事件相似度计算方法在TDT(topic detection and tracking)领域计算同一话题下事件相似度时存在不够精确的问题,根据模板知识提出了一种新的基于话题的事件相似度计算方法。该方法综合考虑了事件的内容相似度、事件和话题的...为了解决传统事件相似度计算方法在TDT(topic detection and tracking)领域计算同一话题下事件相似度时存在不够精确的问题,根据模板知识提出了一种新的基于话题的事件相似度计算方法。该方法综合考虑了事件的内容相似度、事件和话题的相似度、事件的时间相似度。实验结果表明,与传统方法相比,该方法能更准确地判断出同一话题下的事件相似性。展开更多
基金Funded by the Planning Project of National Language Committee in the "12th 5-year Plan"(No.YB125-49)the Foundation for Key Program of Ministry of Education,China(No.212167)the Fundamental Research Funds for the Central Universities(No.SWJTU12CX096)
文摘How to quickly and accurately detect new topics from massive data online becomes a main problem of public opinion monitoring in cyberspace. This paper presents a new event detection method for the current new event detection system,based on sorted subtopic matching algorithm and constructs the entire design framework. In this paper,the subtopics contained in old topics(or news stories) are sorted in descending order according to their importance to the topic(or news stories),and form a sorted subtopic sequence. In the process of subtopic matching,subtopic scoring matrix is used to determine whether a new story is reporting a new event. Experimental results show that the sorted subtopic matching model improved the accuracy and effectiveness of the new event detection system in cyberspace.
文摘为了解决传统事件相似度计算方法在TDT(topic detection and tracking)领域计算同一话题下事件相似度时存在不够精确的问题,根据模板知识提出了一种新的基于话题的事件相似度计算方法。该方法综合考虑了事件的内容相似度、事件和话题的相似度、事件的时间相似度。实验结果表明,与传统方法相比,该方法能更准确地判断出同一话题下的事件相似性。