期刊文献+

基于词对向量空间模型的新事件检测方法 被引量:4

New event detection method based on word pairs vector space model
下载PDF
导出
摘要 新事件检测(NED)的目标是从一个或多个新闻源中检测出报道一个新闻话题的第一个新闻。传统向量空间模型采用单个词来表示文本特征,考虑到词的位置信息以及其他的表示内容的信息,提出了词对表示文本的方法,并结合HowNet资源对所抽取的词对进行归一化处理,最后对不同类别新闻中不同词性对的权重参数进行优化。通过在已有的突发性新闻语料上进行实验,表明这种改进方法的效果比较明显,性能也有一定的提高。 New Event Detection(NED) aims at detecting the first news item on one topic from one or more news reports.The traditional vector space model adopts single word to represent the text features,considering the information of word position and other information of expressing content,this paper proposes an approach using word pairs to express text content.Combined with the HowNet,the extracted word pairs are normalized.Then the different weight parameters of different part of speech pairs are given according to different types of news reports.Experiments on emergency news corpus show that the word-pair method can significantly improve the representation results.
出处 《计算机工程与应用》 CSCD 北大核心 2010年第12期123-125,共3页 Computer Engineering and Applications
基金 国家自然科学基金No.60475022 山西省自然科学基金No.20041041 山西省回国留学人员基金(No.2002004)~~
关键词 向量空间模型 词对特征 新事件检测 vector space model word pair feature new event detection
  • 相关文献

参考文献6

  • 1李保利,俞士汶.话题识别与跟踪研究[J].计算机工程与应用,2003,39(17):7-10. 被引量:61
  • 2Nicola S,Joe C.Combirung semantic and syntactic document classi-fiers to improve first story detection[C]//Proceedings of the 24th An-nual International ACM SIGIR Conference.New York,NY,USA:ACM Press,2001:424-425.
  • 3Yang Y,Pierce T,Carbonell J.A study on retrospective and on-line event detection[C]//Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.CMU,USA:ACM,1998:28-36.
  • 4Juha M,Helena A M,Marko S.Simple semantics in topic detection and tracking[J].Information Retrieval,2004,7(3/4):347-368.
  • 5洪宇,张宇,范基礼,刘挺,李生.基于子话题分治匹配的新事件检测[J].计算机学报,2008,31(4):687-695. 被引量:26
  • 6张阔,李涓子,吴刚,王克宏.基于词元再评估的新事件检测模型[J].软件学报,2008,19(4):817-828. 被引量:17

二级参考文献29

  • 1贾自艳,何清,张海俊,李嘉佑,史忠植.一种基于动态进化模型的事件探测和追踪算法[J].计算机研究与发展,2004,41(7):1273-1280. 被引量:58
  • 2吴平博,陈群秀,马亮.基于时空分析的线索性事件的抽取与集成系统研究[J].中文信息学报,2006,20(1):21-28. 被引量:21
  • 3雷震,吴玲达,雷蕾,黄炎焱.初始化类中心的增量K均值法及其在新闻事件探测中的应用[J].情报学报,2006,25(3):289-295. 被引量:25
  • 4James Allan,Jaime Carbonell,George Doddington et al.Topic Detection and Tracking Pilot Study:Final Report[C].In:Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop,San Francisco ,CA,Morgan Kaufmann Publishers ,Inc, 1998:194-218.
  • 5Yiming Yang,Jaime Carbonell,Ralf Brown et al.Learning Approaches for Detecting and Tracking News Events[J].IEEE Intelligent Systems:.Special Issue on Applications of Intelligent Information Retrieval,1999;14(4) :32-43.
  • 6Wayne C.Multilingual Topic Detection and Tracking:Successful Research Enabled by Corpora and Evaluation[C].In:Language Resources and Evaluation Conference (LREC),2000 : 1487-1494.
  • 7James Allan (ed.).Topic Detection and Tracking : Event-based Information Organization[M].Kluwer Academic Publishers,2002.
  • 8James Allan,Victor Lavrenko,Hubert Jin.First Story Detection in TDT is Hard[C].In:Proceedings of 9th Conference on Information Knowledge Management CIKM ,2000: 374---381.
  • 9Yiming Yang,Tom Ault,Thomas Pierce et al.Improving Text Categorization Methods for Event Tracking[C].In:Proeeedings of the 23rd International Conference on Research and Development in Information Retrieval ( SIGIR-2000),2000: 65-72.
  • 10Alvin Martin,George Doddington,Terri Kamm et al.The DET Curve in Assessment of Detection Task Performance[C].In:Proceedings of Eurospeech 1997,1997:1895-1898.

共引文献89

同被引文献37

  • 1何俊.计算机公共机房管理资源整合[J].实验室研究与探索,2010,29(2):65-67. 被引量:25
  • 2王海春,邱寄帆,邱敦国.一种基于Word文档的数字密写设计与实现[J].微计算机信息,2006(10X):47-48. 被引量:10
  • 3何靖,陈种,闫宏飞.开放域问答系统研究综述[C].见:第六属全国信息检索学术会议论史集,2010:114-121.
  • 4张刚,王斌,吴丽辉.基于链接划分的分布式WEB信息检索[J].模式识别与人工智能,2007,20(4):519-524. 被引量:1
  • 5Ponte J M,Croft W.B.A language modeling approach to information retrieval[C]//the Proceedings of21st An-nual Int’l ACM SIGIR Conf Research and Develop-ment in Information Retrieval,1998.
  • 6Jeon J,Croft W B,Lee J H.Finding similar questions in large question and answer archives[C]//Proceedings of the14th ACM International Conference on Informa-tion and Knowledge Management,2005:84-90.
  • 7Xue X,Jeon J,Croft W B.Retrieval Models for Ques-tion and Answer Archives[C]//The31st Annual Int’l ACM SIGIR Conf on Research and Development in In-formation Retrieval,2008.
  • 8David L N,Jon K.The link-prediction problem for so-cial networks[J].Journal of American Society for Infor-mation Science and Technology,2007,58(7):1019-1031.
  • 9国家中长期教育改革和发展规划纲要(2010-2020)[M].北京:人民出版社,20lO.
  • 10付兵.基于Word字符RGB值的信息隐藏技术[J],电脑知识与术,2007(2):78-80.

引证文献4

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部