期刊文献+

基于SVM的中文报道关系识别方法研究 被引量:3

Research on Chinese story link detection based on SVM
下载PDF
导出
摘要 针对网络新闻的特点,从人名、时间名、地点名、组织机构名、内容五个方面抽取特征词形成特征向量。在此基础上,分别进行了相似度计算,其中,人名、组织机构名、内容采用余弦夹角的方法,时间和地点向量,相似度计算采用了引入报道时间和关联度计算。最后,使用这5个相似度作为特征,使用SVM进行训练,并在测试集上进行了测试。测试结果表明,这种方法可以有效地改善系统的性能。 Via analyzing the characteristic of news in the Web,construct the feature vector using features from five entity categories:persons,time,location,organizations,and content.Using story time and entity relatedness for temporal or place vector when calculating their similarity and cosine similarity for others.All the features together with the entity relatedness are integrated by Support Vector Machine(SVM).Experimental results show that this method can improve system performance effectively.
作者 王强 张永奎
出处 《计算机工程与应用》 CSCD 北大核心 2008年第33期141-143,共3页 Computer Engineering and Applications
基金 国家自然科学基金No.60475022 山西省自然科学基金No.20041041 山西省回国留学人员基金(No.2002004)。~~
关键词 报道关系识别 话题检测与跟踪 多向量表示模型 story link detection topic detection and tracking multi-vector mode
  • 相关文献

参考文献6

二级参考文献25

  • 1金珠,林鸿飞,赵晶.基于HowNet的话题跟踪及倾向性分类研究[J].情报学报,2005,24(5):555-561. 被引量:21
  • 2James Allan,Jaime Carbonell,George Doddington et al.Topic Detection and Tracking Pilot Study:Final Report[C].In:Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop,San Francisco ,CA,Morgan Kaufmann Publishers ,Inc, 1998:194-218.
  • 3Yiming Yang,Jaime Carbonell,Ralf Brown et al.Learning Approaches for Detecting and Tracking News Events[J].IEEE Intelligent Systems:.Special Issue on Applications of Intelligent Information Retrieval,1999;14(4) :32-43.
  • 4Wayne C.Multilingual Topic Detection and Tracking:Successful Research Enabled by Corpora and Evaluation[C].In:Language Resources and Evaluation Conference (LREC),2000 : 1487-1494.
  • 5James Allan (ed.).Topic Detection and Tracking : Event-based Information Organization[M].Kluwer Academic Publishers,2002.
  • 6James Allan,Victor Lavrenko,Hubert Jin.First Story Detection in TDT is Hard[C].In:Proceedings of 9th Conference on Information Knowledge Management CIKM ,2000: 374---381.
  • 7Yiming Yang,Tom Ault,Thomas Pierce et al.Improving Text Categorization Methods for Event Tracking[C].In:Proeeedings of the 23rd International Conference on Research and Development in Information Retrieval ( SIGIR-2000),2000: 65-72.
  • 8Alvin Martin,George Doddington,Terri Kamm et al.The DET Curve in Assessment of Detection Task Performance[C].In:Proceedings of Eurospeech 1997,1997:1895-1898.
  • 9Ying-Ju Chen,Hsin-His Chen.NLP and IR Approaches to Monolingual and Multilingual Link Detection[C].In:Proceedings of the 19^th International Conference on Computational Linguistics(COLING 2002).
  • 10李晓明,阎宏飞,王继民.搜索引擎[M].北京:科学出版社,2005.

共引文献78

同被引文献44

引证文献3

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部