期刊文献+

基于图的特征词权重算法及其在文档排序中的应用 被引量:1

Graph-Based Term Weighting for Document Ranking
下载PDF
导出
摘要 信息检索的核心工作包括文档的分类和排序等操作,如何对文档中的特征词权重进行有效度量是其中的一项关键技术。利用词的共现等关系为每个文档建立文本图,基于邻接词间重要性相互影响的思路,结合文档中特征词的词频特性,迭代计算每个词的权重,进一步结合文本图的密度等全局特性,对信息检索的结果进行排序。实验证实,算法在标准数据集上具有良好的效果。 The core work of information retrieval including document classification and ranking operations, how to effectively compute the term weight of every document is one of a key technology. Use of the word relationship to create a text graph for each document, based on the idea of the importance of interaction between adjacent words, combining the characteristics of the word document word frequency characteristics, we iteratively compute weighting of each word. Further combining the global properties of text graph, such as density, we could rank the results of information retrieval. Experiments confirmed that the algorithm in standard data sets with good results.
出处 《计算机系统应用》 2012年第6期216-218,194,共4页 Computer Systems & Applications
基金 湖南省教育厅自然科学基金(06C658)
关键词 文本图 共现关系 文档排序 特征词权重 text graph co-occurrence relation document ranking term weight
  • 相关文献

参考文献7

二级参考文献24

共引文献33

同被引文献15

  • 1毕鹏.Web信息检索结果个性化排序模型[J].计算机科学,2004,31(B09):35-37. 被引量:1
  • 2米切尔 T M.机器学习[M].北京:机械工业出版社,2003:68-96.
  • 3Li Xian,Meng Wei-yi,Yu C.T-verifier:Verifying truthfulness of fact statements[C]//27th International Conference on Data Engineering(ICDE) IEEE.IEEE,2011.
  • 4Li Zhi-xu,et al.WebPut:efficient Web-based data imputation[C]//Web Information Systems Engineering-WISE 2012.Berlin Heidelberg:Springer,2012:243-256.
  • 5Kahng,Minsuk,Lee S,et al.Ranking objects by following paths in entity-relationship graphs[C]//Proceedings of the 4th workshop on Workshop for Ph.D.students in information & knowledge management.ACM,2011.
  • 6Lovász,László.Random walks on graphs:A survey[M].//Comhinatorics,Paul erdos is eighty(volume 2).Janor Bolyai Mathematical Society,1993:1-46.
  • 7Sergey B,Page L.The anatomy of a large-scale hypertextual Web search engine[J].Computer Networks and ISDN Systems,1998,30(1):107-117.
  • 8Kleinberg Jon M.Authoritative sources in a hyperlinked environment[J].Journal of the ACM(JACM),1999,46(5):604-632.
  • 9Goldberg David E.Genetic algorithms in search,optimization,and machine learning[M].Addision-Wesley Professional,1989.
  • 10NER[OL].http://nlp.stanford.edu/software/CRF-NER.shtml.

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部