期刊文献+

面向垂直搜索基于本体的可读性计算模型

An ontology-based readability model for vertical search
原文传递
导出
摘要 作为一项新兴的信息检索评价指标,可读性在文档相关性、实用性以及质量评估中占据重要地位。其中,如何为用户提供相关可读的文档已成为垂直搜索领域一个亟待解决的问题。为了有效解决这个问题,提出了一种基于本体结构的可读性计算模型。该模型以用户的阅读抽象过程为背景,分别从语篇表面层次和概念层次对文本进行可读性计算,从而引入了3个可读性指标,即概念势、概念域和文档连贯性。具体地是将单个指标或者指标组合计算所得可读性得分融入传统垂直检索模型中,对文档初次检索结果进行重排。在医学领域中,用户实验结果表明基于本体概念序列信息的可读性指标相对于传统的非序列化指标可以更加有效地预测文档的真实可读性水平。系统实验结果进一步说明了基于可读性的重排序模型可以兼顾文档的相关性和可读性,提升垂直领域信息检索性能。 As an emerging evaluation criteria of information retrieval( IR),readability plays an important role in accessing document's relevance,utility and quality. Howto provide different users with relevant and readable documents has been an urgent problem in vertical search. In order to solve this problem,we propose a newontology-based readability method. Based on users' reading process,we measure document's readability from surface and conceptual levels.In this model,three readability indicator shave been introduced,i. e.,Concept Topography,Concept Scope and Document Coherence. Specifically,the readability of a document that computed by individual or combined indicators can be used to re-rank the initial lists of documents which are returned by a conventional search engine. In medical domain,the user-oriented evaluations showthat our model has good correlation with humans' judgments in readability prediction.And our model is also competitive compared with one of the state-of-the-artreadability models in system-orient edevaluation.
出处 《山东大学学报(理学版)》 CAS CSCD 北大核心 2016年第7期23-29,共7页 Journal of Shandong University(Natural Science)
基金 国家重点基础研究发展计划"973计划"项目(2013CB329304 2014CB744604) 国家自然科学基金资助项目(61402324 61272265) 教育部博士点基金资助项目(20130032120044)
关键词 特定领域信息检索 可读性 文档重排 vertical search readability documents re-ranking
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部