期刊文献+

基于Web的中文陈述句正误验证

Verifying truthfulness of Chinese fact statements based on Web
下载PDF
导出
摘要 针对Web页中存在不少不真实信息的问题,提出了一个两步的方法来鉴别一个中文陈述句是否是事实。第一步根据陈述句中的不确定单元对陈述句进行分类扩展,找到一些和待验证陈述句主题匹配的候选陈述句。第二步把候选陈述句代入现有搜索引擎,确定出最有可能的候选。这两步过程都需要从主流的搜索引擎的搜索结果中抽取各种特性。实验结果表明,准确率可以达到85%以上。经过改进,该技术可以用来评测网页的可信度。 The Web contains a significant amount of untruthful information. This paper proposes a two-step method that aims to determine whether a given Chinese fact statement is truthful. In the first step it classifies the given state-ment and extends to alternative statement which has the same topic with the given statement based on doubt unit. In the second step, it sends every alternative statement including the given statement as a query to a search engine and analyzes various features extracted from the search results returned from the search engine. The experimental results show this method can achieve a precision of about 85%. After improvement, the technique can be used to evaluate the reliability of webpage.
出处 《计算机工程与应用》 CSCD 2014年第15期75-81,共7页 Computer Engineering and Applications
基金 国家自然科学基金(No.61170039) 河北省自然科学基金(No.F2012201006)
关键词 陈述句 正误 验证 WEB页面 可信度 fact statement truthfulness verifying Web page reliability
  • 相关文献

参考文献10

  • 1Montague M,Aslam J A.Condorcet fusion for improved retrieval[C]//Proceedings of ACM CIKM Conference, McLean, VA, USA, 2002 : 538-548.
  • 2Yamamoto Y,Tezuka T O,Jatowt A,et al.Honto?Search: estimating trustworthiness of Web information by search results aggregation and temporal analysis[C]//APWeb/WAIM, 2007 : 253-264.
  • 3Ntoulas A,Najork M, Manasse M, et al.Detecting spam web pages through content analysis[C]//Proceedings of the 15th International Conference on World Wide Web, 2006 : 83-92.
  • 4Gyangyi Z, Garcia-Molina H, Pedersen J.Combating Web spam with TrustRank[C]//Proc of the 30th VLDB Conf, 2004 : 576-587.
  • 5Li Xian,Meng Weiyi,Yu Clement.T-verifier:verifying truth- fulness of fact statements[C]//Data Engineering(ICDE), 2011.
  • 6刘群,李素建.基于《知网》的词汇语义相似度计算[EB/OL].http://www.keenage.com/papers.
  • 7Silverstein C,Henzinger M R, Marais H,et al.Analysis of a very large Web search engine query log[J].SIGIR Forum, 1999,33( 1 ) :6-12.
  • 8Goldberg D.Genetic algorithms in search, optimization and machine learning[M].[S.1.] : Addison Wesley, 1989.
  • 9Aslam J A,Montague M H.Models for metasearch[C]// SIGIR, 2001 : 275-284.
  • 10de Borda J C.Memoiresur les Histoire del'Academie Royale 1781. elections au scrutin[M]// des Sciences.Paris: [s.n.],.

共引文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部