期刊文献+

基于互信息的粗糙集信息检索模型 被引量:2

Rough sets information retrieval model based on mutual information
下载PDF
导出
摘要 在信息检索过程中,由于文档中存在大量的多义和近义现象,导致不确定性出现,这将影响检索的性能.为此采用基于互信息的粗糙集理论来处理这类不确定性问题.首先计算训练文档集中的词之间的互信息,对互信息做模糊聚类来构造词之间的等价关系,然后借助于该等价关系提出并实现了一个以粗糙集上下近似为基础的信息检索模型,通过实验的测试,该模型能够提高信息检索的效率. In the processing of information retrieval, the existence of polysemy and synonymy would lead to uncertainty, which reduces the effectiveness of information retrieval. A model based on mutual information is proposed, in which the uncertainty is captured by rough sets. At first, the mutual information between the words of the training corpus is counted, and then the mutual information is employed to build an equivalent relation through fuzzy clustering. An information retrieval model based on upper and lower approximations of rough sets is proposed and implemented in the light of equivalent relation. Experiments show that the model can get improvement of information retrieval.
出处 《山东大学学报(理学版)》 CAS CSCD 北大核心 2006年第3期17-19,138,共4页 Journal of Shandong University(Natural Science)
关键词 互信息 模糊聚类 粗糙集 信息检索 mutual information fuzzy clustering Rough sets information retrieval
  • 相关文献

参考文献11

二级参考文献21

  • 1曾黄麟.粗集理论及其应用--关于数据推理的新方法[M].重庆:重庆大学出版社,1998..
  • 2冯是聪 单松巍 张志刚 等.一个中文网页数据集及其分类体系[A]..海峡两岸技术交流会[C].南京,2002-10.121-129.
  • 3[1]Dubois D,Prade H. Putting rough sets and fuzzy sets together [A]. Intelligent Decision Support: Handbook of Applications and Advanced of the Rough Set Theory [C].Boston: Slowinski R ED, Kluwer Academic Publishers, 1992. 203 - 222.
  • 4[2]Yao Y Y. A comparative study of fuzzy sets and rough sets [J]. Information Sciences, 1998,109 (1-4): 227 -242.
  • 5[4]Keller J M, Gray M R, Givens J A. A fuzzy k-nearest neighbor algorithm [J]. IEEE Transactions on System Man and Cybernetics, 1985,15 (4) :580 - 585.
  • 6[5]Yang Y,Pederen J P. A comparative study on feature selection in text categorization [A]. Proceeding of the Fourteenth International Conference on Machine Learning (ICML97) [C]. Nashville Tennessee USA :Morgan Kaufmann, 1997.412 - 420.
  • 7[7]Denoeux T. A k-nearest neighbor classification rule based on Dempster-Shafer theory [J]. IEEE Transactions on System Man and Cybernetics, 1995,25(5):804 -813.
  • 8[8]Francois J, Grandvalet Y, Denoeux T, et al. Resample and combine:An approach to improving uncertainty representation in evidential pattern classification [J]. Information Fusion,2003 (4) :75 -85.
  • 9[1]Sebastiani F. Machine learning in automated text categorization [J]. ACM Computing Survey, 2002,34 (1):1 -47.
  • 10[2]Deerwester S,Dumais S T,Furnas G W,et al. Indexing by latent semantic analysis [J]. Journal of the American Society of Information Science, 1990,41 (6) :391 - 407.

共引文献171

同被引文献19

  • 1刘邱云,吴根秀,付雪峰.基于可传递信度模型的k-NN分类规则[J].江西师范大学学报(自然科学版),2004,28(3):221-223. 被引量:2
  • 2王珏,刘三阳,张杰.基于广义粗糙近似的信息检索方法研究[J].系统工程与电子技术,2004,26(12):1887-1891. 被引量:2
  • 3谭德坤,赵珑,吴润秀,孙辉.基于UDDI Registry的智能检索引擎的研究[J].计算机工程与设计,2007,28(4):858-861. 被引量:2
  • 4王灿辉,张敏,马少平.自然语言处理在信息检索中的应用综述[J].中文信息学报,2007,21(2):35-45. 被引量:48
  • 5Y Y Yao. Combination of rough and fuzzy set based on a - level sets[ J]. Fuzzy sets and System, 2002, (126) :137 -158.
  • 6G Salton. Developments in automatic text retrieval [ J ]. Science august, 1991(253): 1421- 1426.
  • 7R Intan and M Mukaidono. Generalized fuzzy rough sets by condi- tional probability relations [ J ]. International Journal of Pattern Recognition and Artificial Intelligence, 2002,16 (7) :865 -881.
  • 8Smets P. The transferable belief model and other interpretations of Dempster-Shafer's model[ C ]. Cambridge:Elsevier Science, 1990.
  • 9Francois J, Grandvalet Y, Denceux T, et al. Resample and combine:an approach to improving uncertainty representation in evidential pattern classification [ J ]. Information Fusion, 2003, (4) : 75-85.
  • 10Pawlak Z. Rough sets [ J ]. Intemational Journal of Computer and Information Science, 1982,11 : 341-356.

引证文献2

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部