基于概念的信息检索模型研究被引量：33

Research on the Concept-based Information Retrieval Model

下载PDF

导出

摘要随着Internet的迅速发展 ,WWW已经成为世界上最大的信息库 ,它正日益改变着人类的生活方式 .然而 ,由于WWW信息资源庞大 ,结构复杂 ,如何高效地从中找到需要的信息 ,已经成为困扰网络用户的一大难题 .许多著名的站点 ,如Yahoo ,AltaVista ,Infoseek均使用基于关键字的搜索引擎 ,存在明显的缺陷 ,当查询用的关键字与目标文档尽管语义相同 ,但用词不一致时 ,将检索失败 ,导致召回率很低 .提出一个基于概念的信息检索模型 ,它不是以关键字为核心 ,而是以概念为核心来实现信息检索 .着重介绍了基于概念的信息检索模型的设施。 With the rapid development of Internet, World Wide Web has become a large information resource of the world. It changes the life mode of human being. However, because the resource is very big, and the structure is very complex, how to search and retrieve information efficiently and effectively becomes an important problem. The traditional search engines, such as Yahoo, AltaVista, InfoSeek are keyword-based search engine. They have an obvious default in common. When the word or phrase in the query is different from those used in the material you needs, searching with failed though these have a common sense. This leads to low recall. In this paper, we'll present a concept-based searching engine model. It uses concept instead of keyword as the kernel to complete the information search. This paper briefly introduces the facilities, methods and tools of the Concept-based Information Retrieval Model. The main contents of this paper are (1) to design and build the concept lexicon that supports the mapping between term and concept. At last these concepts can be found in the concept-tree;(2)to design and build the concept-tree that expresses the hierarchy of the knowledge and the relation among concepts. The concept lexicon and the concept-tree constitute the meta-knowledge of the model. The comprehension of concept will be based on it. We also discuss the semi-automatic algorithm to adjust the concept-tree and the concept lexicon;(3)to design the classification and search algorithm based on concepts.

作者李振东费翔林

机构地区南京大学计算机软件新技术国家重点实验室

出处《南京大学学报（自然科学版）》 CAS CSCD 北大核心 2002年第1期99-109,共11页 Journal of Nanjing University（Natural Science）

基金国家杰出青年基金 (6 15 2 5 2 0 4)

关键词信息检索搜索引擎概念树概念词典概念抽取概念匹配模型计算机网络 information retrieval,search engine,concept tree,concept lexicon,concept extraction,concept match

分类号 TP393 [自动化与计算机技术—计算机应用技术] TP391.3 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献14

1Gudivada V N,Raghavan V V, Grosky W I. Information retrieval on the World Wide Web. IEEE: Internet Computing, 1997:58-68.
2Li Zhendong, Fei Xianglin. A Concept-based information retrieval model. Proceedings of the International Symposium on Future ,Software Technology(ISFST-99). 1999:296-300.
3Woods W A, Conceptual indexing:a better way to organize knowledge, Forthcoming technical report. Sun.Microsystems Laboratories, 1997.
4Miller G A. Word-Net: A lexical database for English. ACM Communication, 1995(11):39-41.
5Mandala R,Tokunaga T. Query expansion using heterogeneous thesauri. Information Processing and Management,2000(36) : 361-378.
6Chen H, Schatz B, Yim T, et al. Automatic thesaurus generation for an electronic communitysystem.Journal of the American Society for Information Science, 1995, 46(3):175-193.
7Lee J, Dubin D. Context-senistive vocabulary mapping with a spreading activation network. Proceedings on the 22nd annual international ACM SIGIR conference on Research and development in information retrieval. 1999.
8Ambroziak J R. Conceptually assisted web browsing. 6th International World Wide Web Conference,1997.
9Mark C, Freltag D D. Learning to constract knowledge bases from the World Wide Web.Artificial Intelligence, 2000, 118: 69-113.
10Tauritz D R, Kok J N, Sprinkhuizen-kuyper I G. Adaptive information filtering using evolutionary computation. Information Science,2000(122) :121-140.

同被引文献229

1周竞涛,张树生,孙宏伟,王明微.关系数据共享与交换过程中一种基于XML Schema的模式转化方法[J].计算机集成制造系统-CIMS,2003,9(z1):127-129. 被引量：5
2胡鹤,刘大有,王生生.Web本体语言OWL[J].计算机工程,2004,30(12):1-2. 被引量：42
3邓爱林,左子叶,朱扬勇.基于项目聚类的协同过滤推荐算法[J].小型微型计算机系统,2004,25(9):1665-1670. 被引量：147
4刘红泉,张亮峰.布尔逻辑检索模型的分析探讨[J].现代情报,2004,24(9):4-6. 被引量：15
5张敏,宋睿华,马少平.基于语义关系查询扩展的文档重构方法[J].计算机学报,2004,27(10):1395-1401. 被引量：55
6钱晓东,王正欧.基于神经网络文本检索词的语义扩充[J].计算机工程,2004,30(20):22-24. 被引量：3
7邱树雄,李志蜀,王娣.语义网络及其Web信息检索机制研究[J].计算机工程,2004,30(23):118-120. 被引量：13
8戴新宇,尹存燕,陈家骏,郑国梁.机器翻译研究现状与展望[J].计算机科学,2004,31(11):176-179. 被引量：27
9宋丽哲,牛振东,宋瀚涛,余正涛,师雪霖.数字图书馆个性化服务用户模型研究[J].北京理工大学学报,2005,25(1):58-62. 被引量：45
10胡兆芹,张士靖.概念检索在检索网络信息中的应用[J].中华医学图书情报杂志,2005,14(2):13-15. 被引量：4

引证文献33

1金燕,张玉峰.基于本体论的知识检索研究[J].图书情报工作,2004,48(7):41-43. 被引量：9
2焦玉英,刘伟成.网络环境下情报检索模型理论发展及评价体系研究[J].情报理论与实践,2004,27(5):523-527. 被引量：6
3周竞涛,张树生,董小锋,王克飞,赵寒,张超.面向服务的企业数据语义导航[J].计算机集成制造系统,2005,11(9):1333-1339. 被引量：2
4陈颖明,许欢庆.基于模糊概念网络的信息检索模型研究[J].计算机工程,2005,31(21):146-147. 被引量：2
5孙明欣,尹存燕,戴新宇,陈家骏.一种基于元规则的自然语言生成规则解释技术[J].南京大学学报（自然科学版）,2006,42(1):69-75. 被引量：1
6殷亚玲,张蕾,李海军.基于概念图的相关反馈技术研究[J].计算机工程与应用,2006,42(3):164-167. 被引量：2
7周竞涛,张树生,赵寒,王明微,张超,王克飞,董小锋.基于语义模型的总线式企业信息集成框架[J].计算机集成制造系统,2006,12(3):407-412. 被引量：4
8许春漫.数字图书馆个性化信息检索模型研究[J].现代图书情报技术,2006(3):15-19. 被引量：5
9赵鹏,耿焕同,王清毅,蔡庆生.基于聚类和分类的个性化文章自动推荐系统的研究[J].南京大学学报（自然科学版）,2006,42(5):512-518. 被引量：13
10曹青.概念检索在因特网中的应用及比较研究[J].中山大学研究生学刊（社会科学版）,2006,27(4):117-123.

二级引证文献159

1杨梦月,何洪波,王闰强.基于反事实学习及混淆因子建模的文章个性化推荐[J].计算机系统应用,2020(10):53-60. 被引量：1
2张琳,胡杰,应力,浦丽娜.汉语问答系统概念查询扩展研究[J].郑州大学学报（理学版）,2009,41(1):69-72. 被引量：1
3张耕畅,黄晓禹,卢世尧,王晓萍,侯超钧.基于云计算的大学生兴趣社交平台[J].仲恺农业工程学院学报,2013,26(4):38-42. 被引量：1
4金燕,李敏,张玉峰.基于Ontology的语义导航研究[J].现代图书情报技术,2005(5):37-40. 被引量：7
5杜小勇,马文峰.学科领域知识本体建设方法研究[J].图书情报工作,2005,49(8):74-78. 被引量：33
6魏玖长,赵定涛.基于元搜索引擎的危机信息监控系统的研究与实现[J].管理科学,2005,18(5):36-42. 被引量：13
7叶建华.基于本体面向图书馆的知识管理关键技术研究[J].情报杂志,2006,25(1):85-87. 被引量：15
8马文峰,杜小勇.知识检索研究[J].情报理论与实践,2006,29(2):157-160. 被引量：36
9夏立新,饶洋辉.本体论在公共部门知识管理体系中的应用探析[J].情报杂志,2006,25(6):10-12. 被引量：6
10王弼佐,王茜,李鹏.基于Ontology的多主体知识检索模型[J].情报杂志,2006,25(6):76-77. 被引量：5

1许鑫,曹昉,袁翀.利用移动Agent技术改进基于概念的信息检索[J].图书情报工作,2003,47(1):86-90. 被引量：3
2江红,吴立德,沙新时.机器翻译系统中概念词典的设计与实现[J].计算机研究与发展,1995,32(3):13-18. 被引量：4
3邵波,袁勤波,罗剑,陈嘉凯.对CERNET华东(南)地区网络WWW信息资源建设中存在问题的思考──基于CERNET的信息资源调查与研究报告之一[J].现代图书情报技术,1999(1):17-20. 被引量：1
4王玉红,王东.查询请求的语义扩展研究[J].福建电脑,2009,25(9):36-37.
5赵小谦,郑彦,储海庆.概念树在短文本语义相似度上的应用[J].计算机技术与发展,2012,22(6):159-162. 被引量：4
6廖荣福,李彦,李文强.面向产品创新设计的知识库研究[J].机械设计,2008,25(7):5-10. 被引量：8
7朱凡微,吴明晖,金苍宏,吕嘉,应晶.基于关键字的数据库搜索研究综述[J].计算机应用研究,2008,25(11):3238-3242. 被引量：9
8韩立新,恽爽,陈道蓄,谢立.一个面向Internet数据管理的系统模型[J].计算机科学,2002,29(1):109-112. 被引量：2
9王丹.如何查找隐形网页资源[J].中国信息导报,2005(4):55-57. 被引量：3
10孙爱玲.InfoSeek搜索功能研究[J].内蒙古科技与经济,2008(6):222-223.

南京大学学报（自然科学版）

2002年第1期

浏览历史

内容加载中请稍等...

基于概念的信息检索模型研究被引量：33

参考文献14

同被引文献229

引证文献33

二级引证文献159

相关作者

相关机构

相关主题

浏览历史

基于概念的信息检索模型研究 被引量：33

参考文献14

同被引文献229

引证文献33

二级引证文献159

相关作者

相关机构

相关主题

浏览历史

基于概念的信息检索模型研究被引量：33