期刊文献+
共找到5篇文章
< 1 >
每页显示 20 50 100
多格式文档搜索引擎索引系统设计与实现 被引量:1
1
作者 方跃胜 董辉 姚宏亮 《长江大学学报(自科版)(上旬)》 CAS 2012年第7期111-113,8,共3页
随着Internet和计算机的迅猛发展,搜索引擎应需而生,越来越多的企业利用计算机处理运营过程中产生的大量电子文档。如何从这些网络和多格式文档资源中迅速、方便而准确地检索出企业用户所需的信息已成为越来越重要的问题。索引系统是搜... 随着Internet和计算机的迅猛发展,搜索引擎应需而生,越来越多的企业利用计算机处理运营过程中产生的大量电子文档。如何从这些网络和多格式文档资源中迅速、方便而准确地检索出企业用户所需的信息已成为越来越重要的问题。索引系统是搜索引擎的核心,为提高系统的查全率和查准率,设计了一种适用于文档检索的数据库存储的索引结构并建立索引库来降低索引组织的复杂度,通过布尔逻辑和向量空间的组合模型实现对检索结果排序,以返回最优文档列表。该系统在Windows环境下采用PHP开发组件实现,能够提高检索文档的查全率和查准率。 展开更多
关键词 文档搜索引擎 索引同步 检索模型
下载PDF
文档搜索引擎的解决方案及其检索功能比较分析 被引量:1
2
作者 孙良红 张玉祥 《图书馆界》 2013年第5期82-85,共4页
总结了文档搜索引擎发展过程中存在的两种主要解决方案,并分析这两种解决方案在检索功能上的差异,最后提出了文档搜索引擎的发展前景。
关键词 文档搜索引擎 信息检索 检索功能
下载PDF
Document classification approach by rough-set-based corner classification neural network 被引量:1
3
作者 张卫丰 徐宝文 +1 位作者 崔自峰 徐峻岭 《Journal of Southeast University(English Edition)》 EI CAS 2006年第3期439-444,共6页
A rough set based corner classification neural network, the Rough-CC4, is presented to solve document classification problems such as document representation of different document sizes, document feature selection and... A rough set based corner classification neural network, the Rough-CC4, is presented to solve document classification problems such as document representation of different document sizes, document feature selection and document feature encoding. In the Rough-CC4, the documents are described by the equivalent classes of the approximate words. By this method, the dimensions representing the documents can be reduced, which can solve the precision problems caused by the different document sizes and also blur the differences caused by the approximate words. In the Rough-CC4, a binary encoding method is introduced, through which the importance of documents relative to each equivalent class is encoded. By this encoding method, the precision of the Rough-CC4 is improved greatly and the space complexity of the Rough-CC4 is reduced. The Rough-CC4 can be used in automatic classification of documents. 展开更多
关键词 document classification neural network rough set meta search engine
下载PDF
智慧档案服务在核电设计行业的应用研究
4
作者 龚莉燕 《办公室业务》 2018年第16期141-142,共2页
文件档案部门作为核电设计行业单位的核心业务支撑部门,通过对智慧档案的理论和技术的研究,借助档案服务平台进一步提升文档服务水平,提高核电工程档案的归档、利用和管理效率,从而更好地服务于工程项目。当前,在保障档案管理质量和安... 文件档案部门作为核电设计行业单位的核心业务支撑部门,通过对智慧档案的理论和技术的研究,借助档案服务平台进一步提升文档服务水平,提高核电工程档案的归档、利用和管理效率,从而更好地服务于工程项目。当前,在保障档案管理质量和安全的双重前提下,以先进的信息技术为基础,通过数字档案资源利用、智慧档案馆管理等多维度的服务手段,进一步完善工程文档管理的业务支持特性,提升专业设计人员的使用体验。 展开更多
关键词 智慧档案 档案信息化 文档服务 数字档案利用 文档智能搜索引擎
下载PDF
Improved caching policies and hybrid strategy for query result cache
5
作者 钱立兵 Ji Zhenzhou Bai Jun 《High Technology Letters》 EI CAS 2015年第3期339-346,共8页
To improve efficiency of search engines, the query result cache has drawn much attention re- cently. According to the query processing and user's query logs locality, a new hybrid result cache strategy which associat... To improve efficiency of search engines, the query result cache has drawn much attention re- cently. According to the query processing and user's query logs locality, a new hybrid result cache strategy which associates with caching heat and worth is proposed to compute cache score in accord- ance with cost-aware strategies. Exactly, query repeated distance and query length factor are utilized to improve the static result policy, and the dynamic policy is adjusted by the caching worth. The hy- brid result cache is implemented in term of the document content and document ids (docIds) se- quence. Based on a score format and the new hybrid structure, an initial algorithm and a new rou- ting algorithm are designed for result cache. Experiments' results show that the improved caching policies decrease the average response time effectively, and increase the system throughput signifi- cantly. By choosing comfortable combination of page cache and docIds cache, the new hybrid cac- hing strategy almost reduces more than 20% of the only cache and docId-only cache. average query time compared with the basic page- 展开更多
关键词 query result cache hybrid caching query repeated distance caching policy
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部