期刊文献+

面向教育视频资源的垂直搜索引擎设计与实现 被引量:4

Design and implementation of vertical search engine for education video resources
下载PDF
导出
摘要 在移动学习项目的开发过程中,结合我国教育资源利用率低的问题,通过扩展Heritrix和Lucene,整合教育资源,设计并实现了面向教育视频资源的垂直搜索引擎。针对Heritrix与Lucene串行组合方案难以实现信息抓取、分析过程与索引过程同时进行的问题,提出一种紧耦合的流程优化组合方案,使网页抓取、网页内容分析筛选和建立索引同时进行,降低了系统IO开销和磁盘空间的占用率。实验测试表明,在Heritrix运行过程中嵌入索引建立操作,对系统的运行效率影响较小,满足实际应用的需要。 This paper combines with the question of the lower utilization rate of education resources in our country during the development of M-Learning project, and then integrates education video resources, designs and implements a vertical search engine through the extension of Heritrix and Lucene, which is relevant to the subject of education video resources. In addition, this paper proposes a combination of tightly coupled for Heritrix and Lucene in order to achieve process optimi-zation and solve the problems of serial combination. The new combinational solution makes webpages crawling, web analysis and index building synchronously so as to reduce the cost of system input and output and the occupancy rate of disk. The experiment indicates that there is smaller difference between the combinational solution of tightly coupled and serial in the running efficiency of system. The result meets the need of practical application.
出处 《计算机工程与应用》 CSCD 2014年第15期113-116,135,共5页 Computer Engineering and Applications
基金 国家自然科学基金面上项目(No.61173190)
关键词 视频搜索 HERITRIX LUCENE 垂直搜索引擎 video search Heritrix Lucene vertical search engine
  • 相关文献

参考文献5

  • 1刘大龙.2012Q1中国搜索引擎市场规模54.9亿市场集中度进一步提高[EB/OL].(2012-04-26)[2012-06-20].http://search.iresearch.cn/14/20120426/170800.shtml.
  • 2中华人民共和国教育部.教育信息化十年发展规划(2011m2020年)[EB/OL].(2012-03-29).http://www.moe.edu.cn/ewebeditor/uploadfile/2012/03/29/20120329140800968.doc.
  • 3李开灿,程平,张祖伟.关于精品课程网络资源利用率的统计分析[J].湖北师范学院学报(自然科学版),2010,30(3):10-14. 被引量:15
  • 4郭艳芬利用Heritrix构建特定站点爬虫[EB/OL].(2010-11-29)[2012-03-10].http://www.ibm.com/developerworks/cn/open-source/os-cn-heritrix/#major2.
  • 5Heritrix源码分析[EB/OL].(2011-03-16)[2012-03-01].http://www.docin.com/p-150167879.html.

二级参考文献3

共引文献14

同被引文献25

引证文献4

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部