摘要
宁波广播电视大学各个部门的网站基本上都是在十年前独自开发的。近年来,随着学校转型发展和机构改革等变化,这种重复建站、技术架构混乱的开发模式呈现出来的弊端越来越明显:每个部门网站都是一个信息孤岛,业务数据无法在部门网站之间流通;由于缺少统一规划,网站的维护和改版变得越来越麻烦。为了改变这种状况,设计开发了一个统一设计、统一管理、信息共享的网站群来代替一群网站。利用开源软件lucene.net实现了对网站群的全文搜索,并且比较了几种中文分词器的优缺点,最终选择了盘古分词器,结果表明可以快速的进行全文搜索。
ach d epartment of Ningbo Radio & TV University website was developed independently basically in ten years ago. In recent years, with the development of school restructuring and institutional reform and other changes, this duplication of the site, technical architecture chaotic development model presents more and more obvious drawbacks. Each department website is an information island,business data can’t be circulated between the department website. And due to the lack of unified planning, website maintenance and revision become more and more trouble. To change this situation, a unified design, unified management, information sharing website group rather than a group of web sites is designed and developed. Open source software lucene.net is adopted to realize the full text search of the website group,and the advantages and disadvantages of several Chinese tokenizer is explained, Pangu tokenizer is chosen finally. The results show that you can quickly perform full-text search..
出处
《信息技术与信息化》
2016年第9期27-30,共4页
Information Technology and Informatization
基金
宁波市社会科学院网络社会研究所课题(WL2016-Y01)