摘要
电力营销经过多年的信息化发展,企业内部已经建立大量异构型应用系统,产生了大量分散的结构化、半结构化、非结构化数据。基于云计算及大数据技术的电力"求索"搜索引擎技术,构建集中式数据中心全文检索的索引,实现对大规模不同业务功能和业务数据的统一检索,满足用户从大量异构业务系统和海量数据中进行快速检索的要求。电力"求索"搜索引擎技术基于开源的Elastic Search,利用分布式索引、分布式检索、分布式缓存技术,实现分布式全文检索平台,提供对大规模索引数据的高效管理与快速、灵活的访问能力。通过搜索引擎技术,综合利用文本挖掘、自然语言处理、信息检索等领域的技术,进一步提高全文检索的查准率、查全率。该技术的应用,可以满足大规模不同业务数据的统一检索的需求;并同时满足大规模数据检索请求的快速响应要求。
After years of information technology develop-ment,the enterprise has established a large number of heterogeneous applications,resulting in a large number of distributed structured,semi-structured,unstructured data. Based on the cloud computing and large data technology,the"Search"search engine technology builds a centralized data center full-text index,achieving large-scale business functions and business data of the unified search to meet the user from a large number of heterogeneous business systems and mass data for rapid retrieval requirements. The search engine technology is based on the open source Elastic Search,with the use of distributed index, distributed search, distributed cache technology to achieve distributed full-text search platform to provide large-scale index data, efficient management and fast, flexible access. With the help of search engine technology,text search,natural language processing and information retrieval are com-bined to improve the precision and recall of full-text retrieval.The application of this technology can meet the needs of largescale unified search of different business data,and can meet simultaneously the rapid response request of large-scale data retrieval request.
作者
楼凤丹
裴旭斌
王志强
纪德良
LOU Fengdan PEI Xubin WANG Zhiqiang JI Deliang(State Grid Zhejiang Information & Telecommunication Company, Hangzhou 310007, Zhejiang, China Zhejiang Huayun Information Technology Co., Ltd., Hangzhou 310008, Zhejiang, China)
出处
《电网与清洁能源》
北大核心
2016年第12期86-92,99,共8页
Power System and Clean Energy
基金
国网浙江省电力公司信息化建设项目(7111XT150015)~~