摘要
面对海量的标准信息,信息检索成为了研究和工作人员关注的问题,尤其是非结构化数据的存储、集成和索引。本文将Solr搜索引擎技术应用在标准内容的检索,实现针对标准化对象、具体指标等关键词的搜索,通过分词器选型和完善词表等手段,实现搜索效果的优化。
Faced with massive amount of standard information,information retrieval has become a concern for researchers and staff,especially the storage,integration and indexing of unstructured data.In this paper,SOLR search engine technology is applied to the retrieval of standard content,to achieve the search for keywords such as standardized objects and specific indicators,and to optimize the search effect through the selection of the word segmenter and the improvement of the word list.
作者
赵东海
张文华
ZHAO Dong-hai;ZHANG Wen-hua(Beijing Huijin Digital Technology Co.,Ltd.)
出处
《中国标准化》
2021年第5期153-158,共6页
China Standardization
基金
国家重点研发计划“中外相关标准适用性与差异性研究”(课题编号:2017YFF0209406)研究成果
关键词
标准文献
内容搜索
SOLR优化
standard literature
content search
SOLR optimization