摘要
通过构建版式电子文档库及配套的文档自动转换功能,为实现公文全文检索提供了结构化数据源,同时基于成熟的自然语言处理技术结合政务办公的业务需求特点实现了全文检索、相似文件查询等功能。成功的探索出了一套传统办公系统升级全文检索功能的解决方案。
We built a fixed-layout document library system and deployed automatic format transformation software, they provided data source that support full-text retrieval. Then we implemented official document full-text searching and similar document searching functions with mature NLP technology under special requirements of government affair scenario. It proved to be a feasible solution to improve a traditional system with full-text searching technology.
作者
李正
咸容禹
余前佳
陈卉
吴玉龙
LI Zheng;XIAN Rongyu;YU Qianjia;CHEN Hui;WU Yulong(Information Center of the Ministry of Natural Resources,Beijing 100036)
出处
《国土资源信息化》
2019年第2期22-26,共5页
Land and Resources Informatization
关键词
政务办公系统
版式文档
全文检索
相似文件检索
Official affair system
Fixed-layout Document
Full-text retrieval
Similar doc u me nt searching