摘要
为保障多语种智能翻译系统充分发挥其标准化、快速翻译的作用,必须构建高质量的多语种术语库,不断充实翻译系统的后台词汇。在多语种术语库的构建过程中校审是保证术语库质量的关键环节。然而,与运用数万词条量,甚至体量更为庞大的待校审术语库对比,单纯使用传统的人力校审方式,已经不能满足为智能翻译系统及时扩充术语库的需求。针对上述问题,文章提出了一种网络爬虫技术在多语种术语库校审中的应用方法,并介绍了网络爬虫技术的概念、原理、分类、特点,详细阐述了该技术在多语种术语库校审中的应用实践,最后对网络爬虫技术在翻译和情报专业领域的应用进行了展望。
In order to guarantee the multilingual intelligent translation system to give full play to its standardized and fast translation function,it is necessary to build a high-quality multilingual terminology database and continuously enrich the background vocabulary of the translation system.In the process of building the multilingual terminology database,proofreading is the key link to ensure the quality of the terminology database.However,compared with the use of tens of thousands of terms or even a larger volume of the terminology database to be proofread,the simple use of the traditional manual proofreading method can no longer meet the demand for expanding the terminology database in time for the intelligent translation system.In response to the above problems,this paper proposes an application method of web crawler technology in proofreading multilingual terminology databases,introduces the concept,principle,classification and characteristics of web crawler technology,elaborates the application practice of this technology in proofreading multilingual terminology databases,and finally looks forward to the application of web crawler technology in translation and intelligence professional fields.
作者
刘雯
LIU Wen(Beijing Institute of Aerospace Information,Beijing,100854 China)
出处
《科技资讯》
2023年第8期37-43,共7页
Science & Technology Information
关键词
网络爬虫
多语种术语库
校审
多语种智能翻译系统
Web crawler
Multilingual terminology database
Proofreading
Multilingual intelligent translation system