期刊文献+

基于知识迁移和蒸馏的轻量级领域信息表示模型研究

A Study on the Lightweight Domain Information Representation Model Based on Knowledge Transferring and Knowledge Distillation
下载PDF
导出
摘要 数据精细化处理是电网企业在进行数字化转型过程中的重大难题。由于电网企业的封闭性特点,数据精细化处理过程中专业语言与知识难以与外界通用的知识进行融会贯通。为解决上述问题,本文在teacher-student框架基础上结合fine-tuning技术设计了一种有效的信息表示模型——基于知识迁移与蒸馏的轻量级领域信息表示模型。该模型将通用知识框架作为基座,将专业知识与通用知识提炼成一个统一的向量空间。本文设计的模型比通用大模型更快、更轻量、更有效,仅需对百级别的专业小样本进行增量学习,便可将通用知识与专业知识进行了高效地融合。为了验证该模型的有效性,我们在文本相似度计算任务上进行了实验,实验结果表明技术指标NDCG@5提升5.76%。此外,该模型有效地降低了资源消耗,提升了搜索效率。 Refinement of data processing is a major problem for power grid companies in the process of digital transformation. Due to the closed nature of power grid enterprises, it is difficult to integrate professional language and knowledge with the general knowledge of the outside world in the process of data refinement processing. In order to solve the above problems, this paper designs an effective information representation model based on the teacher-student framework combined with the fine-tuning technology—a lightweight domain information representation model based on knowledge transfer and distillation. The model takes the general knowledge framework as the base, and distills specialized knowledge and general knowledge into a unified vector space. The model designed in this paper is faster, lighter and more effective than the general large model. It only needs to perform incremental learning on a hundred-level professional small samples, and then the general knowledge and professional knowledge can be efficiently integrated. In order to verify the effectiveness of the model, we conducted experiments on the text similarity calculation task. The experimental results show that the technical index NDCG@5 is improved by 5.76%. In addition, the model effectively reduces resource consumption and improves search efficiency.
作者 徐欢 王尧 萧展辉 沈宇红 XU Huan;WANG Yao;XIAO Zhanhui;SHEN Yuhong(China Southern Power Grid Co.,Ltd.,Guangzhou 510000,Guangdong,China;Digital Grid Research Institute of China Southern Power Grid Co.,Ltd.,Guangzhou 510000,Guangdong,China)
出处 《电力大数据》 2022年第9期37-44,共8页 Power Systems and Big Data
关键词 知识迁移 模型蒸馏 领域信息检索模型 数字化转型 knowledge transfer model distillation domain information retrieval model digital transformation
  • 相关文献

参考文献12

二级参考文献162

共引文献151

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部