期刊文献+

融合空间位置注意力机制的英语题注生成模型 被引量:1

English Caption Generation Model Fused with Attention Mechanism of Spatial Position
下载PDF
导出
摘要 为使题注生成模型生成流畅、连贯和信息丰富的特定信息题注,在Transformer架构的基础上提出了Transformer Chart to Text(TransChartText)模型。通过筛选各种科研论文和新闻文章网站,制作了基于图表的题注描述数据集,该数据集的英语题注描述涵盖了丰富的数据类别和逻辑推理。引入数据变量替换图表数据值,有效提高了模型生成题注的内容选择,促使模型生成了连贯的题注内容。为进一步增强模型学习词与词之间位置关系的能力并降低错误词序频率,模型分别对编码器和解码器引入空间位置嵌入编码和集束搜索算法。实验结果表明,TransChartText模型在内容选择(CS)、内容排序(CO)、ROUGE、BLEU指标上取得了更好的分数,生成了高质量的基于图表的英语题注。 The transformer chart to text(TransChartText)model is proposed based on Transformer architecture in order to make the English caption generation model generate fluent,coherent and informative specific information annotations.By screening various scientific research papers and news article websites,a chart-based annotation description data set is made,which covers a wealth of data categories and logical reasoning.Data variables are introduced to replace the data values of the graph,which effectively improves the selection of the content of the captions generated by the model and promoted the model to generate coherent captions.In order to enhance the ability of learning the position relation between words and reduce the frequency of wrong word order,the spatial position embedding coding and cluster search algorithm are introduced into the encoder and decoder respectively.Experimental results show that TransChartText model achieves better scores on content selection(CS),content sequencing(CO),ROUGE and BLEU,and generates high-quality chartbased English captions.
作者 王琴 王鑫 颜靖柯 钟美玲 曾静 WANG Qin;WANG Xin;YAN Jingke;ZHONG Meiling;ZENG Jing(Basic Teaching Department,Guilin University of Electronic Technology,Beihai,Guangxi 536000,China;School of Computer Science and Information Security,Guilin University of Electronic Technology,Guilin,Guangxi 541004,China;School of Marine Engineering,Guilin University of Electronic Technology,Beihai,Guangxi 536000,China;College of Computer Engineering,Guilin University of Electronic Technology,Beihai,Guangxi 536000,China;School of Information and Software Engineering,University of Electronic Science and Technology of China,Chengdu 610000,China)
出处 《计算机工程与应用》 CSCD 北大核心 2022年第12期139-148,共10页 Computer Engineering and Applications
基金 广西自然科学基金面上项目(2019GXNSFAA245053) 广西科技重大专项(AA19254016) 广西高等教育本科教学改革工程项目(2020JGA182) 广西高校中青年教师科研基础能力提升项目(2021KY0184) 广西硕士研究生创新项目(YCSW2021174) 北海市科技规划项目(202082033) 北海城市科技规划项目(202082023)。
关键词 语言模型 生成式题注 TRANSFORMER 注意力机制 集束搜索 language model generative caption Transformer attention mechanism beam search
  • 相关文献

参考文献3

二级参考文献10

共引文献17

同被引文献24

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部