Visual Topic Semantic Enhanced Machine Translation for Multi-Modal Data Efficiency

导出

摘要 The scarcity of bilingual parallel corpus imposes limitations on exploiting the state-of-the-art supervised translation technology.One of the research directions is employing relations among multi-modal data to enhance perfor-mance.However,the reliance on manually annotated multi-modal datasets results in a high cost of data labeling.In this paper,the topic semantics of images is proposed to alleviate the above problem.First,topic-related images can be auto-matically collected from the Internet by search engines.Second,topic semantics is sufficient to encode the relations be-tween multi-modal data such as texts and images.Specifically,we propose a visual topic semantic enhanced translation(VTSE)model that utilizes topic-related images to construct a cross-lingual and cross-modal semantic space,allowing the VTSE model to simultaneously integrate the syntactic structure and semantic features.In the above process,topic similar texts and images are wrapped into groups so that the model can extract more robust topic semantics from a set of similar images and then further optimize the feature integration.The results show that our model outperforms competitive base-lines by a large margin on the Multi30k and the Ambiguous COCO datasets.Our model can use external images to bring gains to translation,improving data efficiency.

作者王超蔡思佳史北祥崇志宏 Chao Wang;Si-Jia Cai;Bei-Xiang Shi;Zhi-Hong Chong(School of Computer Science and Engineering,Southeast University,Nanjing 210096,China;School of Architecture,Southeast University,Nanjing 210096,China)

机构地区 School of Computer Science and Engineering School of Architecture

出处《Journal of Computer Science & Technology》 SCIE EI CSCD 2023年第6期1223-1236,共14页 计算机科学技术学报（英文版）

基金 supported by the National Natural Science Foundation of China under Grant No.52178034.

关键词 multi-modal machine translation visual topic semantics data efficiency

分类号 TP391.2 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1张钹,朱军,苏航.迈向第三代人工智能[J].中国科学：信息科学,2020,50(9):1281-1302. 被引量：170

二级参考文献1

1Jun Zhu,Jianfei Chen,Wenbo Hu,Bo Zhang.Big Learning with Bayesian methods[J].National Science Review,2017,4(4):627-651. 被引量：10

共引文献169

1龚善要.人工智能司法应用的实践审思与完善[J].国家检察官学院学报,2023,31(5):95-108. 被引量：6
2尚凡成,孔繁钰,詹可,朱仁传.基于神经网络的船舶剖面参数化建模与辐射水动力系数预测[J].水动力学研究与进展（A辑）,2022,37(6):751-756.
3刘三女牙.人工智能与教育双向赋能的人才培养模式创新和体系重构[J].科教发展研究,2022(2):42-56. 被引量：5
4王丽莉.一种具有自学习能力的用户感知人工智能测量方法[J].电子测量技术,2023,46(6):147-152. 被引量：1
5王沛然.从控制走向训导:通用人工智能的“直觉”与治理路径[J].东方法学,2023(6):188-198. 被引量：22
6刘云.论可解释的人工智能之制度构建[J].江汉论坛,2020(12):113-119. 被引量：22
7莫伯峰,张重生,门艺.AI缀合中的人机耦合[J].出土文献,2021(1):19-26. 被引量：12
8刘奕群,吴玥悦.信息化与智能化:司法语境下的辨析[J].中国应用法学,2021(2):14-30. 被引量：8
9孙永丹,邓辉文.深度学习的哲学反思[J].科教导刊（电子版）,2021(7):271-273.
10李宁,徐彬森,武宏亮,冯周,李雨生,王克文,刘鹏.人工智能在测井地层评价中的应用现状及前景[J].石油学报,2021,42(4):508-522. 被引量：55

1Shuo Feng,Aoran Cai,Yang Wang,Baicheng Zhang,Qinyu Qiao,Cheng Chen,Song Wang,Jun Jiang.A robotic AI-Chemist system for multi-modal AI-ready database[J].National Science Review,2023,10(12):4-6. 被引量：1
2Jiachen Yang,Yegang Li,Hao Zhang,Junpeng Hu,Rujiang Bai.Aspect-Level Sentiment Analysis Incorporating Semantic and Syntactic Information[J].Journal of Computer and Communications,2024,12(1):191-207.
3石闻达,杜劲松,李笛出乘.基于层次化多模态注意力机制循环神经网络的服装新品销售预测[J].Journal of Donghua University(English Edition),2024,41(1):21-27.
4Yan Li,Qiyuan Wang,Kaidi Jia.Enhancing Image Description Generation through Deep Reinforcement Learning:Fusing Multiple Visual Features and Reward Mechanisms[J].Computers, Materials & Continua,2024,78(2):2469-2489.
5赵丹,赵素云,陈红,刘睿瑄,李翠平,张晓莹.Hadamard Encoding Based Frequent Itemset Mining under Local Differential Privacy[J].Journal of Computer Science & Technology,2023,38(6):1403-1422. 被引量：1
6Instructions for Authors[J].Infectious Diseases & Immunity,2024,4(1):44-50.
7Jie ZHOU,Pei KE,Xipeng QIU,Minlie HUANG,Junping ZHANG.ChatGPT: potential, prospects, and limitations[J].Frontiers of Information Technology & Electronic Engineering,2024,25(1):6-11. 被引量：20
8REN ZiLiang,ZHANG QieShi,CHENG Qin,XU ZhenYu,YUAN Shuai,LUO DeLin.Segment differential aggregation representation and supervised compensation learning of ConvNets for human action recognition[J].Science China(Technological Sciences),2024,67(1):197-208.
9李奕群.网络与怀旧[J].疯狂英语（新悦读）,2024(2):25-29.
10Yuxin HUANG,Huailing GU,Zhengtao YU,Yumeng GAO,Tong PAN,Jialong XU.Enhancing low-resource cross-lingual summarization from noisy data with fine-grained reinforcement learning[J].Frontiers of Information Technology & Electronic Engineering,2024,25(1):121-134. 被引量：1

Journal of Computer Science & Technology

2023年第6期

浏览历史

内容加载中请稍等...

Visual Topic Semantic Enhanced Machine Translation for Multi-Modal Data Efficiency

参考文献1

二级参考文献1

共引文献169

相关作者

相关机构

相关主题

浏览历史