Improving neural sentence alignment with word translation 被引量：2

导出

摘要 Sentence alignment is a basic task in natural lan-guage processing which aims to extract high-quality paral-lel sentences automatically.Motivated by the observation that aligned sentence pairs contain a larger number of aligned words than unaligned ones,we treat word translation as one of the most useful external knowledge.In this paper,we show how to explicitly integrate word translation into neural sentence alignment.Specifically,this paper proposes three cross-lingual encoders to incorporate word translation:1)Mixed Encoder that learns words and their translation annotation vectors over sequences where words and their translations are mixed alterma-tively;2)Factored Encoder that views word translations as fea-tures and encodes words and their translations by concatenating their embeddings;and 3)Gated Encoder that uses gate mechanism to selectively control the amount of word translations moving forward.Experimentation on NIST MT and Opensub-titles Chinese-English datasets on both non-monotonicity and monotonicity scenarios demonstrates that all the proposed encoders significantly improve sentence alignment performance.

作者 Ying DING Junhui LI Zhengxian GONG Guodong ZHOU

机构地区 School of Computer Science and Technology

出处《Frontiers of Computer Science》 SCIE EI CSCD 2021年第1期81-90,共10页 中国计算机科学前沿（英文版）

基金 This work was supported by the National Natural Science Foundation of China(Grant Nos.61876120,61673290).

关键词 sentence alignment word translation mixeden coder factored encoder gated encoder

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1丁颖,李军辉,周国栋.基于词对关联网络的句子对齐研究[J].中文信息学报,2019,0(7):31-39. 被引量：1

二级参考文献1

1刘昕,周明,朱胜火,黄昌宁.基于自动抽取词汇信息的双语句子对齐[J].计算机学报,1998,21(S1):151-158. 被引量：17

同被引文献1

1葛诗利,宋柔.基于成分共享的英汉小句对齐语料库标注体系研究[J].中文信息学报,2020(6):27-35. 被引量：2

引证文献2

1苗国义,刘明童,陈钰枫,徐金安,张玉洁,冯文贺.融合小句对齐知识的汉英神经机器翻译[J].北京大学学报（自然科学版）,2022,58(1):61-68. 被引量：5
2谷仕威,刘静,李丙春,熊德意.无监督句对齐综述[J].计算机科学,2024,51(1):60-67.

二级引证文献5

1冯文贺,高子雄,张文娟.小句识别所依赖的语段全局范围探究——基于预训练语言模型Bert的汉语小句识别[J].语言文字应用,2022(2):111-121. 被引量：2
2赵丽容.基于视觉引导的智能英语翻译机器人人机交互系统[J].自动化与仪器仪表,2022(11):220-225. 被引量：11
3刘宇,刘小明,刘卫光,杨关,刘杰.基于潜层结构化语义增强的低资源摘要模型[J].计算机科学与探索,2023,17(8):1961-1973.
4陈媛,陈红.融合底层信息的电气工程领域神经机器翻译[J].河南科技大学学报（自然科学版）,2023,44(6):42-48. 被引量：1
5白雯.融合跨语言记忆网络与语义信息的神经机器翻译系统架构设计研究[J].自动化与仪器仪表,2024(5):178-181.

1戴彩同,张少锋,黄玉洁.miR-23b-3p靶向PALM3对肺炎链球菌诱导的肺泡上皮细胞凋亡及炎症因子表达的影响[J].中国实验诊断学,2020,24(6):996-1002. 被引量：11
2最美焊工于静[J].Women of China,2021(2):43-43.
3Fangli Ren,Zhengwei Jiang,Xuren Wang,Jian Liu.A DGA domain names detection modeling method based on integrating an attention mechanism and deep neural network[J].Cybersecurity,2020,3(1):71-83. 被引量：9
4Joshua Davis,Rebecca Leff,Anuj Patel,Sriram Venkatesan.Mortality of critical care interventions in the COVID-19:A systematic review[J].World Journal of Meta-Analysis,2021,9(1):64-73.
5Fangli Ren,Zhengwei Jiang,Xuren Wang,Jian Liu.A DGA domain names detection modeling method based on integrating an attention mechanism and deep neural network[J].Cybersecurity,2018,1(1):697-709.
6Li Shaojie,Chen Shudong,Ouyang Xiaoye,Gong Lichen.Joint learning based on multi-shaped filters for knowledge graph completion[J].High Technology Letters,2021,27(1):43-52. 被引量：2
7Xiang Yu,Yan Mi,Li-Chen Wang,Zheng-Rui Li,Di-An Wu,Ruo-Shui Liu,Shu-Li He.Effects of dipolar interactions on the magnetic hyperthermia of Zn_(0.3)Fe_(2.7)O_(4) nanoparticles with different sizes[J].Chinese Physics B,2021,30(1):497-501.
8Ya-nan Liang,Bo Xu.Factors influencing utilization and satisfaction with external breast prosthesis in patients with mastectomy:A systematic review[J].International Journal of Nursing Sciences,2015,2(2):218-224. 被引量：6
9Qianrui Liu,Junyi Li,Mohan Chen.Thermal transport by electrons and ions in warm dense aluminum:A combined density functional theory and deep potential study[J].Matter and Radiation at Extremes,2021,6(2):17-27. 被引量：4
10车赛西亚.On the Translation of Culturally-loaded Terms in The Last Emperor—from the Perspective of Susan Bassnett’s Theory[J].海外英语,2021(5):215-216.

Frontiers of Computer Science

2021年第1期

浏览历史

内容加载中请稍等...