期刊文献+

AI人工智能翻译中数据增广策略和语法错误分析技术研究

Research on Data Augmentation Strategies and Grammar Error Analysis Techniques in AI Artificial Intelligence Translation
原文传递
导出
摘要 人工智能的信息处理逻辑可以对语言系统进行学习理解,进而在翻译工作中给出最优结果,以满足实际应用需要。研究结合数据增广策略和语料库对语法错误生成、纠正及检测模型进行数据训练。研究分析基于规则的数据增广策略对其数据处理进行分析,进而提高训练数据的质量,采用学习者语料库对不同规模的语法纠错(grammatical error correction, GEC)模型进行结果分析,得出200 M左右的合成数据训练的GEC模型的精准率为45%、召回率最高为24%、F_0.5值最高为38%。再对优化后的GEC模型进行训练,得出其值分别为37%、24%和34%。最后在重排序策略下基于数据增广策略的语法错误模型的结果为75%、43%和65%。因此,证明基于数据增广策略的语法错误模型具有高检测精度,为人工智能翻译技术提供技术支持。 The information processing logic of artificial intelligence can learn and understand language systems,and then provide optimal results in translation work to meet practical application needs.Research combines data augmentation strategies and corpora to train grammar error generation,correction,and detection models.Research and analyze rule-based data augmentation strategies to improve the quality of training data.Using a learner corpus to analyze the results of GEC models of different scales,it was found that the accuracy of the GEC model trained with synthetic data of around 200M is 45%,with the highest recall rate of 24%,and F_The maximum value of 0.5 is 38%.Further training was conducted on the optimized GEC model,resulting in values of 37%,24%,and 34%,respectively.Finally,the results of the grammar error model based on data augmentation strategy under reordering strategy are 75%,43%,and 65%.Therefore,it is proven that the grammar error model based on data augmentation strategy has high detection accuracy and improves technical support for artificial intelligence translation technology.
作者 李潇 LI Xiao(Xianyang Normal University,Xianyang Shaanxi 712000,China)
机构地区 咸阳师范学院
出处 《自动化与仪器仪表》 2024年第7期243-246,共4页 Automation & Instrumentation
基金 陕西省教育厅专项科研计划项目《“人工智能时代”AIGC生成工具在大学英语课堂的应用探究与实践》(23JK0245)。
关键词 数据增广策略 学习者语料库 语法错误纠正 GEG模型 重排序策略 data augmentation strategy learner corpus grammar error correction GEG model reordering strategy
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部