期刊文献+
共找到1,755篇文章
< 1 2 88 >
每页显示 20 50 100
Neural Machine Translation by Fusing Key Information of Text
1
作者 Shijie Hu Xiaoyu Li +8 位作者 Jiayu Bai Hang Lei Weizhong Qian Sunqiang Hu Cong Zhang Akpatsa Samuel Kofi Qian Qiu Yong Zhou Shan Yang 《Computers, Materials & Continua》 SCIE EI 2023年第2期2803-2815,共13页
When the Transformer proposed by Google in 2017,it was first used for machine translation tasks and achieved the state of the art at that time.Although the current neural machine translation model can generate high qu... When the Transformer proposed by Google in 2017,it was first used for machine translation tasks and achieved the state of the art at that time.Although the current neural machine translation model can generate high quality translation results,there are still mistranslations and omissions in the translation of key information of long sentences.On the other hand,the most important part in traditional translation tasks is the translation of key information.In the translation results,as long as the key information is translated accurately and completely,even if other parts of the results are translated incorrect,the final translation results’quality can still be guaranteed.In order to solve the problem of mistranslation and missed translation effectively,and improve the accuracy and completeness of long sentence translation in machine translation,this paper proposes a key information fused neural machine translation model based on Transformer.The model proposed in this paper extracts the keywords of the source language text separately as the input of the encoder.After the same encoding as the source language text,it is fused with the output of the source language text encoded by the encoder,then the key information is processed and input into the decoder.With incorporating keyword information from the source language sentence,the model’s performance in the task of translating long sentences is very reliable.In order to verify the effectiveness of the method of fusion of key information proposed in this paper,a series of experiments were carried out on the verification set.The experimental results show that the Bilingual Evaluation Understudy(BLEU)score of the model proposed in this paper on theWorkshop on Machine Translation(WMT)2017 test dataset is higher than the BLEU score of Transformer proposed by Google on the WMT2017 test dataset.The experimental results show the advantages of the model proposed in this paper. 展开更多
关键词 Key information TRANSFORMER FUSION neural machine translation
下载PDF
Research on system combination of machine translation based on Transformer
2
作者 刘文斌 HE Yanqing +1 位作者 LAN Tian WU Zhenfeng 《High Technology Letters》 EI CAS 2023年第3期310-317,共8页
Influenced by its training corpus,the performance of different machine translation systems varies greatly.Aiming at achieving higher quality translations,system combination methods combine the translation results of m... Influenced by its training corpus,the performance of different machine translation systems varies greatly.Aiming at achieving higher quality translations,system combination methods combine the translation results of multiple systems through statistical combination or neural network combination.This paper proposes a new multi-system translation combination method based on the Transformer architecture,which uses a multi-encoder to encode source sentences and the translation results of each system in order to realize encoder combination and decoder combination.The experimental verification on the Chinese-English translation task shows that this method has 1.2-2.35 more bilingual evaluation understudy(BLEU)points compared with the best single system results,0.71-3.12 more BLEU points compared with the statistical combination method,and 0.14-0.62 more BLEU points compared with the state-of-the-art neural network combination method.The experimental results demonstrate the effectiveness of the proposed system combination method based on Transformer. 展开更多
关键词 TRANSFORMER system combination neural machine translation(NMT) attention mechanism multi-encoder
下载PDF
Neural Machine Translation Models with Attention-Based Dropout Layer
3
作者 Huma Israr Safdar Abbas Khan +3 位作者 Muhammad Ali Tahir Muhammad Khuram Shahzad Muneer Ahmad Jasni Mohamad Zain 《Computers, Materials & Continua》 SCIE EI 2023年第5期2981-3009,共29页
In bilingual translation,attention-based Neural Machine Translation(NMT)models are used to achieve synchrony between input and output sequences and the notion of alignment.NMT model has obtained state-of-the-art perfo... In bilingual translation,attention-based Neural Machine Translation(NMT)models are used to achieve synchrony between input and output sequences and the notion of alignment.NMT model has obtained state-of-the-art performance for several language pairs.However,there has been little work exploring useful architectures for Urdu-to-English machine translation.We conducted extensive Urdu-to-English translation experiments using Long short-term memory(LSTM)/Bidirectional recurrent neural networks(Bi-RNN)/Statistical recurrent unit(SRU)/Gated recurrent unit(GRU)/Convolutional neural network(CNN)and Transformer.Experimental results show that Bi-RNN and LSTM with attention mechanism trained iteratively,with a scalable data set,make precise predictions on unseen data.The trained models yielded competitive results by achieving 62.6%and 61%accuracy and 49.67 and 47.14 BLEU scores,respectively.From a qualitative perspective,the translation of the test sets was examined manually,and it was observed that trained models tend to produce repetitive output more frequently.The attention score produced by Bi-RNN and LSTM produced clear alignment,while GRU showed incorrect translation for words,poor alignment and lack of a clear structure.Therefore,we considered refining the attention-based models by defining an additional attention-based dropout layer.Attention dropout fixes alignment errors and minimizes translation errors at the word level.After empirical demonstration and comparison with their counterparts,we found improvement in the quality of the resulting translation system and a decrease in the perplexity and over-translation score.The ability of the proposed model was evaluated using Arabic-English and Persian-English datasets as well.We empirically concluded that adding an attention-based dropout layer helps improve GRU,SRU,and Transformer translation and is considerably more efficient in translation quality and speed. 展开更多
关键词 Natural language processing neural machine translation word embedding ATTENTION PERPLEXITY selective dropout regularization URDU PERSIAN Arabic BLEU
下载PDF
Devised Guidelines of Rapid Post-Editing in Machine Translation Output of Mark on Some Aspects
4
作者 屈亚媛 王庆怡 《海外英语》 2012年第21期130-133,共4页
One of uses of machine translation(MT),is helping readers to read for the gist of a foreign text through a draft transla tion produced by MT engines.Rapid post-editing,as Jeffrey Allen defines it as a"strictly mi... One of uses of machine translation(MT),is helping readers to read for the gist of a foreign text through a draft transla tion produced by MT engines.Rapid post-editing,as Jeffrey Allen defines it as a"strictly minimal editing on texts in order to re move blatant and significant errors without considering stylistic issues",can help present the reader with a roughly comprehensi ble translation as quickly as possible.The purpose of this article is on a proposed set of rapid post-editing guidelines for Biblical Chinese-English MT,with its application on editing the English MT version of Chapter one of Mark(马尔谷福音) of the Chi nese Catholic Bible(天主教思高本圣经) as an example. 展开更多
关键词 machine translation RAPID POST-EDITING MARK
下载PDF
Machine Translation
5
作者 张严心 《海外英语》 2015年第4期255-256,共2页
As a kind of ancillary translation tool, Machine Translation has been paid increasing attention to and received different kinds of study by a great deal of researchers and scholars for a long time. To know the definit... As a kind of ancillary translation tool, Machine Translation has been paid increasing attention to and received different kinds of study by a great deal of researchers and scholars for a long time. To know the definition of Machine Translation and to analyse its benefits and problems are significant for translators in order to make good use of Machine Translation, and helpful to develop and consummate Machine Translation Systems in the future. 展开更多
关键词 machine translation DEFINITION CHARACTERISTICS BENEFITS PROBLEMS
下载PDF
Development Process of Machine Translation during 1930s-1970s
6
作者 张严心 《海外英语》 2014年第23期171-171,178,共2页
Machine Translation are increasingly welcomed and used during recent years with the commonly application of Internet and the acceleration of the integration of world economy. To know about the history and development ... Machine Translation are increasingly welcomed and used during recent years with the commonly application of Internet and the acceleration of the integration of world economy. To know about the history and development process of Machine Translation during 1930s-1970 s could help researchers gain new insights through restudying old material. 展开更多
关键词 machine translation HISTORY development PROCESS 19
下载PDF
Progress in Machine Translation 被引量:1
7
作者 Haifeng Wang Hua Wu +2 位作者 Zhongjun He Liang Huang Kenneth Ward Church 《Engineering》 SCIE EI CAS 2022年第11期143-153,共11页
After more than 70 years of evolution,great achievements have been made in machine translation.Especially in recent years,translation quality has been greatly improved with the emergence of neural machine translation(... After more than 70 years of evolution,great achievements have been made in machine translation.Especially in recent years,translation quality has been greatly improved with the emergence of neural machine translation(NMT).In this article,we first review the history of machine translation from rule-based machine translation to example-based machine translation and statistical machine translation.We then introduce NMT in more detail,including the basic framework and the current dominant framework,Transformer,as well as multilingual translation models to deal with the data sparseness problem.In addition,we introduce cutting-edge simultaneous translation methods that achieve a balance between translation quality and latency.We then describe various products and applications of machine translation.At the end of this article,we briefly discuss challenges and future research directions in this field. 展开更多
关键词 machine translation Neural machine translation Simultaneous translation
下载PDF
Corpus Augmentation for Improving Neural Machine Translation 被引量:1
8
作者 Zijian Li Chengying Chi Yunyun Zhan 《Computers, Materials & Continua》 SCIE EI 2020年第7期637-650,共14页
The translation quality of neural machine translation(NMT)systems depends largely on the quality of large-scale bilingual parallel corpora available.Research shows that under the condition of limited resources,the per... The translation quality of neural machine translation(NMT)systems depends largely on the quality of large-scale bilingual parallel corpora available.Research shows that under the condition of limited resources,the performance of NMT is greatly reduced,and a large amount of high-quality bilingual parallel data is needed to train a competitive translation model.However,not all languages have large-scale and high-quality bilingual corpus resources available.In these cases,improving the quality of the corpora has become the main focus to increase the accuracy of the NMT results.This paper proposes a new method to improve the quality of data by using data cleaning,data expansion,and other measures to expand the data at the word and sentence-level,thus improving the richness of the bilingual data.The long short-term memory(LSTM)language model is also used to ensure the smoothness of sentence construction in the process of sentence construction.At the same time,it uses a variety of processing methods to improve the quality of the bilingual data.Experiments using three standard test sets are conducted to validate the proposed method;the most advanced fairseq-transformer NMT system is used in the training.The results show that the proposed method has worked well on improving the translation results.Compared with the state-of-the-art methods,the BLEU value of our method is increased by 2.34 compared with that of the baseline. 展开更多
关键词 Neural machine translation corpus argumentation model improvement deep learning data cleaning
下载PDF
Korean Morphological Analysis for Korean-Vietnamese Statistical Machine Translation
9
作者 Quang-Phuoc Nguyen Joon-Choul Shin Cheol-Young Ock 《Journal of Electronic Science and Technology》 CAS CSCD 2017年第4期413-419,共7页
This paper describes the experiments with Korean-to-Vietnamese statistical machine translation(SMT). The fact that Korean is a morphologically complex language that does not have clear optimal word boundaries causes a... This paper describes the experiments with Korean-to-Vietnamese statistical machine translation(SMT). The fact that Korean is a morphologically complex language that does not have clear optimal word boundaries causes a major problem of translating into or from Korean. To solve this problem, we present a method to conduct a Korean morphological analysis by using a pre-analyzed partial word-phrase dictionary(PWD).Besides, we build a Korean-Vietnamese parallel corpus for training SMT models by collecting text from multilingual magazines. Then, we apply such a morphology analysis to Korean sentences that are included in the collected parallel corpus as a preprocessing step. The experiment results demonstrate a remarkable improvement of Korean-to-Vietnamese translation quality in term of bi-lingual evaluation understudy(BLEU). 展开更多
关键词 Factored translation models Korean-Vietnamese parallel CORPUS MORPHOLOGICAL analysis statistical machine translation (SMT)
下载PDF
WORD BASED MACHINE TRANSLATION SYSTEM
10
作者 谢金宝 孙岗 杨振宇 《Journal of Shanghai Jiaotong university(Science)》 EI 1999年第2期104-108,共5页
This paper compared several methods of machine translation(MT) design, drew lessons from the idea of phrase structure, GPSG, HPSG and Corpus, took words as the core, built a set of word rules, and developed an English... This paper compared several methods of machine translation(MT) design, drew lessons from the idea of phrase structure, GPSG, HPSG and Corpus, took words as the core, built a set of word rules, and developed an English Chinese Machine Translation System based on it. The paper also discussed some technical problems on building MT system, and provided an estimation principle for using rules. With this principle the syntax ambiguities in MT system are solved better. 展开更多
关键词 machine translation WORD RULES DICTIONARY estimation PRINCIPLE AMBIGUITY ATTRIBUTE
下载PDF
Statistic-based Target Word Selection in English-Chinese Machine Translation
11
作者 刘小虎 李生 张波 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 1999年第1期46-49,共4页
How to select appropriate wolds in a translation is a significant problem in current studies of machine translation, because it directly decides the translation quality. This paper uses an unsupervised corpus-based st... How to select appropriate wolds in a translation is a significant problem in current studies of machine translation, because it directly decides the translation quality. This paper uses an unsupervised corpus-based statisticalmethod to select target word. Based on the concurrence probabilities, all ambiguous words in a sentence are disambiguated at the same time. Because a corpus of limited size cannot cover all the collocation of words, we use an effectivesmoothing method to increase the coverage of the corpus. In ceder to solve the problem in our English-Chinese MT system, we have applied the algorithm to disambiguate senses of verbs, nouns and adjectitves in target language, and theresult shows that the approach is very promising. 展开更多
关键词 TARGET WORD SELECTION SYNTACTIC TUPLE machine translation
下载PDF
Improving Parallel Corpus Quality for Chinese-Vietnamese Statistical Machine Translation
12
作者 Huu-anh Tran Yuhang Guo +2 位作者 Ping Jian Shumin Shi Heyan Huang 《Journal of Beijing Institute of Technology》 EI CAS 2018年第1期127-136,共10页
The performance of a machine translation system heavily depends on the quantity and quality of the bilingual language resource. However,getting a parallel corpus,which has a large scale and is of high quality,is a ver... The performance of a machine translation system heavily depends on the quantity and quality of the bilingual language resource. However,getting a parallel corpus,which has a large scale and is of high quality,is a very difficult task especially for low resource languages such as Chinese-Vietnamese. Fortunately,multilingual user generated contents( UGC),such as bilingual movie subtitles,provide us access to automatic construction of the parallel corpus. Although the amount of UGC parallel corpora can be considerable,the original corpus is not suitable for statistical machine translation( SMT) systems. The corpus may contain translation errors,sentence mismatching,free translations,etc. To improve the quality of the bilingual corpus for SMT systems,three filtering methods are proposed: sentence length difference,the semantic of sentence pairs,and machine learning. Experiments are conducted on the Chinese to Vietnamese translation corpus.Experimental results demonstrate that all the three methods effectively improve the corpus quality,and the machine translation performance( BLEU score) can be improved by 1. 32. 展开更多
关键词 parallel corpus filtering low resource languages bilingual movie subtitles machine translation Chinese-Vietnamese translation
下载PDF
Sentence-Level Paraphrasing for Machine Translation System Combination 被引量:1
13
作者 Junguo Zhu Muyun Yang +1 位作者 Sheng Li Tiejun Zhao 《国际计算机前沿大会会议论文集》 2016年第1期156-158,共3页
In this paper, we propose to enhance machine translation system combination (MTSC) with a sentence-level paraphrasing model trained by a neural network. This work extends the number of candidates in MTSC by paraphrasi... In this paper, we propose to enhance machine translation system combination (MTSC) with a sentence-level paraphrasing model trained by a neural network. This work extends the number of candidates in MTSC by paraphrasing the whole original MT translation sentences. First we train a neural paraphrasing model of Encoder-Decoder, and leverage the model to paraphrase the MT system outputs to generate synonymous candidates in the semantic space. Then we merge all of them into a single improved translation by a state-of-the-art system combination approach (MEMT) adding some new paraphrasing features. Our experimental results show a significant improvement of 0.28 BLEU points on the WMT2011 test data and 0.41 BLEU points without considering the out-of-vocabulary (OOV) words for the sentence-level paraphrasing model. 展开更多
关键词 machine translation System COMBINATION PARAPHRASING NEURAL network
下载PDF
A New Rules Processing Method Based on Decision Tree in Machine Translation
14
作者 荀恩东 《High Technology Letters》 EI CAS 1999年第2期36-39,共4页
This paper gives the representation of rules, the strategy of rule controlling and the existing problems in English Chinese Machine Translation(MT) named BT863 I. Then it puts forward a method for processing these rul... This paper gives the representation of rules, the strategy of rule controlling and the existing problems in English Chinese Machine Translation(MT) named BT863 I. Then it puts forward a method for processing these rules based on the decision tree. With this method, some problems such as rule conflic and rule redundancy occurring in BT863 I have been solved and the efficiency of MT system has been improved greatly. This method also has general meaning in the Rule based expert system. 展开更多
关键词 RULE DECISION TREE machine translation
下载PDF
Integrating Deep Learning and Machine Translation for Understanding Unrefined Languages
15
作者 Hong Geun Ji Soyoung Oh +2 位作者 Jina Kim Seong Choi Eunil Park 《Computers, Materials & Continua》 SCIE EI 2022年第1期669-678,共10页
In the field of natural language processing(NLP),the advancement of neural machine translation has paved the way for cross-lingual research.Yet,most studies in NLP have evaluated the proposed language models on well-r... In the field of natural language processing(NLP),the advancement of neural machine translation has paved the way for cross-lingual research.Yet,most studies in NLP have evaluated the proposed language models on well-refined datasets.We investigatewhether amachine translation approach is suitable for multilingual analysis of unrefined datasets,particularly,chat messages in Twitch.In order to address it,we collected the dataset,which included 7,066,854 and 3,365,569 chat messages from English and Korean streams,respectively.We employed several machine learning classifiers and neural networks with two different types of embedding:word-sequence embedding and the final layer of a pre-trained language model.The results of the employed models indicate that the accuracy difference between English,and English to Korean was relatively high,ranging from 3%to 12%.For Korean data(Korean,and Korean to English),it ranged from 0%to 2%.Therefore,the results imply that translation from a low-resource language(e.g.,Korean)into a high-resource language(e.g.,English)shows higher performance,in contrast to vice versa.Several implications and limitations of the presented results are also discussed.For instance,we suggest the feasibility of translation from resource-poor languages for using the tools of resource-rich languages in further analysis. 展开更多
关键词 TWITCH MULTILINGUAL machine translation machine learning
下载PDF
Dependency-Based Local Attention Approach to Neural Machine Translation
16
作者 Jing Qiu Yan Liu +4 位作者 Yuhan Chai Yaqi Si Shen Su Le Wang Yue Wu 《Computers, Materials & Continua》 SCIE EI 2019年第5期547-562,共16页
Recently dependency information has been used in different ways to improve neural machine translation.For example,add dependency labels to the hidden states of source words.Or the contiguous information of a source wo... Recently dependency information has been used in different ways to improve neural machine translation.For example,add dependency labels to the hidden states of source words.Or the contiguous information of a source word would be found according to the dependency tree and then be learned independently and be added into Neural Machine Translation(NMT)model as a unit in various ways.However,these works are all limited to the use of dependency information to enrich the hidden states of source words.Since many works in Statistical Machine Translation(SMT)and NMT have proven the validity and potential of using dependency information.We believe that there are still many ways to apply dependency information in the NMT structure.In this paper,we explore a new way to use dependency information to improve NMT.Based on the theory of local attention mechanism,we present Dependency-based Local Attention Approach(DLAA),a new attention mechanism that allowed the NMT model to trace the dependency words related to the current translating words.Our work also indicates that dependency information could help to supervise attention mechanism.Experiment results on WMT 17 Chineseto-English translation task shared training datasets show that our model is effective and perform distinctively on long sentence translation. 展开更多
关键词 Neural machine translation attention mechanism dependency parsing
下载PDF
A Novel Beam Search to Improve Neural Machine Translation for English-Chinese
17
作者 Xinyue Lin Jin Liu +1 位作者 Jianming Zhang Se-Jung Lim 《Computers, Materials & Continua》 SCIE EI 2020年第10期387-404,共18页
Neural Machine Translation(NMT)is an end-to-end learning approach for automated translation,overcoming the weaknesses of conventional phrase-based translation systems.Although NMT based systems have gained their popul... Neural Machine Translation(NMT)is an end-to-end learning approach for automated translation,overcoming the weaknesses of conventional phrase-based translation systems.Although NMT based systems have gained their popularity in commercial translation applications,there is still plenty of room for improvement.Being the most popular search algorithm in NMT,beam search is vital to the translation result.However,traditional beam search can produce duplicate or missing translation due to its target sequence selection strategy.Aiming to alleviate this problem,this paper proposed neural machine translation improvements based on a novel beam search evaluation function.And we use reinforcement learning to train a translation evaluation system to select better candidate words for generating translations.In the experiments,we conducted extensive experiments to evaluate our methods.CASIA corpus and the 1,000,000 pairs of bilingual corpora of NiuTrans are used in our experiments.The experiment results prove that the proposed methods can effectively improve the English to Chinese translation quality. 展开更多
关键词 Neural machine translation beam search reinforcement learning
下载PDF
Towards consistent machine translation of abbreviated terms in scientific literature
18
作者 何彦青 Sun Yueying +2 位作者 Wu Zhenfeng Pan You Zhang Junsheng 《High Technology Letters》 EI CAS 2021年第3期282-293,共12页
Scientific literature often contains abbreviated terms in English for brief.Machine translation(MT)systems can help to share knowledge in different languages among researchers.Current MT systems may translate the same... Scientific literature often contains abbreviated terms in English for brief.Machine translation(MT)systems can help to share knowledge in different languages among researchers.Current MT systems may translate the same abbreviated term in different sentences into different target terms.MT systems translate the abbreviated term in two ways:one is to use translation of the full name,the other is to use the abbreviated term directly.Abbreviated terms may be ambiguous and polysemous,and MT systems do not have an explicit strategy to decide which way to use without context information.To get the consistent translation for abbreviated terms in scientific literature,this paper proposes a translation model for abbreviated terms that integrates context information to get consistent translation of abbreviated terms.The context information includes the positions of abbreviated term and domain attributes of scientific literature.The first abbreviated term is translated in full name while the latter ones of the same abbreviated term will show the abbreviated form in the translation text.Experiments of translation from Chinese to English show the effectiveness of the proposed translation model. 展开更多
关键词 abbreviated term context information domain information machine translation(MT)
下载PDF
A Comparative Study of Machine Translation and Computer Aided Translation
19
作者 Li Qingzhao 《学术界》 CSSCI 北大核心 2015年第11期303-307,共5页
In the information era,computer translation has attracted more and more translators' interests.MT(Machine Translation)and CAT(Computer Aided Translation)are closely connected with difference in essence.MT is conve... In the information era,computer translation has attracted more and more translators' interests.MT(Machine Translation)and CAT(Computer Aided Translation)are closely connected with difference in essence.MT is convenient but with many defects,while CAT has a great advantage with translation memory core technology.The features of CAT will make it the inevitable trend of future translation. 展开更多
关键词 计算机辅助翻译 机器翻译 计算机翻译 信息时代 翻译记忆
下载PDF
Design of Chinese-English Machine Translation System CEMT-Ⅱ
20
作者 Zhou Ming Li Sheng Cheng Guangxi Hu Mingzeng (Department of Computer Science) 《哈尔滨工业大学学报》 EI CAS CSCD 北大核心 1990年第3期39-44,共6页
This paper describes the design of CEMT-Ⅱ, an interactive Chinese-English machine translation system. Based on the CEMT-Ⅰsystem, CEMT-Ⅱwill be developed to have the ability to translate Chinese scientific documents... This paper describes the design of CEMT-Ⅱ, an interactive Chinese-English machine translation system. Based on the CEMT-Ⅰsystem, CEMT-Ⅱwill be developed to have the ability to translate Chinese scientific documents into English. Now an user-friendly interface has been worked out to solve various complex ambiguities. The Chinese user need not know English well since all the questions and choices are expressed in Chinese. 展开更多
关键词 汉英机器翻译系统 CEMT-Ⅱ 句型分析 科技语言 设计
下载PDF
上一页 1 2 88 下一页 到第
使用帮助 返回顶部