期刊文献+
共找到7篇文章
< 1 >
每页显示 20 50 100
机器英译非遗外宣文本问题探析——以二十四节气为例 被引量:1
1
作者 李国兵 张敬 《湖北第二师范学院学报》 2023年第1期97-102,共6页
中华文化“走出去”是大势所趋,非遗译介迫在眉睫。为了解机器翻译非遗外宣文本存在的问题,本文选取中国日报双语新闻推送的“二十四节气”双语文本为研究对象,利用Google翻译得到机器译文,对比人工译本,发现机译在词法与句法层面存在... 中华文化“走出去”是大势所趋,非遗译介迫在眉睫。为了解机器翻译非遗外宣文本存在的问题,本文选取中国日报双语新闻推送的“二十四节气”双语文本为研究对象,利用Google翻译得到机器译文,对比人工译本,发现机译在词法与句法层面存在文化负载词误译、漏译与赘译、词性不当、用词欠妥、单复数错误、习俗语误译、时态错译、语序不当、照应不足和逻辑错译等问题,旨在为非遗译介提供借鉴,推动非遗文化走向世界。 展开更多
关键词 机器英译 非遗外宣 二十四节气
下载PDF
Hierarchical Semantic-Category-Tree Model for Chinese-English Machine Translation 被引量:1
2
作者 Zhu Xiaojian Jin Yaohong 《China Communications》 SCIE CSCD 2012年第12期80-92,共13页
We introduce a novel Sermntic-Category- Tree (SCT) model to present the sen-antic structure of a sentence for Chinese-English Machine Translation (MT). We use the SCT model to handle the reordering in a hierarchic... We introduce a novel Sermntic-Category- Tree (SCT) model to present the sen-antic structure of a sentence for Chinese-English Machine Translation (MT). We use the SCT model to handle the reordering in a hierarchical structure in which one reordering is dependent on the others. Different from other reordering approaches, we handle the reordering at three levels: sentence level, chunk level, and word level. The chunk-level reordering is dependent on the sentence-level reordering, and the word-level reordering is dependent on the chunk-level reordering. In this paper, we formally describe the SCT model and discuss the translation strategy based on the SCT model. Further, we present an algorithm for analyzing the source language in SCT and transforming the source SCT into the target SCT. We apply the SCT model to a role-based patent text MT to evaluate the ability of the SCT model. The experimental results show that SCT is efficient in handling the hierarehical reordering operation in MT. 展开更多
关键词 REORDERING SCT MT function word
下载PDF
Research on Parallel Corpus Based Chinese-English Lexicon Builder
3
作者 刘晓月 Yang +4 位作者 Muyun Zhao Tiejun Yajuan 《High Technology Letters》 EI CAS 2003年第4期61-66,共6页
Translation lexicons are fundamental to natural language processing tasks like machine translation and cross language information retrieval. This paper presents a lexicon builder that can auto extract (or assist lexic... Translation lexicons are fundamental to natural language processing tasks like machine translation and cross language information retrieval. This paper presents a lexicon builder that can auto extract (or assist lexicographer in compiling) the word translations from Chinese English parallel corpus. Key mechanisms in this builder system are further described, including co occurrence measure, indirection association resolution and multi word unit translation. Experiment results indicate the effectiveness of the authors’ method and the potentiality of the lexicon builder system. 展开更多
关键词 lexicon builder Chinese English parallel corpus co occurrence
下载PDF
Anchor-based English-Chinese Bilingual Chunk Alignment Model
4
作者 吴尉林 成长生 +1 位作者 徐良贤 陆汝占 《Journal of Donghua University(English Edition)》 EI CAS 2005年第2期35-39,共5页
Chunk alignment for the bilingual corpus is the base of Example-based Machine Translation. An anchor-based English-Chinese bilingual chunk alignment model and the corresponding algorithm of alignment are presented in ... Chunk alignment for the bilingual corpus is the base of Example-based Machine Translation. An anchor-based English-Chinese bilingual chunk alignment model and the corresponding algorithm of alignment are presented in this paper. It can effectively overcome the sparse data problem due to the limited size of the bilingual corpus. In this model, the chunk segmentation disarnbiguation is delayed to the alignment process, and hence the accuracy of chunk segmentation is improved. The experimental results demonstrate the feasibility and viability of this model. 展开更多
关键词 chunk alignment machine translation
下载PDF
MT-Oriented English PoS Tagging and Its Application to Noun Phrase Chunking
5
作者 Ma Jianjun Huang Degen +1 位作者 Liu Haixia Sheng Wenfeng 《China Communications》 SCIE CSCD 2012年第3期58-67,共10页
A hybrid approach to English Part-of-Speech(PoS) tagging with its target application being English-Chinese machine translation in business domain is presented,demonstrating how a present tagger can be adapted to learn... A hybrid approach to English Part-of-Speech(PoS) tagging with its target application being English-Chinese machine translation in business domain is presented,demonstrating how a present tagger can be adapted to learn from a small amount of data and handle unknown words for the purpose of machine translation.A small size of 998 k English annotated corpus in business domain is built semi-automatically based on a new tagset;the maximum entropy model is adopted,and rule-based approach is used in post-processing.The tagger is further applied in Noun Phrase(NP) chunking.Experiments show that our tagger achieves an accuracy of 98.14%,which is a quite satisfactory result.In the application to NP chunking,the tagger gives rise to 2.21% increase in F-score,compared with the results using Stanford tagger. 展开更多
关键词 English PoS tagging maximum entro- py rule-based approach machine translation NP chunking
下载PDF
Alignment of the Polish-English Parallel Text for a Statistical Machine "Translation
6
作者 Krzysztof Wolk Krzysztof Marasek 《Computer Technology and Application》 2013年第11期575-583,共9页
Text alignment is crucial to the accuracy of MT (Machine Translation) systems, some NLP (Natural Language Processing) tools or any other text processing tasks requiring bilingual data. This research proposes a lan... Text alignment is crucial to the accuracy of MT (Machine Translation) systems, some NLP (Natural Language Processing) tools or any other text processing tasks requiring bilingual data. This research proposes a language independent sentence alignment approach based on Polish (not position-sensitive language) to English experiments. This alignment approach was developed on the TED (Translanguage English Database) talks corpus, but can be used for any text domain or language pair. The proposed approach implements various heuristics for sentence recognition. Some of them value synonyms and semantic text structure analysis as a part of additional information. Minimization of data loss was ensured. The solution is compared to other sentence alignment implementations. Also an improvement in MT system score with text processed with the described tool is shown. 展开更多
关键词 Text alignment NLP tools machine learning text corpora processing
下载PDF
Cross-lingual implicit discourse relation recognition with co-training 被引量:1
7
作者 Yao-jie LU Mu XU +3 位作者 Chang-xing WU De-yi XIONG Hong-ji WANG Jin-song SU 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2018年第5期651-661,共11页
A lack of labeled corpora obstructs the research progress on implicit discourse relation recognition (DRR) for Chinese, while there are some available discourse corpora in other languages, such as English. In this p... A lack of labeled corpora obstructs the research progress on implicit discourse relation recognition (DRR) for Chinese, while there are some available discourse corpora in other languages, such as English. In this paper, we propose a cross-lingual implicit DRR framework that exploits an available English corpus for the Chinese DRR task. We use machine translation to generate Chinese instances from a labeled English discourse corpus. In this way, each instance has two independent views: Chinese and English views. Then we train two classifiers in Chinese and English in a co-training way, which exploits unlabeled Chinese data to implement better implicit DRR for Chinese. Experimental results demonstrate the effectiveness of our method. 展开更多
关键词 Cross-lingual Implicit discourse relation recognition CO-TRAINING
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部