期刊文献+
共找到10篇文章
< 1 >
每页显示 20 50 100
A Research on Bilingual Dictionary Based Sentence Alignment for Chinese English Parallel Corpus 被引量:3
1
作者 杨沐昀 Li +6 位作者 Sheng Zhao Tiejun Lu Yajuan Liu Zhanyi 《High Technology Letters》 EI CAS 2002年第1期8-11,共4页
Parallel corpus is of great importance to machine translation, and automatic sentence alignment is the first step towards its processing. This paper puts forward a bilingual dictionary based sentence alignment method ... Parallel corpus is of great importance to machine translation, and automatic sentence alignment is the first step towards its processing. This paper puts forward a bilingual dictionary based sentence alignment method for Chinese English parallel corpus, which differs from previous length based algorithm in its knowledge-rich approach. Experimental result shows that this method produces over 93% accuracy with usual English-Chinese dictionaries whose translations cover 31 88%~47 90% of the corpus. 展开更多
关键词 Sentence alignment parallel corpus Bilingual dictionary
下载PDF
Research on Parallel Corpus Based Chinese-English Lexicon Builder
2
作者 刘晓月 Yang +4 位作者 Muyun Zhao Tiejun Yajuan 《High Technology Letters》 EI CAS 2003年第4期61-66,共6页
Translation lexicons are fundamental to natural language processing tasks like machine translation and cross language information retrieval. This paper presents a lexicon builder that can auto extract (or assist lexic... Translation lexicons are fundamental to natural language processing tasks like machine translation and cross language information retrieval. This paper presents a lexicon builder that can auto extract (or assist lexicographer in compiling) the word translations from Chinese English parallel corpus. Key mechanisms in this builder system are further described, including co occurrence measure, indirection association resolution and multi word unit translation. Experiment results indicate the effectiveness of the authors’ method and the potentiality of the lexicon builder system. 展开更多
关键词 lexicon builder Chinese English parallel corpus co occurrence
下载PDF
The Design and Construction of Jiangxi Red Tourism Resource E-C/C-E Bilingual Parallel Corpus
3
作者 Zhijuan Zhu Liyong Zhu 《International Journal of Technology Management》 2015年第12期63-66,共4页
On the basis of description of the necessity in construction of the Jiangxi red tourism resource E-C/C-E bilingual parallel corpus, this paper discusses the design and construction of the corpus. In its design, it des... On the basis of description of the necessity in construction of the Jiangxi red tourism resource E-C/C-E bilingual parallel corpus, this paper discusses the design and construction of the corpus. In its design, it describes the general design and the framework of the corpus, then it describes its construction including data collection, the standard for the sorted data, data selection, data digitalization, data tagging and data aligning. With the construction, it will not only realize purposes and functions of the corpus, but also provide others with ways or means to use the corpus and to establish such kind of corpus. 展开更多
关键词 corpus CONSTRUCTION Jiangxi red tourism resource bilingual parallel corpus
下载PDF
Construction of Jiangxi Tourism Resource E-C/C-E Parallel Corpus
4
作者 Liyong Zhu Zhijuan Zhu 《International Journal of Technology Management》 2016年第11期65-67,共3页
This paper discusses the construction of Jiangxi Tourism Resource E-C Parallel Courpus on the basis of description of the necessity in its construction. It first describes its framework, then its construction. In cons... This paper discusses the construction of Jiangxi Tourism Resource E-C Parallel Courpus on the basis of description of the necessity in its construction. It first describes its framework, then its construction. In construction, it includes data collection, the standard for the sorted data, data selection, data digitalization, data tagging and data aligning. With the introduction of its construction, it realizes its purposes and functions of the corpus, but also provides others with ways or means to use the corpus and to establish such kind of corpus. 展开更多
关键词 Jiangxi tourism resource parallel corpus E-C construction and application
下载PDF
Improving Parallel Corpus Quality for Chinese-Vietnamese Statistical Machine Translation
5
作者 Huu-anh Tran Yuhang Guo +2 位作者 Ping Jian Shumin Shi Heyan Huang 《Journal of Beijing Institute of Technology》 EI CAS 2018年第1期127-136,共10页
The performance of a machine translation system heavily depends on the quantity and quality of the bilingual language resource. However,getting a parallel corpus,which has a large scale and is of high quality,is a ver... The performance of a machine translation system heavily depends on the quantity and quality of the bilingual language resource. However,getting a parallel corpus,which has a large scale and is of high quality,is a very difficult task especially for low resource languages such as Chinese-Vietnamese. Fortunately,multilingual user generated contents( UGC),such as bilingual movie subtitles,provide us access to automatic construction of the parallel corpus. Although the amount of UGC parallel corpora can be considerable,the original corpus is not suitable for statistical machine translation( SMT) systems. The corpus may contain translation errors,sentence mismatching,free translations,etc. To improve the quality of the bilingual corpus for SMT systems,three filtering methods are proposed: sentence length difference,the semantic of sentence pairs,and machine learning. Experiments are conducted on the Chinese to Vietnamese translation corpus.Experimental results demonstrate that all the three methods effectively improve the corpus quality,and the machine translation performance( BLEU score) can be improved by 1. 32. 展开更多
关键词 parallel corpus filtering low resource languages bilingual movie subtitles machine translation Chinese-Vietnamese translation
下载PDF
On Chinese Reporting Verb"shuo"and its English Translation Based on To Live Corpus
6
作者 郭灵芝 于建平 《海外英语》 2020年第5期157-158,共2页
Based on the Chinese-English parallel corpus of Yu Hua's novel To Live ( the first two sections), this paper focuses on analyzing Chinese reporting verb"shuo"and its translation through concordance, conc... Based on the Chinese-English parallel corpus of Yu Hua's novel To Live ( the first two sections), this paper focuses on analyzing Chinese reporting verb"shuo"and its translation through concordance, concordance plot, collocates, word list and other functions of AntConc. It aims to study translator Michael Berry's diction style and investigate how the source text has influenced the target text. 展开更多
关键词 parallel corpus REPORTING VERB DICTION FEATURES
下载PDF
Korean Morphological Analysis for Korean-Vietnamese Statistical Machine Translation
7
作者 Quang-Phuoc Nguyen Joon-Choul Shin Cheol-Young Ock 《Journal of Electronic Science and Technology》 CAS CSCD 2017年第4期413-419,共7页
This paper describes the experiments with Korean-to-Vietnamese statistical machine translation(SMT). The fact that Korean is a morphologically complex language that does not have clear optimal word boundaries causes a... This paper describes the experiments with Korean-to-Vietnamese statistical machine translation(SMT). The fact that Korean is a morphologically complex language that does not have clear optimal word boundaries causes a major problem of translating into or from Korean. To solve this problem, we present a method to conduct a Korean morphological analysis by using a pre-analyzed partial word-phrase dictionary(PWD).Besides, we build a Korean-Vietnamese parallel corpus for training SMT models by collecting text from multilingual magazines. Then, we apply such a morphology analysis to Korean sentences that are included in the collected parallel corpus as a preprocessing step. The experiment results demonstrate a remarkable improvement of Korean-to-Vietnamese translation quality in term of bi-lingual evaluation understudy(BLEU). 展开更多
关键词 Factored translation models Korean-Vietnamese parallel corpus morphological analysis statistical machine translation(SMT)
下载PDF
Generating a multilingual taxonomy based on multilingual terminology clustering
8
作者 Chengzhi ZHANG 《Chinese Journal of Library and Information Science》 2011年第2期27-40,共14页
Taxonomy denotes the hierarchical structure of a knowledge organization system. It has important applications in knowledge navigation, semantic annotation and semantic search. It is a useful instrument to study the mu... Taxonomy denotes the hierarchical structure of a knowledge organization system. It has important applications in knowledge navigation, semantic annotation and semantic search. It is a useful instrument to study the multilingual taxonomy generated automatically under the dynamic information environment in which massive amounts of information are processed and found. Multilingual taxonomy is the core component of the multilingual thesaurus or ontology. This paper presents two methods of bilingual generated taxonomy: Cross-language terminology clustering and mixed-language based terminology clustering. According to our experimental results of terminology clustering related to four specific subject domains, we found that if the parallel corpus is used to cluster multilingual terminologies, the method of using mixed-language based terminology clustering outperforms that of using the cross-language terminology clustering. 展开更多
关键词 Multilingual taxonomy multilingual terminology clustering cross-language terminology clustering parallel corpus mixed language
下载PDF
基于语料库的法律英语翻译能力培养模式研究 被引量:1
9
作者 王晓慧 高菊霞 《中国ESP研究》 2017年第2期78-86,162,共10页
本文旨在探讨平行语料库和平行文本模式在汉英国际服装外贸跟单翻译中的具体应用,主要从订单的文本信息、词语表达、句法结构及谋篇布局等层面探究其翻译规律,并从术语翻译效率、术语翻译决策、工具资源使用、术语翻译认知四个维度对两... 本文旨在探讨平行语料库和平行文本模式在汉英国际服装外贸跟单翻译中的具体应用,主要从订单的文本信息、词语表达、句法结构及谋篇布局等层面探究其翻译规律,并从术语翻译效率、术语翻译决策、工具资源使用、术语翻译认知四个维度对两组被试进行具体考察。实验表明,学生在翻译法律搭配方面存在严重问题,缺乏语言知识和语用意识,而将法律合同翻译的语料库研究引入跨文化翻译教学有利于全面激发学生的思考与参与,培养学生的翻译思维和跨文化交际意识。本文根据最新实证翻译研究的成果,分析法律平行语料库和专门术语库的建构对翻译初学者法律翻译能力提高和技能习得方面的作用,总结出一些适用于合同类文本汉英翻译的原则及策略。 展开更多
关键词 跨文化交际能力 法律英语 外贸跟单合同翻译 平行语料库
原文传递
Resolving the Terminological Mishmash in Teaching Link Words in EFL Writing
10
作者 Mehrdad YOUSEFPOORI-NAEIM Lawrence Jun ZHANG Sasan BALEGHIZADEH 《Chinese Journal of Applied Linguistics》 2018年第3期321-337,397,共18页
One of the characteristics of good writing is appropriate use of cohesive devices, which many learners often find difficult to do. Although there is a substantial body of research on cohesion in writing, little has be... One of the characteristics of good writing is appropriate use of cohesive devices, which many learners often find difficult to do. Although there is a substantial body of research on cohesion in writing, little has been documented on how to teach it to EFL students. The present study was an attempt to address this under-researched issue by taking a terminological approach, as the diverse and vague terminology used to describe link words has been found to be more of a hindrance than a facilitator in the teaching/learning process. A total of 16 cohesion-related terms were surveyed to find out about the quality and quantity of their use in a self-built corpus of 14 grammar books of different levels. Furthermore, a test was run to measure students' familiarity with these terms, and field-notes were taken while scoring the students' papers in their presence. The findings, in addition to giving insights into the usage points regarding each of the 16 terms, led to drawing a terminological network, which, if used consistently, could help in teaching and learning to use cohesion effectively. 展开更多
关键词 link words grammatical terminology EFL writing self-built corpus
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部