Web search engines are important tools for lexicography.This paper takes translation of business terms("e-commerce"and"e-business")as an example to illustrate the application of web search engines ...Web search engines are important tools for lexicography.This paper takes translation of business terms("e-commerce"and"e-business")as an example to illustrate the application of web search engines in English-Chinese dictionary translation,including the methods of(1)finding the potential Chinese equivalents of the English business terms,and(2)selecting typical and proper Chinese equivalents in accordance with the frequencies and the meanings of the English business terms respectively.展开更多
This article intends to demonstrate the principles for dictionary compilation.To summarize,general rules are to be followed.At the same time,this article probes into the differences of Chinese to Chinese dictionary,Ch...This article intends to demonstrate the principles for dictionary compilation.To summarize,general rules are to be followed.At the same time,this article probes into the differences of Chinese to Chinese dictionary,Chinese to foreign language dictionary,dictionaries for the preparation of certain exams,learner’s dictionaries for reviewing errors,dictionaries with multimedia and dictionaries built via the Internet..In all,one of the key factors for dictionary compilation is the professionalism of editors and writers.The training of the people involved is vital,which would contribute to setting up a better order of the market.Acknowledgements should also be given to achievements in dictionary compilation.Revisions and polishing are required to improve the dictionaries on the market that have already won the acclaim so as to put their influence to the best display.展开更多
Described and exemplified a semantic scoring system of students' on-line English-Chinese translation. To achieve accurate assessment, the system adopted a comprehensive method which combines semantic scoring with ...Described and exemplified a semantic scoring system of students' on-line English-Chinese translation. To achieve accurate assessment, the system adopted a comprehensive method which combines semantic scoring with keyword matching scoring. Four kinds of words-verbs, adjectives, adverbs and "the rest" including nouns, pronouns, idioms, prepositions, etc., are identified after parsing. The system treats different words tagged with different part of speech differently. Then it calculated the semantic similarity between these words of the standard versions and those of students' translations by the distinctive differences of the semantic features of these words with the aid of HowNet. The first semantic feature of verbs and the last semantic features of adjectives and adverbs are calculated. "The rest" is scored by keyword matching. The experiment results show that the semantic scoring system is applicable in fulfilling the task of scoring students' on-line English-Chinese translations.展开更多
On-line assessment of English-Chinese translation is a challenging task as it involves natural language processing.YanFa,an on-line assessment system for English-Chinese translation,is a pilot research project into sc...On-line assessment of English-Chinese translation is a challenging task as it involves natural language processing.YanFa,an on-line assessment system for English-Chinese translation,is a pilot research project into scoring student's translation on-line.Based on the theory of translation equivalence,an algorithm called "conceptual similarity matching" was developed.YanFa can assess students' translation on-line timely,generate test papers automatically,offer standard versions of translation,and the scores of each sentence to students.The evaluation proves that YanFa is practical compared with the scores given by experts.展开更多
针对航空不安全事件领域命名实体识别任务,以航空安全信息周报为数据源,分析并构建航空不安全事件命名实体识别数据集和领域词典。为解决传统命名实体识别模型对于捕获领域实体边界性能较差的问题,基于BERT(bidirectional encoder repre...针对航空不安全事件领域命名实体识别任务,以航空安全信息周报为数据源,分析并构建航空不安全事件命名实体识别数据集和领域词典。为解决传统命名实体识别模型对于捕获领域实体边界性能较差的问题,基于BERT(bidirectional encoder representations from transformers)预训练语言模型提出融合领域词典嵌入的领域语义信息增强的方法。在自建数据集上进行多次对比实验,结果表明:所提出的方法可以进一步提升实体边界的识别率,相较于传统的双向长短期记忆网络-条件随机场(bi-directional long short term memory-conditional random field,BiLSTM-CRF)命名实体识别模型,性能提升约5%。展开更多
Chinese word segmentation is the basis of natural language processing. The dictionary mechanism significantly influences the efficiency of word segmentation and the understanding of the user’s intention which is impl...Chinese word segmentation is the basis of natural language processing. The dictionary mechanism significantly influences the efficiency of word segmentation and the understanding of the user’s intention which is implied in the user’s query. As the traditional dictionary mechanisms can't meet the present situation of personalized mobile search, this paper presents a new dictionary mechanism which contains the word classification information. This paper, furthermore, puts forward an approach for improving the traditional word bank structure, and proposes an improved FMM segmentation algorithm. The results show that the new dictionary mechanism has made a significant increase on the query efficiency and met the user’s individual requirements better.展开更多
The study, conducted in the academic year 2008, explores the potential differences in the use of a dictionary in support of a standard writing task by two student groups at two different proficiency levels. Fifty seve...The study, conducted in the academic year 2008, explores the potential differences in the use of a dictionary in support of a standard writing task by two student groups at two different proficiency levels. Fifty seven students working on a real classroom assignment were observed; in order to make sure that the subjects behaved as they normally would, they had not been informed that their dictionary behavior was to be observed. The study which shows that the need for a dictionary is smaller in the case of more advanced students may be of interest to those foreign language teachers who fear that giving a student an unlimited access to a dictionary may hamper the development of his expressive abilities. In turn, a marked preference on the part of more advanced students for an L I-L2, paralleled by a sustained interest in information categories typically placed in foreign learner's dictionaries suggests that advanced language learners writing in English would probably opt for a lexicographic product combining the best of both dictionary types: an LI-L2 and an MLD.展开更多
文摘Web search engines are important tools for lexicography.This paper takes translation of business terms("e-commerce"and"e-business")as an example to illustrate the application of web search engines in English-Chinese dictionary translation,including the methods of(1)finding the potential Chinese equivalents of the English business terms,and(2)selecting typical and proper Chinese equivalents in accordance with the frequencies and the meanings of the English business terms respectively.
文摘This article intends to demonstrate the principles for dictionary compilation.To summarize,general rules are to be followed.At the same time,this article probes into the differences of Chinese to Chinese dictionary,Chinese to foreign language dictionary,dictionaries for the preparation of certain exams,learner’s dictionaries for reviewing errors,dictionaries with multimedia and dictionaries built via the Internet..In all,one of the key factors for dictionary compilation is the professionalism of editors and writers.The training of the people involved is vital,which would contribute to setting up a better order of the market.Acknowledgements should also be given to achievements in dictionary compilation.Revisions and polishing are required to improve the dictionaries on the market that have already won the acclaim so as to put their influence to the best display.
基金The National Natural Science Foundution of China(No60496326)The Second Phase of 985 Project of Shanghai Jiaotong University
文摘Described and exemplified a semantic scoring system of students' on-line English-Chinese translation. To achieve accurate assessment, the system adopted a comprehensive method which combines semantic scoring with keyword matching scoring. Four kinds of words-verbs, adjectives, adverbs and "the rest" including nouns, pronouns, idioms, prepositions, etc., are identified after parsing. The system treats different words tagged with different part of speech differently. Then it calculated the semantic similarity between these words of the standard versions and those of students' translations by the distinctive differences of the semantic features of these words with the aid of HowNet. The first semantic feature of verbs and the last semantic features of adjectives and adverbs are calculated. "The rest" is scored by keyword matching. The experiment results show that the semantic scoring system is applicable in fulfilling the task of scoring students' on-line English-Chinese translations.
基金The National Natural Science Foundation ofChina(No.60496326)
文摘On-line assessment of English-Chinese translation is a challenging task as it involves natural language processing.YanFa,an on-line assessment system for English-Chinese translation,is a pilot research project into scoring student's translation on-line.Based on the theory of translation equivalence,an algorithm called "conceptual similarity matching" was developed.YanFa can assess students' translation on-line timely,generate test papers automatically,offer standard versions of translation,and the scores of each sentence to students.The evaluation proves that YanFa is practical compared with the scores given by experts.
文摘针对航空不安全事件领域命名实体识别任务,以航空安全信息周报为数据源,分析并构建航空不安全事件命名实体识别数据集和领域词典。为解决传统命名实体识别模型对于捕获领域实体边界性能较差的问题,基于BERT(bidirectional encoder representations from transformers)预训练语言模型提出融合领域词典嵌入的领域语义信息增强的方法。在自建数据集上进行多次对比实验,结果表明:所提出的方法可以进一步提升实体边界的识别率,相较于传统的双向长短期记忆网络-条件随机场(bi-directional long short term memory-conditional random field,BiLSTM-CRF)命名实体识别模型,性能提升约5%。
文摘Chinese word segmentation is the basis of natural language processing. The dictionary mechanism significantly influences the efficiency of word segmentation and the understanding of the user’s intention which is implied in the user’s query. As the traditional dictionary mechanisms can't meet the present situation of personalized mobile search, this paper presents a new dictionary mechanism which contains the word classification information. This paper, furthermore, puts forward an approach for improving the traditional word bank structure, and proposes an improved FMM segmentation algorithm. The results show that the new dictionary mechanism has made a significant increase on the query efficiency and met the user’s individual requirements better.
文摘The study, conducted in the academic year 2008, explores the potential differences in the use of a dictionary in support of a standard writing task by two student groups at two different proficiency levels. Fifty seven students working on a real classroom assignment were observed; in order to make sure that the subjects behaved as they normally would, they had not been informed that their dictionary behavior was to be observed. The study which shows that the need for a dictionary is smaller in the case of more advanced students may be of interest to those foreign language teachers who fear that giving a student an unlimited access to a dictionary may hamper the development of his expressive abilities. In turn, a marked preference on the part of more advanced students for an L I-L2, paralleled by a sustained interest in information categories typically placed in foreign learner's dictionaries suggests that advanced language learners writing in English would probably opt for a lexicographic product combining the best of both dictionary types: an LI-L2 and an MLD.