There are many idioms related to color words in English and Chinese.The use of color words in idioms adds beauty and vividness to the language.Due to the cultural differences,“color idioms”have gained different cult...There are many idioms related to color words in English and Chinese.The use of color words in idioms adds beauty and vividness to the language.Due to the cultural differences,“color idioms”have gained different cultural connotations with the development of English and Chinese languages.It is of great significance to accurately understand and grasp the meanings and differences of color-related idioms in Chinese and English.This paper intends to analyze and expound the cultural connotations of English and Chinese idioms related to several widely used basic color words with the aim of helping English learners know and use the idioms about color words better.展开更多
Good translations play a very important role in cultural exchange.The idea of reception aesthetics think the reader as the center in translation process.This paper will study the translation of culture-loaded words fr...Good translations play a very important role in cultural exchange.The idea of reception aesthetics think the reader as the center in translation process.This paper will study the translation of culture-loaded words from the perspective of reception aesthetics.It is divided into five parts.The first part mainly introduces the background and theoretical foundation of this paper,the second part introduces the writer’s works,the third part gives examples to analyze the culture-loaded words appearing in The Golden Cangue,the fourth part gives examples of the translation methods,and the fifth part is the conclusion.It aims to provide a reference for the English translation of culture-loaded words.展开更多
[目的/意义]在人工智能技术及应用快速发展与深刻变革背景下,机器学习领域不断出现新的研究主题和方法,深度学习和强化学习技术持续发展。因此,有必要探索不同领域机器学习研究主题演化过程,并识别出热点与新兴主题。[方法/过程]本文以...[目的/意义]在人工智能技术及应用快速发展与深刻变革背景下,机器学习领域不断出现新的研究主题和方法,深度学习和强化学习技术持续发展。因此,有必要探索不同领域机器学习研究主题演化过程,并识别出热点与新兴主题。[方法/过程]本文以图书情报领域中2011—2022年Web of Science数据库中的机器学习研究论文为例,融合LDA和Word2vec方法进行主题建模和主题演化分析,引入主题强度、主题影响力、主题关注度与主题新颖性指标识别热点主题与新兴热点主题。[结果/结论]研究结果表明,(1)Word2vec语义处理能力与LDA主题演化能力的结合能够更加准确地识别研究主题,直观展示研究主题的分阶段演化规律;(2)图书情报领域的机器学习研究主题主要分为自然语言处理与文本分析、数据挖掘与分析、信息与知识服务三大类范畴。各类主题之间的关联性较强,且具有主题关联演化特征;(3)设计的主题强度、主题影响力和主题关注度指标及综合指标能够较好地识别出2011—2014年、2015—2018年和2019—2022年3个不同周期阶段的热点主题。展开更多
We use a lot of devices in our daily life to communicate with others. In this modern world, people use email, Facebook, Twitter, and many other social network sites for exchanging information. People lose their valuab...We use a lot of devices in our daily life to communicate with others. In this modern world, people use email, Facebook, Twitter, and many other social network sites for exchanging information. People lose their valuable time misspelling and retyping, and some people are not happy to type large sentences because they face unnecessary words or grammatical issues. So, for this reason, word predictive systems help to exchange textual information more quickly, easier, and comfortably for all people. These systems predict the next most probable words and give users to choose of the needed word from these suggested words. Word prediction can help the writer by predicting the next word and helping complete the sentence correctly. This research aims to forecast the most suitable next word to complete a sentence for any given context. In this research, we have worked on the Bangla language. We have presented a process that can expect the next maximum probable and proper words and suggest a complete sentence using predicted words. In this research, GRU-based RNN has been used on the N-gram dataset to develop the proposed model. We collected a large dataset using multiple sources in the Bangla language and also compared it to the other approaches that have been used such as LSTM, and Naive Bayes. But this suggested approach provides excellent exactness than others. Here, the Unigram model provides 88.22%, Bi-gram model is 99.24%, Tri-gram model is 97.69%, and 4-gram and 5-gram models provide 99.43% and 99.78% on average accurateness. We think that our proposed method profound impression on Bangla search engines.展开更多
安全是民航业的核心主题。针对目前民航非计划事件分析严重依赖专家经验及分析效率低下的问题,文章提出一种结合Word2vec和双向长短期记忆(bidirectional long short-term memory,BiLSTM)神经网络模型的民航非计划事件分析方法。首先采...安全是民航业的核心主题。针对目前民航非计划事件分析严重依赖专家经验及分析效率低下的问题,文章提出一种结合Word2vec和双向长短期记忆(bidirectional long short-term memory,BiLSTM)神经网络模型的民航非计划事件分析方法。首先采用Word2vec模型针对事件文本语料进行词向量训练,缩小空间向量维度;然后通过BiLSTM模型自动提取特征,获取事件文本的完整序列信息和上下文特征向量;最后采用softmax函数对民航非计划事件进行分类。实验结果表明,所提出的方法分类效果更好,能达到更优的准确率和F 1值,对不平衡数据样本同样具有较稳定的分类性能,证明了该方法在民航非计划事件分析上的适用性和有效性。展开更多
微博作为当今热门的社交平台,其中蕴含着许多具有强烈主观性的用户评论文本。为挖掘微博评论文本中潜在的信息,针对传统的情感分析模型中存在的语义缺失以及过度依赖人工标注等问题,提出一种基于LSTM+Word2vec的深度学习情感分析模型。...微博作为当今热门的社交平台,其中蕴含着许多具有强烈主观性的用户评论文本。为挖掘微博评论文本中潜在的信息,针对传统的情感分析模型中存在的语义缺失以及过度依赖人工标注等问题,提出一种基于LSTM+Word2vec的深度学习情感分析模型。采用Word2vec中的连续词袋模型(continuous bag of words,CBOW),利用语境的上下文结构及语义关系将每个词语映射为向量空间,增强词向量之间的稠密度;采用长短时记忆神经网络模型实现对文本上下文序列的线性抓取,最后输出分类预测的结果。实验结果的准确率可达95.9%,通过对照实验得到情感词典、RNN、SVM三种模型的准确率分别为52.3%、92.7%、85.7%,对比发现基于LSTM+Word2vec的深度学习情感分析模型的准确率更高,具有一定的鲁棒性和泛化性,对用户个性化推送和网络舆情监控具有重要意义。展开更多
Culture-loaded words reveal the basic features of a country’s social life,customs and habits,religious beliefs,and so on.The meanings of culture-loaded words contain the profound cultural connotation of a country or ...Culture-loaded words reveal the basic features of a country’s social life,customs and habits,religious beliefs,and so on.The meanings of culture-loaded words contain the profound cultural connotation of a country or a nation.However,due to the influence of different cultures,the original author and the target readers lack a common cognitive context,which means they cannot successfully conduct a cultural communication.Therefore,correctly understanding and translating these culture-loaded words have become a major problem for translators in the translation process of literary works.Under the guidance of the dual ostensive-inferential model in relevance-theoretical translation theory,this paper,by emphasizing the importance of common cognitive context to a successful translation,analyzes the translation process of five categories of culture-loaded words and summarizes the translation strategies that can accurately convey the writing intention of the original author,which provides a new perspective for the translation of culture-loaded words.展开更多
This paper explores the translation strategies for Chinese reduplication to English from the perspective of Skopos theory. Drawing on previous research on the semantic functions of reduplication, the paper analyzes th...This paper explores the translation strategies for Chinese reduplication to English from the perspective of Skopos theory. Drawing on previous research on the semantic functions of reduplication, the paper analyzes the three main types of semantic changes in Chinese reduplicated words: intensification of meaning, moderation of meaning, and addition of meaning. The paper then proposes different translation strategies for Chinese reduplication, including same-word repetition, near-synonym repetition, “have a try” form, “noun(pl.) + of” form, and “all”, “each”, and “every” pattern. Finally, the paper emphasizes the importance of understanding the intended communicative function of the reduplicated words and the need for flexible handling in order to achieve a natural translation with better readability.展开更多
This paper deals with the translation strategies of Chinese Culture-Loaded Words from the perspective of adaptation theory.It is based on the translation text of the sixth episode“Silk Road”and the seventh episode“...This paper deals with the translation strategies of Chinese Culture-Loaded Words from the perspective of adaptation theory.It is based on the translation text of the sixth episode“Silk Road”and the seventh episode“Dunhuang”of the documentary Hexi Corridor.Many words with Chinese cultural connotations appear in the subtitles of this documentary.This paper will be divided into four parts.The first part and the second part deal with the basic theories,i.e.,definition of Chinese Culture-Loaded Words and of adaptation theory.The original text is analysed in the third part.This part deals with the background and specifics of the language of the documentary film Hexi Corridor.The fourth part deals with the difficulties encountered by the author in translation practice and the corresponding solutions adopted by the author.The translation difficulties are solved by five translation methods,namely transliteration,loan translation,substitution,interpretation,and adaptation.展开更多
Nowadays,Internet of Things(IoT)is widely deployed and brings great opportunities to change people's daily life.To realize more effective human-computer interaction in the IoT applications,the Question Answering(Q...Nowadays,Internet of Things(IoT)is widely deployed and brings great opportunities to change people's daily life.To realize more effective human-computer interaction in the IoT applications,the Question Answering(QA)systems implanted in the IoT services are supposed to improve the ability to understand natural language.Therefore,the distributed representation of words,which contains more semantic or syntactic information,has been playing a more and more important role in the QA systems.However,learning high-quality distributed word vectors requires lots of storage and computing resources,hence it cannot be deployed on the resource-constrained IoT devices.It is a good choice to outsource the data and computation to the cloud servers.Nevertheless,it could cause privacy risks to directly upload private data to the untrusted cloud.Therefore,realizing the word vector learning process over untrusted cloud servers without privacy leakage is an urgent and challenging task.In this paper,we present a novel efficient word vector learning scheme over encrypted data.We first design a series of arithmetic computation protocols.Then we use two non-colluding cloud servers to implement high-quality word vectors learning over encrypted data.The proposed scheme allows us to perform training word vectors on the remote cloud servers while protecting privacy.Security analysis and experiments over real data sets demonstrate that our scheme is more secure and efficient than existing privacy-preserving word vector learning schemes.展开更多
文摘There are many idioms related to color words in English and Chinese.The use of color words in idioms adds beauty and vividness to the language.Due to the cultural differences,“color idioms”have gained different cultural connotations with the development of English and Chinese languages.It is of great significance to accurately understand and grasp the meanings and differences of color-related idioms in Chinese and English.This paper intends to analyze and expound the cultural connotations of English and Chinese idioms related to several widely used basic color words with the aim of helping English learners know and use the idioms about color words better.
文摘Good translations play a very important role in cultural exchange.The idea of reception aesthetics think the reader as the center in translation process.This paper will study the translation of culture-loaded words from the perspective of reception aesthetics.It is divided into five parts.The first part mainly introduces the background and theoretical foundation of this paper,the second part introduces the writer’s works,the third part gives examples to analyze the culture-loaded words appearing in The Golden Cangue,the fourth part gives examples of the translation methods,and the fifth part is the conclusion.It aims to provide a reference for the English translation of culture-loaded words.
文摘[目的/意义]在人工智能技术及应用快速发展与深刻变革背景下,机器学习领域不断出现新的研究主题和方法,深度学习和强化学习技术持续发展。因此,有必要探索不同领域机器学习研究主题演化过程,并识别出热点与新兴主题。[方法/过程]本文以图书情报领域中2011—2022年Web of Science数据库中的机器学习研究论文为例,融合LDA和Word2vec方法进行主题建模和主题演化分析,引入主题强度、主题影响力、主题关注度与主题新颖性指标识别热点主题与新兴热点主题。[结果/结论]研究结果表明,(1)Word2vec语义处理能力与LDA主题演化能力的结合能够更加准确地识别研究主题,直观展示研究主题的分阶段演化规律;(2)图书情报领域的机器学习研究主题主要分为自然语言处理与文本分析、数据挖掘与分析、信息与知识服务三大类范畴。各类主题之间的关联性较强,且具有主题关联演化特征;(3)设计的主题强度、主题影响力和主题关注度指标及综合指标能够较好地识别出2011—2014年、2015—2018年和2019—2022年3个不同周期阶段的热点主题。
文摘We use a lot of devices in our daily life to communicate with others. In this modern world, people use email, Facebook, Twitter, and many other social network sites for exchanging information. People lose their valuable time misspelling and retyping, and some people are not happy to type large sentences because they face unnecessary words or grammatical issues. So, for this reason, word predictive systems help to exchange textual information more quickly, easier, and comfortably for all people. These systems predict the next most probable words and give users to choose of the needed word from these suggested words. Word prediction can help the writer by predicting the next word and helping complete the sentence correctly. This research aims to forecast the most suitable next word to complete a sentence for any given context. In this research, we have worked on the Bangla language. We have presented a process that can expect the next maximum probable and proper words and suggest a complete sentence using predicted words. In this research, GRU-based RNN has been used on the N-gram dataset to develop the proposed model. We collected a large dataset using multiple sources in the Bangla language and also compared it to the other approaches that have been used such as LSTM, and Naive Bayes. But this suggested approach provides excellent exactness than others. Here, the Unigram model provides 88.22%, Bi-gram model is 99.24%, Tri-gram model is 97.69%, and 4-gram and 5-gram models provide 99.43% and 99.78% on average accurateness. We think that our proposed method profound impression on Bangla search engines.
文摘安全是民航业的核心主题。针对目前民航非计划事件分析严重依赖专家经验及分析效率低下的问题,文章提出一种结合Word2vec和双向长短期记忆(bidirectional long short-term memory,BiLSTM)神经网络模型的民航非计划事件分析方法。首先采用Word2vec模型针对事件文本语料进行词向量训练,缩小空间向量维度;然后通过BiLSTM模型自动提取特征,获取事件文本的完整序列信息和上下文特征向量;最后采用softmax函数对民航非计划事件进行分类。实验结果表明,所提出的方法分类效果更好,能达到更优的准确率和F 1值,对不平衡数据样本同样具有较稳定的分类性能,证明了该方法在民航非计划事件分析上的适用性和有效性。
文摘微博作为当今热门的社交平台,其中蕴含着许多具有强烈主观性的用户评论文本。为挖掘微博评论文本中潜在的信息,针对传统的情感分析模型中存在的语义缺失以及过度依赖人工标注等问题,提出一种基于LSTM+Word2vec的深度学习情感分析模型。采用Word2vec中的连续词袋模型(continuous bag of words,CBOW),利用语境的上下文结构及语义关系将每个词语映射为向量空间,增强词向量之间的稠密度;采用长短时记忆神经网络模型实现对文本上下文序列的线性抓取,最后输出分类预测的结果。实验结果的准确率可达95.9%,通过对照实验得到情感词典、RNN、SVM三种模型的准确率分别为52.3%、92.7%、85.7%,对比发现基于LSTM+Word2vec的深度学习情感分析模型的准确率更高,具有一定的鲁棒性和泛化性,对用户个性化推送和网络舆情监控具有重要意义。
文摘Culture-loaded words reveal the basic features of a country’s social life,customs and habits,religious beliefs,and so on.The meanings of culture-loaded words contain the profound cultural connotation of a country or a nation.However,due to the influence of different cultures,the original author and the target readers lack a common cognitive context,which means they cannot successfully conduct a cultural communication.Therefore,correctly understanding and translating these culture-loaded words have become a major problem for translators in the translation process of literary works.Under the guidance of the dual ostensive-inferential model in relevance-theoretical translation theory,this paper,by emphasizing the importance of common cognitive context to a successful translation,analyzes the translation process of five categories of culture-loaded words and summarizes the translation strategies that can accurately convey the writing intention of the original author,which provides a new perspective for the translation of culture-loaded words.
文摘This paper explores the translation strategies for Chinese reduplication to English from the perspective of Skopos theory. Drawing on previous research on the semantic functions of reduplication, the paper analyzes the three main types of semantic changes in Chinese reduplicated words: intensification of meaning, moderation of meaning, and addition of meaning. The paper then proposes different translation strategies for Chinese reduplication, including same-word repetition, near-synonym repetition, “have a try” form, “noun(pl.) + of” form, and “all”, “each”, and “every” pattern. Finally, the paper emphasizes the importance of understanding the intended communicative function of the reduplicated words and the need for flexible handling in order to achieve a natural translation with better readability.
基金Research Startup Project for Doctors at the School of Foreign Languages,University of Shanghai for Science and Technology(Fund Project No.:1F-21-305-101).
文摘This paper deals with the translation strategies of Chinese Culture-Loaded Words from the perspective of adaptation theory.It is based on the translation text of the sixth episode“Silk Road”and the seventh episode“Dunhuang”of the documentary Hexi Corridor.Many words with Chinese cultural connotations appear in the subtitles of this documentary.This paper will be divided into four parts.The first part and the second part deal with the basic theories,i.e.,definition of Chinese Culture-Loaded Words and of adaptation theory.The original text is analysed in the third part.This part deals with the background and specifics of the language of the documentary film Hexi Corridor.The fourth part deals with the difficulties encountered by the author in translation practice and the corresponding solutions adopted by the author.The translation difficulties are solved by five translation methods,namely transliteration,loan translation,substitution,interpretation,and adaptation.
基金supported by the National Natural Science Foundation of China under Grant No.61672195,61872372the Open Foundation of State Key Laboratory of Cryptology No.MMKFKT201617the National University of Defense Technology Grant No.ZK19-38.
文摘Nowadays,Internet of Things(IoT)is widely deployed and brings great opportunities to change people's daily life.To realize more effective human-computer interaction in the IoT applications,the Question Answering(QA)systems implanted in the IoT services are supposed to improve the ability to understand natural language.Therefore,the distributed representation of words,which contains more semantic or syntactic information,has been playing a more and more important role in the QA systems.However,learning high-quality distributed word vectors requires lots of storage and computing resources,hence it cannot be deployed on the resource-constrained IoT devices.It is a good choice to outsource the data and computation to the cloud servers.Nevertheless,it could cause privacy risks to directly upload private data to the untrusted cloud.Therefore,realizing the word vector learning process over untrusted cloud servers without privacy leakage is an urgent and challenging task.In this paper,we present a novel efficient word vector learning scheme over encrypted data.We first design a series of arithmetic computation protocols.Then we use two non-colluding cloud servers to implement high-quality word vectors learning over encrypted data.The proposed scheme allows us to perform training word vectors on the remote cloud servers while protecting privacy.Security analysis and experiments over real data sets demonstrate that our scheme is more secure and efficient than existing privacy-preserving word vector learning schemes.