期刊文献+

基于共享词嵌入空间的跨语言相似问句挖掘

Cross-Language Similar Questions Mining Based on Shared Word Embeddings Space
下载PDF
导出
摘要 针对跨语言相似问句查找问题,提出一种基于共享词嵌入空间计算中文和英文问句相似性的方法。该方法首先用fastText训练中、英文词嵌入,之后训练中文词嵌入转换到英文词嵌入的线性矩阵,再对待处理的中、英文问句做相应处理,生成英文空间下句子嵌入,根据句子嵌入余弦相似性计算句子相似性。实验结果表明该方法是可行的。 Aiming at the problem of cross-language similar questions lookup, proposes a method based on Sentence2Embeddings to calculate the similarity between Chinese and English questions. This method first trains Chinese and English word embeddings with fastText, and then trains the Chinese word embeddings to convert to the linear matrix of English word embeddings, then deals with the Chinese questions and English questions sentence to be processed, and generates sentence embedding in English space. Sentence similarity is calculated based on sentence embedding. Experimental results show that the method is feasible.
作者 刘鹏 周安民 LIU Peng;ZHOU An-min(College of Electronic Information,Sichuan University,Chengdu 610065)
出处 《现代计算机》 2019年第8期16-21,共6页 Modern Computer
关键词 句子嵌入 问句相似 跨语言 fastText Sentence2Embeddings Sentence Embeddings Cross-Language Similar to the Question
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部