期刊文献+

基于word2vec的中文歌词关键词提取算法 被引量:3

Chinese Lyrics' Keyword Extraction Algorithm Based on Word2vec
下载PDF
导出
摘要 为了让用户根据歌词内容快速精准地检索音乐,提出一种基于word2vec的中文歌词关键词提取算法.算法运用word2vec将歌词表征为词向量,根据词向量计算词语之间的相似度,其次通过K-means聚类算法得到歌词关键词.同时与基于TFIDF、LDA模型的歌词关键词提取结果进行比较,发现从该算法得出的10个关键词中抽取与人工标注相同的2-5个时,准确率明显增加. In order to make users search music by lyrics rapidly and accurately,A Chinese lyrics’ keyword extraction algorithm based on word2 vec is proposed,which is a model for deep learning.Firstly,this algorithm characterizes lyrics as word vector by word2 vec.Secondly,it calculates the similarity between words and words.Then we can get keywords by K-means algorithm.At the same time,this algorithm is better than TFIDF and LDA when 2-5 of 10 keywords,which is found in manual work.
作者 蒙晓燕 殷雁君 MENG Xiao-yan;YIN Yan-jun(College of Computer, Inner Mongolia Normal University, Hohhot 010022,China)
出处 《内蒙古师范大学学报(自然科学汉文版)》 CAS 2018年第2期137-140,共4页 Journal of Inner Mongolia Normal University(Natural Science Edition)
基金 内蒙古自治区高等学校科学研究项目基金资助(NJZY13047)
关键词 word2vec 词向量 歌词关键词提取 K-MEANS word2vec word vector lyrics ’ keyword extraction K-means
  • 相关文献

参考文献5

二级参考文献40

  • 1耿焕同,蔡庆生,于琨,赵鹏.一种基于词共现图的文档主题词自动抽取方法[J].南京大学学报(自然科学版),2006,42(2):156-162. 被引量:30
  • 2张敏,耿焕同,王煦法.一种利用BC方法的关键词自动提取算法研究[J].小型微型计算机系统,2007,28(1):189-192. 被引量:19
  • 3王灿辉,张敏,马少平,黄宇.基于相邻词的中文关键词自动抽取[J].广西师范大学学报(自然科学版),2007,25(2):161-164. 被引量:10
  • 4杨力.美国口语大观,中英文对照[M].合肥:中国科学技术大学出版社,2001..
  • 5Brun A, Smaili K, Jean - Paul H. Experiment Analysis in Newspaper Topic Detection [ A]. Proceedings of the Seventh International Symposium on String Processing Information Retrieval (SPIRE'00)[C]. Curuna, Spain: IEEE Computer Society, 2000.55 - 64.
  • 6Bigi B,De Mori R,El-Beze M,et al. Detecting topic shifts using a cache memory[ A], 5th International Conference on Spoken language Processing [ C ]. Sydney, Australia: [ s. n, ],1998.2331 - 2334.
  • 7YihW, Goodman J, Carvalho V R. Finding adve~ising keywords on web pages[C]//Proceedings ofthe 15th international conference on WorldWideWeb. ACM, 2006:213-222.
  • 8Chien L F. PAT-tree-based keyword extraction for Chinese information retrieval[C]//ACM SIGIR Forum. ACM, 1997, 31(SI): 50-58.
  • 9Mihalcea R, Tarau P. TextRank: Bringing order into texts[C]//Proceedings of EMNLP. 2004, 4(4): 275.
  • 10Zhang K, Xu H, Tang J, et al. Keyword extraction using support vector machine[M]//Advances in Web-Age Information Management. Springer Berlin Heidelberg 2006:85-96.

共引文献132

同被引文献37

引证文献3

二级引证文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部