期刊文献+

基于峰值密度聚类的电信业投诉热点话题检测方法 被引量:1

Telecom complaint hot topic detection method based on density peaks clustering
下载PDF
导出
摘要 针对电信业对投诉热点话题缺乏有效的检测方法问题,提出一种基于峰值密度聚类算法的投诉热点话题检测方法。首先建立电信业专用词库用于投诉样本的文本分词,采用向量空间模型表示文本分词,然后通过计算文本分词相似度和密度,并运用密度峰值聚类算法对分词进行聚类分析。最终通过类簇关键词选取并排序,从而得到热点话题描述。将本方法应用到某电信企业投诉热点话题检测中,结果表明本方法有效并具有实际应用价值。 In view of the lack of effective detection methods for hot topics in telecom industry,a method of complaint hotspots detection based on density peaks clustering algorithm was proposed.Firstly,a special vocabulary for telecommunication industry was established to segment the complaint samples.The vector space model was used to represent the text segmentation.Then,the similarity and density of the text segmentation were calculated,and the clustering analysis of the words was carried out by using the density peaks clustering algorithm.Finally,keywords were selected and sorted by clustering.This method was applied to the complaint hotspots detection tasks within a telecom company.The results show that this method is effective and has practical application value.
作者 江俊 黄骅 任条娟 张登辉 JIANG Jun;HUANG Hua;REN Tiaojuan;ZHANG Denghui(College of Information Science and Technology,Zhejiang Shuren University,Hangzhou 310015,China2. College of Information Science and Electronic Engineering,Zhejiang University,Hangzhou 310058,China3. Eastern Communications Co.,Ltd.,Hangzhou 310053,China4. Wan Xiang Research Institute of Wan Xiang Group Corporation,Hangzhou 311215,China)
出处 《电信科学》 2019年第5期97-103,共7页 Telecommunications Science
基金 浙江省自然科学基金(No. LGF18F030004, No.LGF19F010005)Foundation Items: The Natural Science Foundation of Zhejiang Province ofChina (No. LGF18F030004, No.LGF19F010005)
关键词 热点话题检测 文本分词 聚类分析 hot topic detection text segmentation cluster analysis
  • 相关文献

参考文献14

二级参考文献138

共引文献224

同被引文献16

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部