摘要
该文根据现有部分人群的通讯数据,统计并分析所有用户的通话时间和通话次数.根据推导权值计算公式Wij=αcos(→ui,→uj)+βT^2ij/Ti*Tj N^2ij/Ni*Nj计算用户个体个性与用户通话时间和通话次数的加权相关性,构建出相应的权值矩阵,并分别用层次聚类法和K-Means聚类法对所有用户进行聚类,综合分析两种聚类方法,得出已知人群分四类最合适的结论,具体聚类结果见正文.
Based on the existing communication data of some people,this paper counts and analyzes the call time and call times of all users.According to the derivation weight calculation formula Wij=αcos(→ui,→uj)+βT^2ij/Ti*Tj N^2ij/Ni*Nj,the weighted correlation between the user's individual personality and the user's call time and call times is calculated,and a corresponding weight matrix is constructed.Then all users are clustered with hierarchical clustering and K-Means clustering,respectively.It is concluded that it is the most suitable to divide the known population into four categories through a comprehensive analysis of the two clustering methods.For the specific clustering results,see the text.
作者
吴建平
詹桢
王涛
文博
杨红超
Wu Jianping;Zhan Zhen;Wang Tao;Wen Bo;Yang Hongchao
出处
《科教文汇》
2020年第21期107-109,共3页
Journal of Science and Education
基金
永州市2018年度科技创新指导性项目(编号:2018ZD07)
湖南省大学生创新创业训练计划项目(编号:201301702020)
湖南科技学院教改课题(编号:湘科院教发[2018]41(29))
湖南省教改课题(编号:湘教通[2019]291(849))。