摘要
自然最近邻算法(TNDP)所得结果的类内差异很小,若待分类集合元素较多,则可能出现过度分割问题——具有相似特征元素的子集被分割为多个小类。为此,在使用TNDP算法后,若类内平方和差分小于给定的阀值,就把距离最近的两个子类合并为一个大类,如此反复,直到类间的距离足够大为止。这可保证“具有相似特征元素的多个子类”合并为一个较大类,从而提高最终分类结果的可解释性。
Since the results of the natural nearest neighbor algorithm(TNDP)have small intra-class differences,if there are many elements in the set to be classified,there may be an over-segmentation problem,a subset of elements with similar characteristics is divided into multiple small classes.This article suggests that after using the TNDP algorithm,if the difference of the sum of squares within the classes is less than a given threshold,the closest two subclasses can be combined into a new big class.Do this until the distance between each class is large enough.This can often ensure that"multiple classes with similar feature elements"are merged into one larger category,thereby improving the interpretability of the final classification result.
作者
李俊海
LI Junhai(College of Science,Henan University of Technology,Zhengzhou 450007,China)
出处
《新乡学院学报》
2020年第12期38-42,共5页
Journal of Xinxiang University
基金
河南省高等学校重点科研项目(20B416001)。
关键词
自然最近邻居
密度聚类算法
类间相似度
natural nearest neighbor
clustering algorithm based on density
similarity between clusters