摘要
HAC是一种常用的聚类方法。本文的目的是根据语音特征中的音素与连续时间的紧密关系,改进HAC快速算法提高无监督分割语音信号到类似音素单位。该算法是基于同一段特征相似度高于跨段特征的相似度。特征的相似度是通过计算相邻特征间的欧式距离,来得到输入语音特征相邻的距离双链表,链表中的每个节点由语音相邻特征的距离和指向前后相邻节点的指针组成。该算法也是通过遍历相邻距离节点链表,查找最小距离后,对相似的相邻特征进行合并,并重复迭代至最后一个类或满足某个阀值。整个过程完全基于无监督下完成,该方法优于快速HAC算法,与快速HAC算法相比能提升65倍以上的聚类速度,节约更多的内存空间,可应用于零资源的语音分割。
HAC is a commonly used clustering method. According to the close relationship between phonemes and continuous time in speech features, the purpose of this paper is to improve the HAC fast algorithm to improve the unsupervised segmentation of speech signals to similar phoneme units. The algorithm is based on the fact that the similarity of the same segment feature is higher than that of the cross-segment feature. The similarity of features is to calculate the Euclidean distance between adjacent features to obtain the adjacent distance double-linked list of input speech features. Each node in the linked list is composed of the distance of adjacent speech features and pointers pointing to the adjacent nodes before and after. The algorithm also traverses the linked list of adjacent distance nodes, finds the minimum distance, combines similar adjacent features, and iterates to the last class or satisfies a certain threshold. The whole process is completed completely without supervision. This method is better than the fast HAC algorithm. Compared with the fast HAC algorithm, it can improve the clustering speed by more than 65 times, save more memory space, and can be applied to zero-resource speech segmentation.
出处
《计算机科学与应用》
2020年第8期1464-1470,共7页
Computer Science and Application
关键词
无监督
音素
HAC算法
语音分割
相邻
Unsupervised
Phoneme
HAC Algorithm
Speech Segmentation
Adjacent