摘要
为了提高哼唱检索旋律匹配的速度和精度,提出了一种基于帧-音符方式的匹配算法。该算法针对旋律曲线的形状特点,采用基频序列表示哼唱片段,采用音符序列表示模板片段,根据累积权重估计基频跳变点位置,然后计算哼唱片段和模板片段之间的编辑距离。在MIREX08数据库上进行的实验结果表明:该算法检索时间为动态时间规整算法的0.013倍;与动态时间规整算法结果进行融合,最终平均排序倒数精度指标可以达到91.2%。
This paper presents a frame-to-note(FTN) algorithm to improve the speed and precision of the melody match for querying by humming(QBH).According to the characteristics of tune curves,the humming phrase is denoted by the pitch sequence while the symbolizing template phrase is denoted by the note sequence.The pitch transition position is estimated based on the predefined weights,with the edit distance between the two phrases then calculated.Experimental results using the MIREX 2008 corpus show that the retrieval time of the FTN algorithm is 0.013 times that of the dynamic time warping(DTW) algorithm,and that the final fusion precision achieves a mean reciprocal rank of 91.2%.
出处
《清华大学学报(自然科学版)》
EI
CAS
CSCD
北大核心
2011年第4期561-565,共5页
Journal of Tsinghua University(Science and Technology)
基金
国家自然科学基金资助项目(61005019
90920302
60931160443)
国家"八六三"高技术项目(2008AA02Z414)
关键词
帧-音符方式算法
基频跳变点
旋律匹配
音乐信息检索
哼唱检索
frame-to-note algorithm
pitch transition position
melody match
music information retrieval
querying by humming