摘要
提出了利用音频子带能量动态范围特征实现两阶段快速音频检索的方法。在预处理阶段根据音频库的子带能量动态范围(DRSBE)特征首先建立1个索引库,检索时分为2步:第一步先计算输入参考音频片段的DRSBE特征,然后根据数据库中建立的索引找到候选音频;第二步计算参考音频和候选音频之间的相似度,输出最后结果。实验结果表明,基于DRSBE特征的快速音频检索方法对于同源音频检索的速度和精度都非常高,在高质量的广播音频检索中达到了实用要求。
A two-stage quick audio retrieval method is proposed based on the DRSBE (Dynamic Range of Sub- Band Energy) feature. In the perprocessing, an audio indexing database is first constructed according to the DRSBE feature of the audio in the database. The audio retrieval method is carried out in two stages. Firstly, the DRSBE feature of the input reference audio clip is calculated, and then the audio clip candidates are quickly extracted from the database by the indexing method. Secondly, the accurate similarity between the reference audio and the audio candidates is evaluated to refine the final output. The experiment show that this quick audio retrieval method based on DRSBE feature has good performance in speed and precision and is good enough for the retrieval of high quality audio in broadcasting application.
出处
《电声技术》
2009年第10期66-68,72,共4页
Audio Engineering
关键词
音频检索
子带能量
动态范围
索引
audio retrieval
sub-band energy
dynamic range
index