摘要
提出一种通过声学模板匹配进行基于音频模板的语音关键词检索算法。该算法通过动态时间规整将音频模板与待检索语音进行匹配,获得音频模板所对应的关键词的出现位置。为了提升匹配质量,本文对音频模板进行筛选和预处理,获得较原始模板更具代表性的多模板作为匹配单元。所提出的音频模板筛选和预处理方法,与直接采用原始模板匹配相比,得到了相对55. 0%的提升。
The paper presents an audio query based keyword search algorithm by query matching.It matches audio queries with speech utterances by dynamic time warping algorithm to obtain the position of the keyword corresponding to audio queries.To improve the quality of matches,the paper implements query selection and query preprocessing to obtain a set of queries with better representation on keywords than original queries and use the set as matching units.By applying query selection and query preprocessing,the keyword search system achieves relative improvement of 55.0%.
作者
张舸
张鹏远
刘建
颜永红
ZHANG Ge;ZHANG Pengyuan;LIU Jian;YAN Yonghong(The Key Laboratory of Speech Acoustics and Content Understanding,Institute of Acoustics,Chinese Academy of Sciences,Beijing,100190,China;University of Chinese Academy of Sciences,Beijing,100190,China;Xinjiang Laboratory of Minority Speech and Language Information Processing,Xinjiang Technical Institute of Physics & Chemistry,Chinese Academy of Sciences,Urumqi,830011,China)
出处
《网络新媒体技术》
2019年第1期18-23,共6页
Network New Media Technology
基金
国家自然科学基金(U1536117
11590770-4)
国家重点研发计划重点专项(2016YFB0801203
2016YFB0801200)
新疆维吾尔自治区科技重大专项(2016A03007-1)