期刊文献+
共找到8篇文章
< 1 >
每页显示 20 50 100
基于模糊聚类决策树的分布式语者识别算法 被引量:1
1
作者 黄继鹏 陈志 +1 位作者 芮路 王宇虹 《计算机技术与发展》 2017年第8期79-82,87,共5页
为解决大规模语者识别问题中普遍存在的加性噪声、高计算复杂度等问题,提高大规模语者识别算法的抗噪性和鲁棒性,利用模糊聚类决策树,提出了一种分布式语者识别算法。该算法将训练数据等分成几个部分,对这几个部分分别使用基于模糊聚类... 为解决大规模语者识别问题中普遍存在的加性噪声、高计算复杂度等问题,提高大规模语者识别算法的抗噪性和鲁棒性,利用模糊聚类决策树,提出了一种分布式语者识别算法。该算法将训练数据等分成几个部分,对这几个部分分别使用基于模糊聚类的决策树算法进行训练;对于输入的测试样本,用建好的决策树进行分类,判断它属于哪棵树的哪个叶节点;在该选定的叶节点上使用梅尔频率倒谱系数和高斯混合模型识别方法识别该语者身份。对训练数据进行模糊聚类的过程主要包括四个步骤:根据相应的层提取语音特征;计算特征数据的均值和标准差得到信任间距集合;对集合使用Lloyd算法得到分隔向量;以分隔向量为基础进行聚类分组得到下一层的节点。实验结果表明,与传统的硬聚类算法相比,该算法能够提高语者识别的准确率和分类效率,对加性噪声具有良好的抗干扰能力。 展开更多
关键词 语者识别 模糊聚类 决策树 分布式计算
下载PDF
语者识别系统设计及Matlab实现
2
作者 王常衡 卢曼 +1 位作者 李嘉伟 罗钦 《计算机产品与流通》 2019年第5期136-136,共1页
本文使用15段不同说话人的音频进行了语者识别系统的设计,给出其整体设计思路,使用Matlab实现分步设计,并对系统性能进行测试。
关键词 语者识别 特征提取 GMM MATLAB
下载PDF
基于计算听觉场景分析和语者模型信息的语音识别鲁棒前端研究 被引量:2
3
作者 关勇 李鹏 +1 位作者 刘文举 徐波 《自动化学报》 EI CSCD 北大核心 2009年第4期410-416,共7页
传统抗噪算法无法解决人声背景下语音识别(Automatic speech recognition,ASR)系统的鲁棒性问题.本文提出了一种基于计算听觉场景分析(Computational auditory scene analysis,CASA)和语者模型信息的混合语音分离系统.该系统在CASA框架... 传统抗噪算法无法解决人声背景下语音识别(Automatic speech recognition,ASR)系统的鲁棒性问题.本文提出了一种基于计算听觉场景分析(Computational auditory scene analysis,CASA)和语者模型信息的混合语音分离系统.该系统在CASA框架下,利用语者模型信息和因大子最大矢量量化(Factorial-max vector quantization,MAXVQ)方法进行实值掩码估计,实现了两语者混合语音中有效地分离出目标说话人语音的目标,从而为ASR系统提供了鲁棒的识别前端.在语音分离挑战(Speech separation challenge,SSC)数据集上的评估表明,相比基线系统,本文所提出的系统的语音识别正确率提高了15.68%,相关的实验结果也验证了本文提出的多语者识别和实值掩码估计的有效性. 展开更多
关键词 计算听觉场景分析 语音分离 鲁棒语音识别 因子最大矢量量化 语者识别
下载PDF
利用谐波显著度和语者音色特征的混合语音中目标人基频轨迹提取 被引量:3
4
作者 后方帅 黎美琪 刘若伦 《声学技术》 CSCD 北大核心 2019年第4期408-413,共6页
从混合语音中提取出目标语者的基频轨迹,是语音监听、语音门禁、对话管理等应用的关键技术。为提高基频轨迹跟踪的准确率、增强抗八度误差的能力、降低系统复杂度,多基频估计以谐波乘积谱为核心,八度校正与基频分组均以元音段为基本单元... 从混合语音中提取出目标语者的基频轨迹,是语音监听、语音门禁、对话管理等应用的关键技术。为提高基频轨迹跟踪的准确率、增强抗八度误差的能力、降低系统复杂度,多基频估计以谐波乘积谱为核心,八度校正与基频分组均以元音段为基本单元,并结合了谐波显著度和语者音色特征。基于MIREX2005语音数据集的实验表明,MIREX的4种多基频估计性能指标均在75%以上,基频分组在混合语音中的判断准确率可达92%。 展开更多
关键词 多基频轨迹 谐波乘积谱 语者识别
下载PDF
方言语料数据库管理系统设计
5
作者 张颖 王钢 安然 《新乡学院学报》 2008年第3期57-58,共2页
由于网络通讯技术的发展,产生的“网络语言”也可以认为是一种“方言”。利用数据库技术、多媒体技术和网络技术建立方言语料数据库,应用于语言识别,作为公安机关侦察破案的辅助手段,也可以应用于方言识别和方言语料的研究。
关键词 方言 语料 语者识别 语音识别
下载PDF
Integrated search technique for parameter determination of SVM for speech recognition 被引量:2
6
作者 Teena Mittal R.K.Sharma 《Journal of Central South University》 SCIE EI CAS CSCD 2016年第6期1390-1398,共9页
Support vector machine(SVM)has a good application prospect for speech recognition problems;still optimum parameter selection is a vital issue for it.To improve the learning ability of SVM,a method for searching the op... Support vector machine(SVM)has a good application prospect for speech recognition problems;still optimum parameter selection is a vital issue for it.To improve the learning ability of SVM,a method for searching the optimal parameters based on integration of predator prey optimization(PPO)and Hooke-Jeeves method has been proposed.In PPO technique,population consists of prey and predator particles.The prey particles search the optimum solution and predator always attacks the global best prey particle.The solution obtained by PPO is further improved by applying Hooke-Jeeves method.Proposed method is applied to recognize isolated words in a Hindi speech database and also to recognize words in a benchmark database TI-20 in clean and noisy environment.A recognition rate of 81.5%for Hindi database and 92.2%for TI-20 database has been achieved using proposed technique. 展开更多
关键词 support vector machine (SVM) predator prey optimization speech recognition Mel-frequency cepstral coefficients wavelet packets Hooke-Jeeves method
下载PDF
A Combined Speaker Adaptation Method for Mandarin Speech Recognition
7
作者 徐向华 朱杰 《Journal of Shanghai Jiaotong university(Science)》 EI 2004年第4期21-24,共4页
A speaker adaptation method that combines transformation matrix linear interpolation with maximum a posteriori (MAP) was proposed. Firstly this method can keep the asymptotical characteristic of MAP. Secondly, as the ... A speaker adaptation method that combines transformation matrix linear interpolation with maximum a posteriori (MAP) was proposed. Firstly this method can keep the asymptotical characteristic of MAP. Secondly, as the method uses linear interpolation with several speaker-dependent (SD) transformation matrixes, it can fully use the prior knowledge and keep fast adaptation. The experimental results show that the combined method achieves an 8.24% word error rate reduction with only one adaptation utterance, and keeps asymptotic to the performance of SD model for large amounts of adaptation data. 展开更多
关键词 speech recognition speaker adaptation maximum a posteriori (MAP) maximum likelihood model interpolation (MLMI)
下载PDF
A Case Study on Intermediate CSL Learners' Word Recognition Processes and Strategies in Contextual Reading Settings 被引量:1
8
作者 Shaoqian Luo Xiaohui SUN 《Chinese Journal of Applied Linguistics》 2018年第3期288-305,396,共19页
This study investigates word recognition processes and strategies of intermediate learners of Chinese as a Second Language (CSL) in contextual reading settings. Two intermediate CSL learners were chosen as research ... This study investigates word recognition processes and strategies of intermediate learners of Chinese as a Second Language (CSL) in contextual reading settings. Two intermediate CSL learners were chosen as research participants, and think-aloud methods and retrospective interviews were used to collect data. The data were analyzed by using Moustakas' data analysis procedure, CresweU's three steps and Bogdon and Biklen's data analysis methods. Results indicated that intermediate CSL learners go through different processes of word recognition as it might be automatic, based on context, pronunciation, previous knowledge and the meaning of characters, or, in case of word recognition failure, skipping the words or skipping them but reading them again later; and their word recognition strategies in contextual reading settings mainly include cognitive strategies and self-regulatory strategies. Among these strategies, cognitive strategies consist of direct transformation, translation, interpretation, guessing, inferring and finding key words; and self-regulatory strategies include metacognitive strategies, behavior regulating strategies, emotion regulating strategies and motivation regulating strategies. A model of intermediate CSL learners' word recognition strategies can be constructed based on the results. The present study provides both theoretical and pedagogical implications in the field of CSL vocabulary acquisition and teaching. 展开更多
关键词 intermediate CSL learners' word recognition processes strategies contextual reading settings
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部