期刊文献+
共找到1,183篇文章
< 1 2 60 >
每页显示 20 50 100
Implementation of Hybrid Deep Reinforcement Learning Technique for Speech Signal Classification
1
作者 R.Gayathri K.Sheela Sobana Rani 《Computer Systems Science & Engineering》 SCIE EI 2023年第7期43-56,共14页
Classification of speech signals is a vital part of speech signal processing systems.With the advent of speech coding and synthesis,the classification of the speech signal is made accurate and faster.Conventional meth... Classification of speech signals is a vital part of speech signal processing systems.With the advent of speech coding and synthesis,the classification of the speech signal is made accurate and faster.Conventional methods are considered inaccurate due to the uncertainty and diversity of speech signals in the case of real speech signal classification.In this paper,we use efficient speech signal classification using a series of neural network classifiers with reinforcement learning operations.Prior classification of speech signals,the study extracts the essential features from the speech signal using Cepstral Analysis.The features are extracted by converting the speech waveform to a parametric representation to obtain a relatively minimized data rate.Hence to improve the precision of classification,Generative Adversarial Networks are used and it tends to classify the speech signal after the extraction of features from the speech signal using the cepstral coefficient.The classifiers are trained with these features initially and the best classifier is chosen to perform the task of classification on new datasets.The validation of testing sets is evaluated using RL that provides feedback to Classifiers.Finally,at the user interface,the signals are played by decoding the signal after being retrieved from the classifier back based on the input query.The results are evaluated in the form of accuracy,recall,precision,f-measure,and error rate,where generative adversarial network attains an increased accuracy rate than other methods:Multi-Layer Perceptron,Recurrent Neural Networks,Deep belief Networks,and Convolutional Neural Networks. 展开更多
关键词 Neural network(NN) reinforcement learning(RL) cepstral coefficient speech signal classification
下载PDF
Enhancing Parkinson’s Disease Diagnosis Accuracy Through Speech Signal Algorithm Modeling 被引量:1
2
作者 Omar M.El-Habbak Abdelrahman M.Abdelalim +5 位作者 Nour H.Mohamed Habiba M.Abd-Elaty Mostafa A.Hammouda Yasmeen Y.Mohamed Mohanad A.Taifor Ali W.Mohamed 《Computers, Materials & Continua》 SCIE EI 2022年第2期2953-2969,共17页
Parkinson’s disease(PD),one of whose symptoms is dysphonia,is a prevalent neurodegenerative disease.The use of outdated diagnosis techniques,which yield inaccurate and unreliable results,continues to represent an obs... Parkinson’s disease(PD),one of whose symptoms is dysphonia,is a prevalent neurodegenerative disease.The use of outdated diagnosis techniques,which yield inaccurate and unreliable results,continues to represent an obstacle in early-stage detection and diagnosis for clinical professionals in the medical field.To solve this issue,the study proposes using machine learning and deep learning models to analyze processed speech signals of patients’voice recordings.Datasets of these processed speech signals were obtained and experimented on by random forest and logistic regression classifiers.Results were highly successful,with 90%accuracy produced by the random forest classifier and 81.5%by the logistic regression classifier.Furthermore,a deep neural network was implemented to investigate if such variation in method could add to the findings.It proved to be effective,as the neural network yielded an accuracy of nearly 92%.Such results suggest that it is possible to accurately diagnose early-stage PD through merely testing patients’voices.This research calls for a revolutionary diagnostic approach in decision support systems,and is the first step in a market-wide implementation of healthcare software dedicated to the aid of clinicians in early diagnosis of PD. 展开更多
关键词 Early diagnosis logistic regression neural network Parkinson’s disease random forest speech signal processing algorithms
下载PDF
COMPRESSED SPEECH SIGNAL SENSING BASED ON THE STRUCTURED BLOCK SPARSITY WITH PARTIAL KNOWLEDGE OF SUPPORT 被引量:1
3
作者 JiYunyun YangZhen XuQian 《Journal of Electronics(China)》 2012年第1期62-71,共10页
Structural and statistical characteristics of signals can improve the performance of Compressed Sensing (CS). Two kinds of features of Discrete Cosine Transform (DCT) coefficients of voiced speech signals are discusse... Structural and statistical characteristics of signals can improve the performance of Compressed Sensing (CS). Two kinds of features of Discrete Cosine Transform (DCT) coefficients of voiced speech signals are discussed in this paper. The first one is the block sparsity of DCT coefficients of voiced speech formulated from two different aspects which are the distribution of the DCT coefficients of voiced speech and the comparison of reconstruction performance between the mixed program and Basis Pursuit (BP). The block sparsity of DCT coefficients of voiced speech means that some algorithms of block-sparse CS can be used to improve the recovery performance of speech signals. It is proved by the simulation results of the mixed program which is an improved version of the mixed program. The second one is the well known large DCT coefficients of voiced speech focus on low frequency. In line with this feature, a special Gaussian and Partial Identity Joint (GPIJ) matrix is constructed as the sensing matrix for voiced speech signals. Simulation results show that the GPIJ matrix outperforms the classical Gaussian matrix for speech signals of male and female adults. 展开更多
关键词 Compressed Sensing (CS) speech signals Sensing matrix Block sparsity
下载PDF
Speech Signal Detection Based on Bayesian Estimation by Observing Air-Conducted Speech under Existence of Surrounding Noise with the Aid of Bone-Conducted Speech 被引量:1
4
作者 Hisako Orimoto Akira Ikuta Kouji Hasegawa 《Intelligent Information Management》 2021年第4期199-213,共15页
In order to apply speech recognition systems to actual circumstances such as inspection and maintenance operations in industrial factories to recording and reporting routines at construction sites, etc. where hand-wri... In order to apply speech recognition systems to actual circumstances such as inspection and maintenance operations in industrial factories to recording and reporting routines at construction sites, etc. where hand-writing is difficult, some countermeasure methods for surrounding noise are indispensable. In this study, a signal detection method to remove the noise for actual speech signals is proposed by using Bayesian estimation with the aid of bone-conducted speech. More specifically, by introducing Bayes’ theorem based on the observation of air-conducted speech contaminated by surrounding background noise, a new type of algorithm for noise removal is theoretically derived. In the proposed speech detection method, bone-conducted speech is utilized in order to obtain precise estimation for speech signals. The effectiveness of the proposed method is experimentally confirmed by applying it to air- and bone-conducted speeches measured in real environment under the existence of surrounding background noise. 展开更多
关键词 speech signal Detection Bayesian Estimation Air- and Bone-Conducted speeches Surrounding Noise
下载PDF
A DISTRIBUTED COMPRESSED SENSING APPROACH FOR SPEECH SIGNAL DENOISING
5
作者 Ji Yunyun Yang Zhen 《Journal of Electronics(China)》 2011年第4期509-517,共9页
Compressed sensing,a new area of signal processing rising in recent years,seeks to minimize the number of samples that is necessary to be taken from a signal for precise reconstruction.The precondition of compressed s... Compressed sensing,a new area of signal processing rising in recent years,seeks to minimize the number of samples that is necessary to be taken from a signal for precise reconstruction.The precondition of compressed sensing theory is the sparsity of signals.In this paper,two methods to estimate the sparsity level of the signal are formulated.And then an approach to estimate the sparsity level directly from the noisy signal is presented.Moreover,a scheme based on distributed compressed sensing for speech signal denoising is described in this work which exploits multiple measurements of the noisy speech signal to construct the block-sparse data and then reconstruct the original speech signal using block-sparse model-based Compressive Sampling Matching Pursuit(CoSaMP) algorithm.Several simulation results demonstrate the accuracy of the estimated sparsity level and that this de-noising system for noisy speech signals can achieve favorable performance especially when speech signals suffer severe noise. 展开更多
关键词 Distributed compressed sensing Sparsity estimation speech signal DENOISING
下载PDF
Analysis of Deaf Speakers’ Speech Signal for Understanding the Acoustic Characteristics by Territory Specific Utterances
6
作者 Nirmaladevi Jaganathan Bommannaraja Kanagaraj 《Circuits and Systems》 2016年第8期1709-1721,共13页
An important concern with the deaf community is inability to hear partially or totally. This may affect the development of language during childhood, which limits their habitual existence. Consequently to facilitate s... An important concern with the deaf community is inability to hear partially or totally. This may affect the development of language during childhood, which limits their habitual existence. Consequently to facilitate such deaf speakers through certain assistive mechanism, an effort has been taken to understand the acoustic characteristics of deaf speakers by evaluating the territory specific utterances. Speech signals are acquired from 32 normal and 32 deaf speakers by uttering ten Indian native Tamil language words. The speech parameters like pitch, formants, signal-to-noise ratio, energy, intensity, jitter and shimmer are analyzed. From the results, it has been observed that the acoustic characteristics of deaf speakers differ significantly and their quantitative measure dominates the normal speakers for the words considered. The study also reveals that the informative part of speech in a normal and deaf speakers may be identified using the acoustic features. In addition, these attributes may be used for differential corrections of deaf speaker’s speech signal and facilitate listeners to understand the conveyed information. 展开更多
关键词 Deaf Speaker Hard of Hearing Deaf speech Processing Assistive Mechanism for Deaf Speaker speech Correction speech signal Processing
下载PDF
SELECTION OF PROPER EMBEDDING DIMENSION IN PHASE SPACE RECONSTRUCTION OF SPEECH SIGNALS
7
作者 Lin Jiayu Huang Zhiping Wang Yueke Shen Zhenken (Dept.4 and Dept.8, Nat/onaJ University of Defence Technology, Changsha 410073) 《Journal of Electronics(China)》 2000年第2期161-169,共9页
In phase space reconstruction of time series, the selection of embedding dimension is important. Based on the idea of checking the behavior of near neighbors in the reconstruction dimension, a new method to determine ... In phase space reconstruction of time series, the selection of embedding dimension is important. Based on the idea of checking the behavior of near neighbors in the reconstruction dimension, a new method to determine proper minimum embedding dimension is constructed. This method has a sound theoretical basis and can lead to good result. It can indicate the noise level in the data to be reconstructed, and estimate the reconstruction quality. It is applied to speech signal reconstruction and the generic embedding dimension of speech signals is deduced. 展开更多
关键词 speech signals CHAOS Phase space RECONSTRUCTION EMBEDDING DIMENSION False nearest NEIGHBOR Noise level estimation RECONSTRUCTION quality
下载PDF
Heart Rate Extraction from Vowel Speech Signals 被引量:6
8
作者 Abdelwadood Mesleh Dmitriy Skopin +1 位作者 Sergey Baglikov Anas Quteishat 《Journal of Computer Science & Technology》 SCIE EI CSCD 2012年第6期1243-1251,共9页
This paper presents a novel non-contact heart rate extraction method from vowel speech signals. The proposed method is based on modeling the relationship between speech production of vowel speech signals and heart act... This paper presents a novel non-contact heart rate extraction method from vowel speech signals. The proposed method is based on modeling the relationship between speech production of vowel speech signals and heart activities for humans where it is observed that the moment of heart beat causes a short increment (evolution) of vowel speech formants. The short-time Fourier transform (STFT) is used to detect the formant maximum peaks so as to accurately estimate the heart rate. Compared with traditional contact pulse oximeter, the average accuracy of the proposed non-contact heart rate extraction method exceeds 95%. The proposed non-contact heart rate extraction method is expected to play an important role in modern medical applications. 展开更多
关键词 ELECTROCARDIOGRAM feature extraction heart rate short-tlme Fourier transform vowel speech signal
原文传递
A coherent method for finding arrival directions of speech signals and its application for noise reduction in microphone array
9
作者 Zheng Liu and Fumitada Itakura (Department of Electrical Engineering, Faculty of Engineering,Nagoya University Furo-Cho, Chikusa-Ku, Nagoya, 464-01, Japan ) 《Chinese Journal of Acoustics》 1997年第3期214-228,共15页
The research on finding the arrival directions of speech signals by microphone arrny is proposed. We first analyze the uniform microphone array and give the design for microphone array applied in the hand-free speech ... The research on finding the arrival directions of speech signals by microphone arrny is proposed. We first analyze the uniform microphone array and give the design for microphone array applied in the hand-free speech recognition. Combining the traditional direction finding technique of MUltiple SIgnal Classification (MUSIC) with the focusing matrix method, we improve the resolving power of the microphone array for multiple speech sources.As one application of finding Direction of Arrival (DOA), a new microphone-array system for noise reduction is proposed. The new system is based on maximum likelihood estimate technique which reconstruct superimposed signals from different directions by using DOA information. The DOA information is got in terms of focusing MUSIC method which has been proven to have high performance than conventional MUSIC method on speaker localization[1]. 展开更多
关键词 IEEE ASSP A coherent method for finding arrival directions of speech signals and its application for noise reduction in microphone array
原文传递
Acquirement and enhancement of remote speech signals 被引量:4
10
作者 吕韬 郭劲 +2 位作者 张合勇 晏春回 王灿进 《Optoelectronics Letters》 EI 2017年第4期275-278,共4页
To address the challenges of non-cooperative and remote acoustic detection, an all-fiber laser Doppler vibrometer(LDV) is established. The all-fiber LDV system can offer the advantages of smaller size, lightweight des... To address the challenges of non-cooperative and remote acoustic detection, an all-fiber laser Doppler vibrometer(LDV) is established. The all-fiber LDV system can offer the advantages of smaller size, lightweight design and robust structure, hence it is a better fit for remote speech detection. In order to improve the performance and the efficiency of LDV for long-range hearing, the speech enhancement technology based on optimally modified log-spectral amplitude(OM-LSA) algorithm is used. The experimental results show that the comprehensible speech signals within the range of 150 m can be obtained by the proposed LDV. The signal-to-noise ratio(SNR) and mean opinion score(MOS) of the LDV speech signal can be increased by 100% and 27%, respectively, by using the speech enhancement technology. This all-fiber LDV, which combines the speech enhancement technology, can meet the practical demand in engineering. 展开更多
关键词 语音信号 远程 LDV系统 激光多普勒测速仪 modified 语音增强技术 全光纤 声学检测
原文传递
基于小波变换的语音信号去噪算法优化
11
作者 王红娟 尚莹莹 《电声技术》 2024年第5期67-69,共3页
深入研究基于小波变换的语音信号去噪方法,并针对传统方法在复杂噪声环境下处理效果不佳的问题,提出一种基于自适应阈值的小波变换去噪优化方法。首先,分析小波变换去噪的基本原理。其次,深入研究自适应阈值技术的数学模型,并将其应用... 深入研究基于小波变换的语音信号去噪方法,并针对传统方法在复杂噪声环境下处理效果不佳的问题,提出一种基于自适应阈值的小波变换去噪优化方法。首先,分析小波变换去噪的基本原理。其次,深入研究自适应阈值技术的数学模型,并将其应用于小波变换,通过动态调整阈值来适应不同噪声环境的需求。最后,采用Aurora数据集进行实验验证。实验结果表明,该方法能够有效去除噪声。 展开更多
关键词 小波变换 语音去噪 自适应阈值 语音信号
下载PDF
中文版hearWHO应用程序在听力筛查中的验证
12
作者 李静 米玛玉珍 +1 位作者 古鑫 傅新星 《中国耳鼻咽喉头颈外科》 CSCD 2024年第6期364-368,共5页
目的验证中文版hearWHO应用程序,一款噪声下数字言语测试工具,在听力筛查中的有效性。方法招募听力正常者12名,听力损失患者92例。所有受试者依次完成纯音听阈测试、声导抗测试以及中文版hearWHO测试。分析双耳PTA0.5~4kHz和hearWHO得... 目的验证中文版hearWHO应用程序,一款噪声下数字言语测试工具,在听力筛查中的有效性。方法招募听力正常者12名,听力损失患者92例。所有受试者依次完成纯音听阈测试、声导抗测试以及中文版hearWHO测试。分析双耳PTA0.5~4kHz和hearWHO得分的相关性,采用受试者工作特性曲线及曲线下面积等参数,确定hearWHO得分在最佳诊断截点下的灵敏度和特异度。结果hearWHO得分与较好耳和较差耳纯音听阈均值PTA0.5~4kHz均呈显著负相关性,相关系数分别为-0.552和-0.693(P<0.001)。按PTA0.5~4kHz≥20dBHL为存在听力损失的标准,当hearWHO得分≤60时,提示至少单耳出现听力下降,灵敏度和特异度分别为0.733、0.690;当hearWHO得分≤50时,提示受试者可能存在双耳听力下降,灵敏度和特异度分别为0.815、0.917。结论hearWHO对检出单耳或双耳听力损失的灵敏度和特异度均较高,当hearWHO得分≤60时,提示受试者有必要转诊进行进一步的听力检测及诊断。 展开更多
关键词 噪声 言语识别测验 信噪比 噪声下数字言语测试 言语识别阈 听力筛查
下载PDF
助听器配戴对听配能随任务变化趋势的影响
13
作者 田宏斌 李佳龄 +1 位作者 张敏 韩朝 《中国听力语言康复科学杂志》 2024年第2期132-138,共7页
目的探讨助听器配戴是否影响听配能随任务难度变化的趋势,以及听觉任务诱发的主观听配能评价与日常生活经验相关的听配能评价之间的关系。方法选取24名双耳配戴助听器的成年患者,分别在未助听和助听的条件下,采用语速和信噪比两个听觉任... 目的探讨助听器配戴是否影响听配能随任务难度变化的趋势,以及听觉任务诱发的主观听配能评价与日常生活经验相关的听配能评价之间的关系。方法选取24名双耳配戴助听器的成年患者,分别在未助听和助听的条件下,采用语速和信噪比两个听觉任务(分别包括6个难度级别)诱导进行听配能主观评估,并结合日常生活场景相关的听配能评价。通过线性回归模型对听配能评分随任务负荷变化趋势进行分析,线性回归线斜率和截距为听配能因变量,分析助听前后听配能的变化以及两种主观听配能评价间的相关性。结果在语速和信噪比听觉任务下,助听器配戴前听配能回归线截距均显著高于助听器配戴后,即在较容易的聆听环境下助听器配戴使听配能明显减少(P<0.05),助听器配戴后听配能回归线斜率显著高于助听器配戴前,即助听后听配能在难易任务中的差距更明显(P<0.05);听配能的减少幅度与每日配戴助听器的时数有关,每日配戴时间越长,听配能减少越多(P<0.05);助听后听觉任务诱导的主观听配能评分与常见生活场景相关的主观听配能评分无关(P>0.05)。结论在典型的听力临床门诊环境下,采用具有较宽任务难度梯度的听觉任务诱导听配能自陈评估的方法能够有效地反映助听器配戴对优化听配能投入的积极效果。每日助听器配戴时间是获得听配能收益的重要影响因素。主观评价听配能时需要充分考虑情景诱导的差异性。 展开更多
关键词 助听器 听配能 语速 信噪比
下载PDF
非对称性听力损失使用信号对传式助听器干预效果研究
14
作者 田沛然 史文迪 +3 位作者 鲍小欢 陈佳慧 王旖旎 张建 《中国眼耳鼻喉科杂志》 2024年第1期32-35,39,共5页
目的采用真耳分析和不同信噪比下的言语识别率测试初步评价双耳信号对传式(BICROS)助听器的助听效果。方法选自2019年2月~2020年1月20例一侧全聋、一侧听力为中度听力损失(简称听损)患者在通过真耳分析验证达到验配公式目标值后,分别在... 目的采用真耳分析和不同信噪比下的言语识别率测试初步评价双耳信号对传式(BICROS)助听器的助听效果。方法选自2019年2月~2020年1月20例一侧全聋、一侧听力为中度听力损失(简称听损)患者在通过真耳分析验证达到验配公式目标值后,分别在单耳佩戴助听器和双耳佩戴BICROS助听器干预前后的安静环境和噪声环境进行言语测试和声场助听听阈评估。结果患者单耳佩戴助听器与双耳佩戴BICROS助听器在0.5~4 kHz的声场助听听阈高度相关(P值均<0.001);声场下65 dB SPL言语识别率测试显示:安静环境、噪声环境(信噪比为5 dB)下双耳佩戴BICROS助听器与仅较好耳佩戴助听器相比差异均有统计学意义(P值均<0.001)。安静环境下裸耳和双耳佩戴BICROS的最大言语识别率显著相关(r=0.911,P<0.001)。结论安静环境、噪音环境下单耳极重度非对称性听损患者通过较好耳单侧佩戴助听器和使用BICROS助听器均能受益,双耳佩戴BICROS助听器在噪声环境下受益更明显。 展开更多
关键词 单侧聋 非对称性 双耳信号对传式助听器 言语识别率
下载PDF
基于语音信号的跨语种交互翻译机器人语义纠错方法
15
作者 付曼 《信息与电脑》 2024年第5期31-33,共3页
传统的跨语种交互翻译机器人语义纠错方法通常是单向的,效率较低,导致识别错误率较高。为此,文章提出基于语音信号的跨语种交互翻译机器人语义纠错方法。在基础语音识别的基础上,通过交互标定和特征提取来修正语义错误位置,并设计语音... 传统的跨语种交互翻译机器人语义纠错方法通常是单向的,效率较低,导致识别错误率较高。为此,文章提出基于语音信号的跨语种交互翻译机器人语义纠错方法。在基础语音识别的基础上,通过交互标定和特征提取来修正语义错误位置,并设计语音信号翻译机器人的语义纠错模型,采用随时间反向传播(Backpropagation Through Time,BPTT)循环训练核验方式,以确保纠错的准确性。测试结果显示,经过3个阶段测试,选定的5段语音材料的纠错识别率成功控制在10%以下,表明基于语音信号的跨语种交互翻译机器人语义纠错方法高效,具有实际应用价值。 展开更多
关键词 语音信号 跨语种交互 交互翻译 机器人语义 语义纠错 纠错方法
下载PDF
BLIND SPEECH SEPARATION FOR ROBOTS WITH INTELLIGENT HUMAN-MACHINE INTERACTION
16
作者 Huang Yulei Ding Zhizhong +1 位作者 Dai Lirong Chen Xiaoping 《Journal of Electronics(China)》 2012年第3期286-293,共8页
Speech recognition rate will deteriorate greatly in human-machine interaction when the speaker's speech mixes with a bystander's voice. This paper proposes a time-frequency approach for Blind Source Seperation... Speech recognition rate will deteriorate greatly in human-machine interaction when the speaker's speech mixes with a bystander's voice. This paper proposes a time-frequency approach for Blind Source Seperation (BSS) for intelligent Human-Machine Interaction(HMI). Main idea of the algorithm is to simultaneously diagonalize the correlation matrix of the pre-whitened signals at different time delays for every frequency bins in time-frequency domain. The prososed method has two merits: (1) fast convergence speed; (2) high signal to interference ratio of the separated signals. Numerical evaluations are used to compare the performance of the proposed algorithm with two other deconvolution algorithms. An efficient algorithm to resolve permutation ambiguity is also proposed in this paper. The algorithm proposed saves more than 10% of computational time with properly selected parameters and achieves good performances for both simulated convolutive mixtures and real room recorded speeches. 展开更多
关键词 Blind Source Separation (BSS) Blind deconvolution speech signal processing Human-machine interaction Simultaneous diagonalization
下载PDF
Speech Encryption with Fractional Watermark
17
作者 Yan Sun Cun Zhu Qi Cui 《Computers, Materials & Continua》 SCIE EI 2022年第10期1817-1825,共9页
Research on the feature of speech and image signals are carried out from two perspectives,the time domain and the frequency domain.The speech and image signals are a non-stationary signal,so FT is not used for the non... Research on the feature of speech and image signals are carried out from two perspectives,the time domain and the frequency domain.The speech and image signals are a non-stationary signal,so FT is not used for the non-stationary characteristics of the signal.When short-term stable speech is obtained by windowing and framing the subsequent processing of the signal is completed by the Discrete Fourier Transform(DFT).The Fast Discrete Fourier Transform is a commonly used analysis method for speech and image signal processing in frequency domain.It has the problem of adjusting window size to a for desired resolution.But the Fractional Fourier Transform can have both time domain and frequency domain processing capabilities.This paper performs global processing speech encryption by combining speech with image of Fractional Fourier Transform.The speech signal is embedded watermark image that is processed by fractional transformation,and the embedded watermark has the effect of rotation and superposition,which improves the security of the speech.The paper results show that the proposed speech encryption method has a higher security level by Fractional Fourier Transform.The technology is easy to extend to practical applications. 展开更多
关键词 Fractional Fourier Transform WATERMARK speech signal processing image processing
下载PDF
耳蜗死区对不同信噪比条件下轻、中度感音神经性听力损失患者言语识别的影响
18
作者 孟超 郭倩倩 +1 位作者 吕静 陈雪清 《听力学及言语疾病杂志》 CAS CSCD 北大核心 2024年第3期228-231,共4页
目的初步探究安静及不同信噪比条件下,有、无耳蜗死区的轻、中度感音神经性听力损失患者的言语识别变化规律,以及不同测试条件下两组患者的言语识别差异。方法通过纯音测听选出符合纳入标准的轻、中度感音神经性听力损失患者44耳,采用... 目的初步探究安静及不同信噪比条件下,有、无耳蜗死区的轻、中度感音神经性听力损失患者的言语识别变化规律,以及不同测试条件下两组患者的言语识别差异。方法通过纯音测听选出符合纳入标准的轻、中度感音神经性听力损失患者44耳,采用均衡噪声阈(threshold equalizing noise,TEN)测试将患者分为非耳蜗死区组24耳和耳蜗死区组20耳。选取汉语(普通话)测听词汇表(Mandarin speech test materials,MSTMs)中单音节词言语测听词表,对两组患者在安静环境和言语谱噪声环境(SNR=6、3、0、-3 dB)下进行言语识别率测试。结果耳蜗死区组患者耳蜗死区1~2个,频率集中在3~4 kHz;两组患者不同测试环境下言语识别率均随信噪比降低而下降(P<0.05);耳蜗死区组患者在各测试条件下言语识别率得分较非耳蜗死区组低,且均有显著差异(P<0.05)。结论无论有无耳蜗死区,轻、中感音神经性听力损失患者的言语识别率均随着信噪比下降呈显著下降的趋势。在本研究各信噪比条件下耳蜗死区组患者的言语识别率均较非耳蜗死区组患者低,对于轻、中度感音神经性听力损失患者开展耳蜗死区测试是必要的。 展开更多
关键词 耳蜗死区 言语识别 均衡噪声阈测试 信噪比
下载PDF
基于OBE理念的“双线混融”教学模式改革研究——以《语音信号处理》课程为例
19
作者 董胡 童欣 +1 位作者 钟新跃 刘刚 《办公自动化》 2024年第2期42-44,共3页
本研究以《语音信号处理》课程为例,探讨基于OBE(Outcome-Based Education)理念的“双线混融”教学模式改革。传统的教学模式强调知识的传授,存在知识单一、缺乏实际应用和互动性差等问题;而OBE理念注重学生的能力培养和实践应用能力的... 本研究以《语音信号处理》课程为例,探讨基于OBE(Outcome-Based Education)理念的“双线混融”教学模式改革。传统的教学模式强调知识的传授,存在知识单一、缺乏实际应用和互动性差等问题;而OBE理念注重学生的能力培养和实践应用能力的提升。本研究提出一种基于OBE理念的“双线混融”的教学模式,在课堂教学与实践环节中相互支持,促进学生的综合能力发展。实践结果表明,采用“双线混融”教学模式后,学生的学习动力和学习效果有明显的提升。学生能够更好地理解和应用语音信号处理相关知识,提高解决问题和实践操作能力。 展开更多
关键词 OBE理念 教学模式改革 双线混融 语音信号处理
下载PDF
人工智能在音频信号处理中的应用与挑战
20
作者 武堂颖 杨璐 徐丽丽 《电声技术》 2024年第5期31-34,共4页
人工智能可以通过智能化的算法和模型处理音频信号,从而实现音频的增强、识别及转换等功能。然而,人工智能在音频处理领域的应用也面临一些挑战。首先从自动语音识别、语音合成、音频去噪与增强、情感识别与音频分析4个方面分析人工智... 人工智能可以通过智能化的算法和模型处理音频信号,从而实现音频的增强、识别及转换等功能。然而,人工智能在音频处理领域的应用也面临一些挑战。首先从自动语音识别、语音合成、音频去噪与增强、情感识别与音频分析4个方面分析人工智能在音频信号处理中的应用,其次从音频信号的复杂性和多变性、数据获取与标注问题、计算资源与效率问题以及隐私与安全问题4个方面分析人工智能在音频信号处理中面临的挑战,最后深入分析应对挑战的对策。 展开更多
关键词 人工智能 音频信号处理 语音识别
下载PDF
上一页 1 2 60 下一页 到第
使用帮助 返回顶部