In order to assist physically handicapped persons in their movements,we developed an embedded isolated word speech recognition system(ASR)applied to voice control of smart wheelchairs.However,in spite of the existence...In order to assist physically handicapped persons in their movements,we developed an embedded isolated word speech recognition system(ASR)applied to voice control of smart wheelchairs.However,in spite of the existence in the industrial market of several kinds of electric wheelchairs,the problem remains the need to manually control this device by hand via joystick;which limits their use especially by people with severe disabilities.Thus,a significant number of disabled people cannot use a standard electric wheelchair or drive it with difficulty.The proposed solution is to use the voice to control and drive the wheelchair instead of classical joysticks.The intelligent chair is equipped with an obstacle detection system consisting of ultrasonic sensors,a moving navigation algorithm and a speech acquisition and recognition module for voice control embedded in a DSP card.The ASR architecture consists of two main modules.The first one is the speech parameterization module(features extraction)and the second module is the classifier which identifies the speech and generates the control word to motors power unit.The training and recognition phases are based on Hidden Markov Models(HMM),K-means,Baum-Welch and Viterbi algorithms.The database consists of 39 isolated speaker words(13 words pronounced 3 times under different environments and conditions).The simulations are tested under Matlab environment and the real-time implementation is performed by C language with code composer studio embedded in a TMS 320 C6416 DSP kit.The results and experiments obtained gave promising recognition ratio and accuracy around 99%in clean environment.However,the system accuracy decreases considerably in noisy environments,especially for SNR values below 5 dB(in street:78%,in factory:52%).展开更多
目的评价早期嗓音康复训练对甲状腺切除术后嗓音异常及发音障碍患者的影响。方法检索Pub Med、Embase、Web of Science、Cochrane Library、CNKI、万方等数据库,搜集从建库至2023年10月25日发表的关于早期嗓音康复训练的随机对照试验,...目的评价早期嗓音康复训练对甲状腺切除术后嗓音异常及发音障碍患者的影响。方法检索Pub Med、Embase、Web of Science、Cochrane Library、CNKI、万方等数据库,搜集从建库至2023年10月25日发表的关于早期嗓音康复训练的随机对照试验,筛选、纳入文献,评价纳入文献质量。采用Rev Man 5.4软件,从嗓音障碍程度、客观声学指标、最长发声时间、声门闭合程度及嗓音总嘶哑度来评估甲状腺切除术后嗓音异常患者进行早期嗓音康复训练的有效性。结果最终共纳入9篇文献,共654名患者。结果显示,嗓音障碍程度(MD=-7.73,95%CI:-13.42,-2.03,P=0.008)、客观声学指标[基频F0(MD=18.19,95%CI:11.27,25.12,P<0.001),基频微扰Jitter(MD=-0.12,95%CI=-0.20,-0.03,P=0.006),振幅微扰Shimmer(MD=-0.51,95%CI:-0.98,-0.05,P=0.03)]、最长发声时间(MD=2.21,95%CI:0.91,3.51,P<0.001)、声门闭合程度(MD=-0.38,95%CI:-0.48,-0.29,P<0.001)、总嘶哑度(MD=-0.13,95%CI:-0.19,-0.07,P<0.001)。结论早期嗓音康复训练可改善甲状腺切除术后嗓音异常患者的发声能力、嗓音粗糙度、嘶哑度、最长发声时间、声门闭合程度及嗓音障碍程度,对声带运动产生积极作用,可改善声门间隙闭合。展开更多
针对Tri-training算法利用无标记样例时会引入噪声且限制无标记样例的利用率而导致分类性能下降的缺点,提出了AR-Tri-training(Tri-training with assistant and rich strategy)算法。提出辅助学习策略,结合富信息策略设计辅助学习器,...针对Tri-training算法利用无标记样例时会引入噪声且限制无标记样例的利用率而导致分类性能下降的缺点,提出了AR-Tri-training(Tri-training with assistant and rich strategy)算法。提出辅助学习策略,结合富信息策略设计辅助学习器,并将辅助学习器应用在Tri-training训练以及说话声识别中。实验结果表明,辅助学习器在Tri-training训练的基础上不仅降低每次迭代可能产生的误标记样例数,而且能够充分地利用无标记样例以及在验证集上的错分样例信息。从实验结果可以得出,该算法能够弥补Tri-training算法的缺点,进一步提高测试率。展开更多
A sensory substitution device (SSD) is a technology that translates information for one sensory modality, like vision, into information for use by another, like touch. Though SSDs have been in existence for over fou...A sensory substitution device (SSD) is a technology that translates information for one sensory modality, like vision, into information for use by another, like touch. Though SSDs have been in existence for over four decades, effective training techniques for their use are rarely discussed. In this paper, we compare three training strategies on a particular SSD known as the vOICe. These comparisons were conducted using a minimal but active search and localization task of luminescent discs. These studies show that an interactive training paradigm, which combines the efforts of two trainees at the same time in a tag-like game, is more effective than passive training at a computer console or active training involving search and localization of luminescent discs. This finding supports philosophical views of perception that take interaction with the environment seriously. In particular, we argue that these findings are in support of certain features of the extended mind view as proposed by Andy Clark (2008). In particular, the study suggests that when environmental conditions are responsive to one's behavioral activity, the activity itself is enhanced, as is predicted by the extended mind view.展开更多
文摘In order to assist physically handicapped persons in their movements,we developed an embedded isolated word speech recognition system(ASR)applied to voice control of smart wheelchairs.However,in spite of the existence in the industrial market of several kinds of electric wheelchairs,the problem remains the need to manually control this device by hand via joystick;which limits their use especially by people with severe disabilities.Thus,a significant number of disabled people cannot use a standard electric wheelchair or drive it with difficulty.The proposed solution is to use the voice to control and drive the wheelchair instead of classical joysticks.The intelligent chair is equipped with an obstacle detection system consisting of ultrasonic sensors,a moving navigation algorithm and a speech acquisition and recognition module for voice control embedded in a DSP card.The ASR architecture consists of two main modules.The first one is the speech parameterization module(features extraction)and the second module is the classifier which identifies the speech and generates the control word to motors power unit.The training and recognition phases are based on Hidden Markov Models(HMM),K-means,Baum-Welch and Viterbi algorithms.The database consists of 39 isolated speaker words(13 words pronounced 3 times under different environments and conditions).The simulations are tested under Matlab environment and the real-time implementation is performed by C language with code composer studio embedded in a TMS 320 C6416 DSP kit.The results and experiments obtained gave promising recognition ratio and accuracy around 99%in clean environment.However,the system accuracy decreases considerably in noisy environments,especially for SNR values below 5 dB(in street:78%,in factory:52%).
文摘目的评价早期嗓音康复训练对甲状腺切除术后嗓音异常及发音障碍患者的影响。方法检索Pub Med、Embase、Web of Science、Cochrane Library、CNKI、万方等数据库,搜集从建库至2023年10月25日发表的关于早期嗓音康复训练的随机对照试验,筛选、纳入文献,评价纳入文献质量。采用Rev Man 5.4软件,从嗓音障碍程度、客观声学指标、最长发声时间、声门闭合程度及嗓音总嘶哑度来评估甲状腺切除术后嗓音异常患者进行早期嗓音康复训练的有效性。结果最终共纳入9篇文献,共654名患者。结果显示,嗓音障碍程度(MD=-7.73,95%CI:-13.42,-2.03,P=0.008)、客观声学指标[基频F0(MD=18.19,95%CI:11.27,25.12,P<0.001),基频微扰Jitter(MD=-0.12,95%CI=-0.20,-0.03,P=0.006),振幅微扰Shimmer(MD=-0.51,95%CI:-0.98,-0.05,P=0.03)]、最长发声时间(MD=2.21,95%CI:0.91,3.51,P<0.001)、声门闭合程度(MD=-0.38,95%CI:-0.48,-0.29,P<0.001)、总嘶哑度(MD=-0.13,95%CI:-0.19,-0.07,P<0.001)。结论早期嗓音康复训练可改善甲状腺切除术后嗓音异常患者的发声能力、嗓音粗糙度、嘶哑度、最长发声时间、声门闭合程度及嗓音障碍程度,对声带运动产生积极作用,可改善声门间隙闭合。
文摘针对Tri-training算法利用无标记样例时会引入噪声且限制无标记样例的利用率而导致分类性能下降的缺点,提出了AR-Tri-training(Tri-training with assistant and rich strategy)算法。提出辅助学习策略,结合富信息策略设计辅助学习器,并将辅助学习器应用在Tri-training训练以及说话声识别中。实验结果表明,辅助学习器在Tri-training训练的基础上不仅降低每次迭代可能产生的误标记样例数,而且能够充分地利用无标记样例以及在验证集上的错分样例信息。从实验结果可以得出,该算法能够弥补Tri-training算法的缺点,进一步提高测试率。
文摘A sensory substitution device (SSD) is a technology that translates information for one sensory modality, like vision, into information for use by another, like touch. Though SSDs have been in existence for over four decades, effective training techniques for their use are rarely discussed. In this paper, we compare three training strategies on a particular SSD known as the vOICe. These comparisons were conducted using a minimal but active search and localization task of luminescent discs. These studies show that an interactive training paradigm, which combines the efforts of two trainees at the same time in a tag-like game, is more effective than passive training at a computer console or active training involving search and localization of luminescent discs. This finding supports philosophical views of perception that take interaction with the environment seriously. In particular, we argue that these findings are in support of certain features of the extended mind view as proposed by Andy Clark (2008). In particular, the study suggests that when environmental conditions are responsive to one's behavioral activity, the activity itself is enhanced, as is predicted by the extended mind view.