期刊文献+
共找到513篇文章
< 1 2 26 >
每页显示 20 50 100
Prosodic Modification of Chinese Speech Based on Sinusoidal Model 被引量:1
1
作者 Jiang-yang Zhou Fang-jing Zheng +1 位作者 Quan Sha Pei-gi Chai 《Advances in Manufacturing》 SCIE CAS 2000年第4期299-303,共5页
Modification on time scale and pitch scale of Chinese syllable based on sinusoidal model is presented in this paper. Firstly, the short term speech is decomposed into a sum of sinusoidal waves of different magnitud... Modification on time scale and pitch scale of Chinese syllable based on sinusoidal model is presented in this paper. Firstly, the short term speech is decomposed into a sum of sinusoidal waves of different magnitudes and phases. Then vocal tract system and excitation are obtained using a homomophic technique. Lastly, the speech with desired time scale and pitch scale is obtained through the change of frequency and phase of excitation while the parameters of vocal tract system are changed accordingly. The results show that the adjustable scale of pitch and time scale is big using this algorithm and it is suitable to be used in analysis and synthesis of Chinese speech. 展开更多
关键词 Chinese speech sinusoidal model pitch scale time scale prosodic modificatp
下载PDF
Emotional Speech Synthesis Based on Prosodic Feature Modification 被引量:2
2
作者 Ling He Hua Huang Margaret Lech 《Engineering(科研)》 2013年第10期73-77,共5页
The synthesis of emotional speech has wide applications in the field of human-computer interaction, medicine, industry and so on. In this work, an emotional speech synthesis system is proposed based on prosodic featur... The synthesis of emotional speech has wide applications in the field of human-computer interaction, medicine, industry and so on. In this work, an emotional speech synthesis system is proposed based on prosodic features modification and Time Domain Pitch Synchronous OverLap Add (TD-PSOLA) waveform concatenative algorithm. The system produces synthesized speech with four types of emotion: angry, happy, sad and bored. The experiment results show that the proposed emotional speech synthesis system achieves a good performance. The produced utterances present clear emotional expression. The subjective test reaches high classification accuracy for different types of synthesized emotional speech utterances. 展开更多
关键词 EMOTIONAL SPEECH Synthesis prosodic Features Time Domain PITCH SYNCHRONOUS OVERLAP ADD
下载PDF
The Implication of Prosodic Features' Pragmatic Functions-For Teaching Listening and Speaking
3
作者 佘桂婷 《海外英语》 2011年第8X期28-30,共3页
Interactive communication is not straightforward but complicated. Prosodic features play an influential role in English communication. They can be used to signal certain pragmatic purposes in real situations for liste... Interactive communication is not straightforward but complicated. Prosodic features play an influential role in English communication. They can be used to signal certain pragmatic purposes in real situations for listeners and speakers to have mutual understanding. Identifying the pragmatic functions of prosodic features will facilitate the teaching of listening and speaking. English teachers need to clarify and emphasize the relationship between prosodic features and their pragmatic functions, attempting to work out how to combine them together into teaching in order to teach students to communicate effectively. 展开更多
关键词 communication prosodic features PRAGMATIC functions LISTENING TEACHING SPEAKING TEACHING
下载PDF
Identification of Question and Non-Question Segments in Arabic Monologues Using Prosodic Features: Novel Type-2 Fuzzy Logic and Sensitivity-Based Linear Learning Approaches
4
作者 Sunday Olusanya Olatunji Lahouari Cheded +1 位作者 Wasfi G. Al-Khatib Omair Khan 《Journal of Intelligent Learning Systems and Applications》 2013年第3期165-175,共11页
In this paper, we extend our previous study of addressing the important problem of automatically identifying question and non-question segments in Arabic monologues using prosodic features. We propose here two novel c... In this paper, we extend our previous study of addressing the important problem of automatically identifying question and non-question segments in Arabic monologues using prosodic features. We propose here two novel classification approaches to this problem: one based on the use of the powerful type-2 fuzzy logic systems (type-2 FLS) and the other on the use of the discriminative sensitivity-based linear learning method (SBLLM). The use of prosodic features has been used in a plethora of practical applications, including speech-related applications, such as speaker and word recognition, emotion and accent identification, topic and sentence segmentation, and text-to-speech applications. In this paper, we continue to specifically focus on the Arabic language, as other languages have received a lot of attention in this regard. Moreover, we aim to improve the performance of our previously-used techniques, of which the support vector machine (SVM) method was the best performing, by applying the two above-mentioned powerful classification approaches. The recorded continuous speech is first segmented into sentences using both energy and time duration parameters. The prosodic features are then extracted from each sentence and fed into each of the two proposed classifiers so as to classify each sentence as a Question or a Non-Question sentence. Our extensive simulation work, based on a moderately-sized database, showed the two proposed classifiers outperform SVM in all of the experiments carried out, with the type-2 FLS classifier consistently exhibiting the best performance, because of its ability to handle all forms of uncertainties. 展开更多
关键词 ARABIC Monologues prosodic Features Type-2 FUZZY LOGIC Systems Sensitivity Based LINEAR LearningMethod Support Vector Machines
下载PDF
A Contrastive Analysis of the Structure of Prosodic Words in Mandarin and English
5
作者 GUO Zhongzi 《北京第二外国语学院学报》 2016年第5期137-137,共1页
This paper, particularly focusing on the pitch of prosodic words,has conducted a contrastive study on the structure of prosodic words in Englishand Mandarin . This paper reports a Mandarin monologue speech corpus-stud... This paper, particularly focusing on the pitch of prosodic words,has conducted a contrastive study on the structure of prosodic words in Englishand Mandarin . This paper reports a Mandarin monologue speech corpus-study, anexperimental phonetic attempt to conduct a study on the pitch of trisyllabic prosodicwords in Mandarin monologue. In addition, taking the characteristics of Englishprosodic words into consideration, the paper makes a contrastive analysis of prosodicwords in English and Mandarin. This study finds that the pitch of trisyllabic prosodicwords in Mandarin is inevitably affected by structural factors. As far as the leftsyllable is concerned, the grammatical category, prosodic hierarchical boundary andthe position of the intonational phrase where the syllable is located, the mid syllableand the right syllable may have influences on the pitch contour of the left syllable.As to the mid syllable, the grammatical category, the left syllable, the right syllableand the position of the intonational phrase where the syllable is located may haveinfluences on the pitch contour of the mid syllable. As for the right syllable, theprosodic hierarchical boundary where the syllable is located and the mid syllable mayhave effects on the pitch contour of the right syllable. Different from the previousfindings of the study on read corpus, this study shows that the mid syllable not onlyhas dissimilatory effects but also has assimilatory effects on the pitch of its precedingsyllable. The left syllable has anticipatory effects on the onset pitch of the mid syllableand the right syllable has coarticulation effects on the offset pitch of the mid syllable. 展开更多
关键词 prosodic WORDS structural factors PITCH REGRESSIVE effects COARTICULATION
下载PDF
Prosodically Rich Speech Synthesis Interface Using Limited Data of Celebrity Voice
6
作者 Takashi Nose Taiki Kamei 《Journal of Computer and Communications》 2016年第16期79-94,共16页
To enhance the communication between human and robots at home in the future, speech synthesis interfaces are indispensable that can generate expressive speech. In addition, synthesizing celebrity voice is commercially... To enhance the communication between human and robots at home in the future, speech synthesis interfaces are indispensable that can generate expressive speech. In addition, synthesizing celebrity voice is commercially important. For these issues, this paper proposes techniques for synthesizing natural-sounding speech that has a rich prosodic personality using a limited amount of data in a text-to-speech (TTS) system. As a target speaker, we chose a well-known prime minister of Japan, Shinzo Abe, who has a good prosodic personality in his speeches. To synthesize natural-sounding and prosodically rich speech, accurate phrasing, robust duration prediction, and rich intonation modeling are important. For these purpose, we propose pause position prediction based on conditional random fields (CRFs), phone-duration prediction using random forests, and mora-based emphasis context labeling. We examine the effectiveness of the above techniques through objective and subjective evaluations. 展开更多
关键词 Parametric Speech Synthesis Hidden Markov Model (HMM) prosodic Personality prosody Modeling Conditional Random Field (CRF) Random Forest Emphasis Context
下载PDF
Enhancement of Voice Quality and Bit-rate Reduction by Prosodic Control in G.723.1 Vocoder
7
作者 Jong-kuk KIM 《Journal of Measurement Science and Instrumentation》 CAS 2011年第2期179-183,共5页
Speech coding techniques have been studied not only to reduce the complexity and bit rate but also to improve the sound quality.CELP type vocoder,used as standard,supports the great sound quality even low bit rate.In ... Speech coding techniques have been studied not only to reduce the complexity and bit rate but also to improve the sound quality.CELP type vocoder,used as standard,supports the great sound quality even low bit rate.In this paper,the preprocessing of input speech to reduce the bit rate is different from the conventional vocoder.Different kinds of parameter are used for the preprocessing compared with the other parameters to find the more appropriate parameter for the vocoder.The parameters are used to synthesize the speech not to encode or decode for coding technique so we proposed the simple algorithm not to have the influence on the processing time or the computation time.The parameters in the preprocessing step are speaking rate,duration,and PSOLA technique. 展开更多
关键词 语音质量 低比特率 声码器 语音编码技术 控制 韵律 计算时间 语音编码器
下载PDF
Melody- Usul - Poetic Prosodic Meter Relations in Ottoman-Turkish Music
8
作者 Gozde Colakoglu SARI 《Journal of Literature and Art Studies》 2015年第2期128-140,共13页
关键词 音乐文化 土耳其 UL 美国 韵律 诗意 结构工程 历史文化
下载PDF
融合韵律特征的诗歌生成模型
9
作者 吴林东 何向真 万福成 《计算机工程与应用》 CSCD 北大核心 2024年第13期162-170,共9页
诗歌生成中的韵律规范和主题一致性一直以来都是自然语言生成领域的研究热点。为提升诗歌生成中的韵律规范,提出了基于Transformer结合韵律特征的诗歌生成模型(Transformer and prosodic features poetry generation model,TPPG)。根据... 诗歌生成中的韵律规范和主题一致性一直以来都是自然语言生成领域的研究热点。为提升诗歌生成中的韵律规范,提出了基于Transformer结合韵律特征的诗歌生成模型(Transformer and prosodic features poetry generation model,TPPG)。根据韵律特征建立平仄韵律词库和平声韵脚词库,在Transformer编码器中引入平仄韵律编码,模型训练过程中可以捕获更多平仄韵律特征的信息,学习到多种诗歌韵律;最终根据建立的平声韵脚词库规范诗歌生成韵脚,运用极大后验概率对于候选的诗歌选择当前赋有韵律特征规范的最优诗句,整体提升诗歌规范性和流畅性。实验结果表明TPPG模型生成的诗歌能够很好地符合韵律,在人工评价和机器评价中均有提高。 展开更多
关键词 诗歌生成 韵律库 韵律编码 韵律特征
下载PDF
结合轻量卷积的非自回归语音合成方法
10
作者 钟巧霞 曾碧 +1 位作者 林镇涛 林伟 《计算机工程与设计》 北大核心 2024年第4期1166-1172,共7页
对如何有效捕捉音素之间的关联及如何合成韵律丰富的音频进行研究,提出一种结合轻量卷积的非自回归语音合成模型LCTTS。引入轻量卷积建立起音素之间的联系,解决发音出错问题。通过添加音高和能量预测器预测生成语音的韵律,解决音频韵律... 对如何有效捕捉音素之间的关联及如何合成韵律丰富的音频进行研究,提出一种结合轻量卷积的非自回归语音合成模型LCTTS。引入轻量卷积建立起音素之间的联系,解决发音出错问题。通过添加音高和能量预测器预测生成语音的韵律,解决音频韵律缺乏问题。训练模型获取梅尔频谱,结合预先训练好的声码器转化为音频。实验结果表明,提出的LCTTS模型优于先前提出的SpeedySpeech模型,在Emotional Speech Database数据集上平均意见得分获得2.8%的提升,梅尔倒谱失真测度下降0.15。 展开更多
关键词 语音合成 轻量级卷积 韵律合成 梅尔频谱生成 非自回归方法 深度学习 自然语言处理
下载PDF
情感语音合成中的语义及韵律特征嵌入方法
11
作者 石凡 杨鉴 《信息技术》 2024年第7期26-33,共8页
针对当前的情感语音合成方法存在合成音频容易忽略文本语义信息的问题,在文本编码器中引入BERT预训练模型,辅助编码器捕获文本语义特征,并提出了语义及韵律特征嵌入方法。缅甸语情感语料的缺乏导致模型难以合成高质量情感语音,因此,文... 针对当前的情感语音合成方法存在合成音频容易忽略文本语义信息的问题,在文本编码器中引入BERT预训练模型,辅助编码器捕获文本语义特征,并提出了语义及韵律特征嵌入方法。缅甸语情感语料的缺乏导致模型难以合成高质量情感语音,因此,文中通过微调各个网络模块参数的方法探索缅甸语情感语音合成模型的训练方法。实验结果表明,文中提出的特征嵌入方法以及训练方法在情感语料缺乏情况下仍能合成出高质量的情感语音,平均情感意见得分分别为4.16与4.18。 展开更多
关键词 缅甸语 情感语音合成 语义特征 韵律特征 微调
下载PDF
新闻播音中韵律边界的声学特性及交际实现
12
作者 刘文 陈彦婷 《语言文字应用》 北大核心 2024年第1期128-141,共14页
韵律边界是口语交际互动的重要线索,其感知高度依赖于声学线索。本文以《新闻联播》的播读语料为研究对象,采用声学手段对韵律边界音节进行系统研究。结果显示:韵律边界音节的时长均大于非边界音节,而音高和音强则小于非边界音节。此外... 韵律边界是口语交际互动的重要线索,其感知高度依赖于声学线索。本文以《新闻联播》的播读语料为研究对象,采用声学手段对韵律边界音节进行系统研究。结果显示:韵律边界音节的时长均大于非边界音节,而音高和音强则小于非边界音节。此外,韵律边界位置上的阳平和上声存在挤喉音。调音方面,男性和女性的共鸣效果好,高频能量均有所增强,且男性存在演讲者共振峰。本文探究了新闻播音的韵律特性,研究成果可为指导播音教学实践提供一定参考。 展开更多
关键词 新闻播音 韵律边界 语速 嗓音质量 演讲者共振峰
下载PDF
汉语拼音输入中韵律边界等级对相邻字母输入时间间隔的影响
13
作者 柳韦任 连湘怡 +1 位作者 庄想灵 马国杰 《应用心理学》 2024年第4期357-364,共8页
由于同音字多,拼音输入不区分音调,导致拼音字母到汉字的转化并不唯一,从而降低了拼音输入法的输入效率。本研究基于汉语输入中的心理运动过程,探讨韵律边界对拼音字母输入时间间隔的影响。为此,我们分别采用自然篇章和歧义拼音字符串... 由于同音字多,拼音输入不区分音调,导致拼音字母到汉字的转化并不唯一,从而降低了拼音输入法的输入效率。本研究基于汉语输入中的心理运动过程,探讨韵律边界对拼音字母输入时间间隔的影响。为此,我们分别采用自然篇章和歧义拼音字符串进行了两个实验,让被试以全拼方式输入指定汉语文本对应的拼音,记录相邻拼音字母的输入时间间隔。结果表明,拼音的韵律边界等级影响了拼音字母的输入时间间隔,等级越大,则输入时间间隔越长。 展开更多
关键词 拼音输入 拼音解歧 韵律边界等级 歧义拼音字符串 言语产生
下载PDF
“没”字句否定辖域和否定焦点的韵律实现 被引量:1
14
作者 黄彩玉 赵雨婷 《华文教学与研究》 2023年第1期19-27,共9页
以“没”后是状中结构的双项动词性成分的否定句为例,考察否定辖域和否定焦点的韵律实现问题。语音实验结果表明:“没”的否定辖域是其后的所有成分,在语调中最突出的投射是辖域内的调域压缩。自然焦点否定句中,说话人优选否定词后毗邻... 以“没”后是状中结构的双项动词性成分的否定句为例,考察否定辖域和否定焦点的韵律实现问题。语音实验结果表明:“没”的否定辖域是其后的所有成分,在语调中最突出的投射是辖域内的调域压缩。自然焦点否定句中,说话人优选否定词后毗邻成分为否定焦点,和“没”共同构成自然焦点否定句的联合焦点。否定焦点和否定句焦点可以分离,可以重合或部分重合,由否定辖域内结构和强调焦点的有无及位置决定。 展开更多
关键词 “没” 否定辖域 否定焦点 韵律实现
下载PDF
烟台话连读变调的韵律辖域 被引量:1
15
作者 张琦 马秋武 《语言科学》 北大核心 2023年第1期44-55,共12页
烟台话两字组连读变调是在北方官话字调基础上的语境变调,三字组连读变调却是在吴语词调基础上的模板变调。基于韵律-句法交互作用和韵律音系学理论,对烟台话连读变调韵律辖域的分析显示,韵律-句法匹配原则和韵律标记性制约条件共同作... 烟台话两字组连读变调是在北方官话字调基础上的语境变调,三字组连读变调却是在吴语词调基础上的模板变调。基于韵律-句法交互作用和韵律音系学理论,对烟台话连读变调韵律辖域的分析显示,韵律-句法匹配原则和韵律标记性制约条件共同作用下形成的最小韵律短语(ωmin),即为烟台话发生连读变调的韵律辖域。依据匹配理论,烟台话连读变调的韵律辖域呈现递归性特征。烟台话连读变调韵律辖域的划分遵循韵律标记性制约条件序列高于匹配原则限制条件的排列规则。制约原则的引入简化了韵律辖域的界定模式,对分析以烟台话为代表的胶东方言具有普遍意义。 展开更多
关键词 韵律辖域 连读变调 匹配理论
下载PDF
语气词重韵块 被引量:1
16
作者 王珏 《汉语学习》 北大核心 2023年第5期3-15,共13页
准话语语气词“啊”只能或也能间隔重复使用,附着话题和述题里的并列、对举和反复词语。句法上,它将所附对象重组为例举短语、对偶短语、描摹短语、增情或轻责短语;语义上,它依次赋予重组短语以类指义、长时交替反复义、长时持续义、增... 准话语语气词“啊”只能或也能间隔重复使用,附着话题和述题里的并列、对举和反复词语。句法上,它将所附对象重组为例举短语、对偶短语、描摹短语、增情或轻责短语;语义上,它依次赋予重组短语以类指义、长时交替反复义、长时持续义、增情呼唤义和轻责义;韵律上,它作为韵脚将例举短语重组为“前后等长式”重韵块,将其余四种短语重组为“前长后短式”重韵块。整体上,“啊”字重韵块嵌入日常口语句中,间接控制了句子模块的数量不超过记忆常数。 展开更多
关键词 “啊” 重韵块 句法功能 语义功能 韵律功能
下载PDF
二语语音研究的韵律转向
17
作者 李杏莲 方喜军 刘希瑞 《河南工业大学学报(社会科学版)》 2023年第3期118-124,共7页
综合运用文献计量可视化分析和传统文献综述的方法,对国内和国际二语语音的研究动态进行了分析,同时对国际二语语音教学研讨会(PSLLT)近20年的研究议题进行了研究。研究发现:早期二语语音研究主要关注元音、辅音等音段层面,2010年前后... 综合运用文献计量可视化分析和传统文献综述的方法,对国内和国际二语语音的研究动态进行了分析,同时对国际二语语音教学研讨会(PSLLT)近20年的研究议题进行了研究。研究发现:早期二语语音研究主要关注元音、辅音等音段层面,2010年前后呈现出由音段层面转向韵律层面的显著趋势,主要涉及可理解性、韵律、语调、流利度、时长、重音等。通过对二语习得国际权威期刊《二语习得研究》专刊《二语习得中的韵律》栏目中7篇研究论文进行综述,探讨了二语韵律研究的具体做法。 展开更多
关键词 二语语音 韵律转向 音段 习得
下载PDF
不完全匹配的语音和文本语句级对齐
18
作者 徐锴 陶冶 李辉 《计算机系统应用》 2023年第4期300-307,共8页
语音文本自动对齐技术广泛应用于语音识别与合成、内容制作等领域,其主要目的是将语音和相应的参考文本在语句、单词、音素等级别的单元进行对齐,并获得语音与参考文本之间的时间对位信息.最新的先进对齐方法大多基于语音识别,一方面,... 语音文本自动对齐技术广泛应用于语音识别与合成、内容制作等领域,其主要目的是将语音和相应的参考文本在语句、单词、音素等级别的单元进行对齐,并获得语音与参考文本之间的时间对位信息.最新的先进对齐方法大多基于语音识别,一方面,准确率受限于语音识别效果,识别字错误率高时文语对齐精度明显下降,识别字错误率对对齐精度影响较大;另一方面,这种对齐方法不能有效处理不完全匹配的长篇幅语音和文本的对齐.该文提出一种基于锚点和韵律信息的文语对齐方法,通过基于边界锚点加权的片段标注将语料划分为对齐段和未对齐段,针对未对齐段使用双门限端点检测方法提取韵律信息,并检测语句边界,降低了基于语音识别的对齐方法对语音识别效果的依赖程度.实验结果表明,与目前先进的基于语音识别的文语对齐方法比较,即使在识别字错误率为0.52时,该文所提方法的对齐准确率仍能提升45%以上;在音频文本不匹配程度为0.5时,该文所提方法能提高3%. 展开更多
关键词 语音文本对齐 韵律信息 锚点 自动语音识别 端点检测
下载PDF
赣榆方言连读变调的优选论分析 被引量:1
19
作者 顾一鸣 《语言科学》 北大核心 2023年第1期27-43,共17页
赣榆方言无焦点、无轻声情形下的连读变调有三个基本问题:底层声调是什么?相关的韵律结构是什么?可以解释连读变调的制约条件都有哪些?文章找出了三大类制约条件:声调显著程度制约条件,韵律协调制约条件,以及声调的可实现性制约条件。... 赣榆方言无焦点、无轻声情形下的连读变调有三个基本问题:底层声调是什么?相关的韵律结构是什么?可以解释连读变调的制约条件都有哪些?文章找出了三大类制约条件:声调显著程度制约条件,韵律协调制约条件,以及声调的可实现性制约条件。韵律结构、底层目标的实现、特定音系元素的依赖性这三个重要的侧面不仅会影响连读变调,还制约着语调以及音段层面的音素的实现。 展开更多
关键词 优选论 连读变调 赣榆方言 韵律结构 依赖性
下载PDF
韵律模式和语素位置概率对汉语学习者切分歧义词的影响
20
作者 鹿士义 黄韵 《华文教学与研究》 2023年第1期43-51,共9页
在中文文本阅读过程中,读者如何对词语进行分词和识别,基本加工单位是什么,一直存在着争论。本文采用线上词汇命名任务,以汉语母语者和高级水平的二语者为被试,研究韵律模式和语素位置概率对汉语词语切分和识别的影响,并在此基础上探讨... 在中文文本阅读过程中,读者如何对词语进行分词和识别,基本加工单位是什么,一直存在着争论。本文采用线上词汇命名任务,以汉语母语者和高级水平的二语者为被试,研究韵律模式和语素位置概率对汉语词语切分和识别的影响,并在此基础上探讨基本加工单位的大小问题。研究发现:韵律因素和语素位置概率显著影响词语的切分和识别;汉语母语者和二语者最大的差异在于二语者加工单位的长度小于母语者,这可能是由于二语者的知觉广度范围较窄导致的。汉语的基本加工单位大小基本遵循“长词优先”原则,同时受韵律特征、语素特征、语言水平等因素的影响。 展开更多
关键词 词语切分 韵律模式 语素位置概率 基本加工单位
下载PDF
上一页 1 2 26 下一页 到第
使用帮助 返回顶部