期刊文献+
共找到614篇文章
< 1 2 31 >
每页显示 20 50 100
VOICINGDECISIONUSINGCONTINUOUSNONLINEARNETWORK
1
作者 周志杰 胡光锐 李群 《Journal of Shanghai Jiaotong university(Science)》 EI 1998年第2期50-53,共4页
A voicing decision algorithm using continuous nonlinear network is discussed. A five dimensional feature vector is used to describe the voicing characteristic of speech segment, and a continuous network is trained wi... A voicing decision algorithm using continuous nonlinear network is discussed. A five dimensional feature vector is used to describe the voicing characteristic of speech segment, and a continuous network is trained with a gradient descent algorithm is served as the voicing decision maker. Computer simulation shows that this algorithm is an outperform way to make voicing decision. The correct rate of this method reaches 97.8%. 展开更多
关键词 SPEECH processing NEURAL network voicing DECISION PITCH EXTRACTION
下载PDF
Attention-Enhanced Voice Portrait Model Using Generative Adversarial Network
2
作者 Jingyi Mao Yuchen Zhou +3 位作者 YifanWang Junyu Li Ziqing Liu Fanliang Bu 《Computers, Materials & Continua》 SCIE EI 2024年第4期837-855,共19页
Voice portrait technology has explored and established the relationship between speakers’ voices and their facialfeatures, aiming to generate corresponding facial characteristics by providing the voice of an unknown ... Voice portrait technology has explored and established the relationship between speakers’ voices and their facialfeatures, aiming to generate corresponding facial characteristics by providing the voice of an unknown speaker.Due to its powerful advantages in image generation, Generative Adversarial Networks (GANs) have now beenwidely applied across various fields. The existing Voice2Face methods for voice portraits are primarily based onGANs trained on voice-face paired datasets. However, voice portrait models solely constructed on GANs facelimitations in image generation quality and struggle to maintain facial similarity. Additionally, the training processis relatively unstable, thereby affecting the overall generative performance of the model. To overcome the abovechallenges,wepropose a novel deepGenerativeAdversarialNetworkmodel for audio-visual synthesis, namedAVPGAN(Attention-enhanced Voice Portrait Model using Generative Adversarial Network). This model is based ona convolutional attention mechanism and is capable of generating corresponding facial images from the voice ofan unknown speaker. Firstly, to address the issue of training instability, we integrate convolutional neural networkswith deep GANs. In the network architecture, we apply spectral normalization to constrain the variation of thediscriminator, preventing issues such as mode collapse. Secondly, to enhance the model’s ability to extract relevantfeatures between the two modalities, we propose a voice portrait model based on convolutional attention. Thismodel learns the mapping relationship between voice and facial features in a common space from both channeland spatial dimensions independently. Thirdly, to enhance the quality of generated faces, we have incorporated adegradation removal module and utilized pretrained facial GANs as facial priors to repair and enhance the clarityof the generated facial images. Experimental results demonstrate that our AVP-GAN achieved a cosine similarity of0.511, outperforming the performance of our comparison model, and effectively achieved the generation of highqualityfacial images corresponding to a speaker’s voice. 展开更多
关键词 Cross-modal generation GANs voice portrait technology face synthesis
下载PDF
Classification research of TCM pulse conditions based on multi-label voice analysis
3
作者 Haoran Shen Junjie Cao +5 位作者 Lin Zhang Jing Li Jianghong Liu Zhiyuan Chu Shifeng Wang Yanjiang Qiao 《Journal of Traditional Chinese Medical Sciences》 CAS 2024年第2期172-179,共8页
Objective:To explore the feasibility of remotely obtaining complex information on traditional Chinese medicine(TCM)pulse conditions through voice signals.Methods: We used multi-label pulse conditions as the entry poin... Objective:To explore the feasibility of remotely obtaining complex information on traditional Chinese medicine(TCM)pulse conditions through voice signals.Methods: We used multi-label pulse conditions as the entry point and modeled and analyzed TCM pulse diagnosis by combining voice analysis and machine learning.Audio features were extracted from voice recordings in the TCM pulse condition dataset.The obtained features were combined with information from tongue and facial diagnoses.A multi-label pulse condition voice classification DNN model was built using 10-fold cross-validation,and the modeling methods were validated using publicly available datasets.Results: The analysis showed that the proposed method achieved an accuracy of 92.59%on the public dataset.The accuracies of the three single-label pulse manifestation models in the test set were 94.27%,96.35%,and 95.39%.The absolute accuracy of the multi-label model was 92.74%.Conclusion: Voice data analysis may serve as a remote adjunct to the TCM diagnostic method for pulse condition assessment. 展开更多
关键词 Pulse conditions TCM pulse diagnosis Voice analysis Multi-label classification Machine learning
下载PDF
Comprehensive Analysis of Gender Classification Accuracy across Varied Geographic Regions through the Application of Deep Learning Algorithms to Speech Signals
4
作者 Abhishek Singhal Devendra Kumar Sharma 《Computer Systems Science & Engineering》 2024年第3期609-625,共17页
This article presents an exhaustive comparative investigation into the accuracy of gender identification across diverse geographical regions,employing a deep learning classification algorithm for speech signal analysi... This article presents an exhaustive comparative investigation into the accuracy of gender identification across diverse geographical regions,employing a deep learning classification algorithm for speech signal analysis.In this study,speech samples are categorized for both training and testing purposes based on their geographical origin.Category 1 comprises speech samples from speakers outside of India,whereas Category 2 comprises live-recorded speech samples from Indian speakers.Testing speech samples are likewise classified into four distinct sets,taking into consideration both geographical origin and the language spoken by the speakers.Significantly,the results indicate a noticeable difference in gender identification accuracy among speakers from different geographical areas.Indian speakers,utilizing 52 Hindi and 26 English phonemes in their speech,demonstrate a notably higher gender identification accuracy of 85.75%compared to those speakers who predominantly use 26 English phonemes in their conversations when the system is trained using speech samples from Indian speakers.The gender identification accuracy of the proposed model reaches 83.20%when the system is trained using speech samples from speakers outside of India.In the analysis of speech signals,Mel Frequency Cepstral Coefficients(MFCCs)serve as relevant features for the speech data.The deep learning classification algorithm utilized in this research is based on a Bidirectional Long Short-Term Memory(BiLSTM)architecture within a Recurrent Neural Network(RNN)model. 展开更多
关键词 Deep learning recurrent neural network voice signal mel frequency cepstral coefficients geographical area GENDER
下载PDF
Research on Multi-modal In-Vehicle Intelligent Personal Assistant Design
5
作者 WANG Jia-rou TANG Cheng-xin SHUAI Liang-ying 《印刷与数字媒体技术研究》 CAS 北大核心 2024年第4期136-146,共11页
Intelligent personal assistants play a pivotal role in in-vehicle systems,significantly enhancing life efficiency,driving safety,and decision-making support.In this study,the multi-modal design elements of intelligent... Intelligent personal assistants play a pivotal role in in-vehicle systems,significantly enhancing life efficiency,driving safety,and decision-making support.In this study,the multi-modal design elements of intelligent personal assistants within the context of visual,auditory,and somatosensory interactions with drivers were discussed.Their impact on the driver’s psychological state through various modes such as visual imagery,voice interaction,and gesture interaction were explored.The study also introduced innovative designs for in-vehicle intelligent personal assistants,incorporating design principles such as driver-centricity,prioritizing passenger safety,and utilizing timely feedback as a criterion.Additionally,the study employed design methods like driver behavior research and driving situation analysis to enhance the emotional connection between drivers and their vehicles,ultimately improving driver satisfaction and trust. 展开更多
关键词 Intelligent personal assistants Multi-modal design User psychology In-vehicle interaction Voice interaction Emotional design
下载PDF
Fuzzy Proportional Integral Derivative control of a voice coil actuator system for adaptive deformable mirrors
6
作者 Ziqiang Cui Heng Zuo +4 位作者 Weikang Qiao Hao Li Fujia Du Yifan Wang Jinrui Guo 《Astronomical Techniques and Instruments》 CSCD 2024年第3期179-186,共8页
Research on adaptive deformable mirror technology for voice coil actuators(VCAs)is an important trend in the development of large ground-based telescopes.A voice coil adaptive deformable mirror contains a large number... Research on adaptive deformable mirror technology for voice coil actuators(VCAs)is an important trend in the development of large ground-based telescopes.A voice coil adaptive deformable mirror contains a large number of actuators,and there are problems with structural coupling and large temperature increases in their internal coils.Additionally,parameters of the traditional proportional integral derivative(PID)control cannot be adjusted in real-time to adapt to system changes.These problems can be addressed by introducing fuzzy control methods.A table lookup method is adopted to replace real-time calculations of the regular fuzzy controller during the control process,and a prototype platform has been established to verify the effectiveness and robustness of this process.Experimental tests compare the control performance of traditional and fuzzy proportional integral derivative(Fuzzy-PID)controllers,showing that,in system step response tests,the fuzzy control system reduces rise time by 20.25%,decreases overshoot by 78.24%,and shortens settling time by 67.59%.In disturbance rejection experiments,fuzzy control achieves a 46.09%reduction in the maximum deviation,indicating stronger robustness.The Fuzzy-PID controller,based on table lookup,outperforms the standard controller significantly,showing excellent potential for enhancing the dynamic performance and disturbance rejection capability of the voice coil motor actuator system. 展开更多
关键词 Adaptive optics Deformable mirror Voice coil actuator Fuzzy control
下载PDF
Aims & Scope
7
《Contemporary Social Sciences》 2024年第3期F0002-F0002,共1页
Contemporary Social Sciences publishes outstanding research in the field of social sciences in China and also includes high-quality research work by foreign scholars on the development of China’s western regions and ... Contemporary Social Sciences publishes outstanding research in the field of social sciences in China and also includes high-quality research work by foreign scholars on the development of China’s western regions and its reform and opening up. The aim is to help promote China’s academic achievements to the world and give China a stronger voice in the global community of social sciences. 展开更多
关键词 outstanding VOICE REFORM
下载PDF
Analysis of Smooth Cepstral Peak Prominence in Hypokinetic Dysarthria Associated With Parkinson’s Disease
8
作者 Qiang LI Abigail WALLACE +4 位作者 Wesley DAVIS Beau ROTH Laura LANGHOFER Shalini NARAYANA Michael CANNITO 《Chinese Journal of Applied Linguistics》 2024年第4期657-669,688,共14页
Smoothed cepstral peak prominence(CPPs)is a measurement of the distance from the prominent cepstral peak to the linear regression line directly beneath it.Variations of CPPs data acquisition and analysis lead to the c... Smoothed cepstral peak prominence(CPPs)is a measurement of the distance from the prominent cepstral peak to the linear regression line directly beneath it.Variations of CPPs data acquisition and analysis lead to the complexity of the clinical cut-off values,and there are no agreeable values for a specific voice disorder,such as hypokinetic dysarthria associated with Parkinson’s disease(PD).This study examined the CPPs in people with hypokinetic dysarthria associated with PD compared with healthy participants.Results demonstrated significant differences in speech tasks of sustained vowel and connected speech,with CPPs of connected speech more sensitive to dysphonia and gender difference in PD participants.Males in PD participants presented higher CPPs for sustained vowels and lower CPPs for connected speech than females.It is implied that a consistent clinical application protocol is necessary,and multiple acoustic measures are needed to ensure the accuracy of clinical decisions. 展开更多
关键词 cepstral peak prominence hypokinetic dysarthria VOICE Parkinson’s disease motor speech disorders
下载PDF
Letters of the Hebrew Alphabet:As Sound Notation
9
作者 Max Stern 《Journal of Literature and Art Studies》 2024年第5期365-368,共4页
While Bronze Age Proto-Sinaic and Proto-Canaanite syllabic inscriptions were found engraved on fragments of pottery and stone,evidence of early alphabetic script was also inscribed in ink onto a massive parchment scro... While Bronze Age Proto-Sinaic and Proto-Canaanite syllabic inscriptions were found engraved on fragments of pottery and stone,evidence of early alphabetic script was also inscribed in ink onto a massive parchment scroll,known as the Torah.Albeit the contours of those original characters transformed over time,it took the clairvoyant genius of Moses,and later the scribes of Ancient Israel,to configure and adapt ancient semitic prototypes into phonetic letters,producing the greatest literary document in the history of the world,the Bible.This article summarizes the acoustic properties of that alphabet,with further historical considerations. 展开更多
关键词 TORAH sacred voice VOWELS cantillation chironomy music notation ARTICULATION dynamic emphasis
下载PDF
All Voices Should Be Heard and Heeded in a True Democracy
10
作者 ZHANG HUI 《China Today》 2024年第5期46-49,共4页
For all its different forms,democracy is expected to promote people’s well-being,instead of being weaponized to justify hegemony,as democracy is also a principle of global governance.
关键词 TRUE VOICE SHOULD
下载PDF
Raising the Chinese Voice
11
作者 MENG JIAXIN 《China Today》 2024年第10期74-75,共2页
As China celebrates the 40th anniversary of joining the International Atomic Energy Agency,an expert talks about how her organization is nurturing talents for overseas projects.
关键词 VOICE OVERSEAS ANNIVERSARY
下载PDF
Young Voices for a Better Future
12
作者 Guo Xixian Chang Xiang 《China Report ASEAN》 2024年第10期58-59,共2页
On September 4,the"Strengthening Youth Exchange to Build a Friendly Future"China-Laos Youth Dialogue was held at the National University of Laos (NUOL) in Vientiane with an aim to consolidate bilateral frien... On September 4,the"Strengthening Youth Exchange to Build a Friendly Future"China-Laos Youth Dialogue was held at the National University of Laos (NUOL) in Vientiane with an aim to consolidate bilateral friendship,promote the building of a ChinaLaos community with a shared future,and strengthen exchanges and cooperation between the youth of both countries. 展开更多
关键词 VOICE YOUTH FUTURE
下载PDF
Greater Voice for Global South
13
作者 MAHASHA RAMPEDI 《ChinAfrica》 2024年第1期37-37,共1页
The African Union's(AU)admission as a new G20 member is a diplomatic breakthrough and a major step towards a more balanced world order in favour of the developing nations.In September,the G20 Summit in Delhi,India... The African Union's(AU)admission as a new G20 member is a diplomatic breakthrough and a major step towards a more balanced world order in favour of the developing nations.In September,the G20 Summit in Delhi,India,accepted the AU as its new member,giving the continent a greater voice in the global economic affairs. 展开更多
关键词 BREAKTHROUGH VOICE AFFAIRS
下载PDF
Emoti-Shing: Detecting Vishing Attacks by Learning Emotion Dynamics through Hidden Markov Models
14
作者 Virgile Simé Nyassi Franklin Tchakounté +3 位作者 Blaise Omer Yenké Duplex Elvis Houpa Danga Magnuss Dufe Ngoran Jean Louis Kedieng Ebongue Fendji 《Journal of Intelligent Learning Systems and Applications》 2024年第3期274-315,共42页
This study examines vishing, a form of social engineering scam using voice communication to deceive individuals into revealing sensitive information or losing money. With the rise of smartphone usage, people are more ... This study examines vishing, a form of social engineering scam using voice communication to deceive individuals into revealing sensitive information or losing money. With the rise of smartphone usage, people are more susceptible to vishing attacks. The proposed Emoti-Shing model analyzes potential victims’ emotions using Hidden Markov Models to track vishing scams by examining the emotional content of phone call audio conversations. This approach aims to detect vishing scams using biological features of humans, specifically emotions, which cannot be easily masked or spoofed. Experimental results on 30 generated emotions indicate the potential for increased vishing scam detection through this approach. 展开更多
关键词 Social Engineering Hidden Markov Model Vishing Voice Mining
下载PDF
《夏洛的网》中的“说”
15
作者 王慧琴 《疯狂英语(新悦读)》 2024年第7期52-61,77,78,共12页
必备好词一、轻声“说”1.whisper低声说出(常指耳语、窃窃私语)2.murmur/mutter喃喃自语(多指别人不易听到的低语)3.moan/groan/complain/grumble抱怨着说4.mumble/grunt咕哝5.gossip(对别人的隐私)说长道短6.breathe低声说7.sigh叹着... 必备好词一、轻声“说”1.whisper低声说出(常指耳语、窃窃私语)2.murmur/mutter喃喃自语(多指别人不易听到的低语)3.moan/groan/complain/grumble抱怨着说4.mumble/grunt咕哝5.gossip(对别人的隐私)说长道短6.breathe低声说7.sigh叹着气说8.in a soft/gentle/mild tone/voice用温柔的语气说9.in a whisper低声说。 展开更多
关键词 GENTLE 《夏洛的网》 VOICE
下载PDF
康开丽:向西方介绍中国当代戏剧
16
作者 崔潇月 康开丽 《国际比较文学(中英文)》 2023年第4期184-188,共5页
Introduction Claire Conceison is one of the leading figures in the research field of contemporary Chinese theater. She has multiple roles: a scholar, translator, and director. She is a professor of Chinese Culture and... Introduction Claire Conceison is one of the leading figures in the research field of contemporary Chinese theater. She has multiple roles: a scholar, translator, and director. She is a professor of Chinese Culture and Theater Arts at MIT and the author of two books——Significant Other: Staging the American in China(2004), Voices Carry: Behind Bars and Backstage during China’s Revolution and Reform(2009). 展开更多
关键词 TRANSLATOR VOICE CONTEMPORARY
下载PDF
Research trends in methods for controlling macro-micro motion platforms 被引量:2
17
作者 Lufan Zhang Pengqi Zhang +1 位作者 Boshi Jiang Heng Yan 《Nanotechnology and Precision Engineering》 EI CAS CSCD 2023年第3期64-78,共15页
With ongoing economic,scientific,and technological developments,the electronic devices used in daily lives are developing toward precision and miniaturization,and so the demand for high-precision manufacturing machine... With ongoing economic,scientific,and technological developments,the electronic devices used in daily lives are developing toward precision and miniaturization,and so the demand for high-precision manufacturing machinery is expanding.The most important piece of equipment in modern high-precision manufacturing is the macro-micro motion platform(M3P),which offers high speed,precision,and efficiency and has macro-micro motion coupling characteristics due to its mechanical design and composition of its driving components.Therefore,the design of the control system is crucial for the overall precision of the platform;conventional proportional–integral–derivative control cannot meet the system requirements,and so M3Ps are the subject of a growing range of modern control strategies.This paper begins by describing the development history of M3Ps,followed by their platform structure and motion control system components,and then in-depth assessments of the macro,micro,and macro-micro control systems.In addition to examining the advantages and disadvantages of current macro-micro motion control,recent technological breakthroughs are noted.Finally,based on existing problems,future directions for M3P control systems are given,and the present conclusions offer guidelines for future work on M3Ps. 展开更多
关键词 Macro-micro motion platform Precision positioning Control method Piezoelectric driver Voice coil motor
下载PDF
Estill Voice Training在流行音乐演唱中的应用——技巧和效果探究 被引量:1
18
作者 宇文荧彬 《黄河之声》 2023年第13期132-135,共4页
本研究旨在探究Estill Voice Training在流行音乐演唱中的应用,以揭示其在提升歌手演唱技巧和效果方面的潜力。通过对Estill Voice Training的核心原理和技巧进行阐述,并结合实际案例和研究数据,探讨其在流行音乐领域的实际应用价值。... 本研究旨在探究Estill Voice Training在流行音乐演唱中的应用,以揭示其在提升歌手演唱技巧和效果方面的潜力。通过对Estill Voice Training的核心原理和技巧进行阐述,并结合实际案例和研究数据,探讨其在流行音乐领域的实际应用价值。本文旨在为流行音乐演唱者、声乐教育者和音乐产业相关人士提供借鉴,促进流行音乐演唱技术的进步和发展。 展开更多
关键词 Estill Voice Training 流行音乐演唱 演唱技巧 声乐训练
下载PDF
Age and Gender Classification Using Backpropagation and Bagging Algorithms
19
作者 Ammar Almomani Mohammed Alweshah +6 位作者 Waleed Alomoush Mohammad Alauthman Aseel Jabai Anwar Abbass Ghufran Hamad Meral Abdalla Brij B.Gupta 《Computers, Materials & Continua》 SCIE EI 2023年第2期3045-3062,共18页
Voice classification is important in creating more intelligent systems that help with student exams,identifying criminals,and security systems.The main aim of the research is to develop a system able to predicate and ... Voice classification is important in creating more intelligent systems that help with student exams,identifying criminals,and security systems.The main aim of the research is to develop a system able to predicate and classify gender,age,and accent.So,a newsystem calledClassifyingVoice Gender,Age,and Accent(CVGAA)is proposed.Backpropagation and bagging algorithms are designed to improve voice recognition systems that incorporate sensory voice features such as rhythm-based features used to train the device to distinguish between the two gender categories.It has high precision compared to other algorithms used in this problem,as the adaptive backpropagation algorithm had an accuracy of 98%and the Bagging algorithm had an accuracy of 98.10%in the gender identification data.Bagging has the best accuracy among all algorithms,with 55.39%accuracy in the voice common dataset and age classification and accent accuracy in a speech accent of 78.94%. 展开更多
关键词 Classify voice gender ACCENT age bagging algorithms back propagation algorithms AI classifiers
下载PDF
The Dilemma of Women’s Voices in the Post-Epidemic Era:Misconceptions of Feminism on Digital Media Platforms-The Example of Sina Weibo
20
作者 LIU Yuewen 《Psychology Research》 2023年第2期88-103,共16页
Weibo,one of China’s largest digital media platforms,has become a major platform for women’s voices to fight for equality.However,misconceptions of feminism on Weibo have become obstacles to women’s voices,for whic... Weibo,one of China’s largest digital media platforms,has become a major platform for women’s voices to fight for equality.However,misconceptions of feminism on Weibo have become obstacles to women’s voices,for which the platforms did not post women’s views prominently.From the perspective of women themselves,this paper adopted a questionnaire to study the misunderstanding of feminism and its impact on women’s expression on Weibo. 展开更多
关键词 FEMINISM women’s voices digital media platforms the post-epidemic era
下载PDF
上一页 1 2 31 下一页 到第
使用帮助 返回顶部