In this paper, a new method for making v/uv decision is developed which uses a multi-feature v/uv classification algorithm based on the analysis of cepstral peak, zero crossing rate, and autocorrelation function (ACF)...In this paper, a new method for making v/uv decision is developed which uses a multi-feature v/uv classification algorithm based on the analysis of cepstral peak, zero crossing rate, and autocorrelation function (ACF) peak of short-time segments of the speech signal by using some clustering methods. This v/uv classifier achieved excellent results for identification of voiced and unvoiced segments of speech.展开更多
Since voiced reading is an important way in learning English,rhythm is the most critical factor that enables to read beautifully.This article illustrates the relationship between rhythm and voiced reading,the importan...Since voiced reading is an important way in learning English,rhythm is the most critical factor that enables to read beautifully.This article illustrates the relationship between rhythm and voiced reading,the importance of rhythm,and the methods to develop the sense of rhythm.展开更多
In conventional source-filter models, voiced and unvoiced components were considered independently. However, in practice it was difficult to separate the source into two parts. An actual source consists of a mixture o...In conventional source-filter models, voiced and unvoiced components were considered independently. However, in practice it was difficult to separate the source into two parts. An actual source consists of a mixture of two sources and the ratio varies according to the content or the intention of speaker. It had been investigated to separate the voiced and unvoiced components for different source models. Source signals were modeled based on the residual signal measured from inverse filtering. Three different source models were assumed. The parameters of each model were optimized for the original speech signal using a genetic algorithm. The resulting parameters were compared in terms of the mel-cepstral distance to the original signal, the spectrogram and the spectral envelope from the synthesized signal. The optimization method achieves an improvement of 15% for the Klatt model, but there is little improvement in the modified residual case.展开更多
Unvoiced/voiced classification of speech is a challenging problem especially under conditions of low signal-to-noise ratio or the non-white-stationary noise environment. To solve this problem, an algorithm for speech ...Unvoiced/voiced classification of speech is a challenging problem especially under conditions of low signal-to-noise ratio or the non-white-stationary noise environment. To solve this problem, an algorithm for speech classification, and a technique for the estimation of palrwise magnitude frequency in voiced speech am proposed. By using third order spectrum of speech signal to remove noise, in this algorithm the least spectrum difference to get refined pitch and the max harmonic number is given. And this algorithm utilizes spectral envelope to estimate signal-to-noise ratio of speech harmonics. Speech classification, voicing probability, and harmonic parameters of the voiced frame can be obtained. Simulation results indicate that the proposed algorithm, under complicated background noise, especially Gaussian noise, can effectively classify speech in high accuracy for voicing probability and the voiced parameters.展开更多
Voice portrait technology has explored and established the relationship between speakers’ voices and their facialfeatures, aiming to generate corresponding facial characteristics by providing the voice of an unknown ...Voice portrait technology has explored and established the relationship between speakers’ voices and their facialfeatures, aiming to generate corresponding facial characteristics by providing the voice of an unknown speaker.Due to its powerful advantages in image generation, Generative Adversarial Networks (GANs) have now beenwidely applied across various fields. The existing Voice2Face methods for voice portraits are primarily based onGANs trained on voice-face paired datasets. However, voice portrait models solely constructed on GANs facelimitations in image generation quality and struggle to maintain facial similarity. Additionally, the training processis relatively unstable, thereby affecting the overall generative performance of the model. To overcome the abovechallenges,wepropose a novel deepGenerativeAdversarialNetworkmodel for audio-visual synthesis, namedAVPGAN(Attention-enhanced Voice Portrait Model using Generative Adversarial Network). This model is based ona convolutional attention mechanism and is capable of generating corresponding facial images from the voice ofan unknown speaker. Firstly, to address the issue of training instability, we integrate convolutional neural networkswith deep GANs. In the network architecture, we apply spectral normalization to constrain the variation of thediscriminator, preventing issues such as mode collapse. Secondly, to enhance the model’s ability to extract relevantfeatures between the two modalities, we propose a voice portrait model based on convolutional attention. Thismodel learns the mapping relationship between voice and facial features in a common space from both channeland spatial dimensions independently. Thirdly, to enhance the quality of generated faces, we have incorporated adegradation removal module and utilized pretrained facial GANs as facial priors to repair and enhance the clarityof the generated facial images. Experimental results demonstrate that our AVP-GAN achieved a cosine similarity of0.511, outperforming the performance of our comparison model, and effectively achieved the generation of highqualityfacial images corresponding to a speaker’s voice.展开更多
This article presents an exhaustive comparative investigation into the accuracy of gender identification across diverse geographical regions,employing a deep learning classification algorithm for speech signal analysi...This article presents an exhaustive comparative investigation into the accuracy of gender identification across diverse geographical regions,employing a deep learning classification algorithm for speech signal analysis.In this study,speech samples are categorized for both training and testing purposes based on their geographical origin.Category 1 comprises speech samples from speakers outside of India,whereas Category 2 comprises live-recorded speech samples from Indian speakers.Testing speech samples are likewise classified into four distinct sets,taking into consideration both geographical origin and the language spoken by the speakers.Significantly,the results indicate a noticeable difference in gender identification accuracy among speakers from different geographical areas.Indian speakers,utilizing 52 Hindi and 26 English phonemes in their speech,demonstrate a notably higher gender identification accuracy of 85.75%compared to those speakers who predominantly use 26 English phonemes in their conversations when the system is trained using speech samples from Indian speakers.The gender identification accuracy of the proposed model reaches 83.20%when the system is trained using speech samples from speakers outside of India.In the analysis of speech signals,Mel Frequency Cepstral Coefficients(MFCCs)serve as relevant features for the speech data.The deep learning classification algorithm utilized in this research is based on a Bidirectional Long Short-Term Memory(BiLSTM)architecture within a Recurrent Neural Network(RNN)model.展开更多
Intelligent personal assistants play a pivotal role in in-vehicle systems,significantly enhancing life efficiency,driving safety,and decision-making support.In this study,the multi-modal design elements of intelligent...Intelligent personal assistants play a pivotal role in in-vehicle systems,significantly enhancing life efficiency,driving safety,and decision-making support.In this study,the multi-modal design elements of intelligent personal assistants within the context of visual,auditory,and somatosensory interactions with drivers were discussed.Their impact on the driver’s psychological state through various modes such as visual imagery,voice interaction,and gesture interaction were explored.The study also introduced innovative designs for in-vehicle intelligent personal assistants,incorporating design principles such as driver-centricity,prioritizing passenger safety,and utilizing timely feedback as a criterion.Additionally,the study employed design methods like driver behavior research and driving situation analysis to enhance the emotional connection between drivers and their vehicles,ultimately improving driver satisfaction and trust.展开更多
Research on adaptive deformable mirror technology for voice coil actuators(VCAs)is an important trend in the development of large ground-based telescopes.A voice coil adaptive deformable mirror contains a large number...Research on adaptive deformable mirror technology for voice coil actuators(VCAs)is an important trend in the development of large ground-based telescopes.A voice coil adaptive deformable mirror contains a large number of actuators,and there are problems with structural coupling and large temperature increases in their internal coils.Additionally,parameters of the traditional proportional integral derivative(PID)control cannot be adjusted in real-time to adapt to system changes.These problems can be addressed by introducing fuzzy control methods.A table lookup method is adopted to replace real-time calculations of the regular fuzzy controller during the control process,and a prototype platform has been established to verify the effectiveness and robustness of this process.Experimental tests compare the control performance of traditional and fuzzy proportional integral derivative(Fuzzy-PID)controllers,showing that,in system step response tests,the fuzzy control system reduces rise time by 20.25%,decreases overshoot by 78.24%,and shortens settling time by 67.59%.In disturbance rejection experiments,fuzzy control achieves a 46.09%reduction in the maximum deviation,indicating stronger robustness.The Fuzzy-PID controller,based on table lookup,outperforms the standard controller significantly,showing excellent potential for enhancing the dynamic performance and disturbance rejection capability of the voice coil motor actuator system.展开更多
Contemporary Social Sciences publishes outstanding research in the field of social sciences in China and also includes high-quality research work by foreign scholars on the development of China’s western regions and ...Contemporary Social Sciences publishes outstanding research in the field of social sciences in China and also includes high-quality research work by foreign scholars on the development of China’s western regions and its reform and opening up. The aim is to help promote China’s academic achievements to the world and give China a stronger voice in the global community of social sciences.展开更多
While Bronze Age Proto-Sinaic and Proto-Canaanite syllabic inscriptions were found engraved on fragments of pottery and stone,evidence of early alphabetic script was also inscribed in ink onto a massive parchment scro...While Bronze Age Proto-Sinaic and Proto-Canaanite syllabic inscriptions were found engraved on fragments of pottery and stone,evidence of early alphabetic script was also inscribed in ink onto a massive parchment scroll,known as the Torah.Albeit the contours of those original characters transformed over time,it took the clairvoyant genius of Moses,and later the scribes of Ancient Israel,to configure and adapt ancient semitic prototypes into phonetic letters,producing the greatest literary document in the history of the world,the Bible.This article summarizes the acoustic properties of that alphabet,with further historical considerations.展开更多
For all its different forms,democracy is expected to promote people’s well-being,instead of being weaponized to justify hegemony,as democracy is also a principle of global governance.
As China celebrates the 40th anniversary of joining the International Atomic Energy Agency,an expert talks about how her organization is nurturing talents for overseas projects.
On September 4,the"Strengthening Youth Exchange to Build a Friendly Future"China-Laos Youth Dialogue was held at the National University of Laos (NUOL) in Vientiane with an aim to consolidate bilateral frien...On September 4,the"Strengthening Youth Exchange to Build a Friendly Future"China-Laos Youth Dialogue was held at the National University of Laos (NUOL) in Vientiane with an aim to consolidate bilateral friendship,promote the building of a ChinaLaos community with a shared future,and strengthen exchanges and cooperation between the youth of both countries.展开更多
The African Union's(AU)admission as a new G20 member is a diplomatic breakthrough and a major step towards a more balanced world order in favour of the developing nations.In September,the G20 Summit in Delhi,India...The African Union's(AU)admission as a new G20 member is a diplomatic breakthrough and a major step towards a more balanced world order in favour of the developing nations.In September,the G20 Summit in Delhi,India,accepted the AU as its new member,giving the continent a greater voice in the global economic affairs.展开更多
This study examines vishing, a form of social engineering scam using voice communication to deceive individuals into revealing sensitive information or losing money. With the rise of smartphone usage, people are more ...This study examines vishing, a form of social engineering scam using voice communication to deceive individuals into revealing sensitive information or losing money. With the rise of smartphone usage, people are more susceptible to vishing attacks. The proposed Emoti-Shing model analyzes potential victims’ emotions using Hidden Markov Models to track vishing scams by examining the emotional content of phone call audio conversations. This approach aims to detect vishing scams using biological features of humans, specifically emotions, which cannot be easily masked or spoofed. Experimental results on 30 generated emotions indicate the potential for increased vishing scam detection through this approach.展开更多
必备好词一、轻声“说”1.whisper低声说出(常指耳语、窃窃私语)2.murmur/mutter喃喃自语(多指别人不易听到的低语)3.moan/groan/complain/grumble抱怨着说4.mumble/grunt咕哝5.gossip(对别人的隐私)说长道短6.breathe低声说7.sigh叹着...必备好词一、轻声“说”1.whisper低声说出(常指耳语、窃窃私语)2.murmur/mutter喃喃自语(多指别人不易听到的低语)3.moan/groan/complain/grumble抱怨着说4.mumble/grunt咕哝5.gossip(对别人的隐私)说长道短6.breathe低声说7.sigh叹着气说8.in a soft/gentle/mild tone/voice用温柔的语气说9.in a whisper低声说。展开更多
Introduction Claire Conceison is one of the leading figures in the research field of contemporary Chinese theater. She has multiple roles: a scholar, translator, and director. She is a professor of Chinese Culture and...Introduction Claire Conceison is one of the leading figures in the research field of contemporary Chinese theater. She has multiple roles: a scholar, translator, and director. She is a professor of Chinese Culture and Theater Arts at MIT and the author of two books——Significant Other: Staging the American in China(2004), Voices Carry: Behind Bars and Backstage during China’s Revolution and Reform(2009).展开更多
With ongoing economic,scientific,and technological developments,the electronic devices used in daily lives are developing toward precision and miniaturization,and so the demand for high-precision manufacturing machine...With ongoing economic,scientific,and technological developments,the electronic devices used in daily lives are developing toward precision and miniaturization,and so the demand for high-precision manufacturing machinery is expanding.The most important piece of equipment in modern high-precision manufacturing is the macro-micro motion platform(M3P),which offers high speed,precision,and efficiency and has macro-micro motion coupling characteristics due to its mechanical design and composition of its driving components.Therefore,the design of the control system is crucial for the overall precision of the platform;conventional proportional–integral–derivative control cannot meet the system requirements,and so M3Ps are the subject of a growing range of modern control strategies.This paper begins by describing the development history of M3Ps,followed by their platform structure and motion control system components,and then in-depth assessments of the macro,micro,and macro-micro control systems.In addition to examining the advantages and disadvantages of current macro-micro motion control,recent technological breakthroughs are noted.Finally,based on existing problems,future directions for M3P control systems are given,and the present conclusions offer guidelines for future work on M3Ps.展开更多
Voice classification is important in creating more intelligent systems that help with student exams,identifying criminals,and security systems.The main aim of the research is to develop a system able to predicate and ...Voice classification is important in creating more intelligent systems that help with student exams,identifying criminals,and security systems.The main aim of the research is to develop a system able to predicate and classify gender,age,and accent.So,a newsystem calledClassifyingVoice Gender,Age,and Accent(CVGAA)is proposed.Backpropagation and bagging algorithms are designed to improve voice recognition systems that incorporate sensory voice features such as rhythm-based features used to train the device to distinguish between the two gender categories.It has high precision compared to other algorithms used in this problem,as the adaptive backpropagation algorithm had an accuracy of 98%and the Bagging algorithm had an accuracy of 98.10%in the gender identification data.Bagging has the best accuracy among all algorithms,with 55.39%accuracy in the voice common dataset and age classification and accent accuracy in a speech accent of 78.94%.展开更多
文摘In this paper, a new method for making v/uv decision is developed which uses a multi-feature v/uv classification algorithm based on the analysis of cepstral peak, zero crossing rate, and autocorrelation function (ACF) peak of short-time segments of the speech signal by using some clustering methods. This v/uv classifier achieved excellent results for identification of voiced and unvoiced segments of speech.
文摘Since voiced reading is an important way in learning English,rhythm is the most critical factor that enables to read beautifully.This article illustrates the relationship between rhythm and voiced reading,the importance of rhythm,and the methods to develop the sense of rhythm.
基金supported by the Second Stage of Brain Korea 21 Projects
文摘In conventional source-filter models, voiced and unvoiced components were considered independently. However, in practice it was difficult to separate the source into two parts. An actual source consists of a mixture of two sources and the ratio varies according to the content or the intention of speaker. It had been investigated to separate the voiced and unvoiced components for different source models. Source signals were modeled based on the residual signal measured from inverse filtering. Three different source models were assumed. The parameters of each model were optimized for the original speech signal using a genetic algorithm. The resulting parameters were compared in terms of the mel-cepstral distance to the original signal, the spectrogram and the spectral envelope from the synthesized signal. The optimization method achieves an improvement of 15% for the Klatt model, but there is little improvement in the modified residual case.
文摘Unvoiced/voiced classification of speech is a challenging problem especially under conditions of low signal-to-noise ratio or the non-white-stationary noise environment. To solve this problem, an algorithm for speech classification, and a technique for the estimation of palrwise magnitude frequency in voiced speech am proposed. By using third order spectrum of speech signal to remove noise, in this algorithm the least spectrum difference to get refined pitch and the max harmonic number is given. And this algorithm utilizes spectral envelope to estimate signal-to-noise ratio of speech harmonics. Speech classification, voicing probability, and harmonic parameters of the voiced frame can be obtained. Simulation results indicate that the proposed algorithm, under complicated background noise, especially Gaussian noise, can effectively classify speech in high accuracy for voicing probability and the voiced parameters.
基金the Double First-Class Innovation Research Projectfor People’s Public Security University of China (No. 2023SYL08).
文摘Voice portrait technology has explored and established the relationship between speakers’ voices and their facialfeatures, aiming to generate corresponding facial characteristics by providing the voice of an unknown speaker.Due to its powerful advantages in image generation, Generative Adversarial Networks (GANs) have now beenwidely applied across various fields. The existing Voice2Face methods for voice portraits are primarily based onGANs trained on voice-face paired datasets. However, voice portrait models solely constructed on GANs facelimitations in image generation quality and struggle to maintain facial similarity. Additionally, the training processis relatively unstable, thereby affecting the overall generative performance of the model. To overcome the abovechallenges,wepropose a novel deepGenerativeAdversarialNetworkmodel for audio-visual synthesis, namedAVPGAN(Attention-enhanced Voice Portrait Model using Generative Adversarial Network). This model is based ona convolutional attention mechanism and is capable of generating corresponding facial images from the voice ofan unknown speaker. Firstly, to address the issue of training instability, we integrate convolutional neural networkswith deep GANs. In the network architecture, we apply spectral normalization to constrain the variation of thediscriminator, preventing issues such as mode collapse. Secondly, to enhance the model’s ability to extract relevantfeatures between the two modalities, we propose a voice portrait model based on convolutional attention. Thismodel learns the mapping relationship between voice and facial features in a common space from both channeland spatial dimensions independently. Thirdly, to enhance the quality of generated faces, we have incorporated adegradation removal module and utilized pretrained facial GANs as facial priors to repair and enhance the clarityof the generated facial images. Experimental results demonstrate that our AVP-GAN achieved a cosine similarity of0.511, outperforming the performance of our comparison model, and effectively achieved the generation of highqualityfacial images corresponding to a speaker’s voice.
文摘This article presents an exhaustive comparative investigation into the accuracy of gender identification across diverse geographical regions,employing a deep learning classification algorithm for speech signal analysis.In this study,speech samples are categorized for both training and testing purposes based on their geographical origin.Category 1 comprises speech samples from speakers outside of India,whereas Category 2 comprises live-recorded speech samples from Indian speakers.Testing speech samples are likewise classified into four distinct sets,taking into consideration both geographical origin and the language spoken by the speakers.Significantly,the results indicate a noticeable difference in gender identification accuracy among speakers from different geographical areas.Indian speakers,utilizing 52 Hindi and 26 English phonemes in their speech,demonstrate a notably higher gender identification accuracy of 85.75%compared to those speakers who predominantly use 26 English phonemes in their conversations when the system is trained using speech samples from Indian speakers.The gender identification accuracy of the proposed model reaches 83.20%when the system is trained using speech samples from speakers outside of India.In the analysis of speech signals,Mel Frequency Cepstral Coefficients(MFCCs)serve as relevant features for the speech data.The deep learning classification algorithm utilized in this research is based on a Bidirectional Long Short-Term Memory(BiLSTM)architecture within a Recurrent Neural Network(RNN)model.
文摘Intelligent personal assistants play a pivotal role in in-vehicle systems,significantly enhancing life efficiency,driving safety,and decision-making support.In this study,the multi-modal design elements of intelligent personal assistants within the context of visual,auditory,and somatosensory interactions with drivers were discussed.Their impact on the driver’s psychological state through various modes such as visual imagery,voice interaction,and gesture interaction were explored.The study also introduced innovative designs for in-vehicle intelligent personal assistants,incorporating design principles such as driver-centricity,prioritizing passenger safety,and utilizing timely feedback as a criterion.Additionally,the study employed design methods like driver behavior research and driving situation analysis to enhance the emotional connection between drivers and their vehicles,ultimately improving driver satisfaction and trust.
基金supported by the National Key R&D Program of China (2022YFA1603001,2021YFC2801402)the National Nature Science Foundation of China (12073053)the Science and Technology Plan of Inner Mongolia (2021GG0245).
文摘Research on adaptive deformable mirror technology for voice coil actuators(VCAs)is an important trend in the development of large ground-based telescopes.A voice coil adaptive deformable mirror contains a large number of actuators,and there are problems with structural coupling and large temperature increases in their internal coils.Additionally,parameters of the traditional proportional integral derivative(PID)control cannot be adjusted in real-time to adapt to system changes.These problems can be addressed by introducing fuzzy control methods.A table lookup method is adopted to replace real-time calculations of the regular fuzzy controller during the control process,and a prototype platform has been established to verify the effectiveness and robustness of this process.Experimental tests compare the control performance of traditional and fuzzy proportional integral derivative(Fuzzy-PID)controllers,showing that,in system step response tests,the fuzzy control system reduces rise time by 20.25%,decreases overshoot by 78.24%,and shortens settling time by 67.59%.In disturbance rejection experiments,fuzzy control achieves a 46.09%reduction in the maximum deviation,indicating stronger robustness.The Fuzzy-PID controller,based on table lookup,outperforms the standard controller significantly,showing excellent potential for enhancing the dynamic performance and disturbance rejection capability of the voice coil motor actuator system.
文摘Contemporary Social Sciences publishes outstanding research in the field of social sciences in China and also includes high-quality research work by foreign scholars on the development of China’s western regions and its reform and opening up. The aim is to help promote China’s academic achievements to the world and give China a stronger voice in the global community of social sciences.
文摘While Bronze Age Proto-Sinaic and Proto-Canaanite syllabic inscriptions were found engraved on fragments of pottery and stone,evidence of early alphabetic script was also inscribed in ink onto a massive parchment scroll,known as the Torah.Albeit the contours of those original characters transformed over time,it took the clairvoyant genius of Moses,and later the scribes of Ancient Israel,to configure and adapt ancient semitic prototypes into phonetic letters,producing the greatest literary document in the history of the world,the Bible.This article summarizes the acoustic properties of that alphabet,with further historical considerations.
文摘For all its different forms,democracy is expected to promote people’s well-being,instead of being weaponized to justify hegemony,as democracy is also a principle of global governance.
文摘As China celebrates the 40th anniversary of joining the International Atomic Energy Agency,an expert talks about how her organization is nurturing talents for overseas projects.
文摘On September 4,the"Strengthening Youth Exchange to Build a Friendly Future"China-Laos Youth Dialogue was held at the National University of Laos (NUOL) in Vientiane with an aim to consolidate bilateral friendship,promote the building of a ChinaLaos community with a shared future,and strengthen exchanges and cooperation between the youth of both countries.
文摘The African Union's(AU)admission as a new G20 member is a diplomatic breakthrough and a major step towards a more balanced world order in favour of the developing nations.In September,the G20 Summit in Delhi,India,accepted the AU as its new member,giving the continent a greater voice in the global economic affairs.
文摘This study examines vishing, a form of social engineering scam using voice communication to deceive individuals into revealing sensitive information or losing money. With the rise of smartphone usage, people are more susceptible to vishing attacks. The proposed Emoti-Shing model analyzes potential victims’ emotions using Hidden Markov Models to track vishing scams by examining the emotional content of phone call audio conversations. This approach aims to detect vishing scams using biological features of humans, specifically emotions, which cannot be easily masked or spoofed. Experimental results on 30 generated emotions indicate the potential for increased vishing scam detection through this approach.
文摘必备好词一、轻声“说”1.whisper低声说出(常指耳语、窃窃私语)2.murmur/mutter喃喃自语(多指别人不易听到的低语)3.moan/groan/complain/grumble抱怨着说4.mumble/grunt咕哝5.gossip(对别人的隐私)说长道短6.breathe低声说7.sigh叹着气说8.in a soft/gentle/mild tone/voice用温柔的语气说9.in a whisper低声说。
文摘Introduction Claire Conceison is one of the leading figures in the research field of contemporary Chinese theater. She has multiple roles: a scholar, translator, and director. She is a professor of Chinese Culture and Theater Arts at MIT and the author of two books——Significant Other: Staging the American in China(2004), Voices Carry: Behind Bars and Backstage during China’s Revolution and Reform(2009).
基金This research was supported financially by the China Postdoctoral Science Foundation,the National Natural Science Foundation of China(Grant No.51705132)the Young Backbone Teacher Training Program in Henan University of Technology,the Education Department of Henan Province Natural Science Project(Grant No.21A460006)the Natural Science Project of Henan Provincial Department of Science and Technology(Grant No.222102220088).
文摘With ongoing economic,scientific,and technological developments,the electronic devices used in daily lives are developing toward precision and miniaturization,and so the demand for high-precision manufacturing machinery is expanding.The most important piece of equipment in modern high-precision manufacturing is the macro-micro motion platform(M3P),which offers high speed,precision,and efficiency and has macro-micro motion coupling characteristics due to its mechanical design and composition of its driving components.Therefore,the design of the control system is crucial for the overall precision of the platform;conventional proportional–integral–derivative control cannot meet the system requirements,and so M3Ps are the subject of a growing range of modern control strategies.This paper begins by describing the development history of M3Ps,followed by their platform structure and motion control system components,and then in-depth assessments of the macro,micro,and macro-micro control systems.In addition to examining the advantages and disadvantages of current macro-micro motion control,recent technological breakthroughs are noted.Finally,based on existing problems,future directions for M3P control systems are given,and the present conclusions offer guidelines for future work on M3Ps.
文摘Voice classification is important in creating more intelligent systems that help with student exams,identifying criminals,and security systems.The main aim of the research is to develop a system able to predicate and classify gender,age,and accent.So,a newsystem calledClassifyingVoice Gender,Age,and Accent(CVGAA)is proposed.Backpropagation and bagging algorithms are designed to improve voice recognition systems that incorporate sensory voice features such as rhythm-based features used to train the device to distinguish between the two gender categories.It has high precision compared to other algorithms used in this problem,as the adaptive backpropagation algorithm had an accuracy of 98%and the Bagging algorithm had an accuracy of 98.10%in the gender identification data.Bagging has the best accuracy among all algorithms,with 55.39%accuracy in the voice common dataset and age classification and accent accuracy in a speech accent of 78.94%.