Voice portrait technology has explored and established the relationship between speakers’ voices and their facialfeatures, aiming to generate corresponding facial characteristics by providing the voice of an unknown ...Voice portrait technology has explored and established the relationship between speakers’ voices and their facialfeatures, aiming to generate corresponding facial characteristics by providing the voice of an unknown speaker.Due to its powerful advantages in image generation, Generative Adversarial Networks (GANs) have now beenwidely applied across various fields. The existing Voice2Face methods for voice portraits are primarily based onGANs trained on voice-face paired datasets. However, voice portrait models solely constructed on GANs facelimitations in image generation quality and struggle to maintain facial similarity. Additionally, the training processis relatively unstable, thereby affecting the overall generative performance of the model. To overcome the abovechallenges,wepropose a novel deepGenerativeAdversarialNetworkmodel for audio-visual synthesis, namedAVPGAN(Attention-enhanced Voice Portrait Model using Generative Adversarial Network). This model is based ona convolutional attention mechanism and is capable of generating corresponding facial images from the voice ofan unknown speaker. Firstly, to address the issue of training instability, we integrate convolutional neural networkswith deep GANs. In the network architecture, we apply spectral normalization to constrain the variation of thediscriminator, preventing issues such as mode collapse. Secondly, to enhance the model’s ability to extract relevantfeatures between the two modalities, we propose a voice portrait model based on convolutional attention. Thismodel learns the mapping relationship between voice and facial features in a common space from both channeland spatial dimensions independently. Thirdly, to enhance the quality of generated faces, we have incorporated adegradation removal module and utilized pretrained facial GANs as facial priors to repair and enhance the clarityof the generated facial images. Experimental results demonstrate that our AVP-GAN achieved a cosine similarity of0.511, outperforming the performance of our comparison model, and effectively achieved the generation of highqualityfacial images corresponding to a speaker’s voice.展开更多
The African Union's(AU)admission as a new G20 member is a diplomatic breakthrough and a major step towards a more balanced world order in favour of the developing nations.In September,the G20 Summit in Delhi,India...The African Union's(AU)admission as a new G20 member is a diplomatic breakthrough and a major step towards a more balanced world order in favour of the developing nations.In September,the G20 Summit in Delhi,India,accepted the AU as its new member,giving the continent a greater voice in the global economic affairs.展开更多
Introduction Claire Conceison is one of the leading figures in the research field of contemporary Chinese theater. She has multiple roles: a scholar, translator, and director. She is a professor of Chinese Culture and...Introduction Claire Conceison is one of the leading figures in the research field of contemporary Chinese theater. She has multiple roles: a scholar, translator, and director. She is a professor of Chinese Culture and Theater Arts at MIT and the author of two books——Significant Other: Staging the American in China(2004), Voices Carry: Behind Bars and Backstage during China’s Revolution and Reform(2009).展开更多
With ongoing economic,scientific,and technological developments,the electronic devices used in daily lives are developing toward precision and miniaturization,and so the demand for high-precision manufacturing machine...With ongoing economic,scientific,and technological developments,the electronic devices used in daily lives are developing toward precision and miniaturization,and so the demand for high-precision manufacturing machinery is expanding.The most important piece of equipment in modern high-precision manufacturing is the macro-micro motion platform(M3P),which offers high speed,precision,and efficiency and has macro-micro motion coupling characteristics due to its mechanical design and composition of its driving components.Therefore,the design of the control system is crucial for the overall precision of the platform;conventional proportional–integral–derivative control cannot meet the system requirements,and so M3Ps are the subject of a growing range of modern control strategies.This paper begins by describing the development history of M3Ps,followed by their platform structure and motion control system components,and then in-depth assessments of the macro,micro,and macro-micro control systems.In addition to examining the advantages and disadvantages of current macro-micro motion control,recent technological breakthroughs are noted.Finally,based on existing problems,future directions for M3P control systems are given,and the present conclusions offer guidelines for future work on M3Ps.展开更多
Voice classification is important in creating more intelligent systems that help with student exams,identifying criminals,and security systems.The main aim of the research is to develop a system able to predicate and ...Voice classification is important in creating more intelligent systems that help with student exams,identifying criminals,and security systems.The main aim of the research is to develop a system able to predicate and classify gender,age,and accent.So,a newsystem calledClassifyingVoice Gender,Age,and Accent(CVGAA)is proposed.Backpropagation and bagging algorithms are designed to improve voice recognition systems that incorporate sensory voice features such as rhythm-based features used to train the device to distinguish between the two gender categories.It has high precision compared to other algorithms used in this problem,as the adaptive backpropagation algorithm had an accuracy of 98%and the Bagging algorithm had an accuracy of 98.10%in the gender identification data.Bagging has the best accuracy among all algorithms,with 55.39%accuracy in the voice common dataset and age classification and accent accuracy in a speech accent of 78.94%.展开更多
Weibo,one of China’s largest digital media platforms,has become a major platform for women’s voices to fight for equality.However,misconceptions of feminism on Weibo have become obstacles to women’s voices,for whic...Weibo,one of China’s largest digital media platforms,has become a major platform for women’s voices to fight for equality.However,misconceptions of feminism on Weibo have become obstacles to women’s voices,for which the platforms did not post women’s views prominently.From the perspective of women themselves,this paper adopted a questionnaire to study the misunderstanding of feminism and its impact on women’s expression on Weibo.展开更多
User authentication is critical to the security of any information system. The traditional text-based passwords and even biometric systems based on face and fingerprint validation suffer from various drawbacks. Voice-...User authentication is critical to the security of any information system. The traditional text-based passwords and even biometric systems based on face and fingerprint validation suffer from various drawbacks. Voice-based authentication systems have emerged as an effective alternative method. Within the user authentication systems, the server-side voice authentication systems added advantages. The purpose of this paper is to present an innovative approach to the use of voice verification for user authentication. This paper describes a new framework for the implementation of server-side voice authentication, ensuring that only the users who are authenticated and validated can access the system. In addition to providing enhanced security and a more pleasant user experience, this technology has potential applications in a wide range of fields.展开更多
This paper examines how advisors perceive the voices made by Mr.A(pseudonym),the founder of after-school support for children(Initiative Z:pseudonym)in Japan,to advisors who support children.Furthermore,the purpose of...This paper examines how advisors perceive the voices made by Mr.A(pseudonym),the founder of after-school support for children(Initiative Z:pseudonym)in Japan,to advisors who support children.Furthermore,the purpose of this study is to find out how advisors think about the voices and that the voices have led or not advisors to support children,if to do so,what points are key to continuing support for children.Therefore,in Initiative Z,I conducted a survey of two advisors who were approached by Mr.A,who is involved in supporting children as an advisor.As a result of analyzing the narratives obtained from interviews with the two advisors,it was found that the advisor had a sense of being recognized by Mr.A because Mr.A acknowledged the advisor’s way of life.This feeling on the part of the advisor led to trust in Mr.A,and the advisor was in tune with Mr.A’s thoughts on after-school support,suggesting that the advisor was providing support to the child.展开更多
In the telecommunications sector, companies suffer serious damages due to fraud, especially in Africa. One of the main types of fraud is SIM box bypass fraud, which includes using SIM cards to divert incoming internat...In the telecommunications sector, companies suffer serious damages due to fraud, especially in Africa. One of the main types of fraud is SIM box bypass fraud, which includes using SIM cards to divert incoming international calls from mobile operators creating massive losses of revenue. In order to provide a solution to these shortcomings that apply almost to all network operators, we developed intelligent algorithms that exploit huge amounts of data from mobile operators and that detect fraud by analyzing CDRs from voice calls. In this paper we used three classification techniques: Random Forest, Support Vector Machine (SVM) and XGBoost to detect this type of fraud;we compared the performance of these different algorithms to evaluate the model by using data collected from an operator’s network in Cameroon. The algorithm that produced a better performance was the Random Forest with 92% accuracy, so we effectuated the detection of existing fraudulent numbers on the telecommunications operator’s network.展开更多
As the opening work of“Fishing the Sloe-Black River Stories,”Irish-American writer Colum McCann’s short story“Sisters”has attracted scholars’interest with its profuse historical background and themes of the time...As the opening work of“Fishing the Sloe-Black River Stories,”Irish-American writer Colum McCann’s short story“Sisters”has attracted scholars’interest with its profuse historical background and themes of the times.From the perspective of narratology,this paper considers that the writer uses narrative techniques such as non-linear narration and embedded structure to express multi-level narrative voices through a first-person narrator,Sheona.These narrative techniques help to fully reflect the theme of the work and endow the work with great aesthetic and ideological value.展开更多
During this year’s Two Sessions,which are of great importance in the country’s political calendar,representatives from all walks of life gathered in Beijing in March to discuss important topics of common concerns.Th...During this year’s Two Sessions,which are of great importance in the country’s political calendar,representatives from all walks of life gathered in Beijing in March to discuss important topics of common concerns.Their insights and voices of the vital role of standards in supporting high-quality development are showcased in the SPECIAL REPORT column.展开更多
During this year’s Two Sessions,the First Session of the 14th National People’s Congress(NPC,the top legislative body)and the First Session of the 14th National Committee of the Chinese People’s Political Consultat...During this year’s Two Sessions,the First Session of the 14th National People’s Congress(NPC,the top legislative body)and the First Session of the 14th National Committee of the Chinese People’s Political Consultative Conference(CPPCC,the top advisory body),in March,nine press briefings were held at the Great Hall of the People in Beijing,at which 18 NPC deputies,24 CPPCC National Committee members,and nine leaders of government ministries and commissions shared stories of how they performed their duties,responded to questions of common concern,and looked forward to the next stage of China’s development.展开更多
Since 2010, Beijing Mulan Community Service Center has been dedicated to providing female migrant workers with services and help to adapt to urban life.IN January 2023, an original short documentary called Song of Fre...Since 2010, Beijing Mulan Community Service Center has been dedicated to providing female migrant workers with services and help to adapt to urban life.IN January 2023, an original short documentary called Song of Freedom was released. It takes an in-depth look into the lives and backstories of a special group of photography enthusiasts at the Mulan Community Service Center in Beijing.展开更多
Passive voice is an important grammatical phenomenon,and the translation of English passive voice is a hot issue in translation research.In translation,some linguistic phenomena of the source language cannot be expres...Passive voice is an important grammatical phenomenon,and the translation of English passive voice is a hot issue in translation research.In translation,some linguistic phenomena of the source language cannot be expressed in the target language,and the translation has limits of translatability.However,fewer scholars have studied the loss of textual function caused by the translation of passive voice.The authors argue that the conversion of voice in the translation process can,to a certain extent,cause the loss of meaning.In this paper,the authors analyze the loss of textual function caused by the conversion of English passive voice to Chinese active voice from the perspective of the limits of translatability.The authors believe that this phenomenon is common and unavoidable.Therefore,when dealing with the passive voice,the translator should preserve its discourse function as much as possible,rather than just“converting passive to active voice”.展开更多
语音门户是利用了CTI技术实现电话网与互联网集成的重要部件,支持了用户通过普通电话访问互联网获取信息,是由IVR(Interactive Voice Response)、TTS(Text To Speech)、ASR(Automatic Speech Recognition)、Voice XML 4个子系统组成,该...语音门户是利用了CTI技术实现电话网与互联网集成的重要部件,支持了用户通过普通电话访问互联网获取信息,是由IVR(Interactive Voice Response)、TTS(Text To Speech)、ASR(Automatic Speech Recognition)、Voice XML 4个子系统组成,该文在一个实用的语音门户系统的基础上,讨论了系统结构以及4个模块的设计实现,系统设计采用面向对象技术、自动机技术将板卡、通道以其语音合成、识别等资源有机集成在一个系统内,方便了系统设计与功能扩充。展开更多
基金the Double First-Class Innovation Research Projectfor People’s Public Security University of China (No. 2023SYL08).
文摘Voice portrait technology has explored and established the relationship between speakers’ voices and their facialfeatures, aiming to generate corresponding facial characteristics by providing the voice of an unknown speaker.Due to its powerful advantages in image generation, Generative Adversarial Networks (GANs) have now beenwidely applied across various fields. The existing Voice2Face methods for voice portraits are primarily based onGANs trained on voice-face paired datasets. However, voice portrait models solely constructed on GANs facelimitations in image generation quality and struggle to maintain facial similarity. Additionally, the training processis relatively unstable, thereby affecting the overall generative performance of the model. To overcome the abovechallenges,wepropose a novel deepGenerativeAdversarialNetworkmodel for audio-visual synthesis, namedAVPGAN(Attention-enhanced Voice Portrait Model using Generative Adversarial Network). This model is based ona convolutional attention mechanism and is capable of generating corresponding facial images from the voice ofan unknown speaker. Firstly, to address the issue of training instability, we integrate convolutional neural networkswith deep GANs. In the network architecture, we apply spectral normalization to constrain the variation of thediscriminator, preventing issues such as mode collapse. Secondly, to enhance the model’s ability to extract relevantfeatures between the two modalities, we propose a voice portrait model based on convolutional attention. Thismodel learns the mapping relationship between voice and facial features in a common space from both channeland spatial dimensions independently. Thirdly, to enhance the quality of generated faces, we have incorporated adegradation removal module and utilized pretrained facial GANs as facial priors to repair and enhance the clarityof the generated facial images. Experimental results demonstrate that our AVP-GAN achieved a cosine similarity of0.511, outperforming the performance of our comparison model, and effectively achieved the generation of highqualityfacial images corresponding to a speaker’s voice.
文摘The African Union's(AU)admission as a new G20 member is a diplomatic breakthrough and a major step towards a more balanced world order in favour of the developing nations.In September,the G20 Summit in Delhi,India,accepted the AU as its new member,giving the continent a greater voice in the global economic affairs.
文摘Introduction Claire Conceison is one of the leading figures in the research field of contemporary Chinese theater. She has multiple roles: a scholar, translator, and director. She is a professor of Chinese Culture and Theater Arts at MIT and the author of two books——Significant Other: Staging the American in China(2004), Voices Carry: Behind Bars and Backstage during China’s Revolution and Reform(2009).
基金This research was supported financially by the China Postdoctoral Science Foundation,the National Natural Science Foundation of China(Grant No.51705132)the Young Backbone Teacher Training Program in Henan University of Technology,the Education Department of Henan Province Natural Science Project(Grant No.21A460006)the Natural Science Project of Henan Provincial Department of Science and Technology(Grant No.222102220088).
文摘With ongoing economic,scientific,and technological developments,the electronic devices used in daily lives are developing toward precision and miniaturization,and so the demand for high-precision manufacturing machinery is expanding.The most important piece of equipment in modern high-precision manufacturing is the macro-micro motion platform(M3P),which offers high speed,precision,and efficiency and has macro-micro motion coupling characteristics due to its mechanical design and composition of its driving components.Therefore,the design of the control system is crucial for the overall precision of the platform;conventional proportional–integral–derivative control cannot meet the system requirements,and so M3Ps are the subject of a growing range of modern control strategies.This paper begins by describing the development history of M3Ps,followed by their platform structure and motion control system components,and then in-depth assessments of the macro,micro,and macro-micro control systems.In addition to examining the advantages and disadvantages of current macro-micro motion control,recent technological breakthroughs are noted.Finally,based on existing problems,future directions for M3P control systems are given,and the present conclusions offer guidelines for future work on M3Ps.
文摘Voice classification is important in creating more intelligent systems that help with student exams,identifying criminals,and security systems.The main aim of the research is to develop a system able to predicate and classify gender,age,and accent.So,a newsystem calledClassifyingVoice Gender,Age,and Accent(CVGAA)is proposed.Backpropagation and bagging algorithms are designed to improve voice recognition systems that incorporate sensory voice features such as rhythm-based features used to train the device to distinguish between the two gender categories.It has high precision compared to other algorithms used in this problem,as the adaptive backpropagation algorithm had an accuracy of 98%and the Bagging algorithm had an accuracy of 98.10%in the gender identification data.Bagging has the best accuracy among all algorithms,with 55.39%accuracy in the voice common dataset and age classification and accent accuracy in a speech accent of 78.94%.
文摘Weibo,one of China’s largest digital media platforms,has become a major platform for women’s voices to fight for equality.However,misconceptions of feminism on Weibo have become obstacles to women’s voices,for which the platforms did not post women’s views prominently.From the perspective of women themselves,this paper adopted a questionnaire to study the misunderstanding of feminism and its impact on women’s expression on Weibo.
文摘User authentication is critical to the security of any information system. The traditional text-based passwords and even biometric systems based on face and fingerprint validation suffer from various drawbacks. Voice-based authentication systems have emerged as an effective alternative method. Within the user authentication systems, the server-side voice authentication systems added advantages. The purpose of this paper is to present an innovative approach to the use of voice verification for user authentication. This paper describes a new framework for the implementation of server-side voice authentication, ensuring that only the users who are authenticated and validated can access the system. In addition to providing enhanced security and a more pleasant user experience, this technology has potential applications in a wide range of fields.
基金This research is a revised version of the free research presentation at the 42nd Annual Conference of the Japan Society for Lifelong Education,“Prospects for the Role of Advisors in After-School Support for Children”.We would like to express my deepest gratitude to everyone who cooperated with this research.
文摘This paper examines how advisors perceive the voices made by Mr.A(pseudonym),the founder of after-school support for children(Initiative Z:pseudonym)in Japan,to advisors who support children.Furthermore,the purpose of this study is to find out how advisors think about the voices and that the voices have led or not advisors to support children,if to do so,what points are key to continuing support for children.Therefore,in Initiative Z,I conducted a survey of two advisors who were approached by Mr.A,who is involved in supporting children as an advisor.As a result of analyzing the narratives obtained from interviews with the two advisors,it was found that the advisor had a sense of being recognized by Mr.A because Mr.A acknowledged the advisor’s way of life.This feeling on the part of the advisor led to trust in Mr.A,and the advisor was in tune with Mr.A’s thoughts on after-school support,suggesting that the advisor was providing support to the child.
文摘In the telecommunications sector, companies suffer serious damages due to fraud, especially in Africa. One of the main types of fraud is SIM box bypass fraud, which includes using SIM cards to divert incoming international calls from mobile operators creating massive losses of revenue. In order to provide a solution to these shortcomings that apply almost to all network operators, we developed intelligent algorithms that exploit huge amounts of data from mobile operators and that detect fraud by analyzing CDRs from voice calls. In this paper we used three classification techniques: Random Forest, Support Vector Machine (SVM) and XGBoost to detect this type of fraud;we compared the performance of these different algorithms to evaluate the model by using data collected from an operator’s network in Cameroon. The algorithm that produced a better performance was the Random Forest with 92% accuracy, so we effectuated the detection of existing fraudulent numbers on the telecommunications operator’s network.
文摘As the opening work of“Fishing the Sloe-Black River Stories,”Irish-American writer Colum McCann’s short story“Sisters”has attracted scholars’interest with its profuse historical background and themes of the times.From the perspective of narratology,this paper considers that the writer uses narrative techniques such as non-linear narration and embedded structure to express multi-level narrative voices through a first-person narrator,Sheona.These narrative techniques help to fully reflect the theme of the work and endow the work with great aesthetic and ideological value.
文摘During this year’s Two Sessions,which are of great importance in the country’s political calendar,representatives from all walks of life gathered in Beijing in March to discuss important topics of common concerns.Their insights and voices of the vital role of standards in supporting high-quality development are showcased in the SPECIAL REPORT column.
文摘During this year’s Two Sessions,the First Session of the 14th National People’s Congress(NPC,the top legislative body)and the First Session of the 14th National Committee of the Chinese People’s Political Consultative Conference(CPPCC,the top advisory body),in March,nine press briefings were held at the Great Hall of the People in Beijing,at which 18 NPC deputies,24 CPPCC National Committee members,and nine leaders of government ministries and commissions shared stories of how they performed their duties,responded to questions of common concern,and looked forward to the next stage of China’s development.
文摘Since 2010, Beijing Mulan Community Service Center has been dedicated to providing female migrant workers with services and help to adapt to urban life.IN January 2023, an original short documentary called Song of Freedom was released. It takes an in-depth look into the lives and backstories of a special group of photography enthusiasts at the Mulan Community Service Center in Beijing.
文摘Passive voice is an important grammatical phenomenon,and the translation of English passive voice is a hot issue in translation research.In translation,some linguistic phenomena of the source language cannot be expressed in the target language,and the translation has limits of translatability.However,fewer scholars have studied the loss of textual function caused by the translation of passive voice.The authors argue that the conversion of voice in the translation process can,to a certain extent,cause the loss of meaning.In this paper,the authors analyze the loss of textual function caused by the conversion of English passive voice to Chinese active voice from the perspective of the limits of translatability.The authors believe that this phenomenon is common and unavoidable.Therefore,when dealing with the passive voice,the translator should preserve its discourse function as much as possible,rather than just“converting passive to active voice”.
文摘语音门户是利用了CTI技术实现电话网与互联网集成的重要部件,支持了用户通过普通电话访问互联网获取信息,是由IVR(Interactive Voice Response)、TTS(Text To Speech)、ASR(Automatic Speech Recognition)、Voice XML 4个子系统组成,该文在一个实用的语音门户系统的基础上,讨论了系统结构以及4个模块的设计实现,系统设计采用面向对象技术、自动机技术将板卡、通道以其语音合成、识别等资源有机集成在一个系统内,方便了系统设计与功能扩充。