Voice portrait technology has explored and established the relationship between speakers’ voices and their facialfeatures, aiming to generate corresponding facial characteristics by providing the voice of an unknown ...Voice portrait technology has explored and established the relationship between speakers’ voices and their facialfeatures, aiming to generate corresponding facial characteristics by providing the voice of an unknown speaker.Due to its powerful advantages in image generation, Generative Adversarial Networks (GANs) have now beenwidely applied across various fields. The existing Voice2Face methods for voice portraits are primarily based onGANs trained on voice-face paired datasets. However, voice portrait models solely constructed on GANs facelimitations in image generation quality and struggle to maintain facial similarity. Additionally, the training processis relatively unstable, thereby affecting the overall generative performance of the model. To overcome the abovechallenges,wepropose a novel deepGenerativeAdversarialNetworkmodel for audio-visual synthesis, namedAVPGAN(Attention-enhanced Voice Portrait Model using Generative Adversarial Network). This model is based ona convolutional attention mechanism and is capable of generating corresponding facial images from the voice ofan unknown speaker. Firstly, to address the issue of training instability, we integrate convolutional neural networkswith deep GANs. In the network architecture, we apply spectral normalization to constrain the variation of thediscriminator, preventing issues such as mode collapse. Secondly, to enhance the model’s ability to extract relevantfeatures between the two modalities, we propose a voice portrait model based on convolutional attention. Thismodel learns the mapping relationship between voice and facial features in a common space from both channeland spatial dimensions independently. Thirdly, to enhance the quality of generated faces, we have incorporated adegradation removal module and utilized pretrained facial GANs as facial priors to repair and enhance the clarityof the generated facial images. Experimental results demonstrate that our AVP-GAN achieved a cosine similarity of0.511, outperforming the performance of our comparison model, and effectively achieved the generation of highqualityfacial images corresponding to a speaker’s voice.展开更多
Objective To explore the feasibility of remotely obtaining complex information on traditional Chinese medicine(TCM)pulse conditions through voice signals.Methods We used multi-label pulse conditions as the entry point...Objective To explore the feasibility of remotely obtaining complex information on traditional Chinese medicine(TCM)pulse conditions through voice signals.Methods We used multi-label pulse conditions as the entry point and modeled and analyzed TCM pulse diagnosis by combining voice analysis and machine learning.Audio features were extracted from voice recordings in the TCM pulse condition dataset.The obtained features were combined with information from tongue and facial diagnoses.A multi-label pulse condition voice classification DNN model was built using 10-fold cross-validation,and the modeling methods were validated using publicly available datasets.Results The analysis showed that the proposed method achieved an accuracy of 92.59%on the public dataset.The accuracies of the three single-label pulse manifestation models in the test set were 94.27%,96.35%,and 95.39%.The absolute accuracy of the multi-label model was 92.74%.Conclusion Voice data analysis may serve as a remote adjunct to the TCM diagnostic method for pulse condition assessment.展开更多
Research on adaptive deformable mirror technology for voice coil actuators(VCAs)is an important trend in the development of large ground-based telescopes.A voice coil adaptive deformable mirror contains a large number...Research on adaptive deformable mirror technology for voice coil actuators(VCAs)is an important trend in the development of large ground-based telescopes.A voice coil adaptive deformable mirror contains a large number of actuators,and there are problems with structural coupling and large temperature increases in their internal coils.Additionally,parameters of the traditional proportional integral derivative(PID)control cannot be adjusted in real-time to adapt to system changes.These problems can be addressed by introducing fuzzy control methods.A table lookup method is adopted to replace real-time calculations of the regular fuzzy controller during the control process,and a prototype platform has been established to verify the effectiveness and robustness of this process.Experimental tests compare the control performance of traditional and fuzzy proportional integral derivative(Fuzzy-PID)controllers,showing that,in system step response tests,the fuzzy control system reduces rise time by 20.25%,decreases overshoot by 78.24%,and shortens settling time by 67.59%.In disturbance rejection experiments,fuzzy control achieves a 46.09%reduction in the maximum deviation,indicating stronger robustness.The Fuzzy-PID controller,based on table lookup,outperforms the standard controller significantly,showing excellent potential for enhancing the dynamic performance and disturbance rejection capability of the voice coil motor actuator system.展开更多
For all its different forms,democracy is expected to promote people’s well-being,instead of being weaponized to justify hegemony,as democracy is also a principle of global governance.
As China celebrates the 40th anniversary of joining the International Atomic Energy Agency,an expert talks about how her organization is nurturing talents for overseas projects.
The African Union's(AU)admission as a new G20 member is a diplomatic breakthrough and a major step towards a more balanced world order in favour of the developing nations.In September,the G20 Summit in Delhi,India...The African Union's(AU)admission as a new G20 member is a diplomatic breakthrough and a major step towards a more balanced world order in favour of the developing nations.In September,the G20 Summit in Delhi,India,accepted the AU as its new member,giving the continent a greater voice in the global economic affairs.展开更多
Weibo,one of China’s largest digital media platforms,has become a major platform for women’s voices to fight for equality.However,misconceptions of feminism on Weibo have become obstacles to women’s voices,for whic...Weibo,one of China’s largest digital media platforms,has become a major platform for women’s voices to fight for equality.However,misconceptions of feminism on Weibo have become obstacles to women’s voices,for which the platforms did not post women’s views prominently.From the perspective of women themselves,this paper adopted a questionnaire to study the misunderstanding of feminism and its impact on women’s expression on Weibo.展开更多
User authentication is critical to the security of any information system. The traditional text-based passwords and even biometric systems based on face and fingerprint validation suffer from various drawbacks. Voice-...User authentication is critical to the security of any information system. The traditional text-based passwords and even biometric systems based on face and fingerprint validation suffer from various drawbacks. Voice-based authentication systems have emerged as an effective alternative method. Within the user authentication systems, the server-side voice authentication systems added advantages. The purpose of this paper is to present an innovative approach to the use of voice verification for user authentication. This paper describes a new framework for the implementation of server-side voice authentication, ensuring that only the users who are authenticated and validated can access the system. In addition to providing enhanced security and a more pleasant user experience, this technology has potential applications in a wide range of fields.展开更多
This paper examines how advisors perceive the voices made by Mr.A(pseudonym),the founder of after-school support for children(Initiative Z:pseudonym)in Japan,to advisors who support children.Furthermore,the purpose of...This paper examines how advisors perceive the voices made by Mr.A(pseudonym),the founder of after-school support for children(Initiative Z:pseudonym)in Japan,to advisors who support children.Furthermore,the purpose of this study is to find out how advisors think about the voices and that the voices have led or not advisors to support children,if to do so,what points are key to continuing support for children.Therefore,in Initiative Z,I conducted a survey of two advisors who were approached by Mr.A,who is involved in supporting children as an advisor.As a result of analyzing the narratives obtained from interviews with the two advisors,it was found that the advisor had a sense of being recognized by Mr.A because Mr.A acknowledged the advisor’s way of life.This feeling on the part of the advisor led to trust in Mr.A,and the advisor was in tune with Mr.A’s thoughts on after-school support,suggesting that the advisor was providing support to the child.展开更多
During this year’s Two Sessions,the First Session of the 14th National People’s Congress(NPC,the top legislative body)and the First Session of the 14th National Committee of the Chinese People’s Political Consultat...During this year’s Two Sessions,the First Session of the 14th National People’s Congress(NPC,the top legislative body)and the First Session of the 14th National Committee of the Chinese People’s Political Consultative Conference(CPPCC,the top advisory body),in March,nine press briefings were held at the Great Hall of the People in Beijing,at which 18 NPC deputies,24 CPPCC National Committee members,and nine leaders of government ministries and commissions shared stories of how they performed their duties,responded to questions of common concern,and looked forward to the next stage of China’s development.展开更多
During this year’s Two Sessions,which are of great importance in the country’s political calendar,representatives from all walks of life gathered in Beijing in March to discuss important topics of common concerns.Th...During this year’s Two Sessions,which are of great importance in the country’s political calendar,representatives from all walks of life gathered in Beijing in March to discuss important topics of common concerns.Their insights and voices of the vital role of standards in supporting high-quality development are showcased in the SPECIAL REPORT column.展开更多
Since 2010, Beijing Mulan Community Service Center has been dedicated to providing female migrant workers with services and help to adapt to urban life.IN January 2023, an original short documentary called Song of Fre...Since 2010, Beijing Mulan Community Service Center has been dedicated to providing female migrant workers with services and help to adapt to urban life.IN January 2023, an original short documentary called Song of Freedom was released. It takes an in-depth look into the lives and backstories of a special group of photography enthusiasts at the Mulan Community Service Center in Beijing.展开更多
Passive voice is an important grammatical phenomenon,and the translation of English passive voice is a hot issue in translation research.In translation,some linguistic phenomena of the source language cannot be expres...Passive voice is an important grammatical phenomenon,and the translation of English passive voice is a hot issue in translation research.In translation,some linguistic phenomena of the source language cannot be expressed in the target language,and the translation has limits of translatability.However,fewer scholars have studied the loss of textual function caused by the translation of passive voice.The authors argue that the conversion of voice in the translation process can,to a certain extent,cause the loss of meaning.In this paper,the authors analyze the loss of textual function caused by the conversion of English passive voice to Chinese active voice from the perspective of the limits of translatability.The authors believe that this phenomenon is common and unavoidable.Therefore,when dealing with the passive voice,the translator should preserve its discourse function as much as possible,rather than just“converting passive to active voice”.展开更多
基金the Double First-Class Innovation Research Projectfor People’s Public Security University of China (No. 2023SYL08).
文摘Voice portrait technology has explored and established the relationship between speakers’ voices and their facialfeatures, aiming to generate corresponding facial characteristics by providing the voice of an unknown speaker.Due to its powerful advantages in image generation, Generative Adversarial Networks (GANs) have now beenwidely applied across various fields. The existing Voice2Face methods for voice portraits are primarily based onGANs trained on voice-face paired datasets. However, voice portrait models solely constructed on GANs facelimitations in image generation quality and struggle to maintain facial similarity. Additionally, the training processis relatively unstable, thereby affecting the overall generative performance of the model. To overcome the abovechallenges,wepropose a novel deepGenerativeAdversarialNetworkmodel for audio-visual synthesis, namedAVPGAN(Attention-enhanced Voice Portrait Model using Generative Adversarial Network). This model is based ona convolutional attention mechanism and is capable of generating corresponding facial images from the voice ofan unknown speaker. Firstly, to address the issue of training instability, we integrate convolutional neural networkswith deep GANs. In the network architecture, we apply spectral normalization to constrain the variation of thediscriminator, preventing issues such as mode collapse. Secondly, to enhance the model’s ability to extract relevantfeatures between the two modalities, we propose a voice portrait model based on convolutional attention. Thismodel learns the mapping relationship between voice and facial features in a common space from both channeland spatial dimensions independently. Thirdly, to enhance the quality of generated faces, we have incorporated adegradation removal module and utilized pretrained facial GANs as facial priors to repair and enhance the clarityof the generated facial images. Experimental results demonstrate that our AVP-GAN achieved a cosine similarity of0.511, outperforming the performance of our comparison model, and effectively achieved the generation of highqualityfacial images corresponding to a speaker’s voice.
基金supported by Fundamental Research Funds from the Beijing University of Chinese Medicine(2023-JYB-KYPT-13)the Developmental Fund of Beijing University of Chinese Medicine(2020-ZXFZJJ-083).
文摘Objective To explore the feasibility of remotely obtaining complex information on traditional Chinese medicine(TCM)pulse conditions through voice signals.Methods We used multi-label pulse conditions as the entry point and modeled and analyzed TCM pulse diagnosis by combining voice analysis and machine learning.Audio features were extracted from voice recordings in the TCM pulse condition dataset.The obtained features were combined with information from tongue and facial diagnoses.A multi-label pulse condition voice classification DNN model was built using 10-fold cross-validation,and the modeling methods were validated using publicly available datasets.Results The analysis showed that the proposed method achieved an accuracy of 92.59%on the public dataset.The accuracies of the three single-label pulse manifestation models in the test set were 94.27%,96.35%,and 95.39%.The absolute accuracy of the multi-label model was 92.74%.Conclusion Voice data analysis may serve as a remote adjunct to the TCM diagnostic method for pulse condition assessment.
基金supported by the National Key R&D Program of China (2022YFA1603001,2021YFC2801402)the National Nature Science Foundation of China (12073053)the Science and Technology Plan of Inner Mongolia (2021GG0245).
文摘Research on adaptive deformable mirror technology for voice coil actuators(VCAs)is an important trend in the development of large ground-based telescopes.A voice coil adaptive deformable mirror contains a large number of actuators,and there are problems with structural coupling and large temperature increases in their internal coils.Additionally,parameters of the traditional proportional integral derivative(PID)control cannot be adjusted in real-time to adapt to system changes.These problems can be addressed by introducing fuzzy control methods.A table lookup method is adopted to replace real-time calculations of the regular fuzzy controller during the control process,and a prototype platform has been established to verify the effectiveness and robustness of this process.Experimental tests compare the control performance of traditional and fuzzy proportional integral derivative(Fuzzy-PID)controllers,showing that,in system step response tests,the fuzzy control system reduces rise time by 20.25%,decreases overshoot by 78.24%,and shortens settling time by 67.59%.In disturbance rejection experiments,fuzzy control achieves a 46.09%reduction in the maximum deviation,indicating stronger robustness.The Fuzzy-PID controller,based on table lookup,outperforms the standard controller significantly,showing excellent potential for enhancing the dynamic performance and disturbance rejection capability of the voice coil motor actuator system.
文摘For all its different forms,democracy is expected to promote people’s well-being,instead of being weaponized to justify hegemony,as democracy is also a principle of global governance.
文摘As China celebrates the 40th anniversary of joining the International Atomic Energy Agency,an expert talks about how her organization is nurturing talents for overseas projects.
文摘The African Union's(AU)admission as a new G20 member is a diplomatic breakthrough and a major step towards a more balanced world order in favour of the developing nations.In September,the G20 Summit in Delhi,India,accepted the AU as its new member,giving the continent a greater voice in the global economic affairs.
文摘Weibo,one of China’s largest digital media platforms,has become a major platform for women’s voices to fight for equality.However,misconceptions of feminism on Weibo have become obstacles to women’s voices,for which the platforms did not post women’s views prominently.From the perspective of women themselves,this paper adopted a questionnaire to study the misunderstanding of feminism and its impact on women’s expression on Weibo.
文摘User authentication is critical to the security of any information system. The traditional text-based passwords and even biometric systems based on face and fingerprint validation suffer from various drawbacks. Voice-based authentication systems have emerged as an effective alternative method. Within the user authentication systems, the server-side voice authentication systems added advantages. The purpose of this paper is to present an innovative approach to the use of voice verification for user authentication. This paper describes a new framework for the implementation of server-side voice authentication, ensuring that only the users who are authenticated and validated can access the system. In addition to providing enhanced security and a more pleasant user experience, this technology has potential applications in a wide range of fields.
基金This research is a revised version of the free research presentation at the 42nd Annual Conference of the Japan Society for Lifelong Education,“Prospects for the Role of Advisors in After-School Support for Children”.We would like to express my deepest gratitude to everyone who cooperated with this research.
文摘This paper examines how advisors perceive the voices made by Mr.A(pseudonym),the founder of after-school support for children(Initiative Z:pseudonym)in Japan,to advisors who support children.Furthermore,the purpose of this study is to find out how advisors think about the voices and that the voices have led or not advisors to support children,if to do so,what points are key to continuing support for children.Therefore,in Initiative Z,I conducted a survey of two advisors who were approached by Mr.A,who is involved in supporting children as an advisor.As a result of analyzing the narratives obtained from interviews with the two advisors,it was found that the advisor had a sense of being recognized by Mr.A because Mr.A acknowledged the advisor’s way of life.This feeling on the part of the advisor led to trust in Mr.A,and the advisor was in tune with Mr.A’s thoughts on after-school support,suggesting that the advisor was providing support to the child.
文摘During this year’s Two Sessions,the First Session of the 14th National People’s Congress(NPC,the top legislative body)and the First Session of the 14th National Committee of the Chinese People’s Political Consultative Conference(CPPCC,the top advisory body),in March,nine press briefings were held at the Great Hall of the People in Beijing,at which 18 NPC deputies,24 CPPCC National Committee members,and nine leaders of government ministries and commissions shared stories of how they performed their duties,responded to questions of common concern,and looked forward to the next stage of China’s development.
文摘During this year’s Two Sessions,which are of great importance in the country’s political calendar,representatives from all walks of life gathered in Beijing in March to discuss important topics of common concerns.Their insights and voices of the vital role of standards in supporting high-quality development are showcased in the SPECIAL REPORT column.
文摘Since 2010, Beijing Mulan Community Service Center has been dedicated to providing female migrant workers with services and help to adapt to urban life.IN January 2023, an original short documentary called Song of Freedom was released. It takes an in-depth look into the lives and backstories of a special group of photography enthusiasts at the Mulan Community Service Center in Beijing.
文摘Passive voice is an important grammatical phenomenon,and the translation of English passive voice is a hot issue in translation research.In translation,some linguistic phenomena of the source language cannot be expressed in the target language,and the translation has limits of translatability.However,fewer scholars have studied the loss of textual function caused by the translation of passive voice.The authors argue that the conversion of voice in the translation process can,to a certain extent,cause the loss of meaning.In this paper,the authors analyze the loss of textual function caused by the conversion of English passive voice to Chinese active voice from the perspective of the limits of translatability.The authors believe that this phenomenon is common and unavoidable.Therefore,when dealing with the passive voice,the translator should preserve its discourse function as much as possible,rather than just“converting passive to active voice”.