Voice portrait technology has explored and established the relationship between speakers’ voices and their facialfeatures, aiming to generate corresponding facial characteristics by providing the voice of an unknown ...Voice portrait technology has explored and established the relationship between speakers’ voices and their facialfeatures, aiming to generate corresponding facial characteristics by providing the voice of an unknown speaker.Due to its powerful advantages in image generation, Generative Adversarial Networks (GANs) have now beenwidely applied across various fields. The existing Voice2Face methods for voice portraits are primarily based onGANs trained on voice-face paired datasets. However, voice portrait models solely constructed on GANs facelimitations in image generation quality and struggle to maintain facial similarity. Additionally, the training processis relatively unstable, thereby affecting the overall generative performance of the model. To overcome the abovechallenges,wepropose a novel deepGenerativeAdversarialNetworkmodel for audio-visual synthesis, namedAVPGAN(Attention-enhanced Voice Portrait Model using Generative Adversarial Network). This model is based ona convolutional attention mechanism and is capable of generating corresponding facial images from the voice ofan unknown speaker. Firstly, to address the issue of training instability, we integrate convolutional neural networkswith deep GANs. In the network architecture, we apply spectral normalization to constrain the variation of thediscriminator, preventing issues such as mode collapse. Secondly, to enhance the model’s ability to extract relevantfeatures between the two modalities, we propose a voice portrait model based on convolutional attention. Thismodel learns the mapping relationship between voice and facial features in a common space from both channeland spatial dimensions independently. Thirdly, to enhance the quality of generated faces, we have incorporated adegradation removal module and utilized pretrained facial GANs as facial priors to repair and enhance the clarityof the generated facial images. Experimental results demonstrate that our AVP-GAN achieved a cosine similarity of0.511, outperforming the performance of our comparison model, and effectively achieved the generation of highqualityfacial images corresponding to a speaker’s voice.展开更多
Research on adaptive deformable mirror technology for voice coil actuators(VCAs)is an important trend in the development of large ground-based telescopes.A voice coil adaptive deformable mirror contains a large number...Research on adaptive deformable mirror technology for voice coil actuators(VCAs)is an important trend in the development of large ground-based telescopes.A voice coil adaptive deformable mirror contains a large number of actuators,and there are problems with structural coupling and large temperature increases in their internal coils.Additionally,parameters of the traditional proportional integral derivative(PID)control cannot be adjusted in real-time to adapt to system changes.These problems can be addressed by introducing fuzzy control methods.A table lookup method is adopted to replace real-time calculations of the regular fuzzy controller during the control process,and a prototype platform has been established to verify the effectiveness and robustness of this process.Experimental tests compare the control performance of traditional and fuzzy proportional integral derivative(Fuzzy-PID)controllers,showing that,in system step response tests,the fuzzy control system reduces rise time by 20.25%,decreases overshoot by 78.24%,and shortens settling time by 67.59%.In disturbance rejection experiments,fuzzy control achieves a 46.09%reduction in the maximum deviation,indicating stronger robustness.The Fuzzy-PID controller,based on table lookup,outperforms the standard controller significantly,showing excellent potential for enhancing the dynamic performance and disturbance rejection capability of the voice coil motor actuator system.展开更多
Objective To explore the feasibility of remotely obtaining complex information on traditional Chinese medicine(TCM)pulse conditions through voice signals.Methods We used multi-label pulse conditions as the entry point...Objective To explore the feasibility of remotely obtaining complex information on traditional Chinese medicine(TCM)pulse conditions through voice signals.Methods We used multi-label pulse conditions as the entry point and modeled and analyzed TCM pulse diagnosis by combining voice analysis and machine learning.Audio features were extracted from voice recordings in the TCM pulse condition dataset.The obtained features were combined with information from tongue and facial diagnoses.A multi-label pulse condition voice classification DNN model was built using 10-fold cross-validation,and the modeling methods were validated using publicly available datasets.Results The analysis showed that the proposed method achieved an accuracy of 92.59%on the public dataset.The accuracies of the three single-label pulse manifestation models in the test set were 94.27%,96.35%,and 95.39%.The absolute accuracy of the multi-label model was 92.74%.Conclusion Voice data analysis may serve as a remote adjunct to the TCM diagnostic method for pulse condition assessment.展开更多
For all its different forms,democracy is expected to promote people’s well-being,instead of being weaponized to justify hegemony,as democracy is also a principle of global governance.
On September 4,the"Strengthening Youth Exchange to Build a Friendly Future"China-Laos Youth Dialogue was held at the National University of Laos (NUOL) in Vientiane with an aim to consolidate bilateral frien...On September 4,the"Strengthening Youth Exchange to Build a Friendly Future"China-Laos Youth Dialogue was held at the National University of Laos (NUOL) in Vientiane with an aim to consolidate bilateral friendship,promote the building of a ChinaLaos community with a shared future,and strengthen exchanges and cooperation between the youth of both countries.展开更多
The African Union's(AU)admission as a new G20 member is a diplomatic breakthrough and a major step towards a more balanced world order in favour of the developing nations.In September,the G20 Summit in Delhi,India...The African Union's(AU)admission as a new G20 member is a diplomatic breakthrough and a major step towards a more balanced world order in favour of the developing nations.In September,the G20 Summit in Delhi,India,accepted the AU as its new member,giving the continent a greater voice in the global economic affairs.展开更多
User authentication is critical to the security of any information system. The traditional text-based passwords and even biometric systems based on face and fingerprint validation suffer from various drawbacks. Voice-...User authentication is critical to the security of any information system. The traditional text-based passwords and even biometric systems based on face and fingerprint validation suffer from various drawbacks. Voice-based authentication systems have emerged as an effective alternative method. Within the user authentication systems, the server-side voice authentication systems added advantages. The purpose of this paper is to present an innovative approach to the use of voice verification for user authentication. This paper describes a new framework for the implementation of server-side voice authentication, ensuring that only the users who are authenticated and validated can access the system. In addition to providing enhanced security and a more pleasant user experience, this technology has potential applications in a wide range of fields.展开更多
During this year’s Two Sessions,the First Session of the 14th National People’s Congress(NPC,the top legislative body)and the First Session of the 14th National Committee of the Chinese People’s Political Consultat...During this year’s Two Sessions,the First Session of the 14th National People’s Congress(NPC,the top legislative body)and the First Session of the 14th National Committee of the Chinese People’s Political Consultative Conference(CPPCC,the top advisory body),in March,nine press briefings were held at the Great Hall of the People in Beijing,at which 18 NPC deputies,24 CPPCC National Committee members,and nine leaders of government ministries and commissions shared stories of how they performed their duties,responded to questions of common concern,and looked forward to the next stage of China’s development.展开更多
During this year’s Two Sessions,which are of great importance in the country’s political calendar,representatives from all walks of life gathered in Beijing in March to discuss important topics of common concerns.Th...During this year’s Two Sessions,which are of great importance in the country’s political calendar,representatives from all walks of life gathered in Beijing in March to discuss important topics of common concerns.Their insights and voices of the vital role of standards in supporting high-quality development are showcased in the SPECIAL REPORT column.展开更多
Since 2010, Beijing Mulan Community Service Center has been dedicated to providing female migrant workers with services and help to adapt to urban life.IN January 2023, an original short documentary called Song of Fre...Since 2010, Beijing Mulan Community Service Center has been dedicated to providing female migrant workers with services and help to adapt to urban life.IN January 2023, an original short documentary called Song of Freedom was released. It takes an in-depth look into the lives and backstories of a special group of photography enthusiasts at the Mulan Community Service Center in Beijing.展开更多
This paper examines how advisors perceive the voices made by Mr.A(pseudonym),the founder of after-school support for children(Initiative Z:pseudonym)in Japan,to advisors who support children.Furthermore,the purpose of...This paper examines how advisors perceive the voices made by Mr.A(pseudonym),the founder of after-school support for children(Initiative Z:pseudonym)in Japan,to advisors who support children.Furthermore,the purpose of this study is to find out how advisors think about the voices and that the voices have led or not advisors to support children,if to do so,what points are key to continuing support for children.Therefore,in Initiative Z,I conducted a survey of two advisors who were approached by Mr.A,who is involved in supporting children as an advisor.As a result of analyzing the narratives obtained from interviews with the two advisors,it was found that the advisor had a sense of being recognized by Mr.A because Mr.A acknowledged the advisor’s way of life.This feeling on the part of the advisor led to trust in Mr.A,and the advisor was in tune with Mr.A’s thoughts on after-school support,suggesting that the advisor was providing support to the child.展开更多
Weibo,one of China’s largest digital media platforms,has become a major platform for women’s voices to fight for equality.However,misconceptions of feminism on Weibo have become obstacles to women’s voices,for whic...Weibo,one of China’s largest digital media platforms,has become a major platform for women’s voices to fight for equality.However,misconceptions of feminism on Weibo have become obstacles to women’s voices,for which the platforms did not post women’s views prominently.From the perspective of women themselves,this paper adopted a questionnaire to study the misunderstanding of feminism and its impact on women’s expression on Weibo.展开更多
基金the Double First-Class Innovation Research Projectfor People’s Public Security University of China (No. 2023SYL08).
文摘Voice portrait technology has explored and established the relationship between speakers’ voices and their facialfeatures, aiming to generate corresponding facial characteristics by providing the voice of an unknown speaker.Due to its powerful advantages in image generation, Generative Adversarial Networks (GANs) have now beenwidely applied across various fields. The existing Voice2Face methods for voice portraits are primarily based onGANs trained on voice-face paired datasets. However, voice portrait models solely constructed on GANs facelimitations in image generation quality and struggle to maintain facial similarity. Additionally, the training processis relatively unstable, thereby affecting the overall generative performance of the model. To overcome the abovechallenges,wepropose a novel deepGenerativeAdversarialNetworkmodel for audio-visual synthesis, namedAVPGAN(Attention-enhanced Voice Portrait Model using Generative Adversarial Network). This model is based ona convolutional attention mechanism and is capable of generating corresponding facial images from the voice ofan unknown speaker. Firstly, to address the issue of training instability, we integrate convolutional neural networkswith deep GANs. In the network architecture, we apply spectral normalization to constrain the variation of thediscriminator, preventing issues such as mode collapse. Secondly, to enhance the model’s ability to extract relevantfeatures between the two modalities, we propose a voice portrait model based on convolutional attention. Thismodel learns the mapping relationship between voice and facial features in a common space from both channeland spatial dimensions independently. Thirdly, to enhance the quality of generated faces, we have incorporated adegradation removal module and utilized pretrained facial GANs as facial priors to repair and enhance the clarityof the generated facial images. Experimental results demonstrate that our AVP-GAN achieved a cosine similarity of0.511, outperforming the performance of our comparison model, and effectively achieved the generation of highqualityfacial images corresponding to a speaker’s voice.
基金supported by the National Key R&D Program of China (2022YFA1603001,2021YFC2801402)the National Nature Science Foundation of China (12073053)the Science and Technology Plan of Inner Mongolia (2021GG0245).
文摘Research on adaptive deformable mirror technology for voice coil actuators(VCAs)is an important trend in the development of large ground-based telescopes.A voice coil adaptive deformable mirror contains a large number of actuators,and there are problems with structural coupling and large temperature increases in their internal coils.Additionally,parameters of the traditional proportional integral derivative(PID)control cannot be adjusted in real-time to adapt to system changes.These problems can be addressed by introducing fuzzy control methods.A table lookup method is adopted to replace real-time calculations of the regular fuzzy controller during the control process,and a prototype platform has been established to verify the effectiveness and robustness of this process.Experimental tests compare the control performance of traditional and fuzzy proportional integral derivative(Fuzzy-PID)controllers,showing that,in system step response tests,the fuzzy control system reduces rise time by 20.25%,decreases overshoot by 78.24%,and shortens settling time by 67.59%.In disturbance rejection experiments,fuzzy control achieves a 46.09%reduction in the maximum deviation,indicating stronger robustness.The Fuzzy-PID controller,based on table lookup,outperforms the standard controller significantly,showing excellent potential for enhancing the dynamic performance and disturbance rejection capability of the voice coil motor actuator system.
基金supported by Fundamental Research Funds from the Beijing University of Chinese Medicine(2023-JYB-KYPT-13)the Developmental Fund of Beijing University of Chinese Medicine(2020-ZXFZJJ-083).
文摘Objective To explore the feasibility of remotely obtaining complex information on traditional Chinese medicine(TCM)pulse conditions through voice signals.Methods We used multi-label pulse conditions as the entry point and modeled and analyzed TCM pulse diagnosis by combining voice analysis and machine learning.Audio features were extracted from voice recordings in the TCM pulse condition dataset.The obtained features were combined with information from tongue and facial diagnoses.A multi-label pulse condition voice classification DNN model was built using 10-fold cross-validation,and the modeling methods were validated using publicly available datasets.Results The analysis showed that the proposed method achieved an accuracy of 92.59%on the public dataset.The accuracies of the three single-label pulse manifestation models in the test set were 94.27%,96.35%,and 95.39%.The absolute accuracy of the multi-label model was 92.74%.Conclusion Voice data analysis may serve as a remote adjunct to the TCM diagnostic method for pulse condition assessment.
文摘For all its different forms,democracy is expected to promote people’s well-being,instead of being weaponized to justify hegemony,as democracy is also a principle of global governance.
文摘On September 4,the"Strengthening Youth Exchange to Build a Friendly Future"China-Laos Youth Dialogue was held at the National University of Laos (NUOL) in Vientiane with an aim to consolidate bilateral friendship,promote the building of a ChinaLaos community with a shared future,and strengthen exchanges and cooperation between the youth of both countries.
文摘本文通过阐述上行增强技术,对提升高丢包指标的相关参数进行分析研究,提出通过 VOLTE 数据优先级提升参 数、语音业务的目标 bler 参数、PDCCH 自适应参数的优化设置,分析在数传过程中对丢包的影响,并通过西安移动网络,进行了 参数实际验证,证明了优化此类参数对改善丢包指标的作用,找出满足实际网络需求的参数设置,从而优化降低 VoLTE 高丢包 率,探索出一种能改善丢包率及语音质量的有效方法,希望提供简单有效的降低 VoLTE 高丢包率的优化方法,对无线网络优化工 作有所帮助。
文摘The African Union's(AU)admission as a new G20 member is a diplomatic breakthrough and a major step towards a more balanced world order in favour of the developing nations.In September,the G20 Summit in Delhi,India,accepted the AU as its new member,giving the continent a greater voice in the global economic affairs.
文摘User authentication is critical to the security of any information system. The traditional text-based passwords and even biometric systems based on face and fingerprint validation suffer from various drawbacks. Voice-based authentication systems have emerged as an effective alternative method. Within the user authentication systems, the server-side voice authentication systems added advantages. The purpose of this paper is to present an innovative approach to the use of voice verification for user authentication. This paper describes a new framework for the implementation of server-side voice authentication, ensuring that only the users who are authenticated and validated can access the system. In addition to providing enhanced security and a more pleasant user experience, this technology has potential applications in a wide range of fields.
文摘During this year’s Two Sessions,the First Session of the 14th National People’s Congress(NPC,the top legislative body)and the First Session of the 14th National Committee of the Chinese People’s Political Consultative Conference(CPPCC,the top advisory body),in March,nine press briefings were held at the Great Hall of the People in Beijing,at which 18 NPC deputies,24 CPPCC National Committee members,and nine leaders of government ministries and commissions shared stories of how they performed their duties,responded to questions of common concern,and looked forward to the next stage of China’s development.
文摘During this year’s Two Sessions,which are of great importance in the country’s political calendar,representatives from all walks of life gathered in Beijing in March to discuss important topics of common concerns.Their insights and voices of the vital role of standards in supporting high-quality development are showcased in the SPECIAL REPORT column.
文摘Since 2010, Beijing Mulan Community Service Center has been dedicated to providing female migrant workers with services and help to adapt to urban life.IN January 2023, an original short documentary called Song of Freedom was released. It takes an in-depth look into the lives and backstories of a special group of photography enthusiasts at the Mulan Community Service Center in Beijing.
基金This research is a revised version of the free research presentation at the 42nd Annual Conference of the Japan Society for Lifelong Education,“Prospects for the Role of Advisors in After-School Support for Children”.We would like to express my deepest gratitude to everyone who cooperated with this research.
文摘This paper examines how advisors perceive the voices made by Mr.A(pseudonym),the founder of after-school support for children(Initiative Z:pseudonym)in Japan,to advisors who support children.Furthermore,the purpose of this study is to find out how advisors think about the voices and that the voices have led or not advisors to support children,if to do so,what points are key to continuing support for children.Therefore,in Initiative Z,I conducted a survey of two advisors who were approached by Mr.A,who is involved in supporting children as an advisor.As a result of analyzing the narratives obtained from interviews with the two advisors,it was found that the advisor had a sense of being recognized by Mr.A because Mr.A acknowledged the advisor’s way of life.This feeling on the part of the advisor led to trust in Mr.A,and the advisor was in tune with Mr.A’s thoughts on after-school support,suggesting that the advisor was providing support to the child.
文摘Weibo,one of China’s largest digital media platforms,has become a major platform for women’s voices to fight for equality.However,misconceptions of feminism on Weibo have become obstacles to women’s voices,for which the platforms did not post women’s views prominently.From the perspective of women themselves,this paper adopted a questionnaire to study the misunderstanding of feminism and its impact on women’s expression on Weibo.