This study examines the integration of classical aesthetics into the silent curriculum of higher vocational education,emphasizing its potential to significantly enhance emotional and social skills.Incorporating aesthe...This study examines the integration of classical aesthetics into the silent curriculum of higher vocational education,emphasizing its potential to significantly enhance emotional and social skills.Incorporating aesthetic principles into education emphasizes the importance of nurturing emotional intelligence,creativity,and cultural awareness in students-skills that go beyond the classroom and are essential for their growth,adaptability,and future careers.It explores theoretical foundations and practical implementations and addresses challenges such as the need for specialized educator training,overcoming institutional resistance,and securing adequate resources.Advocating for empirical research and strategic cultural partnerships,the paper proposes a transformative approach to vocational education,aligning it with contemporary societal and labor market demands,and underscores the vital role of classical aesthetics in enriching vocational training and enhancing student outcomes.展开更多
In the era of cultural economy or aesthetic capitalism, the relationship between humanities and science has undergone great changes that include four aspects: (1) the humanities are compatible with and dependent on...In the era of cultural economy or aesthetic capitalism, the relationship between humanities and science has undergone great changes that include four aspects: (1) the humanities are compatible with and dependent on science rather than antagonism and conflict; (2) the content and form of humanities have so greatly changed that the new humanities discipline has appeared; (3) in contemporary scientific and cultural development pattern, new humanities have gradually become the principal discipline; (4) aesthetics and art gradually highlight its importance. Based on the thoughts of Karl Marx's Economic and Philosophic Manuscripts of 1844, this paper intends to discuss the essence and significances of the modernity problem in contemporary society. And then, it analyzes "how to think about the future" in the era of aesthetic capitalism or cultural economy. In the last part of the paper, several characteristics of the new humanities are studied from the perspective of contemporary aesthetics.展开更多
Throughout Western music history, pre-existing material has long been the aesthetic core of a new composition. Yet there has never been such an epoch as our time in which using pre-existing material, melodic quotation...Throughout Western music history, pre-existing material has long been the aesthetic core of a new composition. Yet there has never been such an epoch as our time in which using pre-existing material, melodic quotation in particular, features so extensively in works of many of the composers. The aim of this paper is to investigate how the use of quoted tunes in a musical piece operates in an interwoven complex where time and space are of the essence. A quote is able to oscillate perpetually between one’s mental worlds of the memorable past and the imaginative present when it is highlighted enough to be recognizable from its surrounding context. Upon interpreting the use of quotation in various contexts, the aesthetic object, I argue, is the shift from original to quoted music, and vice versa. And listeners can respond aesthetically to the quotation itself even without knowledge of its provenance and textual or referential content.展开更多
The essentialismabout beauty and art of Tolstoy is based on the criticismof the traditional trinity ideology of the true,the good and the beautiful. H e resolutely denies the discussion of the essence of beauty of the...The essentialismabout beauty and art of Tolstoy is based on the criticismof the traditional trinity ideology of the true,the good and the beautiful. H e resolutely denies the discussion of the essence of beauty of the traditional aesthetics,denying that the essence of beauty is in the pleasant sensation and the traditional fine arts. At the same time,he believes that the concept of art cannot be based on beauty and that the essence of art is determined —art is nothing but communication of emotion,putting forward the pure,clear and sincere aesthetical standard and revealing the essence of emotion and the functions of art. Tolstoy is the milestone in the history of the western aesthetics and is of great significance to the development of the western modern aesthetics.展开更多
In recent years,speech synthesis systems have allowed for the produc-tion of very high-quality voices.Therefore,research in this domain is now turning to the problem of integrating emotions into speech.However,the met...In recent years,speech synthesis systems have allowed for the produc-tion of very high-quality voices.Therefore,research in this domain is now turning to the problem of integrating emotions into speech.However,the method of con-structing a speech synthesizer for each emotion has some limitations.First,this method often requires an emotional-speech data set with many sentences.Such data sets are very time-intensive and labor-intensive to complete.Second,training each of these models requires computers with large computational capabilities and a lot of effort and time for model tuning.In addition,each model for each emotion failed to take advantage of data sets of other emotions.In this paper,we propose a new method to synthesize emotional speech in which the latent expressions of emotions are learned from a small data set of professional actors through a Flow-tron model.In addition,we provide a new method to build a speech corpus that is scalable and whose quality is easy to control.Next,to produce a high-quality speech synthesis model,we used this data set to train the Tacotron 2 model.We used it as a pre-trained model to train the Flowtron model.We applied this method to synthesize Vietnamese speech with sadness and happiness.Mean opi-nion score(MOS)assessment results show that MOS is 3.61 for sadness and 3.95 for happiness.In conclusion,the proposed method proves to be more effec-tive for a high degree of automation and fast emotional sentence generation,using a small emotional-speech data set.展开更多
The Japanese movie“0.5mm”connects the life clips between the nursing-care helper Sawa and several old people in the form of a road movie,highlighting many thought-provoking social problems,and revealing how the elem...The Japanese movie“0.5mm”connects the life clips between the nursing-care helper Sawa and several old people in the form of a road movie,highlighting many thought-provoking social problems,and revealing how the elements of emotional abuse hidden between the old people and their relatives and friends affect people’s dignity and decency,and at the same time trying to offset the persecution from the emotional abuse with the warm kindness between the elderly and the care workers.The film’s implicit description of emotional abuse and explicit display of good deeds are blended in the quiet and mysterious narrative character,achieving the effect of synchronizing the artistic narrative rhythm with the flow of life,reflecting the unique aesthetic characteristics of Japanese films.展开更多
Due to the lack of large-scale emotion databases,it is hard to obtain comparable improvement in multimodal emotion recognition of the deep neural network by deep learning,which has made great progress in other areas.W...Due to the lack of large-scale emotion databases,it is hard to obtain comparable improvement in multimodal emotion recognition of the deep neural network by deep learning,which has made great progress in other areas.We use transfer learning to improve its performance with pretrained models on largescale data.Audio is encoded using deep speech recognition networks with 500 hours’speech and video is encoded using convolutional neural networks with over 110,000 images.The extracted audio and visual features are fed into Long Short-Term Memory to train models respectively.Logistic regression and ensemble method are performed in decision level fusion.The experiment results indicate that 1)audio features extracted from deep speech recognition networks achieve better performance than handcrafted audio features;2)the visual emotion recognition obtains better performance than audio emotion recognition;3)the ensemble method gets better performance than logistic regression and prior knowledge from micro-F1 value further improves the performance and robustness,achieving accuracy of 67.00%for“happy”,54.90%for“an?gry”,and 51.69%for“sad”.展开更多
Background A crucial element of human-machine interaction,the automatic detection of emotional states from human speech has long been regarded as a challenging task for machine learning models.One vital challenge in s...Background A crucial element of human-machine interaction,the automatic detection of emotional states from human speech has long been regarded as a challenging task for machine learning models.One vital challenge in speech emotion recognition(SER)is learning robust and discriminative representations from speech.Although machine learning methods have been widely applied in SER research,the inadequate amount of available annotated data has become a bottleneck impeding the extended application of such techniques(e.g.,deep neural networks).To address this issue,we present a deep learning method that combines knowledge transfer and self-attention for SER tasks.Herein,we apply the log-Mel spectrogram with deltas and delta-deltas as inputs.Moreover,given that emotions are time dependent,we apply temporal convolutional neural networks to model the variations in emotions.We further introduce an attention transfer mechanism,which is based on a self-attention algorithm to learn long-term dependencies.The self-attention transfer network(SATN)in our proposed approach takes advantage of attention transfer to learn attention from speech recognition,followed by transferring this knowledge into SER.An evaluation built on Interactive Emotional Dyadic Motion Capture(IEMOCAP)dataset demonstrates the effectiveness of the proposed model.展开更多
Infertility is considered to be a growing problem worldwide. In sub-Saharan Africa, at least 20%-50% of couples of reproductive age experience a fertility problem and 30% are diagnosed with infertility. This study exp...Infertility is considered to be a growing problem worldwide. In sub-Saharan Africa, at least 20%-50% of couples of reproductive age experience a fertility problem and 30% are diagnosed with infertility. This study explores the experiences of women in South Africa who are involuntary childless and explores their psychological and emotional experiences of In Vitro Fertilisation and Embryo Transfer (IVF-ET). Utilising a qualitative methodology, a diverse group of 21 married women diagnosed with infertility and who had undergone at least two cycles of IVF-ET were recruited. Semi-structured, in-depth individual interviews were conducted and the data were analysed using thematic analysis. The results of the study indicated that the women perceived themselves as not conforming to a dominant belief system and as a result felt compelled to explore all the medical options available. They reported emotional turmoil characterised by primary binary emotions of anxiety-excitement and nervousness-optimistic. These emotions were experienced throughout the five stages of the IVF-ET treatment cycles. A synopsis of the psychological and emotional responses to the IVF-ET treatment is discussed. The findings of this study suggest the need for the incorporation of a mandatory psychosocial intervention as part of infertility management. Greater attention to the psychological and emotional repercussions of infertility treatment could lead to a more personalised client-approach which, in turn, would prepare infertile women and couples for the emotional demands of the treatment.展开更多
In order to improve the efficiency of speech emotion recognition across corpora,a speech emotion transfer learning method based on the deep sparse auto-encoder is proposed.The algorithm first reconstructs a small amou...In order to improve the efficiency of speech emotion recognition across corpora,a speech emotion transfer learning method based on the deep sparse auto-encoder is proposed.The algorithm first reconstructs a small amount of data in the target domain by training the deep sparse auto-encoder,so that the encoder can learn the low-dimensional structural representation of the target domain data.Then,the source domain data and the target domain data are coded by the trained deep sparse auto-encoder to obtain the reconstruction data of the low-dimensional structural representation close to the target domain.Finally,a part of the reconstructed tagged target domain data is mixed with the reconstructed source domain data to jointly train the classifier.This part of the target domain data is used to guide the source domain data.Experiments on the CASIA,SoutheastLab corpus show that the model recognition rate after a small amount of data transferred reached 89.2%and 72.4%on the DNN.Compared to the training results of the complete original corpus,it only decreased by 2%in the CASIA corpus,and only 3.4%in the SoutheastLab corpus.Experiments show that the algorithm can achieve the effect of labeling all data in the extreme case that the data set has only a small amount of data tagged.展开更多
Since 1949, modern Chinese language has, in the course o f its development in China's Mainland, twice witnessed large-scale transfers in its word emotive overtones. The first began in 1949 and went on all the way ...Since 1949, modern Chinese language has, in the course o f its development in China's Mainland, twice witnessed large-scale transfers in its word emotive overtones. The first began in 1949 and went on all the way till the end o f the Cultural Revolution in 1977. Derogation manifested itself in that period, during which the derogatory words enjoyed their greatest number, widest usages and highest frequency in the history o f the Chinese language. The second began from the Reform and Opening Up Policy in 1978 and lasted untill now. De-derogation has manifested itself in this period, during which the derogatory words have had the smallest number, least usages and lowest frequency in the history o f the Chinese language. The two large-scale transfers result from their specific social backgrounds and the development o f the Chinese language itself.展开更多
基金Philosophy and Social Sciences Research Program in Colleges and Universities of Jiangsu Provincial Department of Education(2024SJSZ0285)Higher Education Reform Research Project of Jiangsu Higher Education Association(2023JSJG649)+1 种基金Scientific Research Cultivation Project of Shenzhen Institute of Information Technology(SZIIT2021KJ033)Scientific Research Foundation for High-level Talents(RC2023-006)。
文摘This study examines the integration of classical aesthetics into the silent curriculum of higher vocational education,emphasizing its potential to significantly enhance emotional and social skills.Incorporating aesthetic principles into education emphasizes the importance of nurturing emotional intelligence,creativity,and cultural awareness in students-skills that go beyond the classroom and are essential for their growth,adaptability,and future careers.It explores theoretical foundations and practical implementations and addresses challenges such as the need for specialized educator training,overcoming institutional resistance,and securing adequate resources.Advocating for empirical research and strategic cultural partnerships,the paper proposes a transformative approach to vocational education,aligning it with contemporary societal and labor market demands,and underscores the vital role of classical aesthetics in enriching vocational training and enhancing student outcomes.
基金Acknowledgements: This paper was sponsored by China National Social Science Foundation "Research on the Fundamental Problems of the Contemporary Aesthetics and Criticism Patterns" (15ZDB023).
文摘In the era of cultural economy or aesthetic capitalism, the relationship between humanities and science has undergone great changes that include four aspects: (1) the humanities are compatible with and dependent on science rather than antagonism and conflict; (2) the content and form of humanities have so greatly changed that the new humanities discipline has appeared; (3) in contemporary scientific and cultural development pattern, new humanities have gradually become the principal discipline; (4) aesthetics and art gradually highlight its importance. Based on the thoughts of Karl Marx's Economic and Philosophic Manuscripts of 1844, this paper intends to discuss the essence and significances of the modernity problem in contemporary society. And then, it analyzes "how to think about the future" in the era of aesthetic capitalism or cultural economy. In the last part of the paper, several characteristics of the new humanities are studied from the perspective of contemporary aesthetics.
文摘Throughout Western music history, pre-existing material has long been the aesthetic core of a new composition. Yet there has never been such an epoch as our time in which using pre-existing material, melodic quotation in particular, features so extensively in works of many of the composers. The aim of this paper is to investigate how the use of quoted tunes in a musical piece operates in an interwoven complex where time and space are of the essence. A quote is able to oscillate perpetually between one’s mental worlds of the memorable past and the imaginative present when it is highlighted enough to be recognizable from its surrounding context. Upon interpreting the use of quotation in various contexts, the aesthetic object, I argue, is the shift from original to quoted music, and vice versa. And listeners can respond aesthetically to the quotation itself even without knowledge of its provenance and textual or referential content.
文摘The essentialismabout beauty and art of Tolstoy is based on the criticismof the traditional trinity ideology of the true,the good and the beautiful. H e resolutely denies the discussion of the essence of beauty of the traditional aesthetics,denying that the essence of beauty is in the pleasant sensation and the traditional fine arts. At the same time,he believes that the concept of art cannot be based on beauty and that the essence of art is determined —art is nothing but communication of emotion,putting forward the pure,clear and sincere aesthetical standard and revealing the essence of emotion and the functions of art. Tolstoy is the milestone in the history of the western aesthetics and is of great significance to the development of the western modern aesthetics.
基金funded by the Hanoi University of Science and Technology(HUST)under grant number T2018-PC-210.
文摘In recent years,speech synthesis systems have allowed for the produc-tion of very high-quality voices.Therefore,research in this domain is now turning to the problem of integrating emotions into speech.However,the method of con-structing a speech synthesizer for each emotion has some limitations.First,this method often requires an emotional-speech data set with many sentences.Such data sets are very time-intensive and labor-intensive to complete.Second,training each of these models requires computers with large computational capabilities and a lot of effort and time for model tuning.In addition,each model for each emotion failed to take advantage of data sets of other emotions.In this paper,we propose a new method to synthesize emotional speech in which the latent expressions of emotions are learned from a small data set of professional actors through a Flow-tron model.In addition,we provide a new method to build a speech corpus that is scalable and whose quality is easy to control.Next,to produce a high-quality speech synthesis model,we used this data set to train the Tacotron 2 model.We used it as a pre-trained model to train the Flowtron model.We applied this method to synthesize Vietnamese speech with sadness and happiness.Mean opi-nion score(MOS)assessment results show that MOS is 3.61 for sadness and 3.95 for happiness.In conclusion,the proposed method proves to be more effec-tive for a high degree of automation and fast emotional sentence generation,using a small emotional-speech data set.
文摘The Japanese movie“0.5mm”connects the life clips between the nursing-care helper Sawa and several old people in the form of a road movie,highlighting many thought-provoking social problems,and revealing how the elements of emotional abuse hidden between the old people and their relatives and friends affect people’s dignity and decency,and at the same time trying to offset the persecution from the emotional abuse with the warm kindness between the elderly and the care workers.The film’s implicit description of emotional abuse and explicit display of good deeds are blended in the quiet and mysterious narrative character,achieving the effect of synchronizing the artistic narrative rhythm with the flow of life,reflecting the unique aesthetic characteristics of Japanese films.
文摘Due to the lack of large-scale emotion databases,it is hard to obtain comparable improvement in multimodal emotion recognition of the deep neural network by deep learning,which has made great progress in other areas.We use transfer learning to improve its performance with pretrained models on largescale data.Audio is encoded using deep speech recognition networks with 500 hours’speech and video is encoded using convolutional neural networks with over 110,000 images.The extracted audio and visual features are fed into Long Short-Term Memory to train models respectively.Logistic regression and ensemble method are performed in decision level fusion.The experiment results indicate that 1)audio features extracted from deep speech recognition networks achieve better performance than handcrafted audio features;2)the visual emotion recognition obtains better performance than audio emotion recognition;3)the ensemble method gets better performance than logistic regression and prior knowledge from micro-F1 value further improves the performance and robustness,achieving accuracy of 67.00%for“happy”,54.90%for“an?gry”,and 51.69%for“sad”.
基金the National Natural Science Foundation of China(62071330)the National Science Fund for Distinguished Young Scholars(61425017)+3 种基金the Key Program of the National Natural Science Foundation(61831022)the Key Program of the Natural Science Foundation of Tianjin(18JCZDJC36300)the Open Projects Program of the National Laboratory of Pattern Recognition and the Senior Visiting Scholar Program of Tianjin Normal Universitythe Innovative Medicines Initiative 2 Joint Undertaking(115902),which receives support from the European Union's Horizon 2020 research and innovation program and EFPIA.
文摘Background A crucial element of human-machine interaction,the automatic detection of emotional states from human speech has long been regarded as a challenging task for machine learning models.One vital challenge in speech emotion recognition(SER)is learning robust and discriminative representations from speech.Although machine learning methods have been widely applied in SER research,the inadequate amount of available annotated data has become a bottleneck impeding the extended application of such techniques(e.g.,deep neural networks).To address this issue,we present a deep learning method that combines knowledge transfer and self-attention for SER tasks.Herein,we apply the log-Mel spectrogram with deltas and delta-deltas as inputs.Moreover,given that emotions are time dependent,we apply temporal convolutional neural networks to model the variations in emotions.We further introduce an attention transfer mechanism,which is based on a self-attention algorithm to learn long-term dependencies.The self-attention transfer network(SATN)in our proposed approach takes advantage of attention transfer to learn attention from speech recognition,followed by transferring this knowledge into SER.An evaluation built on Interactive Emotional Dyadic Motion Capture(IEMOCAP)dataset demonstrates the effectiveness of the proposed model.
文摘Infertility is considered to be a growing problem worldwide. In sub-Saharan Africa, at least 20%-50% of couples of reproductive age experience a fertility problem and 30% are diagnosed with infertility. This study explores the experiences of women in South Africa who are involuntary childless and explores their psychological and emotional experiences of In Vitro Fertilisation and Embryo Transfer (IVF-ET). Utilising a qualitative methodology, a diverse group of 21 married women diagnosed with infertility and who had undergone at least two cycles of IVF-ET were recruited. Semi-structured, in-depth individual interviews were conducted and the data were analysed using thematic analysis. The results of the study indicated that the women perceived themselves as not conforming to a dominant belief system and as a result felt compelled to explore all the medical options available. They reported emotional turmoil characterised by primary binary emotions of anxiety-excitement and nervousness-optimistic. These emotions were experienced throughout the five stages of the IVF-ET treatment cycles. A synopsis of the psychological and emotional responses to the IVF-ET treatment is discussed. The findings of this study suggest the need for the incorporation of a mandatory psychosocial intervention as part of infertility management. Greater attention to the psychological and emotional repercussions of infertility treatment could lead to a more personalised client-approach which, in turn, would prepare infertile women and couples for the emotional demands of the treatment.
基金The National Natural Science Foundation of China(No.61871213,61673108,61571106)Six Talent Peaks Project in Jiangsu Province(No.2016-DZXX-023)
文摘In order to improve the efficiency of speech emotion recognition across corpora,a speech emotion transfer learning method based on the deep sparse auto-encoder is proposed.The algorithm first reconstructs a small amount of data in the target domain by training the deep sparse auto-encoder,so that the encoder can learn the low-dimensional structural representation of the target domain data.Then,the source domain data and the target domain data are coded by the trained deep sparse auto-encoder to obtain the reconstruction data of the low-dimensional structural representation close to the target domain.Finally,a part of the reconstructed tagged target domain data is mixed with the reconstructed source domain data to jointly train the classifier.This part of the target domain data is used to guide the source domain data.Experiments on the CASIA,SoutheastLab corpus show that the model recognition rate after a small amount of data transferred reached 89.2%and 72.4%on the DNN.Compared to the training results of the complete original corpus,it only decreased by 2%in the CASIA corpus,and only 3.4%in the SoutheastLab corpus.Experiments show that the algorithm can achieve the effect of labeling all data in the extreme case that the data set has only a small amount of data tagged.
文摘Since 1949, modern Chinese language has, in the course o f its development in China's Mainland, twice witnessed large-scale transfers in its word emotive overtones. The first began in 1949 and went on all the way till the end o f the Cultural Revolution in 1977. Derogation manifested itself in that period, during which the derogatory words enjoyed their greatest number, widest usages and highest frequency in the history o f the Chinese language. The second began from the Reform and Opening Up Policy in 1978 and lasted untill now. De-derogation has manifested itself in this period, during which the derogatory words have had the smallest number, least usages and lowest frequency in the history o f the Chinese language. The two large-scale transfers result from their specific social backgrounds and the development o f the Chinese language itself.