自发性知觉经络反应(autonomous sensory meridian response,ASMR),指人体通过视、听、触、嗅等感知上的刺激,在颅内、头皮、背部或身体其他部位产生的令人愉悦的独特刺激感[1],又名耳音、颅内高潮等。Poerio等[2]邀请了91名健康人观看A...自发性知觉经络反应(autonomous sensory meridian response,ASMR),指人体通过视、听、触、嗅等感知上的刺激,在颅内、头皮、背部或身体其他部位产生的令人愉悦的独特刺激感[1],又名耳音、颅内高潮等。Poerio等[2]邀请了91名健康人观看ASMR视频,结果提示超过半数的人有ASMR类似体验。这提示ASMR可能普遍存在于正常人群中。虽然生理学暂时没有明确的证据阐明该现象的机制,但大量案例提示,ASMR在睡眠、抗抑郁和治疗慢性疼痛方面可能有促进作用。本研究将对ASMR的相关研究进行综述,具体如下。展开更多
A machine learning based speech enhancement method is proposed to improve the intelligibility of whispered speech. A binary mask estimated by a two-class support vector machine (SVM) classifier is used to synthesize...A machine learning based speech enhancement method is proposed to improve the intelligibility of whispered speech. A binary mask estimated by a two-class support vector machine (SVM) classifier is used to synthesize the enhanced whisper. A novel noise robust feature called Gammatone feature cosine coefficients (GFCCs) extracted by an auditory periphery model is derived and used for the binary mask estimation. The intelligibility performance of the proposed method is evaluated and compared with the traditional speech enhancement methods. Objective and subjective evaluation results indicate that the proposed method can effectively improve the intelligibility of whispered speech which is contaminated by noise. Compared with the power subtract algorithm and the log-MMSE algorithm, both of which do not improve the intelligibility in lower signal-to-noise ratio (SNR) environments, the proposed method has good performance in improving the intelligibility of noisy whisper. Additionally, the intelligibility of the enhanced whispered speech using the proposed method also outperforms that of the corresponding unprocessed noisy whispered speech.展开更多
Some factors influencing the intelligibility of the enhanced whisper in the joint time-frequency domain are evaluated. Specifically, both the spectrum density and different regions of the enhanced spectrum are analyze...Some factors influencing the intelligibility of the enhanced whisper in the joint time-frequency domain are evaluated. Specifically, both the spectrum density and different regions of the enhanced spectrum are analyzed. Experimental results show that for a spectrum of some density, the joint time-frequency gain-modification based speech enhancement algorithm achieves significant improvement in intelligibility. Additionally, the spectrum region where the estimated spectrum is smaller than the clean spectrum, is the most important region contributing to intelligibility improvement for the enhanced whisper. The spectrum region where the estimated spectrum is larger than twice the size of the clean spectrum is detrimental to speech intelligibility perception within the whisper context.展开更多
The cognitive performance-based dimensional emotion recognition in whispered speech is studied.First,the whispered speech emotion databases and data collection methods are compared, and the character of emotion expres...The cognitive performance-based dimensional emotion recognition in whispered speech is studied.First,the whispered speech emotion databases and data collection methods are compared, and the character of emotion expression in whispered speech is studied,especially the basic types of emotions.Secondly,the emotion features for whispered speech is analyzed,and by reviewing the latest references,the related valence features and the arousal features are provided. The effectiveness of valence and arousal features in whispered speech emotion classification is studied.Finally,the Gaussian mixture model is studied and applied to whispered speech emotion recognition. The cognitive performance is also considered in emotion recognition so that the recognition errors of whispered speech emotion can be corrected.Based on the cognitive scores,the emotion recognition results can be improved.The results show that the formant features are not significantly related to arousal dimension,while the short-term energy features are related to the emotion changes in arousal dimension.Using the cognitive scores,the recognition results can be improved.展开更多
AIM: To determine whether listening to music decreases the requirement for dosages of sedative drugs, patients' anxiety, pain and dissatisfaction feelings during colonoscopy and makes the procedure more comfortable ...AIM: To determine whether listening to music decreases the requirement for dosages of sedative drugs, patients' anxiety, pain and dissatisfaction feelings during colonoscopy and makes the procedure more comfortable and acceptable. METHODS: Patients undergoing elective colonoscopy between October 2005 and February 2006 were randomized into either listening to music (Group 1, n = 30) or not listening to music (Group 2, n = 30). Anxiolytic and analgesic drugs (intravenous midazolam and meperidine) were given according to the patients' demand. Administered medications were monitored. We determined their levels of anxiety using the State-Trait Anxiety Inventory Test form. Patients' satisfaction, pain, and willingness to undergo a repeated procedure were self-assessed using a visual analog scale. RESULTS: The mean dose of sedative and analgesic drugs used in group 1 (midazolam: 2.1 ± 1.4, meperidine: 18.1 ± 11.7) was smaller than group 2 (midazolam: 2.4 ± 1.0, meperidine: 20.6 ± 11.5), but without a significant difference (P 〉 0.05). The mean anxiety level in group 1 was lower than group 2 (36.7 ± 2.2 vs 251.0 ± 1.9, P 〈 0.001). The mean satisfaction score was higher in group 1 compared to group 2 (87.8 ± 3.1 vs 58.1 ± 3.4, P 〈 0.001). The mean pain score in group i was lower than group 2 (74.1 ± 4.7 vs 39.0 ± 3.9, P 〈 0.001). CONCLUSION: Listening to music during colonoscopy helps reduce the dose of sedative medications, as well as patients' anxiety, pain, dissatisfaction during the procedure. Therefore, we believe that listening to music can play an adjunctive role to sedation in colonoscopy. It is a simple, inexpensive way to improve patients' comfort during the procedure.展开更多
A filter algorithm based on cochlear mechanics and neuron filter mechanism is proposed from the view point of vibration.It helps to solve the problem that the non-linear amplification is rarely considered in studying ...A filter algorithm based on cochlear mechanics and neuron filter mechanism is proposed from the view point of vibration.It helps to solve the problem that the non-linear amplification is rarely considered in studying the auditory filters.A cochlear mechanical transduction model is built to illustrate the audio signals processing procedure in cochlea,and then the neuron filter mechanism is modeled to indirectly obtain the outputs with the cochlear properties of frequency tuning and non-linear amplification.The mathematic description of the proposed algorithm is derived by the two models.The parameter space,the parameter selection rules and the error correction of the proposed algorithm are discussed.The unit impulse responses in the time domain and the frequency domain are simulated and compared to probe into the characteristics of the proposed algorithm.Then a 24-channel filter bank is built based on the proposed algorithm and applied to the enhancements of the audio signals.The experiments and comparisons verify that,the proposed algorithm can effectively divide the audio signals into different frequencies,significantly enhance the high frequency parts,and provide positive impacts on the performance of speech enhancement in different noise environments,especially for the babble noise and the volvo noise.展开更多
The Autoregressive Moving Average (ARMA) model for whispered speech is proposed. with normal speech, whispered speech has no fundamental frequency because of the glottis being semi-opened and turbulent flow being cr...The Autoregressive Moving Average (ARMA) model for whispered speech is proposed. with normal speech, whispered speech has no fundamental frequency because of the glottis being semi-opened and turbulent flow being created, and formant shifting exists in the lower frequency region due to the narrowing of the tract in the false vocal fold regions and weak acoustic coupling with the aubglottal system. Analysis shows that the effect of the subglottal system is to introduce additional pole-zero pairs into the vocal tract transfer function. Theoretically, the method based on an ARMA process is superior to that based on an AR process in the spectral analysis of the whispered speech. Two methods, the least squared modified Yule-Walker likelihood estimate (LSMY) algorithm and the Frequency-Domain Steiglitz-Mcbide (FDSM) algorithm, are applied to the ARMA mfldel for the whispered speech. The performance evaluation shows that the ARMA model is much more appropriate for representing the whispered speech than the AR model, and the FDSM algorithm provides a name acorate estimation of the whispered speech spectral envelope than the LSMY algorithm with higher conputational complexity.展开更多
Language, literature, customs and traditions, music and art are cultural items that were transmitted from generation to generation throughout history. In this context, literature is an important source of music cultur...Language, literature, customs and traditions, music and art are cultural items that were transmitted from generation to generation throughout history. In this context, literature is an important source of music culture that takes inspiration from the customs and traditions of a society. Prosodic meter is echoed in form, usul and general structure in works composed from the divan literature and almost lives in the work. In the same way, when examples of folk literature composed by composers and performed by poets and a^lks are examined, it is observed that there are parallels between literary features and form, structure and rhythmic features. The aim of this paper is to reveal the integral link between Melody-Usul and Meter in Ottoman Turkish Music展开更多
Mete Sakpmar, who was born in Ankara, May 5, 1954, is a contemporary Turkish composer. His compositions were played in various countries like Turkey, France (1980-1983), Canada, Australia, Sweden, Norway, Hungary, A...Mete Sakpmar, who was born in Ankara, May 5, 1954, is a contemporary Turkish composer. His compositions were played in various countries like Turkey, France (1980-1983), Canada, Australia, Sweden, Norway, Hungary, Albania, Belgium, Holland, and German, and he also joined lots of seminars, radio, and television programs. His inspirational sources are traditional Turkish music, modern French music, American cultural traditions, as well as jazz, acoustic, and electronic music. His music, in summary, is about taking a little from tonality, twelve-note system, coincidences, improvisations, etc., and joining them with his own experiences like emotions, knowledge, and intelligence. Sakpmar always defends that the composer of the day has to benefit from lots of various sources. He is doing this as well as trying new forms in every new piece. It does not matter if these are tonal, atonal, serial, or modal but it must be personal. All the borders and capacities of the instruments have to be forced.展开更多
Animals have special solution to the problem of communication in high levels of background noise. A small group of vertebrates (bats,dolphins and whales,and some rodents) that use ultrasound for communication.Our rese...Animals have special solution to the problem of communication in high levels of background noise. A small group of vertebrates (bats,dolphins and whales,and some rodents) that use ultrasound for communication.Our research first demonstrated that the concave-eared torrent frog is the first non- mammalian vertebrate found to be capable of pro- ducing and detecting ultrasounds for communica- tion.This study may provide a clue for understand- ing why humans have ear canals and how animals auditory systems have evolved,and inspire in de- veloping bionic tecnology for improving hearing in noise.展开更多
文摘自发性知觉经络反应(autonomous sensory meridian response,ASMR),指人体通过视、听、触、嗅等感知上的刺激,在颅内、头皮、背部或身体其他部位产生的令人愉悦的独特刺激感[1],又名耳音、颅内高潮等。Poerio等[2]邀请了91名健康人观看ASMR视频,结果提示超过半数的人有ASMR类似体验。这提示ASMR可能普遍存在于正常人群中。虽然生理学暂时没有明确的证据阐明该现象的机制,但大量案例提示,ASMR在睡眠、抗抑郁和治疗慢性疼痛方面可能有促进作用。本研究将对ASMR的相关研究进行综述,具体如下。
基金The National Natural Science Foundation of China (No.61231002,61273266,51075068,60872073,60975017, 61003131)the Ph.D.Programs Foundation of the Ministry of Education of China(No.20110092130004)+1 种基金the Science Foundation for Young Talents in the Educational Committee of Anhui Province(No. 2010SQRL018)the 211 Project of Anhui University(No.2009QN027B)
文摘A machine learning based speech enhancement method is proposed to improve the intelligibility of whispered speech. A binary mask estimated by a two-class support vector machine (SVM) classifier is used to synthesize the enhanced whisper. A novel noise robust feature called Gammatone feature cosine coefficients (GFCCs) extracted by an auditory periphery model is derived and used for the binary mask estimation. The intelligibility performance of the proposed method is evaluated and compared with the traditional speech enhancement methods. Objective and subjective evaluation results indicate that the proposed method can effectively improve the intelligibility of whispered speech which is contaminated by noise. Compared with the power subtract algorithm and the log-MMSE algorithm, both of which do not improve the intelligibility in lower signal-to-noise ratio (SNR) environments, the proposed method has good performance in improving the intelligibility of noisy whisper. Additionally, the intelligibility of the enhanced whispered speech using the proposed method also outperforms that of the corresponding unprocessed noisy whispered speech.
基金The National Natural Science Foundation of China(No.61301295,61273266,61301219,61201326,61003131)the Natural Science Foundation of Anhui Province(No.1308085QF100,1408085MF113)+2 种基金the Natural Science Foundation of Jiangsu Province(No.BK20130241)the Natural Science Foundation of Higher Education Institutions of Jiangsu Province(No.12KJB510021)the Doctoral Fund of Anhui University
文摘Some factors influencing the intelligibility of the enhanced whisper in the joint time-frequency domain are evaluated. Specifically, both the spectrum density and different regions of the enhanced spectrum are analyzed. Experimental results show that for a spectrum of some density, the joint time-frequency gain-modification based speech enhancement algorithm achieves significant improvement in intelligibility. Additionally, the spectrum region where the estimated spectrum is smaller than the clean spectrum, is the most important region contributing to intelligibility improvement for the enhanced whisper. The spectrum region where the estimated spectrum is larger than twice the size of the clean spectrum is detrimental to speech intelligibility perception within the whisper context.
基金The National Natural Science Foundation of China(No.11401412)
文摘The cognitive performance-based dimensional emotion recognition in whispered speech is studied.First,the whispered speech emotion databases and data collection methods are compared, and the character of emotion expression in whispered speech is studied,especially the basic types of emotions.Secondly,the emotion features for whispered speech is analyzed,and by reviewing the latest references,the related valence features and the arousal features are provided. The effectiveness of valence and arousal features in whispered speech emotion classification is studied.Finally,the Gaussian mixture model is studied and applied to whispered speech emotion recognition. The cognitive performance is also considered in emotion recognition so that the recognition errors of whispered speech emotion can be corrected.Based on the cognitive scores,the emotion recognition results can be improved.The results show that the formant features are not significantly related to arousal dimension,while the short-term energy features are related to the emotion changes in arousal dimension.Using the cognitive scores,the recognition results can be improved.
文摘AIM: To determine whether listening to music decreases the requirement for dosages of sedative drugs, patients' anxiety, pain and dissatisfaction feelings during colonoscopy and makes the procedure more comfortable and acceptable. METHODS: Patients undergoing elective colonoscopy between October 2005 and February 2006 were randomized into either listening to music (Group 1, n = 30) or not listening to music (Group 2, n = 30). Anxiolytic and analgesic drugs (intravenous midazolam and meperidine) were given according to the patients' demand. Administered medications were monitored. We determined their levels of anxiety using the State-Trait Anxiety Inventory Test form. Patients' satisfaction, pain, and willingness to undergo a repeated procedure were self-assessed using a visual analog scale. RESULTS: The mean dose of sedative and analgesic drugs used in group 1 (midazolam: 2.1 ± 1.4, meperidine: 18.1 ± 11.7) was smaller than group 2 (midazolam: 2.4 ± 1.0, meperidine: 20.6 ± 11.5), but without a significant difference (P 〉 0.05). The mean anxiety level in group 1 was lower than group 2 (36.7 ± 2.2 vs 251.0 ± 1.9, P 〈 0.001). The mean satisfaction score was higher in group 1 compared to group 2 (87.8 ± 3.1 vs 58.1 ± 3.4, P 〈 0.001). The mean pain score in group i was lower than group 2 (74.1 ± 4.7 vs 39.0 ± 3.9, P 〈 0.001). CONCLUSION: Listening to music during colonoscopy helps reduce the dose of sedative medications, as well as patients' anxiety, pain, dissatisfaction during the procedure. Therefore, we believe that listening to music can play an adjunctive role to sedation in colonoscopy. It is a simple, inexpensive way to improve patients' comfort during the procedure.
基金Project(17KJB510029)supported by the Natural Science Foundation of the Jiangsu Higher Education Institutions,ChinaProject(GXL2017004)supported by the Scientific Research Foundation of Nanjing Forestry University,China+3 种基金Project(202102210132)supported by the Important Project of Science and Technology of Henan Province,ChinaProject(B2019-51)supported by the Scientific Research Foundation of Henan Polytechnic University,ChinaProject(51521003)supported by the Foundation for Innovative Research Groups of the National Natural Science Foundation of ChinaProject(KQTD2016112515134654)supported by Shenzhen Science and Technology Program,China。
文摘A filter algorithm based on cochlear mechanics and neuron filter mechanism is proposed from the view point of vibration.It helps to solve the problem that the non-linear amplification is rarely considered in studying the auditory filters.A cochlear mechanical transduction model is built to illustrate the audio signals processing procedure in cochlea,and then the neuron filter mechanism is modeled to indirectly obtain the outputs with the cochlear properties of frequency tuning and non-linear amplification.The mathematic description of the proposed algorithm is derived by the two models.The parameter space,the parameter selection rules and the error correction of the proposed algorithm are discussed.The unit impulse responses in the time domain and the frequency domain are simulated and compared to probe into the characteristics of the proposed algorithm.Then a 24-channel filter bank is built based on the proposed algorithm and applied to the enhancements of the audio signals.The experiments and comparisons verify that,the proposed algorithm can effectively divide the audio signals into different frequencies,significantly enhance the high frequency parts,and provide positive impacts on the performance of speech enhancement in different noise environments,especially for the babble noise and the volvo noise.
基金supported by the Independent Innovation Foundation of Shandong University(No.2009JC004)the Natural Science Foundation of Shandong Province(No.Y2007G31)
文摘The Autoregressive Moving Average (ARMA) model for whispered speech is proposed. with normal speech, whispered speech has no fundamental frequency because of the glottis being semi-opened and turbulent flow being created, and formant shifting exists in the lower frequency region due to the narrowing of the tract in the false vocal fold regions and weak acoustic coupling with the aubglottal system. Analysis shows that the effect of the subglottal system is to introduce additional pole-zero pairs into the vocal tract transfer function. Theoretically, the method based on an ARMA process is superior to that based on an AR process in the spectral analysis of the whispered speech. Two methods, the least squared modified Yule-Walker likelihood estimate (LSMY) algorithm and the Frequency-Domain Steiglitz-Mcbide (FDSM) algorithm, are applied to the ARMA mfldel for the whispered speech. The performance evaluation shows that the ARMA model is much more appropriate for representing the whispered speech than the AR model, and the FDSM algorithm provides a name acorate estimation of the whispered speech spectral envelope than the LSMY algorithm with higher conputational complexity.
文摘Language, literature, customs and traditions, music and art are cultural items that were transmitted from generation to generation throughout history. In this context, literature is an important source of music culture that takes inspiration from the customs and traditions of a society. Prosodic meter is echoed in form, usul and general structure in works composed from the divan literature and almost lives in the work. In the same way, when examples of folk literature composed by composers and performed by poets and a^lks are examined, it is observed that there are parallels between literary features and form, structure and rhythmic features. The aim of this paper is to reveal the integral link between Melody-Usul and Meter in Ottoman Turkish Music
文摘Mete Sakpmar, who was born in Ankara, May 5, 1954, is a contemporary Turkish composer. His compositions were played in various countries like Turkey, France (1980-1983), Canada, Australia, Sweden, Norway, Hungary, Albania, Belgium, Holland, and German, and he also joined lots of seminars, radio, and television programs. His inspirational sources are traditional Turkish music, modern French music, American cultural traditions, as well as jazz, acoustic, and electronic music. His music, in summary, is about taking a little from tonality, twelve-note system, coincidences, improvisations, etc., and joining them with his own experiences like emotions, knowledge, and intelligence. Sakpmar always defends that the composer of the day has to benefit from lots of various sources. He is doing this as well as trying new forms in every new piece. It does not matter if these are tonal, atonal, serial, or modal but it must be personal. All the borders and capacities of the instruments have to be forced.
文摘Animals have special solution to the problem of communication in high levels of background noise. A small group of vertebrates (bats,dolphins and whales,and some rodents) that use ultrasound for communication.Our research first demonstrated that the concave-eared torrent frog is the first non- mammalian vertebrate found to be capable of pro- ducing and detecting ultrasounds for communica- tion.This study may provide a clue for understand- ing why humans have ear canals and how animals auditory systems have evolved,and inspire in de- veloping bionic tecnology for improving hearing in noise.