Conventional acoustic-to-articulatory inversion methods usually train the mapping model by using maximum likelihood or least square criterion,which assumes all the articulatory channels are equally important.In this p...Conventional acoustic-to-articulatory inversion methods usually train the mapping model by using maximum likelihood or least square criterion,which assumes all the articulatory channels are equally important.In this paper,the importance of each articulatory channel at each time instant is modeled as an exponential function of its velocity profile and incorporated into the conventional least square loss function.The loss function is applied to optimize a batch-normalized Deep Neural Network(DNN).The result shows that the DNN trained with proposed cost function outperforms the one trained with traditional cost function.展开更多
Conventional acoustic-to-articulatory inversion methods usually train mappings by using maximum likelihood or least square criterion,which assume that all the articulatory channels are equally important.However,differ...Conventional acoustic-to-articulatory inversion methods usually train mappings by using maximum likelihood or least square criterion,which assume that all the articulatory channels are equally important.However,different articulatory channels play different roles in speech production.In this paper,to account for this in acoustic-to-articulatory inversion,the importance of each articulatory channel is modeled as an exponential function of its corresponding velocity profile,and incorporated into the conventional least square loss function.The proposed loss function is applied to optimize a batch normalized Deep Neural Network(DNN)for acoustic-to-articulatory inversion.The result indicates that the DNN trained with the proposed cost function outperforms the DNN trained with traditional cost function for most articulatory channels.展开更多
This research investigated the articulatory and acoustic cues of the emotional vowels from production/coding perspectives Chinese and Japanese emotional speech and EMA data were recorded.The acoustic and articulatory ...This research investigated the articulatory and acoustic cues of the emotional vowels from production/coding perspectives Chinese and Japanese emotional speech and EMA data were recorded.The acoustic and articulatory spaces of isolated vowels across different emotions were analyzed to understand the coding scheme in cross-cultural emotion production.The results suggest that,on the one hand,those two languages share some common features:(1) Significant differences of the acoustic space exist across emotions;(2) 'Sad' and 'Angry' vowels raise tongue position higher than other emotions;(3)The lip’s protrusion feature is shared by both 'Angry' and 'Sad' emotions,except[a]or[A].We suggested that the exception is caused by the increase of the mouth opening.The mouth opening and the degree of lip protrusion are a pair of complementary features.On the other hand,differences also exist between those two languages;(l)In Chinese,the acoustic space moves downwards with the increasing of emotion intensity;in Japanese,however, 'Angry' is characterized by a horizontally compressed acoustic space,and 'Sad' by a vertically compressed acoustic space.(2)In Chinese,emotional factors only cause tongue raising movement;in Japanese,emotional factors cause both tongue raising and significant tongue retraction,which can be enhanced by 'Angry' and 'Sad' emotions.展开更多
In many Eastern and Western countries,falling birth rates have led to the gradual aging of society.Older adults are often left alone at home or live in a long-term care center,which results in them being susceptible t...In many Eastern and Western countries,falling birth rates have led to the gradual aging of society.Older adults are often left alone at home or live in a long-term care center,which results in them being susceptible to unsafe events(such as falls)that can have disastrous consequences.However,automatically detecting falls fromvideo data is challenging,and automatic fall detection methods usually require large volumes of training data,which can be difficult to acquire.To address this problem,video kinematic data can be used as training data,thereby avoiding the requirement of creating a large fall data set.This study integrated an improved particle swarm optimization method into a double interactively recurrent fuzzy cerebellar model articulation controller model to develop a costeffective and accurate fall detection system.First,it obtained an optical flow(OF)trajectory diagram from image sequences by using the OF method,and it solved problems related to focal length and object offset by employing the discrete Fourier transform(DFT)algorithm.Second,this study developed the D-IRFCMAC model,which combines spatial and temporal(recurrent)information.Third,it designed an IPSO(Improved Particle Swarm Optimization)algorithm that effectively strengthens the exploratory capabilities of the proposed D-IRFCMAC(Double-Interactively Recurrent Fuzzy Cerebellar Model Articulation Controller)model in the global search space.The proposed approach outperforms existing state-of-the-art methods in terms of action recognition accuracy on the UR-Fall,UP-Fall,and PRECIS HAR data sets.The UCF11 dataset had an average accuracy of 93.13%,whereas the UCF101 dataset had an average accuracy of 92.19%.The UR-Fall dataset had an accuracy of 100%,the UP-Fall dataset had an accuracy of 99.25%,and the PRECIS HAR dataset had an accuracy of 99.07%.展开更多
While Bronze Age Proto-Sinaic and Proto-Canaanite syllabic inscriptions were found engraved on fragments of pottery and stone,evidence of early alphabetic script was also inscribed in ink onto a massive parchment scro...While Bronze Age Proto-Sinaic and Proto-Canaanite syllabic inscriptions were found engraved on fragments of pottery and stone,evidence of early alphabetic script was also inscribed in ink onto a massive parchment scroll,known as the Torah.Albeit the contours of those original characters transformed over time,it took the clairvoyant genius of Moses,and later the scribes of Ancient Israel,to configure and adapt ancient semitic prototypes into phonetic letters,producing the greatest literary document in the history of the world,the Bible.This article summarizes the acoustic properties of that alphabet,with further historical considerations.展开更多
Superplasticity of AZ 31 magnesium matrix composites reinforced with 10 vol% SiC(2 μm) particulate i s investigated at temperature range from 365℃ to 565℃ and strain rate from 2.0 8×10<sup>-3</sup&g...Superplasticity of AZ 31 magnesium matrix composites reinforced with 10 vol% SiC(2 μm) particulate i s investigated at temperature range from 365℃ to 565℃ and strain rate from 2.0 8×10<sup>-3</sup> to 5.21×10<sup>-1</sup> s<sup>-1</sup>. The maximum total elongation of 228 % is obtained at a strain rate of 2.08×10<sup>-1</sup> s<sup>-1</sup>. The strain rate se nsitivity exponent (m) higher than 0.3, is observed when the strain rate is high er than 10<sup>-1</sup> s<sup>-1</sup> at 525℃. Increasing the test temperature to 540℃, the maximum total elongation exceeding 195% is achieved at a higher strain rate of 5.21×10<sup>-1</sup> s<sup>-1</sup> than that at 525℃. SiC in AZ31/SiCp composite ca n fine the matrix grain size. Filament is observed on the fracture surface of th e specimens showing superplasticity.展开更多
目前Microsoft Power Point(PPT)在多媒体课件制作软件中占据了主导地位,但PPT课件用在基于网络化、数字化的教学中有着先天性不足。文章从多媒体元素、PPT兼容、平台(Flash vs Html 5)、技术的可用性和技术的教学性五个方面对基于PPT...目前Microsoft Power Point(PPT)在多媒体课件制作软件中占据了主导地位,但PPT课件用在基于网络化、数字化的教学中有着先天性不足。文章从多媒体元素、PPT兼容、平台(Flash vs Html 5)、技术的可用性和技术的教学性五个方面对基于PPT扩展应用的三款主流快速课件制作工具(Articulate Studio 13、i Spring Presenter 7、Adobe Presenter 9)进行了横向对比,分析了其制作网络化多媒体课件的优势,总结了快速课件制作的发展趋势,以期为创建适用于网络化、数字化教学需求的多媒体课件提供借鉴。展开更多
A principal component analysis-cerebellar model articulation controller (PCA-CMAC) model is proposed for machine performance degradation assessment.PCA is used to feature selection,which eliminates the redundant inf...A principal component analysis-cerebellar model articulation controller (PCA-CMAC) model is proposed for machine performance degradation assessment.PCA is used to feature selection,which eliminates the redundant information among the features from the sensor signals and reduces the dimension of the input to CMAC.CMAC is used to assess degradation states quantitatively based on its local generalization ability.The implementation of the model is presented and the model is applied in a drilling machine to assess the states of the cutting tool. The results show that the model can assess the wear states quantitatively based on the normal state of the cutting tool.The influence of the quantization parameter g and the generalization parameter r in the CMAC model on the assessment results is analyzed.If g is larger,the generalization ability is better,but the difference of degradation states is not obvious.If r is smaller,the different states are distinct,but memory requirements for storing the weights are larger.The principle for selecting two parameters is that the memory storing the weights should be small while the degradation states should be easily distinguished.展开更多
The accuracy of an articulated torque analysis influences the comprehensive performances of heavy-duty multi-legged robots. Currently, the extremal estimation method and some complex methods are employed to calculate ...The accuracy of an articulated torque analysis influences the comprehensive performances of heavy-duty multi-legged robots. Currently, the extremal estimation method and some complex methods are employed to calculate the articulated torques, which results in a large safety margin or a large number of calculations. To quickly obtain accurate articulated torques, an analysis method for the articulated torque is presented for an electrically driven heavy-duty six-legged robot. First, the rearmost leg that experiences the maximum normal contact force is confirmed when the robot transits a slope. Based on the ant-type and crab-type tripod gaits, the formulas of classical mechanics and MATLAB software are employed to theoretically analyze the relevant static torques of the joints. With the changes in the joint angles for the abductor joint, hip joint, and knee joint, variable tendency charts and extreme curves are obtained for the static articulated torques. Meanwhile, the maximum static articulated torques and the corresponding poses of the robot are also obtained. According to the poses of the robot under the maximum static articulated torques, ADAMS software is used to carry out a static simulation analysis. Based on the relevant simulation curves of the articulated torques, the maximum static articulated torques are acquired. A comparative analysis of the maximum static articulated torques shows that the theoretical calculation values are higher than the static simulation values, and the maximum error value is approximately 10%. The proposed method lays a foundation for quickly determining accurate articulated torques to develop heavy-duty six-legged robots.展开更多
In general, the orientation interpolation of industrial robots has been done based on Euler angle system which can result in singular point (so-called Gimbal Lock). However, quaternion interpolation has the advantag...In general, the orientation interpolation of industrial robots has been done based on Euler angle system which can result in singular point (so-called Gimbal Lock). However, quaternion interpolation has the advantage of natural (specifically smooth) orientation interpolation without Gimbal Lock. This work presents the application of quatemion interpolation, specifically Spherical Linear IntERPolation (SLERP), to the orientation control of the 6-axis articulated robot (RS2) using LabVIEW and RecurDyn. For the comparison of SLERP with linear Euler interpolation in the view of smooth movement (profile) of joint angles (torques), the two methods are dynamically simulated on RS2 by using both LabVIEW and RecurDyn. Finally, our original work, specifically the implementation of SLERP and linear Euler interpolation on the actual robot, i.e. RS2, is done using LabVIEW motion control tool kit. The SLERP orientation control is shown to be effective in terms of smooth joint motion and torque when compared to a conventional (linear) Euler interpolation.展开更多
基金supported by the National Natural Science-Foundation of China (No.61175016,61304250)Key Fund projects of 61233009financial support from CASS Innovation Project “Articulatory model for pronunciation training”
文摘Conventional acoustic-to-articulatory inversion methods usually train the mapping model by using maximum likelihood or least square criterion,which assumes all the articulatory channels are equally important.In this paper,the importance of each articulatory channel at each time instant is modeled as an exponential function of its velocity profile and incorporated into the conventional least square loss function.The loss function is applied to optimize a batch-normalized Deep Neural Network(DNN).The result shows that the DNN trained with proposed cost function outperforms the one trained with traditional cost function.
基金supported by the National Natural Science-Foundation of China(No.61977049)Advanced Innovation Center for Language Resource and Intelligence(KYR17005)+1 种基金National Major Social Sciences Foundation of China(15ZDB103)Innovation Program of Chinese Academy of Social Science
文摘Conventional acoustic-to-articulatory inversion methods usually train mappings by using maximum likelihood or least square criterion,which assume that all the articulatory channels are equally important.However,different articulatory channels play different roles in speech production.In this paper,to account for this in acoustic-to-articulatory inversion,the importance of each articulatory channel is modeled as an exponential function of its corresponding velocity profile,and incorporated into the conventional least square loss function.The proposed loss function is applied to optimize a batch normalized Deep Neural Network(DNN)for acoustic-to-articulatory inversion.The result indicates that the DNN trained with the proposed cost function outperforms the DNN trained with traditional cost function for most articulatory channels.
基金funded by JSPS Ronpaku Program and NSFC Project with No.60975081CASS innovation project
文摘This research investigated the articulatory and acoustic cues of the emotional vowels from production/coding perspectives Chinese and Japanese emotional speech and EMA data were recorded.The acoustic and articulatory spaces of isolated vowels across different emotions were analyzed to understand the coding scheme in cross-cultural emotion production.The results suggest that,on the one hand,those two languages share some common features:(1) Significant differences of the acoustic space exist across emotions;(2) 'Sad' and 'Angry' vowels raise tongue position higher than other emotions;(3)The lip’s protrusion feature is shared by both 'Angry' and 'Sad' emotions,except[a]or[A].We suggested that the exception is caused by the increase of the mouth opening.The mouth opening and the degree of lip protrusion are a pair of complementary features.On the other hand,differences also exist between those two languages;(l)In Chinese,the acoustic space moves downwards with the increasing of emotion intensity;in Japanese,however, 'Angry' is characterized by a horizontally compressed acoustic space,and 'Sad' by a vertically compressed acoustic space.(2)In Chinese,emotional factors only cause tongue raising movement;in Japanese,emotional factors cause both tongue raising and significant tongue retraction,which can be enhanced by 'Angry' and 'Sad' emotions.
基金supported by the National Science and Technology Council under grants NSTC 112-2221-E-320-002the Buddhist Tzu Chi Medical Foundation in Taiwan under Grant TCMMP 112-02-02.
文摘In many Eastern and Western countries,falling birth rates have led to the gradual aging of society.Older adults are often left alone at home or live in a long-term care center,which results in them being susceptible to unsafe events(such as falls)that can have disastrous consequences.However,automatically detecting falls fromvideo data is challenging,and automatic fall detection methods usually require large volumes of training data,which can be difficult to acquire.To address this problem,video kinematic data can be used as training data,thereby avoiding the requirement of creating a large fall data set.This study integrated an improved particle swarm optimization method into a double interactively recurrent fuzzy cerebellar model articulation controller model to develop a costeffective and accurate fall detection system.First,it obtained an optical flow(OF)trajectory diagram from image sequences by using the OF method,and it solved problems related to focal length and object offset by employing the discrete Fourier transform(DFT)algorithm.Second,this study developed the D-IRFCMAC model,which combines spatial and temporal(recurrent)information.Third,it designed an IPSO(Improved Particle Swarm Optimization)algorithm that effectively strengthens the exploratory capabilities of the proposed D-IRFCMAC(Double-Interactively Recurrent Fuzzy Cerebellar Model Articulation Controller)model in the global search space.The proposed approach outperforms existing state-of-the-art methods in terms of action recognition accuracy on the UR-Fall,UP-Fall,and PRECIS HAR data sets.The UCF11 dataset had an average accuracy of 93.13%,whereas the UCF101 dataset had an average accuracy of 92.19%.The UR-Fall dataset had an accuracy of 100%,the UP-Fall dataset had an accuracy of 99.25%,and the PRECIS HAR dataset had an accuracy of 99.07%.
文摘While Bronze Age Proto-Sinaic and Proto-Canaanite syllabic inscriptions were found engraved on fragments of pottery and stone,evidence of early alphabetic script was also inscribed in ink onto a massive parchment scroll,known as the Torah.Albeit the contours of those original characters transformed over time,it took the clairvoyant genius of Moses,and later the scribes of Ancient Israel,to configure and adapt ancient semitic prototypes into phonetic letters,producing the greatest literary document in the history of the world,the Bible.This article summarizes the acoustic properties of that alphabet,with further historical considerations.
文摘Superplasticity of AZ 31 magnesium matrix composites reinforced with 10 vol% SiC(2 μm) particulate i s investigated at temperature range from 365℃ to 565℃ and strain rate from 2.0 8×10<sup>-3</sup> to 5.21×10<sup>-1</sup> s<sup>-1</sup>. The maximum total elongation of 228 % is obtained at a strain rate of 2.08×10<sup>-1</sup> s<sup>-1</sup>. The strain rate se nsitivity exponent (m) higher than 0.3, is observed when the strain rate is high er than 10<sup>-1</sup> s<sup>-1</sup> at 525℃. Increasing the test temperature to 540℃, the maximum total elongation exceeding 195% is achieved at a higher strain rate of 5.21×10<sup>-1</sup> s<sup>-1</sup> than that at 525℃. SiC in AZ31/SiCp composite ca n fine the matrix grain size. Filament is observed on the fracture surface of th e specimens showing superplasticity.
文摘目前Microsoft Power Point(PPT)在多媒体课件制作软件中占据了主导地位,但PPT课件用在基于网络化、数字化的教学中有着先天性不足。文章从多媒体元素、PPT兼容、平台(Flash vs Html 5)、技术的可用性和技术的教学性五个方面对基于PPT扩展应用的三款主流快速课件制作工具(Articulate Studio 13、i Spring Presenter 7、Adobe Presenter 9)进行了横向对比,分析了其制作网络化多媒体课件的优势,总结了快速课件制作的发展趋势,以期为创建适用于网络化、数字化教学需求的多媒体课件提供借鉴。
基金The National Natural Science Foundation of China(No.60443007,50390063).
文摘A principal component analysis-cerebellar model articulation controller (PCA-CMAC) model is proposed for machine performance degradation assessment.PCA is used to feature selection,which eliminates the redundant information among the features from the sensor signals and reduces the dimension of the input to CMAC.CMAC is used to assess degradation states quantitatively based on its local generalization ability.The implementation of the model is presented and the model is applied in a drilling machine to assess the states of the cutting tool. The results show that the model can assess the wear states quantitatively based on the normal state of the cutting tool.The influence of the quantization parameter g and the generalization parameter r in the CMAC model on the assessment results is analyzed.If g is larger,the generalization ability is better,but the difference of degradation states is not obvious.If r is smaller,the different states are distinct,but memory requirements for storing the weights are larger.The principle for selecting two parameters is that the memory storing the weights should be small while the degradation states should be easily distinguished.
基金supported by National Basic Research Program of China(973 Program, Grant No. 2013CB035502)International Science and Technology Cooperation Project with Russia (Grant No. 2010DFR70270)+2 种基金National Natural Science Foundation of China (Grant No. 51275106)"111" Project (Grant No. B07018)Key Laboratory Opening Funding of Aerospace Mechanism and Control, China (Grant No. HIT. KLOF.2010057)
文摘The accuracy of an articulated torque analysis influences the comprehensive performances of heavy-duty multi-legged robots. Currently, the extremal estimation method and some complex methods are employed to calculate the articulated torques, which results in a large safety margin or a large number of calculations. To quickly obtain accurate articulated torques, an analysis method for the articulated torque is presented for an electrically driven heavy-duty six-legged robot. First, the rearmost leg that experiences the maximum normal contact force is confirmed when the robot transits a slope. Based on the ant-type and crab-type tripod gaits, the formulas of classical mechanics and MATLAB software are employed to theoretically analyze the relevant static torques of the joints. With the changes in the joint angles for the abductor joint, hip joint, and knee joint, variable tendency charts and extreme curves are obtained for the static articulated torques. Meanwhile, the maximum static articulated torques and the corresponding poses of the robot are also obtained. According to the poses of the robot under the maximum static articulated torques, ADAMS software is used to carry out a static simulation analysis. Based on the relevant simulation curves of the articulated torques, the maximum static articulated torques are acquired. A comparative analysis of the maximum static articulated torques shows that the theoretical calculation values are higher than the static simulation values, and the maximum error value is approximately 10%. The proposed method lays a foundation for quickly determining accurate articulated torques to develop heavy-duty six-legged robots.
基金Project supported by the Second Stage of Brain Korea 21 Projectssupported by Basic Science Research Program through the National Research Foundation of Korea (NRF)funded by the Ministry of Education,Science and Technology (2011-0013902)
文摘In general, the orientation interpolation of industrial robots has been done based on Euler angle system which can result in singular point (so-called Gimbal Lock). However, quaternion interpolation has the advantage of natural (specifically smooth) orientation interpolation without Gimbal Lock. This work presents the application of quatemion interpolation, specifically Spherical Linear IntERPolation (SLERP), to the orientation control of the 6-axis articulated robot (RS2) using LabVIEW and RecurDyn. For the comparison of SLERP with linear Euler interpolation in the view of smooth movement (profile) of joint angles (torques), the two methods are dynamically simulated on RS2 by using both LabVIEW and RecurDyn. Finally, our original work, specifically the implementation of SLERP and linear Euler interpolation on the actual robot, i.e. RS2, is done using LabVIEW motion control tool kit. The SLERP orientation control is shown to be effective in terms of smooth joint motion and torque when compared to a conventional (linear) Euler interpolation.