Since it is difficult to fit measured parameters using the conventional traffic model, a new traffic density and average speed model is introduced in this paper. To determine traffic model structures accurately, a mod...Since it is difficult to fit measured parameters using the conventional traffic model, a new traffic density and average speed model is introduced in this paper. To determine traffic model structures accurately, a model identification method for uncertain nonlinear system is developed. To simplify uncertain nonlinear problem, this paper presents a new robust criterion to identify the multi-section traffic model structure of freeway efficiently. In the new model identification criterion, numerically efficient U-D factofization is used to avoid computing the determinant values of two complex matrices. By estimating the values of U-D factor of data matrix, both the upper and lower bounds of system uncertainties are described. Thus a model structure identification algorithm is proposed. Comparisons between identification outputs and simulation outputs of traffic states show that the traffic states can be accurately predicted by means of the new traffic models and the structure identification criterion.展开更多
The Mel-frequency cepstral coefficient (MFCC) is the most widely used feature in speech and speaker recognition. However, MFCC is very sensitive to noise interference, which tends to drastically de- grade the perfor...The Mel-frequency cepstral coefficient (MFCC) is the most widely used feature in speech and speaker recognition. However, MFCC is very sensitive to noise interference, which tends to drastically de- grade the performance of recognition systems because of the mismatches between training and testing. In this paper, the logarithmic transformation in the standard MFCC analysis is replaced by a combined function to improve the noisy sensitivity. The proposed feature extraction process is also combined with speech en- hancement methods, such as spectral subtraction and median-filter to further suppress the noise. Experi- ments show that the proposed robust MFCC-based feature significantly reduces the recognition error rate over a wide signal-to-noise ratio range.展开更多
Unseen handset mismatch is the major source of performance degradation in speaker identification in telecommunication environments. To alleviate the problem, a maximum likelihood a priori knowledge interpolation (ML-...Unseen handset mismatch is the major source of performance degradation in speaker identification in telecommunication environments. To alleviate the problem, a maximum likelihood a priori knowledge interpolation (ML-AKI)-based handset mismatch compensation approach is proposed. It first collects a set of handset characteristics of seen handsets to use as the a priori knowledge for representing the space of handsets. During evaluation the characteristics of an unknown test handset are optimally estimated by interpolation from the set of the a priori knowledge. Experimental results on the HTIMIT database show that the ML-AKI method can improve the average speaker identification rate from 60.0% to 74.6% as compared with conventional maximum a posteriori-adapted Gaussian mixture models. The proposed ML-AKI method is a promising method for robust speaker identification.展开更多
基金The work was supported by Chinese Science Foundation (No .60134010) .
文摘Since it is difficult to fit measured parameters using the conventional traffic model, a new traffic density and average speed model is introduced in this paper. To determine traffic model structures accurately, a model identification method for uncertain nonlinear system is developed. To simplify uncertain nonlinear problem, this paper presents a new robust criterion to identify the multi-section traffic model structure of freeway efficiently. In the new model identification criterion, numerically efficient U-D factofization is used to avoid computing the determinant values of two complex matrices. By estimating the values of U-D factor of data matrix, both the upper and lower bounds of system uncertainties are described. Thus a model structure identification algorithm is proposed. Comparisons between identification outputs and simulation outputs of traffic states show that the traffic states can be accurately predicted by means of the new traffic models and the structure identification criterion.
基金Supported by the National Natural Science Foundation of China(No. 6007201)
文摘The Mel-frequency cepstral coefficient (MFCC) is the most widely used feature in speech and speaker recognition. However, MFCC is very sensitive to noise interference, which tends to drastically de- grade the performance of recognition systems because of the mismatches between training and testing. In this paper, the logarithmic transformation in the standard MFCC analysis is replaced by a combined function to improve the noisy sensitivity. The proposed feature extraction process is also combined with speech en- hancement methods, such as spectral subtraction and median-filter to further suppress the noise. Experi- ments show that the proposed robust MFCC-based feature significantly reduces the recognition error rate over a wide signal-to-noise ratio range.
基金the Science Council of Taiwan, China (No. NSC95-2221-E-027-102)
文摘Unseen handset mismatch is the major source of performance degradation in speaker identification in telecommunication environments. To alleviate the problem, a maximum likelihood a priori knowledge interpolation (ML-AKI)-based handset mismatch compensation approach is proposed. It first collects a set of handset characteristics of seen handsets to use as the a priori knowledge for representing the space of handsets. During evaluation the characteristics of an unknown test handset are optimally estimated by interpolation from the set of the a priori knowledge. Experimental results on the HTIMIT database show that the ML-AKI method can improve the average speaker identification rate from 60.0% to 74.6% as compared with conventional maximum a posteriori-adapted Gaussian mixture models. The proposed ML-AKI method is a promising method for robust speaker identification.