In the article, hypothesis test for coefficients in high dimensional regression models is considered. I develop simultaneous test statistic for the hypothesis test in both linear and partial linear models. The derived...In the article, hypothesis test for coefficients in high dimensional regression models is considered. I develop simultaneous test statistic for the hypothesis test in both linear and partial linear models. The derived test is designed for growing p and fixed n where the conventional F-test is no longer appropriate. The asymptotic distribution of the proposed test statistic under the null hypothesis is obtained.展开更多
This paper proposes a test procedure for testing the regression coefficients in high dimensional partially linear models based on the F-statistic. In the partially linear model, the authors first estimate the unknown ...This paper proposes a test procedure for testing the regression coefficients in high dimensional partially linear models based on the F-statistic. In the partially linear model, the authors first estimate the unknown nonlinear component by some nonparametric methods and then generalize the F-statistic to test the regression coefficients under some regular conditions. During this procedure, the estimation of the nonlinear component brings much challenge to explore the properties of generalized F-test. The authors obtain some asymptotic properties of the generalized F-test in more general cases,including the asymptotic normality and the power of this test with p/n ∈(0, 1) without normality assumption. The asymptotic result is general and by adding some constraint conditions we can obtain the similar conclusions in high dimensional linear models. Through simulation studies, the authors demonstrate good finite-sample performance of the proposed test in comparison with the theoretical results. The practical utility of our method is illustrated by a real data example.展开更多
In this paper,we focus on the partially linear varying-coefficient quantile regression with missing observations under ultra-high dimension,where the missing observations include either responses or covariates or the ...In this paper,we focus on the partially linear varying-coefficient quantile regression with missing observations under ultra-high dimension,where the missing observations include either responses or covariates or the responses and part of the covariates are missing at random,and the ultra-high dimension implies that the dimension of parameter is much larger than sample size.Based on the B-spline method for the varying coefficient functions,we study the consistency of the oracle estimator which is obtained only using active covariates whose coefficients are nonzero.At the same time,we discuss the asymptotic normality of the oracle estimator for the linear parameter.Note that the active covariates are unknown in practice,non-convex penalized estimator is investigated for simultaneous variable selection and estimation,whose oracle property is also established.Finite sample behavior of the proposed methods is investigated via simulations and real data analysis.展开更多
This paper considers tests for regression coefficients in high dimensional partially linear Models.The authors first use the B-spline method to estimate the unknown smooth function so that it could be linearly express...This paper considers tests for regression coefficients in high dimensional partially linear Models.The authors first use the B-spline method to estimate the unknown smooth function so that it could be linearly expressed.Then,the authors propose an empirical likelihood method to test regression coefficients.The authors derive the asymptotic chi-squared distribution with two degrees of freedom of the proposed test statistics under the null hypothesis.In addition,the method is extended to test with nuisance parameters.Simulations show that the proposed method have a good performance in control of type-I error rate and power.The proposed method is also employed to analyze a data of Skin Cutaneous Melanoma(SKCM).展开更多
Based on a comparison between the oxygen isotope records of benthic and plank tonic foraminifers from core 8KL of the South China Sea and sea-level change records derived from the Huon Peninsula, New Guinea, it is fou...Based on a comparison between the oxygen isotope records of benthic and plank tonic foraminifers from core 8KL of the South China Sea and sea-level change records derived from the Huon Peninsula, New Guinea, it is found that both records are very similar from 72 K a B.P. to the present, especially for the benthic oxygen isotope record. The linear regression shows that δ18O changes (0.9995‰ for benthic foraminifers and 1.022‰ for planktonic foraminifers) are equal to 100 m in sea-level fluctuation. After making temperature correction in the δ18O record of benthic foraminifers from 72 to 120 Ka B.P., the curve of sea-level oscillation of the South China Sea since 186 Ka B.P. has been reconstructed. The lowermost sea - level that occurred in the last glacial maximum and oxygen isotope stage 6 is approximately - 130 m.展开更多
For Microwave Humidity and Temperature sounder(MWHTS) measurements over the ocean, a cloud filtering method is presented to filter out cloud-and precipitation-affected observations by analyzing the sensitivity of the ...For Microwave Humidity and Temperature sounder(MWHTS) measurements over the ocean, a cloud filtering method is presented to filter out cloud-and precipitation-affected observations by analyzing the sensitivity of the simulated brightness temperatures of MWHTS to cloud liquid water, and using the root mean square error(RMSE)between observation and simulation in clear sky as a reference standard. The atmospheric temperature and humidity profiles are retrieved using MWHTS measurements with and without filtering by multiple linear regression(MLR),artificial neural networks(ANN) and one-dimensional variational(1DVAR) retrieval methods, respectively, and the effects of the filtering method on the retrieval accuracies are analyzed. The numerical results show that the filtering method can improve the retrieval accuracies of the MLR and the 1DVAR retrieval methods, but have little influence on that of the ANN. In addition, the dependencies of the retrieval methods upon the testing samples of brightness temperature are studied, and the results show that the 1DVAR retrieval method has great stability due to that the testing samples have great impact on the retrieval accuracies of the MLR and the ANN, but have little impact on that of the 1DVAR.展开更多
The recognition of pathological voice is considered a difficult task for speech analysis.Moreover,otolaryngologists needed to rely on oral communication with patients to discover traces of voice pathologies like dysph...The recognition of pathological voice is considered a difficult task for speech analysis.Moreover,otolaryngologists needed to rely on oral communication with patients to discover traces of voice pathologies like dysphonia that are caused by voice alteration of vocal folds and their accuracy is between 60%–70%.To enhance detection accuracy and reduce processing speed of dysphonia detection,a novel approach is proposed in this paper.We have leveraged Linear Discriminant Analysis(LDA)to train multiple Machine Learning(ML)models for dysphonia detection.Several ML models are utilized like Support Vector Machine(SVM),Logistic Regression,and K-nearest neighbor(K-NN)to predict the voice pathologies based on features like Mel-Frequency Cepstral Coefficients(MFCC),Fundamental Frequency(F0),Shimmer(%),Jitter(%),and Harmonic to Noise Ratio(HNR).The experiments were performed using Saarbrucken Voice Data-base(SVD)and a privately collected dataset.The K-fold cross-validation approach was incorporated to increase the robustness and stability of the ML models.According to the experimental results,our proposed approach has a 70%increase in processing speed over Principal Component Analysis(PCA)and performs remarkably well with a recognition accuracy of 95.24%on the SVD dataset surpassing the previous best accuracy of 82.37%.In the case of the private dataset,our proposed method achieved an accuracy rate of 93.37%.It can be an effective non-invasive method to detect dysphonia.展开更多
This paper concerns computational problems of the concave penalized linear regression model.We propose a fixed point iterative algorithm to solve the computational problem based on the fact that the penalized estimato...This paper concerns computational problems of the concave penalized linear regression model.We propose a fixed point iterative algorithm to solve the computational problem based on the fact that the penalized estimator satisfies a fixed point equation.The convergence property of the proposed algorithm is established.Numerical studies are conducted to evaluate the finite sample performance of the proposed algorithm.展开更多
文摘In the article, hypothesis test for coefficients in high dimensional regression models is considered. I develop simultaneous test statistic for the hypothesis test in both linear and partial linear models. The derived test is designed for growing p and fixed n where the conventional F-test is no longer appropriate. The asymptotic distribution of the proposed test statistic under the null hypothesis is obtained.
基金supported by the Natural Science Foundation of China under Grant Nos.11231010,11471223,11501586BCMIIS and Key Project of Beijing Municipal Educational Commission under Grant No.KZ201410028030
文摘This paper proposes a test procedure for testing the regression coefficients in high dimensional partially linear models based on the F-statistic. In the partially linear model, the authors first estimate the unknown nonlinear component by some nonparametric methods and then generalize the F-statistic to test the regression coefficients under some regular conditions. During this procedure, the estimation of the nonlinear component brings much challenge to explore the properties of generalized F-test. The authors obtain some asymptotic properties of the generalized F-test in more general cases,including the asymptotic normality and the power of this test with p/n ∈(0, 1) without normality assumption. The asymptotic result is general and by adding some constraint conditions we can obtain the similar conclusions in high dimensional linear models. Through simulation studies, the authors demonstrate good finite-sample performance of the proposed test in comparison with the theoretical results. The practical utility of our method is illustrated by a real data example.
基金Supported by National Natural Science Foundation of China(Grant No.12071348)Fundamental Research Funds for Central Universities,China(Grant No.2023-3-2D-04)。
文摘In this paper,we focus on the partially linear varying-coefficient quantile regression with missing observations under ultra-high dimension,where the missing observations include either responses or covariates or the responses and part of the covariates are missing at random,and the ultra-high dimension implies that the dimension of parameter is much larger than sample size.Based on the B-spline method for the varying coefficient functions,we study the consistency of the oracle estimator which is obtained only using active covariates whose coefficients are nonzero.At the same time,we discuss the asymptotic normality of the oracle estimator for the linear parameter.Note that the active covariates are unknown in practice,non-convex penalized estimator is investigated for simultaneous variable selection and estimation,whose oracle property is also established.Finite sample behavior of the proposed methods is investigated via simulations and real data analysis.
基金supported by the University of Chinese Academy of Sciences under Grant No.Y95401TXX2Beijing Natural Science Foundation under Grant No.Z190004Key Program of Joint Funds of the National Natural Science Foundation of China under Grant No.U19B2040。
文摘This paper considers tests for regression coefficients in high dimensional partially linear Models.The authors first use the B-spline method to estimate the unknown smooth function so that it could be linearly expressed.Then,the authors propose an empirical likelihood method to test regression coefficients.The authors derive the asymptotic chi-squared distribution with two degrees of freedom of the proposed test statistics under the null hypothesis.In addition,the method is extended to test with nuisance parameters.Simulations show that the proposed method have a good performance in control of type-I error rate and power.The proposed method is also employed to analyze a data of Skin Cutaneous Melanoma(SKCM).
基金Project 49206062 funded by the National Natural Science Foundation of China
文摘Based on a comparison between the oxygen isotope records of benthic and plank tonic foraminifers from core 8KL of the South China Sea and sea-level change records derived from the Huon Peninsula, New Guinea, it is found that both records are very similar from 72 K a B.P. to the present, especially for the benthic oxygen isotope record. The linear regression shows that δ18O changes (0.9995‰ for benthic foraminifers and 1.022‰ for planktonic foraminifers) are equal to 100 m in sea-level fluctuation. After making temperature correction in the δ18O record of benthic foraminifers from 72 to 120 Ka B.P., the curve of sea-level oscillation of the South China Sea since 186 Ka B.P. has been reconstructed. The lowermost sea - level that occurred in the last glacial maximum and oxygen isotope stage 6 is approximately - 130 m.
基金Key Fostering Project of National Space Science Center,Chinese Academy of Sciences(Y62112f37s)National 863 Project of China(2015AA8126027)
文摘For Microwave Humidity and Temperature sounder(MWHTS) measurements over the ocean, a cloud filtering method is presented to filter out cloud-and precipitation-affected observations by analyzing the sensitivity of the simulated brightness temperatures of MWHTS to cloud liquid water, and using the root mean square error(RMSE)between observation and simulation in clear sky as a reference standard. The atmospheric temperature and humidity profiles are retrieved using MWHTS measurements with and without filtering by multiple linear regression(MLR),artificial neural networks(ANN) and one-dimensional variational(1DVAR) retrieval methods, respectively, and the effects of the filtering method on the retrieval accuracies are analyzed. The numerical results show that the filtering method can improve the retrieval accuracies of the MLR and the 1DVAR retrieval methods, but have little influence on that of the ANN. In addition, the dependencies of the retrieval methods upon the testing samples of brightness temperature are studied, and the results show that the 1DVAR retrieval method has great stability due to that the testing samples have great impact on the retrieval accuracies of the MLR and the ANN, but have little impact on that of the 1DVAR.
文摘The recognition of pathological voice is considered a difficult task for speech analysis.Moreover,otolaryngologists needed to rely on oral communication with patients to discover traces of voice pathologies like dysphonia that are caused by voice alteration of vocal folds and their accuracy is between 60%–70%.To enhance detection accuracy and reduce processing speed of dysphonia detection,a novel approach is proposed in this paper.We have leveraged Linear Discriminant Analysis(LDA)to train multiple Machine Learning(ML)models for dysphonia detection.Several ML models are utilized like Support Vector Machine(SVM),Logistic Regression,and K-nearest neighbor(K-NN)to predict the voice pathologies based on features like Mel-Frequency Cepstral Coefficients(MFCC),Fundamental Frequency(F0),Shimmer(%),Jitter(%),and Harmonic to Noise Ratio(HNR).The experiments were performed using Saarbrucken Voice Data-base(SVD)and a privately collected dataset.The K-fold cross-validation approach was incorporated to increase the robustness and stability of the ML models.According to the experimental results,our proposed approach has a 70%increase in processing speed over Principal Component Analysis(PCA)and performs remarkably well with a recognition accuracy of 95.24%on the SVD dataset surpassing the previous best accuracy of 82.37%.In the case of the private dataset,our proposed method achieved an accuracy rate of 93.37%.It can be an effective non-invasive method to detect dysphonia.
基金Supported by the National Natural Science Foundation of China(11701571)
文摘This paper concerns computational problems of the concave penalized linear regression model.We propose a fixed point iterative algorithm to solve the computational problem based on the fact that the penalized estimator satisfies a fixed point equation.The convergence property of the proposed algorithm is established.Numerical studies are conducted to evaluate the finite sample performance of the proposed algorithm.