期刊文献+
共找到5篇文章
< 1 >
每页显示 20 50 100
Optimal decorrelated score subsampling for generalized linear models with massive data
1
作者 Junzhuo Gao Lei Wang heng lian 《Science China Mathematics》 SCIE CSCD 2024年第2期405-430,共26页
In this paper, we consider the unified optimal subsampling estimation and inference on the lowdimensional parameter of main interest in the presence of the nuisance parameter for low/high-dimensionalgeneralized linear... In this paper, we consider the unified optimal subsampling estimation and inference on the lowdimensional parameter of main interest in the presence of the nuisance parameter for low/high-dimensionalgeneralized linear models (GLMs) with massive data. We first present a general subsampling decorrelated scorefunction to reduce the influence of the less accurate nuisance parameter estimation with the slow convergencerate. The consistency and asymptotic normality of the resultant subsample estimator from a general decorrelatedscore subsampling algorithm are established, and two optimal subsampling probabilities are derived under theA- and L-optimality criteria to downsize the data volume and reduce the computational burden. The proposedoptimal subsampling probabilities provably improve the asymptotic efficiency of the subsampling schemes in thelow-dimensional GLMs and perform better than the uniform subsampling scheme in the high-dimensional GLMs.A two-step algorithm is further proposed to implement, and the asymptotic properties of the correspondingestimators are also given. Simulations show satisfactory performance of the proposed estimators, and twoapplications to census income and Fashion-MNIST datasets also demonstrate its practical applicability. 展开更多
关键词 A-OPTIMALITY decorrelated score subsampling high-dimensional inference L-optimality massive data
原文传递
A general framework for frequentist model averaging In Celebration of Professor Lincheng Zhao's 75th Birthday 被引量:3
2
作者 Priyam Mitra heng lian +2 位作者 Ritwik Mitra Hua liang Min-ge Xie 《Science China Mathematics》 SCIE CSCD 2019年第2期205-226,共22页
Model selection strategies have been routinely employed to determine a model for data analysis in statistics, and further study and inference then often proceed as though the selected model were the true model that we... Model selection strategies have been routinely employed to determine a model for data analysis in statistics, and further study and inference then often proceed as though the selected model were the true model that were known a priori. Model averaging approaches, on the other hand, try to combine estimators for a set of candidate models. Specifically, instead of deciding which model is the 'right' one, a model averaging approach suggests to fit a set of candidate models and average over the estimators using data adaptive weights.In this paper we establish a general frequentist model averaging framework that does not set any restrictions on the set of candidate models. It broaden, the scope of the existing methodologies under the frequentist model averaging development. Assuming the data is from an unknown model, we derive the model averaging estimator and study its limiting distributions and related predictions while taking possible modeling biases into account.We propose a set of optimal weights to combine the individual estimators so that the expected mean squared error of the average estimator is minimized. Simulation studies are conducted to compare the performance of the estimator with that of the existing methods. The results show the benefits of the proposed approach over traditional model selection approaches as well as existing model averaging methods. 展开更多
关键词 ASYMPTOTIC distribution bias variance trade-off local mis-specification model AVERAGING ESTIMATORS optimal weight selection
原文传递
Variable Selection for Fixed Effects Varying Coefficient Models 被引量:4
3
作者 Gao Rong LI heng lian +1 位作者 Peng LAI heng PENG 《Acta Mathematica Sinica,English Series》 SCIE CSCD 2015年第1期91-110,共20页
We consider the problem of variable selection for the fixed effects varying coefficient models. A variable selection procedure is developed using basis function approximations and group nonconcave penalized functions,... We consider the problem of variable selection for the fixed effects varying coefficient models. A variable selection procedure is developed using basis function approximations and group nonconcave penalized functions, and the fixed effects are removed using the proper weight matrices. The proposed procedure simultaneously removes the fixed individual effects, selects the significant variables and estimates the nonzero coefficient functions. With appropriate selection of the tuning parameters, an asymptotic theory for the resulting estimates is established under suitable conditions. Simulation studies are carried out to assess the performance of our proposed method, and a real data set is analyzed for further illustration. 展开更多
关键词 Varying coefficient model fixed effect variable selection basis function
原文传递
Life History Recorded in the Vagino-cervical Microbiome Along with Multi-omes 被引量:2
4
作者 Zhuye Jie Chen Chen +38 位作者 Lilan Hao Fei Li Liju Song Xiaowei Zhang Jie Zhu Liu Tian Xin Tong Kaiye Cai Zhe Zhang Yanmei Ju Xinlei Yu Ying Li Hongcheng Zhou Haorong Lu Xuemei Qiu Qiang Li Yunli Liao Dongsheng Zhou heng lian Yong Zuo Xiaomin Chen Weiqiao Rao Yan Ren Yuan Wang Jin Zi Rong Wang Na Liu Jinghua Wu Wei Zhang Xiao Liu Yang Zong Weibin Liu liang Xiao Yong Hou Xun Xu Huanming Yang Jian Wang Karsten Kristiansen Huijue Jia 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2022年第2期304-321,共18页
The vagina contains at least a billion microbial cells,dominated by lactobacilli.Here we perform metagenomic shotgun sequencing on cervical and fecal samples from a cohort of 516 Chinese women of reproductive age,as w... The vagina contains at least a billion microbial cells,dominated by lactobacilli.Here we perform metagenomic shotgun sequencing on cervical and fecal samples from a cohort of 516 Chinese women of reproductive age,as well as cervical,fecal,and salivary samples from a second cohort of 632 women.Factors such as pregnancy history,delivery history,cesarean section,and breastfeeding were all more important than menstrual cycle in shaping the microbiome,and such information would be necessary before trying to interpret differences between vagino-cervical microbiome data.Greater proportion of Bifidobacterium breve was seen with older age at sexual debut.The relative abundance of lactobacilli especially Lactobacillus crispatus was negatively associated with pregnancy history.Potential markers for lack of menstrual regularity,heavy flow,dysmenorrhea,and contraceptives were also identified.Lactobacilli were rare during breastfeeding or post-menopause.Other features such as mood fluctuations and facial speckles could potentially be predicted from the vagino-cervical microbiome.Gut and salivary microbiomes,plasma vitamins,metals,amino acids,and hormones showed associations with the vagino-cervical microbiome.Our results offer an unprecedented glimpse into the microbiota of the female reproductive tract and call for international collaborations to better understand its long-term health impact other than in the settings of infection or pre-term birth. 展开更多
关键词 Vagino-cervical microbiome Metagenomic shotgun sequencing Pregnancy history Delivery history BREASTFEEDING
原文传递
Discussion of the paper‘A review of distributed statistical inference’
5
作者 heng lian 《Statistical Theory and Related Fields》 2022年第2期100-101,共2页
The authors should be congratulated on their timely contribution to this emerging field with a compre-hensive review,which will certainly attract more researchers into this area.In the simplest one-shot approach,the e... The authors should be congratulated on their timely contribution to this emerging field with a compre-hensive review,which will certainly attract more researchers into this area.In the simplest one-shot approach,the entire dataset is distributed on multiple machines,and each machine computes a local estimate based on local data only,and a central machine per-forms an aggregation calculation as a final processing step.In more complicated settings,multiple communi-cations are carried out,typically passing also first-order information(gradient)and/or second-order informa-tion(Hession matrix)between local machines and the central machine.This review clearly separates the exist-ing works in this area into several sections,considering parameter regression,nonparametric regression,and other models including principal component analysis and variable screening. 展开更多
关键词 LOCAL typically AGGREGATION
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部