摘要
本文目的是介绍复杂抽样调查设计多值名义资料一水平多重logistic回归模型构建,并探讨不同策略之间的差异。采用SAS中的LOGISTIC过程和SURVEYLOGISTIC过程,分别按照是否考虑抽样设计与是否考虑抽样权重共4种分析策略对数据构建广义logistic回归模型,并比较结果。不同分析策略所得结果显示,不仅参数估计值、回归系数标准误、OR值及其置信区间的估计值有所差别,而且对纳入模型的解释变量也有影响。因此,在对复杂抽样调查设计多值名义资料构建广义logistics回归模型时,既要考虑抽样设计,又要兼顾抽样权重,否则即使样本量足够大,也会导致错误的推断结论。
The purpose of this article was to introduce the construction of multiple logistic regression models with multi-value nominal data collected from the complex sampling survey design,and to explore the differences between different strategies.Using the LOGISTIC procedure and the SURVEYLOGISTIC procedure in SAS software,generalized logistics regression models were constructed based on whether the sampling design or the sampling weights were considered,and the results were compared.The results obtained by different analysis strategies showed that not only the values of parameter estimation,the standard error of the regression coefficients,the OR value and its confidence intervals were different,but also the explanatory variables in the established models were also different.When constructing a generalized logistics regression model for multi-value nominal data of complex sampling design,both the sampling design and the sampling weights should be considered.Otherwise,even if the sample size was large enough,it would lead to the erroneous inference conclusions.
作者
刘媛媛
李长平
胡良平
Liu Yuanyuan;Li Changping;Hu Liangping(Department of Health Statistics,School of Public Health,Tianjin Medical University,Tianjin 300070,China;Specialty Committee of Clinical Scientific Research Statistics of World Federation of Chinese Medicine Societies,Beijing 100029,China;Graduate School,Academy of Military Sciences PLA China,Beijing 100850,China)
出处
《四川精神卫生》
2019年第6期490-494,共5页
Sichuan Mental Health
基金
国家高技术研究发展计划课题资助(2015AA020102)
国家自然科学基金项目(81803333)