TY - JOUR
T1 - Variable selection in joint modelling of the mean and variance for hierarchical data
AU - Charalambous, Christiana
AU - Pan, Jianxin
AU - Tranmer, Mark
N1 - Funding for the project was provided by the Engineering and Physical Sciences Research Council (EPSRC) and Economic and Social Research Council (ESRC) collaborative PhD scholarship.
PY - 2014/11/26
Y1 - 2014/11/26
N2 - We propose to extend the use of penalized likelihood variable selection to hierarchical generalized linear models (HGLMs) for jointly modelling the mean and variance structures. We assume a two-level hierarchical data structure, with subjects nested within groups. A generalized linear mixed model (GLMM) is fitted for the mean, with a structured dispersion in the form of a generalized linear model (GLM) for the between-group variation. To do variable selection, we use the smoothly clipped absolute deviation (SCAD) penalty, which simultaneously shrinks the coefficients of redundant variables to 0 and estimates the coefficients of the remaining important covariates. We run simulation studies and real data analysis for the joint mean–variance models, to assess the performance of the proposed procedure against a similar process which excludes variable selection. The results indicate that our method can successfully identify the zero/non-zero components in our models and can also significantly improve the efficiency of the resulting penalized estimates.
AB - We propose to extend the use of penalized likelihood variable selection to hierarchical generalized linear models (HGLMs) for jointly modelling the mean and variance structures. We assume a two-level hierarchical data structure, with subjects nested within groups. A generalized linear mixed model (GLMM) is fitted for the mean, with a structured dispersion in the form of a generalized linear model (GLM) for the between-group variation. To do variable selection, we use the smoothly clipped absolute deviation (SCAD) penalty, which simultaneously shrinks the coefficients of redundant variables to 0 and estimates the coefficients of the remaining important covariates. We run simulation studies and real data analysis for the joint mean–variance models, to assess the performance of the proposed procedure against a similar process which excludes variable selection. The results indicate that our method can successfully identify the zero/non-zero components in our models and can also significantly improve the efficiency of the resulting penalized estimates.
KW - Generalized linear mixed models
KW - H-likelihood
KW - Mean-covariance modelling
KW - Multilevel data
KW - Smoothly clipped absolute deviation penalty
U2 - 10.1177/1471082X13520424
DO - 10.1177/1471082X13520424
M3 - Article
VL - 15
SP - 1
EP - 27
JO - Statistical Modelling
JF - Statistical Modelling
SN - 1477-0342
IS - 1
ER -