Direct effects testing: A two-stage procedure to test for effect size and variable importance for correlated binary predictors and a binary response

M. Sperrin, T. Jaki

    Research output: Contribution to journalArticlepeer-review

    Abstract

    In applications such as medical statistics and genetics, we encounter situations where a large number of highly correlated predictors explain a response. For example, the response may be a disease indicator and the predictors may be treatment indicators or single nucleotide polymorphisms (SNPs). Constructing a good predictive model in such cases is well studied. Less well understood is how to recover the 'true sparsity pattern', that is finding which predictors have direct effects on the response, and indicating the statistical significance of the results. Restricting attention to binary predictors and response, we study the recovery of the true sparsity pattern using a two-stage method that separates establishing the presence of effects from inferring their exact relationship with the predictors. Simulations and a real data application demonstrate that the method discriminates well between associations and direct effects. Comparisons with lasso-based methods demonstrate favourable performance of the proposed method. Copyright © 2010 John Wiley & Sons, Ltd.
    Original languageEnglish
    Pages (from-to)2544-2556
    Number of pages12
    JournalStatistics in medicine
    Volume29
    Issue number24
    DOIs
    Publication statusPublished - 30 Oct 2010

    Keywords

    • Contingency table
    • Direct effect
    • High dimensional
    • Lasso
    • Noncentral hypergeometric distribution
    • Sparsity

    Fingerprint

    Dive into the research topics of 'Direct effects testing: A two-stage procedure to test for effect size and variable importance for correlated binary predictors and a binary response'. Together they form a unique fingerprint.

    Cite this