Karolinska Institutet
Browse

Statistical methods for twin and sibling designs

Download (404.93 kB)
thesis
posted on 2024-09-02, 21:45 authored by Johan Zetterqvist

Twin and sibling studies are valuable in that they allow adjustment for potential confounding factors that are impossible or hard to measure. By measuring associations ‘within-cluster’ it is possible to adjust for many factors that are shared between individuals in the same cluster.

Using Swedish national registers, it is possible to obtain information about a large number of potential confounders. While this gives medical researchers great opportunities to control for confounding, it also increases the risk of model misspecification leading to biased estimates. One strategy to reduce the risk of such bias is to use doubly robust(DR) estimation. In DR estimation two working models are combined in such a way that the resulting estimate will remain asymptotically unbiased when one of the models is misspecified.

In study I, we implement existing DR estimators for parameters in linear, log-linear and logistic regression models in the R package drgee. In study II, we propose a new class of DR estimators for ‘within-cluster’ association measures in linear and log-linear regression models. In study III we propose a DR estimator for the ‘within-cluster’ log odds ratio parameter in logistic regression models. The estimators proposed in studies II and III are also implemented in the R package drgee.

In study IV, we discuss what shared factors the ‘within-cluster’ association actually is adjusted for. Using the formal theory of causal diagrams we demonstrate that the standard methods for estimating ‘within-cluster’ association parameters implicitly adjust for shared confounders, shared mediators, but not shared colliders. Therefore, the estimated parameter may have a causal interpretation as a direct effect, i.e. as the part of the causal effect that is not mediated through shared factors.

List of scientific papers

I. Johan Zetterqvist and Arvid Sjölander. Doubly robust estimation with the R package drgee. Epidemiologic Methods. 4(1):69–86, 2015.
https://doi.org/10.1515/em-2014-0021

II. Johan Zetterqvist, Stijn Vansteelandt, Yudi Pawitan, and Arvid Sjölander. Doubly robust methods for handling confounding by cluster. Biostatistics. 17(2):264–276, 2016.
https://doi.org/10.1093/biostatistics/kxv041

III. Johan Zetterqvist, Karel Vermeulen, Stijn Vansteelandt, and Arvid Sjölander. Doubly robust conditional logistic regression. [Submitted]

IV. Arvid Sjölander and Johan Zetterqvist. Confounding, mediation and colliding in conditional maximum likelihood estimation. Epidemiology. [Accepted]
https://doi.org/10.1097/EDE.0000000000000649

History

Defence date

2017-05-11

Department

  • Department of Medical Epidemiology and Biostatistics

Publisher/Institution

Karolinska Institutet

Main supervisor

Sjölander, Arvid

Co-supervisors

Pawitan, Yudi; Lichtenstein, Paul; Larsson, Henrik

Publication year

2017

Thesis type

  • Doctoral thesis

ISBN

978-91-7676-670-5

Number of supporting papers

4

Language

  • eng

Original publication date

2017-04-20

Author name in thesis

Zetterqvist, Johan

Original department name

Department of Medical Epidemiology and Biostatistics

Place of publication

Stockholm

Usage metrics

    Theses

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC