Karolinska Institutet
Browse

Truncation and missing family links in population-based registers

Download (1.36 MB)
thesis
posted on 2024-09-03, 01:47 authored by Monica Leu

Studies of familial aggregation of disease routinely use linked population registers to construct retrospective cohorts. Although such resources have provided numerous estimates of familial risk, little is known regarding the sensitivity of the estimates to assumed disease models and incompleteness of the data, such as truncation and/or missing family links. Furthermore, there are no standard tools for testing the validity of estimates from standard epidemiologic designs using register data.

We first introduce a software package, Poplab for simulating realistic populations of related individuals, with complete family information, using easily available vital statistics (fertility and mortality rates) and disease incidence rates. Incidence events are influenced by the familial model of disease, and other disease-related features such as value of familial association, case mortality ratio or biological relationship of aggregation.

We illustrate by mimicking the Swedish population evolving dynamically over the calendar period 1955 - 2002, with female breast cancer aggregating in families. The simulated population agrees well on important demographic features with the real population, and the simulated parameters of familial aggregation are faithfully recovered.

Next, virtual populations are used to investigate the impact of left-truncation of family history and missing family links due to death on familial risk estimates. The missing familial links had no effect, except when there was differential mortality for familial and non-familial cases. Bias due to left-truncation is most pronounced for high familial risks and for registers with a short life-span. The age distribution of disease and the magnitude of background incidence rates also affected the magnitude of bias.

In the third study, we develop a method for correcting the bias in familial risk estimates due to left-truncation. The required sensitivity of exposure is estimated from virtual populations, and was found non-differential for cases and healthy individuals. In all the situations studied, the bias-corrected estimates are in excellent agreement with the true values.

In our last study, we use the bias-correction methodology to evaluate the bias in the apparent familial risks due to left-truncation for the common cancer sites in Sweden. The study cohorts are based on the Swedish MultiGeneration Register linked to the Swedish Cancer Register for the period 1961-2002. We found that corrected age-group specific and overall estimates of the familial risks for colorectum, lung, breast and prostate cancer were close to the apparent relative risks, with overall values of 1.99 95%CI (1.85, 2.14), 2.05 (1.86, 2.26), 1.84 (1.76, 1.92) and 2.33 (2.19, 2.48), respectively. For melanoma, the apparent estimate, 2.68 (2.35, 3.07) was somewhat smaller than the corrected estimate, 3.18 (2.73, 3.64), and was dramatically different with the exposure defined as a parent affected at a younger age, thus changing from 4.07 (3.21, 5.16) to 5.67 (4.51, 6.83) after correction.

In conclusion, we found that left-truncation induces bias in familial risk estimates especially for high values of risk. For common cancers, analyses of the Swedish MultiGeneration cohorts produced generally unbiased familial risk estimates, which was in agreement with our expectations for diseases with older ages of onset and familial risks of relatively low magnitude. However, where the exposure of interest is early age of onset in a parent, commonly considered to be an indication of genetically determined cancers, estimates may be biased, especially where familial risk is high. Our simulation method can correct for such biases and offers a feasible alternative to the use of validation samples.

List of scientific papers

I. Leu M, Czene K, Reilly M (2007). ""Population lab": the creation of virtual populations for genetic epidemiology research." Epidemiology 18(4): 433-40
https://pubmed.ncbi.nlm.nih.gov/17486019

II. Leu M, Czene K, Reilly M (2007). "The impact of truncation and missing family links in population-based registers on familial risk estimates." Am J Epidemiol 166(12): 1461-7. Epub 2007 Sep 17
https://pubmed.ncbi.nlm.nih.gov/17878173

III. Leu M, Czene K, Reilly M (2008). "Bias-correction of estimates of familial risk from population based registers." (Submitted)

IV. Leu M, Reilly M, Czene K (2008). "Evaluation of bias in familial risk estimates: a study of common cancers using Swedish population-based registers." (Submitted)

History

Defence date

2008-03-28

Department

  • Department of Medical Epidemiology and Biostatistics

Publisher/Institution

Karolinska Institutet

Publication year

2008

Thesis type

  • Doctoral thesis

ISBN

978-91-7357-552-2

Number of supporting papers

4

Language

  • eng

Original publication date

2008-03-07

Author name in thesis

Leu, Monica

Original department name

Department of Medical Epidemiology and Biostatistics

Place of publication

Stockholm

Usage metrics

    Theses

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC