Statistical genetic analysis of family data
The importance of genetic determinants and risk factors of diseases has been consistently recognized in genetic epidemiology, which is one of the fastest growing areas in genomic medicine. Familial clustering is a common characteristic of genetic related phenotypes, providing vital insights into the etiology of diseases by establishing the relative contribution of genetic and environmental factors. The availability of family data has opened up new opportunities for studying genetic and environmental contributions to diverse diseases. Family data overcome the limitations of statistical power common in twin data analysis, but also enhance the breadth of genetic information.
The generalized linear mixed model has provided a central conceptual framework that allows estimation of the genetic and environmental contributions with adjustment for various epidemiological risk factors. However, estimation often requires high-dimensional integrals to integrate out the random effects and in the models that we considered this is general analytically intractable. Since we have to deal with large datasets with sparse binary outcomes, computation has been another stumbling block in the analysis of realistic models.
This thesis focuses on the analysis of population-based family data, for application in cancer, perinatal diseases and psychiatric disorders. We have closely investigated the marginal and hierarchical-likelihood approaches, and also considered ascertainment approaches for both binary traits and age-at-onset traits. We demonstrate that the newly developed methodologies for the analysis of family data are highly flexible and allow straightforward handling of covariates.
List of scientific papers
I. Noh M, Yip B, Lee Y, Pawitan Y (2006). Multicomponent variance estimation for binary traits in family-based studies. Genet Epidemiol. 30(1): 37-47
https://pubmed.ncbi.nlm.nih.gov/16265627
II. Yip BH, Björk C, Lichtenstein P, Hultman CM, Pawitan Y (2008). Covariance component models for multivariate binary traits in family data analysis. Stat Med. 27(7): 1086-105
https://pubmed.ncbi.nlm.nih.gov/17634971
III. Yip BH, Reilly M, Cnattingius S, Pawitan Y (2008). Matched ascertainment of informative families for complex genetic modelling. [Submitted]
IV. Yip BH, Moger TA, Pawitan Y (2008). Genetic analysis of age-at-onset traits based on case-control family data. [Submitted]
History
Defence date
2008-09-12Department
- Department of Medical Epidemiology and Biostatistics
Publication year
2008Thesis type
- Doctoral thesis
ISBN
978-91-7409-137-3Number of supporting papers
4Language
- eng