Bioinformatic methods in rare disease genomics
The larger goal of medical genetics is to map genotype to phenotype and to understand how genomic variation affects human health. In the field of rare disease genomics, there is a mendelian assumption that states: one disease one variant. This is simplified and means that when we observe the phenotype of a rare disease patient, we suspect that there is one or two genetic variations in one gene that cause the disease. It might sound like a simple problem to solve at first, especially compared to other fields in genomics, such as cancer and common disease where multiple loci, unrelated, together are expected to cause the biological state. However, it can be a daunting task to find this variant among the handful of million variants that each human individual is carrying in the genome. This thesis is focused on the problem of finding the causative variants in patients with suspected rare inherited disorders even though some of the tools and methods are applicable in other areas as well.
Many challenges arise in the sequencing analysis as the amount of data grows, requiring development of novel methods and algorithms to enable handling and interpretation of the massive amounts of data. Hundreds of millions of short sequence reads are produced for a single individual in a whole genome sequencing experiment. These are mapped to a reference genome and the positions and regions that differ from the reference are identified or “called” as variants. The variants are annotated with as much relevant information as possible, so that prediction algorithms and humans can determine which variant or small number of variants among the millions identified that are pathogenic in a particular genomic or phenotypic context.
This thesis was created in parallel with the process of establishing a genomics platform in the Stockholm region, to provide the hospitals with state-of-the-art genome analysis. The tools and methods that were developed during these years were implemented and tested in a production setting immediately. In this thesis work I will illustrate the field of Clinical Genomics from different perspectives, from the components of a rare disease analysis pipeline to the integration of whole genome sequencing in a clinical setting via a close-up case study.
List of scientific papers
I. MultiQC: summarize analysis results for multiple tools and samples in a single report. Ewels P, Magnusson M, Lundin S, Käller M. Bioinformatics. 2016 Oct 1;32(19):3047-8.
https://doi.org/10.1093/bioinformatics/btw354
II. Loqusdb: added value of an observations database of local genomic variation. Magnusson M, Eisfeldt J, Nilsson D, Rosenbaum A, Wirta V, Lindstrand A, Wedell A, Stranneheim H. BMC Bioinformatics. 2020 Jul 1;21(1):273.
https://doi.org/10.1186/s12859-020-03609-z
III. Integration of whole genome sequencing into a healthcare setting: high diagnostic rates across multiple clinical entities in 3219 rare disease patients. Stranneheim H*, Lagerstedt-Robinson K*, Magnusson M, Kvarnung M, Nilsson D, Lesko N, Engvall M, Anderlid BM, Arnell H, Johansson CB, Barbaro M, Björck E, Bruhn H, Eisfeldt J, Freyer C, Grigelioniene G, Gustavsson P, Hammarsjö A, Hellström-Pigg M, Iwarsson E, Jemt A, Laaksonen M, Enoksson SL, Malmgren H, Naess K, Nordenskjöld M, Oscarson M, Pettersson M, Rasi C, Rosenbaum A, Sahlin E, Sardh E, Stödberg T, Tesi B, Tham E, Thonberg H, Töhönen V, von Döbeln U, Vassiliou D, Vonlanthen S, Wikström AC, Wincent J, Winqvist O, Wredenberg A, Ygberg S, Zetterström RH, Marits P, Soller MJ, Nordgren A, Wirta V, Lindstrand A†, Wedell A†. Genome Med. 2021 Mar 17;13(1):40. *Shared first authorship, †Shared senior authorship.
https://doi.org/10.1186/s13073-021-00855-5
IV. SLC12A2 mutations cause NKCC1 deficiency with encephalopathy and impaired secretory epithelia. Stödberg T*, Magnusson M*, Lesko N, Wredenberg A, Martin Munoz D, Stranneheim H, Wedell A. Neurol Genet. 2020 Jul 2;6(4):e478. *Shared first authorship.
https://doi.org/10.1212/NXG.0000000000000478
History
Defence date
2021-10-22Department
- Department of Molecular Medicine and Surgery
Publisher/Institution
Karolinska InstitutetMain supervisor
Wedell, AnnaCo-supervisors
Stranneheim, Henrik; Nilsson, DanielPublication year
2021Thesis type
- Doctoral thesis
ISBN
978-91-8016-276-0Number of supporting papers
4Language
- eng