Karolinska Institutet
Browse

Bioinformatic methods in rare disease genomics

Download (1.16 MB)
thesis
posted on 2024-09-03, 04:37 authored by Måns Magnusson

The larger goal of medical genetics is to map genotype to phenotype and to understand how genomic variation affects human health. In the field of rare disease genomics, there is a mendelian assumption that states: one disease one variant. This is simplified and means that when we observe the phenotype of a rare disease patient, we suspect that there is one or two genetic variations in one gene that cause the disease. It might sound like a simple problem to solve at first, especially compared to other fields in genomics, such as cancer and common disease where multiple loci, unrelated, together are expected to cause the biological state. However, it can be a daunting task to find this variant among the handful of million variants that each human individual is carrying in the genome. This thesis is focused on the problem of finding the causative variants in patients with suspected rare inherited disorders even though some of the tools and methods are applicable in other areas as well.

Many challenges arise in the sequencing analysis as the amount of data grows, requiring development of novel methods and algorithms to enable handling and interpretation of the massive amounts of data. Hundreds of millions of short sequence reads are produced for a single individual in a whole genome sequencing experiment. These are mapped to a reference genome and the positions and regions that differ from the reference are identified or “called” as variants. The variants are annotated with as much relevant information as possible, so that prediction algorithms and humans can determine which variant or small number of variants among the millions identified that are pathogenic in a particular genomic or phenotypic context.

This thesis was created in parallel with the process of establishing a genomics platform in the Stockholm region, to provide the hospitals with state-of-the-art genome analysis. The tools and methods that were developed during these years were implemented and tested in a production setting immediately. In this thesis work I will illustrate the field of Clinical Genomics from different perspectives, from the components of a rare disease analysis pipeline to the integration of whole genome sequencing in a clinical setting via a close-up case study.

List of scientific papers

I. MultiQC: summarize analysis results for multiple tools and samples in a single report. Ewels P, Magnusson M, Lundin S, Käller M. Bioinformatics. 2016 Oct 1;32(19):3047-8.
https://doi.org/10.1093/bioinformatics/btw354

II. Loqusdb: added value of an observations database of local genomic variation. Magnusson M, Eisfeldt J, Nilsson D, Rosenbaum A, Wirta V, Lindstrand A, Wedell A, Stranneheim H. BMC Bioinformatics. 2020 Jul 1;21(1):273.
https://doi.org/10.1186/s12859-020-03609-z

III. Integration of whole genome sequencing into a healthcare setting: high diagnostic rates across multiple clinical entities in 3219 rare disease patients. Stranneheim H*, Lagerstedt-Robinson K*, Magnusson M, Kvarnung M, Nilsson D, Lesko N, Engvall M, Anderlid BM, Arnell H, Johansson CB, Barbaro M, Björck E, Bruhn H, Eisfeldt J, Freyer C, Grigelioniene G, Gustavsson P, Hammarsjö A, Hellström-Pigg M, Iwarsson E, Jemt A, Laaksonen M, Enoksson SL, Malmgren H, Naess K, Nordenskjöld M, Oscarson M, Pettersson M, Rasi C, Rosenbaum A, Sahlin E, Sardh E, Stödberg T, Tesi B, Tham E, Thonberg H, Töhönen V, von Döbeln U, Vassiliou D, Vonlanthen S, Wikström AC, Wincent J, Winqvist O, Wredenberg A, Ygberg S, Zetterström RH, Marits P, Soller MJ, Nordgren A, Wirta V, Lindstrand A†, Wedell A†. Genome Med. 2021 Mar 17;13(1):40. *Shared first authorship, †Shared senior authorship.
https://doi.org/10.1186/s13073-021-00855-5

IV. SLC12A2 mutations cause NKCC1 deficiency with encephalopathy and impaired secretory epithelia. Stödberg T*, Magnusson M*, Lesko N, Wredenberg A, Martin Munoz D, Stranneheim H, Wedell A. Neurol Genet. 2020 Jul 2;6(4):e478. *Shared first authorship.
https://doi.org/10.1212/NXG.0000000000000478

History

Defence date

2021-10-22

Department

  • Department of Molecular Medicine and Surgery

Publisher/Institution

Karolinska Institutet

Main supervisor

Wedell, Anna

Co-supervisors

Stranneheim, Henrik; Nilsson, Daniel

Publication year

2021

Thesis type

  • Doctoral thesis

ISBN

978-91-8016-276-0

Number of supporting papers

4

Language

  • eng

Original publication date

2021-09-22

Author name in thesis

Magnusson, Måns

Original department name

Department of Molecular Medicine and Surgery

Place of publication

Stockholm

Usage metrics

    Theses

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC