Methods for deep examination of DNA
The development of sequencing technology has had a rapid pace during the last years and today, the sequencing instruments harbors enormous capacity. This thesis is about the development of methods to make the most out of this capacity and to use it for various applications.
In paper I, a dual tagging system for sequencing large sample sets was developed. To proof the concept, 4,700 dogs were subjected to amplicon sequencing of the 2nd exon of the gene DLA-DRB1 using a 454 genome sequencer. By using a combination of two tags, 4,992 samples can be analyzed within the same run using only 148 tagging sequences. In our experiment, 95% of the generated PCR-products achieved a sufficient read depth (≥ 20×) for variant calling.
Paper II solved a problem coupled to amplicon sequencing experiments, namely contamination of sequence data from unwanted by-product formation. By hybridizing an oligo nucleotide coupled to a fluorescent dye to the wanted PCR products, fluorescence activated cell sorting (FACS) could be used to enrich the target molecules after emulsion PCR. The resulted in a nearly three-fold increase of quality reads from a sorted library compared to a non-sorted.
Since the cost of sequence data has decreased during the last years, the budget item of library preparation have became more abundant in sequencing experiments. In paper III, library preparation using cheap bulk enzymes was explored regarding various factors affecting process efficiency. Multiplex sequence capture was also evaluated for processing up to eight samples at the cost of one single reaction.
Using circulating tumor DNA for measuring systemic tumor load have been proposed and investigated by several studies. In paper IV, the utility of exome sequencing for this application was assessed. The findings suggests that the ability of detecting low-frequent alleles is insufficient for clinical use where detection levels of < 0.5% is required.
List of scientific papers
I. Neiman M., Lundin S., Savolainen P. and Ahmadian A. Decoding a Substantial Set of Samples in Parallel by Massive Sequencing. PLoS ONE. (2011) 6(3):e17785.
https://doi.org/10.1371/journal.pone.0017785
II. Sandberg J., Neiman M., Ahmadian A. and Lundeberg L. Gene-specific FACS sorting method for target selection in high throughput amplicon sequencing. BMC Genomics. (2010) 11:140.
https://doi.org/10.1186/1471-2164-11-140
III. Neiman M., Sundling S., Grönberg H., Hall P., Czene K., Lindberg J. and Klevebring D. Library Preparation and Multiplex Capture for Massive Parallel Sequencing Applications Made Efficient and Easy. PLoS ONE. (2012) 7(11):e48616.
https://doi.org/10.1371/journal.pone.0048616
IV. Neiman M., Klevebring D., Wiklund F., Sundling S., Wiklund P., Egevad L., Grönberg H. and Lindberg J. Exome sequencing of cell-free plasma DNA in prostate cancer patients. [Manuscript]
History
Defence date
2013-05-30Department
- Department of Medical Epidemiology and Biostatistics
Publisher/Institution
Karolinska InstitutetMain supervisor
Grönberg, HenrikPublication year
2013Thesis type
- Doctoral thesis
ISBN
978-91-7549-173-8Number of supporting papers
4Language
- eng