Genomic feature identification in trypanosomatid parasites
The trypanosomatid parasites cause death and suffering, among humans as well as livestock. Current drugs lack efficacy and cause severe side effects, and no vaccines are available. Increased knowledge of the biology of the parasites is vital for the development of new drugs. Research on these ancient eukaryotes has also already led to the discovery of mechanisms of broader relevance, such as RNA editing, trans splicing and antigenic variation.
Post-transcriptional regulation is an important part of the regulatory networks of most higher organisms, including humans. In the kinetoplastids, only a very limited part of the control of gene expression is exerted at the transcriptional level. Genes are expressed as long polycistronic pre-mRNA, and individual messages are formed by trans splicing and polyadenylation. Even genes that are not coregulated can be on the same polycistronic pre-mRNA. The trypanosomatids can be regarded as models for post-transcriptional regulation, in relation to the more complex eukaryotes.
The progress of the human and other genome projects shows the opportunity provided by a complete genomic sequence to increase the efficiency of traditional molecular biology. Use of computer-aided and fully automated genome sequence analysis tools allows novel feature discovery as well as the direction of hypothesis driven experiments.
We have sequenced the genome of Trypanosoma cruzi as part of a three-centre collaboration, and provided an extensive annotation that identifies biologically interesting features. To this end we have used available informatics tools where possible, and developed some new programs. Focus was on integrating current molecular biology knowledge in large scale analyses, and arriving at experimentally testable hypotheses.
This thesis is based on five papers (I-V). Paper I describes a program for gene-finding and annotation that we constructed for the annotation of the genome, described in paper III. Here we collaborated with experts in several areas to investigate the gene content of T. cruzi. In paper II we present global base skew features in the genome. In paper IV we describe a model of trans splicing in Trypanosoma brucei, and the application of it at the genome level. In paper V, we apply the trans splice model to predict message boundaries in Trypanosoma cruzi, and based on these predictions, we find that upstream open reading frames are common. We hypothesise that these generally repress translation.
List of scientific papers
I. Nilsson D, Andersson B (2004). A graphical tool for parasite genome annotation. Comput Methods Programs Biomed. 73(1): 55-60.
https://pubmed.ncbi.nlm.nih.gov/14715167
II. Nilsson D, Andersson B (2005). Strand asymmetry patterns in trypanosomatid parasites. Exp Parasitol. 109(3): 143-9.
https://pubmed.ncbi.nlm.nih.gov/15713445
III. El-Sayed NM, Myler PJ, Bartholomeu DC, Nilsson D, Aggarwal G, Tran AN, Ghedin E, Worthey EA, Delcher AL, Blandin G, Westenberger SJ, et. al (2005). The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas disease. Science. 309(5733): 409-15.
https://pubmed.ncbi.nlm.nih.gov/16020725
IV. Benz C, Nilsson D, Andersson B, Clayton C, Guilbride DL (2005). Messenger RNA processing sites in Trypanosoma brucei. Mol Biochem Parasitol. 143(2): 125-34.
https://pubmed.ncbi.nlm.nih.gov/15993496
V. Nilsson D, Tran AN, Ferella M, Eklund J, Wang F, Potenza M, Andersson B (2006). Kinetoplastid parasite trans splice site predictions reveal translational control by uAUGs. [Manuscript]
History
Defence date
2006-06-08Department
- Department of Cell and Molecular Biology
Publisher/Institution
Karolinska InstitutetPublication year
2006Thesis type
- Doctoral thesis
ISBN-10
91-7140-789-8Number of supporting papers
5Language
- eng