1
|
Hoyos Sanchez MC, Ospina Zapata HS, Suarez BD, Ospina C, Barbosa HJ, Carranza Martinez JC, Vallejo GA, Urrea Montes D, Duitama J. A phased genome assembly of a Colombian Trypanosoma cruzi TcI strain and the evolution of gene families. Sci Rep 2024; 14:2054. [PMID: 38267502 PMCID: PMC10808112 DOI: 10.1038/s41598-024-52449-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 01/18/2024] [Indexed: 01/26/2024] Open
Abstract
Chagas is an endemic disease in tropical regions of Latin America, caused by the parasite Trypanosoma cruzi. High intraspecies variability and genome complexity have been challenges to assemble high quality genomes needed for studies in evolution, population genomics, diagnosis and drug development. Here we present a chromosome-level phased assembly of a TcI T. cruzi strain (Dm25). While 29 chromosomes show a large collinearity with the assembly of the Brazil A4 strain, three chromosomes show both large heterozygosity and large divergence, compared to previous assemblies of TcI T. cruzi strains. Nucleotide and protein evolution statistics indicate that T. cruzi Marinkellei separated before the diversification of T. cruzi in the known DTUs. Interchromosomal paralogs of dispersed gene families and histones appeared before but at the same time have a more strict purifying selection, compared to other repeat families. Previously unreported large tandem arrays of protein kinases and histones were identified in this assembly. Over one million variants obtained from Illumina reads aligned to the primary assembly clearly separate the main DTUs. We expect that this new assembly will be a valuable resource for further studies on evolution and functional genomics of Trypanosomatids.
Collapse
Affiliation(s)
- Maria Camila Hoyos Sanchez
- Systems and Computing Engineering Department, Universidad de los Andes, Bogotá, Colombia
- School of Veterinary Medicine, Texas Tech University, Amarillo, TX, 79106, USA
| | | | - Brayhan Dario Suarez
- Laboratorio de Investigaciones en Parasitología Tropical (LIPT), Universidad del Tolima, Ibagué, Colombia
| | - Carlos Ospina
- Laboratorio de Investigaciones en Parasitología Tropical (LIPT), Universidad del Tolima, Ibagué, Colombia
| | - Hamilton Julian Barbosa
- Laboratorio de Investigaciones en Parasitología Tropical (LIPT), Universidad del Tolima, Ibagué, Colombia
| | | | - Gustavo Adolfo Vallejo
- Laboratorio de Investigaciones en Parasitología Tropical (LIPT), Universidad del Tolima, Ibagué, Colombia
| | - Daniel Urrea Montes
- Laboratorio de Investigaciones en Parasitología Tropical (LIPT), Universidad del Tolima, Ibagué, Colombia
| | - Jorge Duitama
- Systems and Computing Engineering Department, Universidad de los Andes, Bogotá, Colombia.
| |
Collapse
|
2
|
Berná L, Greif G, Pita S, Faral-Tello P, Díaz-Viraqué F, Souza RDCMD, Vallejo GA, Alvarez-Valin F, Robello C. Maxicircle architecture and evolutionary insights into Trypanosoma cruzi complex. PLoS Negl Trop Dis 2021; 15:e0009719. [PMID: 34437557 PMCID: PMC8425572 DOI: 10.1371/journal.pntd.0009719] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Revised: 09/08/2021] [Accepted: 08/10/2021] [Indexed: 12/13/2022] Open
Abstract
We sequenced maxicircles from T. cruzi strains representative of the species evolutionary diversity by using long-read sequencing, which allowed us to uncollapse their repetitive regions, finding that their real lengths range from 35 to 50 kb. T. cruzi maxicircles have a common architecture composed of four regions: coding region (CR), AT-rich region, short (SR) and long repeats (LR). Distribution of genes, both in order and in strand orientation are conserved, being the main differences the presence of deletions affecting genes coding for NADH dehydrogenase subunits, reinforcing biochemical findings that indicate that complex I is not functional in T. cruzi. Moreover, the presence of complete minicircles into maxicircles of some strains lead us to think about the origin of minicircles. Finally, a careful phylogenetic analysis was conducted using coding regions of maxicircles from up to 29 strains, and 1108 single copy nuclear genes from all of the DTUs, clearly establishing that taxonomically T. cruzi is a complex of species composed by group 1 that contains clades A (TcI), B (TcIII) and D (TcIV), and group 2 (1 and 2 do not coincide with groups I and II described decades ago) containing clade C (TcII), being all hybrid strains of the BC type. Three variants of maxicircles exist in T. cruzi: a, b and c, in correspondence with clades A, B, and C from mitochondrial phylogenies. While A and C carry maxicircles a and c respectively, both clades B and D carry b maxicircle variant; hybrid strains also carry the b- variant. We then propose a new nomenclature that is self-descriptive and makes use of both the phylogenetic relationships and the maxicircle variants present in T. cruzi.
Collapse
Affiliation(s)
- Luisa Berná
- Laboratorio de Interacciones Hospedero-Patógeno, Unidad de Biología Molecular, Institut Pasteur de Montevideo, Montevideo, Uruguay
- Sección Biomatemática—Unidad de Genómica Evolutiva, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
| | - Gonzalo Greif
- Laboratorio de Interacciones Hospedero-Patógeno, Unidad de Biología Molecular, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | - Sebastián Pita
- Laboratorio de Interacciones Hospedero-Patógeno, Unidad de Biología Molecular, Institut Pasteur de Montevideo, Montevideo, Uruguay
- Sección Genética, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
| | - Paula Faral-Tello
- Laboratorio de Interacciones Hospedero-Patógeno, Unidad de Biología Molecular, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | - Florencia Díaz-Viraqué
- Laboratorio de Interacciones Hospedero-Patógeno, Unidad de Biología Molecular, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | | | - Gustavo Adolfo Vallejo
- Laboratorio de investigaciones en Parasitología Tropical (LIPT), Facultad de Ciencias, Universidad del Tolima, Tolima, Colombia
| | - Fernando Alvarez-Valin
- Sección Biomatemática—Unidad de Genómica Evolutiva, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
| | - Carlos Robello
- Laboratorio de Interacciones Hospedero-Patógeno, Unidad de Biología Molecular, Institut Pasteur de Montevideo, Montevideo, Uruguay
- Departamento de Bioquímica, Facultad de Medicina, Universidad de la República, Montevideo, Uruguay
- * E-mail:
| |
Collapse
|