1
|
Radío S, Fort RS, Garat B, Sotelo-Silveira J, Smircich P. UTRme: A Scoring-Based Tool to Annotate Untranslated Regions in Trypanosomatid Genomes. Front Genet 2018; 9:671. [PMID: 30619487 PMCID: PMC6305552 DOI: 10.3389/fgene.2018.00671] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Accepted: 12/04/2018] [Indexed: 11/23/2022] Open
Abstract
Most signals involved in post-transcriptional regulatory networks are located in the untranslated regions (UTRs) of the mRNAs. Therefore, to deepen our understanding of gene expression regulation, delimitation of these regions with high accuracy is needed. The trypanosomatid lineage includes a variety of parasitic protozoans causing a significant worldwide burden on human health. Given their peculiar mechanisms of gene expression, these organisms depend on post-transcriptional regulation as the main level of gene expression control. In this context, the definition of the UTR regions becomes of key importance. We have developed UTR-mini-exon (UTRme), a graphical user interface (GUI) stand-alone application to identify and annotate 5′ and 3′ UTR regions in a highly accurate way. UTRme implements a multiple scoring system tailored to address the issue of false positive UTR assignment that frequently arise because of the characteristics of the intergenic regions. Even though it was developed for trypanosomatids, the tool can be used to predict 3′ sites in any eukaryote and 5′ UTRs in any organism where trans-splicing occurs (such as the model organism C. elegans). UTRme offers a way for non-bioinformaticians to precisely determine UTRs from transcriptomic data. The tool is freely available via the conda and github repositories.
Collapse
Affiliation(s)
- Santiago Radío
- Department of Genomics, Instituto de Investigaciones Biológicas Clemente Estable, MEC, Montevideo, Uruguay.,Laboratory of Molecular Interactions, Facultad de Ciencias. Universidad de la República, Montevideo, Uruguay
| | - Rafael Sebastián Fort
- Department of Genomics, Instituto de Investigaciones Biológicas Clemente Estable, MEC, Montevideo, Uruguay.,Laboratory of Molecular Interactions, Facultad de Ciencias. Universidad de la República, Montevideo, Uruguay
| | - Beatriz Garat
- Laboratory of Molecular Interactions, Facultad de Ciencias. Universidad de la República, Montevideo, Uruguay
| | - José Sotelo-Silveira
- Department of Genomics, Instituto de Investigaciones Biológicas Clemente Estable, MEC, Montevideo, Uruguay.,Department of Cell and Molecular Biology, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
| | - Pablo Smircich
- Department of Genomics, Instituto de Investigaciones Biológicas Clemente Estable, MEC, Montevideo, Uruguay.,Laboratory of Molecular Interactions, Facultad de Ciencias. Universidad de la República, Montevideo, Uruguay
| |
Collapse
|
2
|
Chávez S, Eastman G, Smircich P, Becco LL, Oliveira-Rizzo C, Fort R, Potenza M, Garat B, Sotelo-Silveira JR, Duhagon MA. Transcriptome-wide analysis of the Trypanosoma cruzi proliferative cycle identifies the periodically expressed mRNAs and their multiple levels of control. PLoS One 2017; 12:e0188441. [PMID: 29182646 PMCID: PMC5705152 DOI: 10.1371/journal.pone.0188441] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2017] [Accepted: 11/07/2017] [Indexed: 12/02/2022] Open
Abstract
Trypanosoma cruzi is the protozoan parasite causing American trypanosomiasis or Chagas disease, a neglected parasitosis with important human health impact in Latin America. The efficacy of current therapy is limited, and its toxicity is high. Since parasite proliferation is a fundamental target for rational drug design, we sought to progress into its understanding by applying a genome-wide approach. Treating a TcI linage strain with hydroxyurea, we isolated epimastigotes in late G1, S and G2/M cell cycle stages at 70% purity. The sequencing of each phase identified 305 stage-specific transcripts (1.5-fold change, p≤0.01), coding for conserved cell cycle regulated proteins and numerous proteins whose cell cycle dependence has not been recognized before. Comparisons with the parasite T. brucei and the human host reveal important differences. The meta-analysis of T. cruzi transcriptomic and ribonomic data indicates that cell cycle regulated mRNAs are subject to sub-cellular compartmentalization. Compositional and structural biases of these genes- including CAI, GC content, UTR length, and polycistron position- may contribute to their regulation. To discover nucleotide motifs responsible for the co-regulation of cell cycle regulated genes, we looked for overrepresented motifs at their UTRs and found a variant of the cell cycle sequence motif at the 3' UTR of most of the S and G2 stage genes. We additionally identified hairpin structures at the 5' UTRs of a high proportion of the transcripts, suggesting that periodic gene expression might also rely on translation initiation in T. cruzi. In summary, we report a comprehensive list of T. cruzi cell cycle regulated genes, including many previously unstudied proteins, we show evidence favoring a multi-step control of their expression, and we identify mRNA motifs that may mediate their regulation. Our results provide novel information of the T. cruzi proliferative proteins and the integrated levels of their gene expression control.
Collapse
Affiliation(s)
- Santiago Chávez
- Laboratory of Molecular Interactions, School of Sciences, Universidad de la República, Montevideo, Uruguay
- Department of Genetics, School of Medicine, Universidad de la República, Montevideo, Uruguay
| | - Guillermo Eastman
- Department of Genomics, Instituto de Investigaciones Biológicas Clemente Estable, Montevideo, Uruguay
| | - Pablo Smircich
- Laboratory of Molecular Interactions, School of Sciences, Universidad de la República, Montevideo, Uruguay
- Department of Genetics, School of Medicine, Universidad de la República, Montevideo, Uruguay
| | - Lorena Lourdes Becco
- Laboratory of Molecular Interactions, School of Sciences, Universidad de la República, Montevideo, Uruguay
| | - Carolina Oliveira-Rizzo
- Laboratory of Molecular Interactions, School of Sciences, Universidad de la República, Montevideo, Uruguay
- Department of Genetics, School of Medicine, Universidad de la República, Montevideo, Uruguay
| | - Rafael Fort
- Laboratory of Molecular Interactions, School of Sciences, Universidad de la República, Montevideo, Uruguay
- Department of Genetics, School of Medicine, Universidad de la República, Montevideo, Uruguay
| | - Mariana Potenza
- Institute for Research in Genetic Engineering and Molecular Biology 'Dr. N.H. Torres', Buenos Aires, Argentina
| | - Beatriz Garat
- Laboratory of Molecular Interactions, School of Sciences, Universidad de la República, Montevideo, Uruguay
| | - José Roberto Sotelo-Silveira
- Department of Genomics, Instituto de Investigaciones Biológicas Clemente Estable, Montevideo, Uruguay
- Department of Cell and Molecular Biology, School of Sciences, Universidad de la República, Montevideo, Uruguay
| | - María Ana Duhagon
- Laboratory of Molecular Interactions, School of Sciences, Universidad de la República, Montevideo, Uruguay
- Department of Genetics, School of Medicine, Universidad de la República, Montevideo, Uruguay
- * E-mail:
| |
Collapse
|
3
|
Smircich P, El-Sayed NM, Garat B. Intrinsic DNA curvature in trypanosomes. BMC Res Notes 2017; 10:585. [PMID: 29121981 PMCID: PMC5679330 DOI: 10.1186/s13104-017-2908-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2017] [Accepted: 11/01/2017] [Indexed: 12/21/2022] Open
Abstract
Background Trypanosoma cruzi and Trypanosoma brucei are protozoan parasites
causing Chagas disease and African sleeping sickness, displaying unique features of cellular and molecular biology. Remarkably, no canonical signals for RNA polymerase II promoters, which drive protein coding genes transcription, have been identified so far. The secondary structure of DNA has long been recognized as a signal in biological processes and more recently, its involvement in transcription initiation in Leishmania was proposed. In order to study whether this feature is conserved in trypanosomatids, we undertook a genome wide search for intrinsic DNA curvature in T. cruzi and T. brucei. Results Using a region integrated intrinsic curvature (RIIC) scoring that we previously developed, a non-random distribution of sequence-dependent curvature was observed. High RIIC scores were found to be significantly correlated with transcription start sites in T. cruzi, which have been mapped in divergent switch regions, whereas in T. brucei, the high RIIC scores correlated with sites that have been involved not only in RNA polymerase II initiation but also in termination. In addition, we observed regions with high RIIC score presenting in-phase tracts of Adenines, in the subtelomeric regions of the T. brucei chromosomes that harbor the variable surface glycoproteins genes. Conclusions In both T. cruzi and T. brucei genomes, a link between DNA conformational signals and gene expression was found. High sequence dependent curvature is associated with transcriptional regulation regions. High intrinsic curvature also occurs at the T. brucei chromosome subtelomeric regions where the recombination processes involved in the evasion of the immune host system take place. These findings underscore the relevance of indirect DNA readout in these ancient eukaryotes. Electronic supplementary material The online version of this article (10.1186/s13104-017-2908-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Pablo Smircich
- Laboratorio de Interacciones Moleculares, Facultad de Ciencias, Universidad de la Republica, 11400, Montevideo, Uruguay.,Departamento de Genética, Facultad de Medicina, Universidad de la Republica, 11800, Montevideo, Uruguay
| | - Najib M El-Sayed
- Department of Cell Biology and Molecular Genetics and Center for Bioinformatics and Computational Biology, University of Maryland College Park, College Park, MD, 20742, USA
| | - Beatriz Garat
- Laboratorio de Interacciones Moleculares, Facultad de Ciencias, Universidad de la Republica, 11400, Montevideo, Uruguay.
| |
Collapse
|
4
|
Dikhit MR, Moharana KC, Sahoo BR, Sahoo GC, Das P. LeishMicrosatDB: open source database of repeat sequences detected in six fully sequenced Leishmania genomes. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2014; 2014:bau078. [PMID: 25125444 PMCID: PMC4132413 DOI: 10.1093/database/bau078] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
UNLABELLED A Leishmania Microsatellite Database (LeishMicrosatDB) is reported for genome wise mining of microsatellites in six Leishmania species, using in silico techniques. This was created to provide parasitologists a platform to understand the genome characterization, mapping, phylogeny and evolutionary analysis. The present version of the database contains 1,738,669 simple sequence repeats of which 181 s756 repeats are present in compound form. The repeats can be sought in a chromosome using input parameters such as repeat type (mono- hexa), coding status, repeat unit length and repeat sequence motif. The genic repeats have been further hyperlinked with their corresponding locus id, and the database is appended with primer3 plus for primer designing of selected repeats with left and right flanking sequences up to 250 bp. Information on clustering and polymorphic repeats can also be retrieved. This database may also be adopted as a tool to study the relative occurrence and distribution of microsatellites across the parasitic genome. The database can enable a biologist to select markers at desired intervals over the chromosomes, and can be accessed as an open source repository at http://biomedinformri.com/leishmicrosat. DATABASE URL http://biomedinformri.com/leishmicrosat.
Collapse
Affiliation(s)
- Manas R Dikhit
- Biomedical Informatics Center and Department of Molecular Biology, Rajendra Memorial Research Institute of Medical Sciences, Patna 800007, India
| | - Kanhu C Moharana
- Biomedical Informatics Center and Department of Molecular Biology, Rajendra Memorial Research Institute of Medical Sciences, Patna 800007, India
| | - Bikash R Sahoo
- Biomedical Informatics Center and Department of Molecular Biology, Rajendra Memorial Research Institute of Medical Sciences, Patna 800007, India
| | - Ganesh C Sahoo
- Biomedical Informatics Center and Department of Molecular Biology, Rajendra Memorial Research Institute of Medical Sciences, Patna 800007, India
| | - Pradeep Das
- Biomedical Informatics Center and Department of Molecular Biology, Rajendra Memorial Research Institute of Medical Sciences, Patna 800007, IndiaBiomedical Informatics Center and Department of Molecular Biology, Rajendra Memorial Research Institute of Medical Sciences, Patna 800007, India
| |
Collapse
|
5
|
Ferreira KAM, Fajardo EF, Baptista RP, Macedo AM, Lages-Silva E, Ramírez LE, Pedrosa AL. Species-specific markers for the differential diagnosis of Trypanosoma cruzi and Trypanosoma rangeli and polymorphisms detection in Trypanosoma rangeli. Parasitol Res 2014; 113:2199-207. [PMID: 24728520 DOI: 10.1007/s00436-014-3872-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2013] [Accepted: 03/24/2014] [Indexed: 10/25/2022]
Abstract
Trypanosoma cruzi and Trypanosoma rangeli are kinetoplastid parasites which are able to infect humans in Central and South America. Misdiagnosis between these trypanosomes can be avoided by targeting barcoding sequences or genes of each organism. This work aims to analyze the feasibility of using species-specific markers for identification of intraspecific polymorphisms and as target for diagnostic methods by PCR. Accordingly, primers which are able to specifically detect T. cruzi or T. rangeli genomic DNA were characterized. The use of intergenic regions, generally divergent in the trypanosomatids, and the serine carboxypeptidase gene were successful. Using T. rangeli genomic sequences for the identification of group-specific polymorphisms and a polymorphic AT(n) dinucleotide repeat permitted the classification of the strains into two groups, which are entirely coincident with T. rangeli main lineages, KP1 (+) and KP1 (-), previously determined by kinetoplast DNA (kDNA) characterization. The sequences analyzed totalize 622 bp (382 bp represent a hypothetical protein sequence, and 240 bp represent an anonymous sequence), and of these, 581 (93.3%) are conserved sites and 41 bp (6.7%) are polymorphic, with 9 transitions (21.9%), 2 transversions (4.9%), and 30 (73.2%) insertion/deletion events. Taken together, the species-specific markers analyzed may be useful for the development of new strategies for the accurate diagnosis of infections. Furthermore, the identification of T. rangeli polymorphisms has a direct impact in the understanding of the population structure of this parasite.
Collapse
Affiliation(s)
- Keila Adriana Magalhães Ferreira
- Instituto de Ciências Biológicas e Naturais, Universidade Federal do Triângulo Mineiro, Avenida Frei Paulino, 30, Bairro Abadia, Uberaba, Minas Gerais, 38025-180, Brazil
| | | | | | | | | | | | | |
Collapse
|
6
|
De Gaudenzi JG, Carmona SJ, Agüero F, Frasch AC. Genome-wide analysis of 3'-untranslated regions supports the existence of post-transcriptional regulons controlling gene expression in trypanosomes. PeerJ 2013; 1:e118. [PMID: 23904995 PMCID: PMC3728762 DOI: 10.7717/peerj.118] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2013] [Accepted: 07/10/2013] [Indexed: 12/22/2022] Open
Abstract
In eukaryotic cells, a group of messenger ribonucleic acids (mRNAs) encoding functionally interrelated proteins together with the trans-acting factors that coordinately modulate their expression is termed a post-transcriptional regulon, due to their partial analogy to a prokaryotic polycistron. This mRNA clustering is organized by sequence-specific RNA-binding proteins (RBPs) that bind cis-regulatory elements in the noncoding regions of genes, and mediates the synchronized control of their fate. These recognition motifs are often characterized by conserved sequences and/or RNA structures, and it is likely that various classes of cis-elements remain undiscovered. Current evidence suggests that RNA regulons govern gene expression in trypanosomes, unicellular parasites which mainly use post-transcriptional mechanisms to control protein synthesis. In this study, we used motif discovery tools to test whether groups of functionally related trypanosomatid genes contain a common cis-regulatory element. We obtained conserved structured RNA motifs statistically enriched in the noncoding region of 38 out of 53 groups of metabolically related transcripts in comparison with a random control. These motifs have a hairpin loop structure, a preferred sense orientation and are located in close proximity to the open reading frames. We found that 15 out of these 38 groups represent unique motifs in which most 3'-UTR signature elements were group-specific. Two extensively studied Trypanosoma cruzi RBPs, TcUBP1 and TcRBP3 were found associated with a few candidate RNA regulons. Interestingly, 13 motifs showed a strong correlation with clusters of developmentally co-expressed genes and six RNA elements were enriched in gene clusters affected after hyperosmotic stress. Here we report a systematic genome-wide in silico screen to search for novel RNA-binding sites in transcripts, and describe an organized network of several coordinately regulated cohorts of mRNAs in T. cruzi. Moreover, we found that structured RNA elements are also conserved in other human pathogens. These results support a model of regulation of gene expression by multiple post-transcriptional regulons in trypanosomes.
Collapse
Affiliation(s)
- Javier G De Gaudenzi
- Instituto de Investigaciones Biotecnológicas-Instituto Tecnológico de Chascomús, UNSAM-CONICET , Buenos Aires , Argentina
| | | | | | | |
Collapse
|
7
|
Smircich P, Forteza D, El-Sayed NM, Garat B. Genomic analysis of sequence-dependent DNA curvature in Leishmania. PLoS One 2013; 8:e63068. [PMID: 23646176 PMCID: PMC3639952 DOI: 10.1371/journal.pone.0063068] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2012] [Accepted: 03/27/2013] [Indexed: 11/26/2022] Open
Abstract
Leishmania major is a flagellated protozoan parasite of medical importance. Like other members of the Trypanosomatidae family, it possesses unique mechanisms of gene expression such as constitutive polycistronic transcription of directional gene clusters, gene amplification, mRNA trans-splicing, and extensive editing of mitochondrial transcripts. The molecular signals underlying most of these processes remain under investigation. In order to investigate the role of DNA secondary structure signals in gene expression, we carried out a genome-wide in silico analysis of the intrinsic DNA curvature. The L. major genome revealed a lower frequency of high intrinsic curvature regions as well as inter- and intra- chromosomal distribution heterogeneity, when compared to prokaryotic and eukaryotic organisms. Using a novel method aimed at detecting region-integrated intrinsic curvature (RIIC), high DNA curvature was found to be associated with regions implicated in transcription initiation. Those include divergent strand-switch regions between directional gene clusters and regions linked to markers of active transcription initiation such as acetylated H3 histone, TRF4 and SNAP50. These findings suggest a role for DNA curvature in transcription initiation in Leishmania supporting the relevance of DNA secondary structures signals.
Collapse
Affiliation(s)
- Pablo Smircich
- Laboratorio de Interacciones Moleculares, Facultad de Ciencias, Montevideo, Uruguay
- Departamento de Genética, Facultad de Medicina, Montevideo, Uruguay
| | - Diego Forteza
- Laboratorio de Interacciones Moleculares, Facultad de Ciencias, Montevideo, Uruguay
| | - Najib M. El-Sayed
- Department of Cell Biology and Molecular Genetics and Center for Bioinformatics and Computational Biology, University of Maryland College Park, Maryland, United States of America
| | - Beatriz Garat
- Laboratorio de Interacciones Moleculares, Facultad de Ciencias, Montevideo, Uruguay
| |
Collapse
|
8
|
Pastro L, Smircich P, Pérez-Díaz L, Duhagon MA, Garat B. Implication of CA repeated tracts on post-transcriptional regulation in Trypanosoma cruzi. Exp Parasitol 2013; 134:511-8. [PMID: 23631879 DOI: 10.1016/j.exppara.2013.04.004] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2012] [Revised: 04/13/2013] [Accepted: 04/19/2013] [Indexed: 10/26/2022]
Abstract
In Trypanosoma cruzi gene expression regulation mainly relays on post-transcriptional events. Nevertheless, little is known about the signals which control mRNA abundance and functionality. We have previously found that CA repeated tracts (polyCA) are abundant in the vicinity of open reading frames and constitute specific targets for single stranded binding proteins from T. cruzi epimastigote. Given the reported examples of the involvement of polyCA motifs in gene expression regulation, we decided to further study their role in T. cruzi. Using an in silico genome-wide analysis, we identify the genes that contain polyCA within their predicted UTRs. We found that about 10% of T. cruzi genes carry polyCA therein. Strikingly, they are frequently concurrent with GT repeated tracts (polyGT), favoring the formation of a secondary structure exhibiting the complementary polydinucleotides in a double stranded helix. This feature is found in the species-specific family of genes coding for mucine associated proteins (MASPs) and other genes. For those polyCA-containing UTRs that lack polyGT, the polyCA is mainly predicted to adopt a single stranded structure. We further analyzed the functional role of such element using a reporter approach in T. cruzi. We found out that the insertion of polyCA at the 3' UTR of a reporter gene in the pTEX vector modulates its expression along the parasite's life cycle. While no significant change of the mRNA steady state of the reporter gene could be detected at the trypomastigote stage, significant increase in the epimastigote and reduction in the amastigote stage were observed. Altogether, these results suggest the involvement of polyCA as a signal in gene expression regulation in T. cruzi.
Collapse
Affiliation(s)
- Lucía Pastro
- Laboratorio de Interacciones Moleculares, Facultad de Ciencias, 11400 Montevideo, Uruguay
| | | | | | | | | |
Collapse
|
9
|
Meglécz E, Nève G, Biffin E, Gardner MG. Breakdown of phylogenetic signal: a survey of microsatellite densities in 454 shotgun sequences from 154 non model eukaryote species. PLoS One 2012; 7:e40861. [PMID: 22815847 PMCID: PMC3397955 DOI: 10.1371/journal.pone.0040861] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2012] [Accepted: 06/14/2012] [Indexed: 11/19/2022] Open
Abstract
Microsatellites are ubiquitous in Eukaryotic genomes. A more complete understanding of their origin and spread can be gained from a comparison of their distribution within a phylogenetic context. Although information for model species is accumulating rapidly, it is insufficient due to a lack of species depth, thus intragroup variation is necessarily ignored. As such, apparent differences between groups may be overinflated and generalizations cannot be inferred until an analysis of the variation that exists within groups has been conducted. In this study, we examined microsatellite coverage and motif patterns from 454 shotgun sequences of 154 Eukaryote species from eight distantly related phyla (Cnidaria, Arthropoda, Onychophora, Bryozoa, Mollusca, Echinodermata, Chordata and Streptophyta) to test if a consistent phylogenetic pattern emerges from the microsatellite composition of these species. It is clear from our results that data from model species provide incomplete information regarding the existing microsatellite variability within the Eukaryotes. A very strong heterogeneity of microsatellite composition was found within most phyla, classes and even orders. Autocorrelation analyses indicated that while microsatellite contents of species within clades more recent than 200 Mya tend to be similar, the autocorrelation breaks down and becomes negative or non-significant with increasing divergence time. Therefore, the age of the taxon seems to be a primary factor in degrading the phylogenetic pattern present among related groups. The most recent classes or orders of Chordates still retain the pattern of their common ancestor. However, within older groups, such as classes of Arthropods, the phylogenetic pattern has been scrambled by the long independent evolution of the lineages.
Collapse
Affiliation(s)
- Emese Meglécz
- IMBE UMR 7263 CNRS IRD, Aix-Marseille University, Marseille, France.
| | | | | | | |
Collapse
|
10
|
Meglécz E, Nève G, Biffin E, Gardner MG. Breakdown of phylogenetic signal: a survey of microsatellite densities in 454 shotgun sequences from 154 non model eukaryote species. PLoS One 2012. [PMID: 22815847 DOI: 10.1371/journal.pone.004086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/11/2023] Open
Abstract
Microsatellites are ubiquitous in Eukaryotic genomes. A more complete understanding of their origin and spread can be gained from a comparison of their distribution within a phylogenetic context. Although information for model species is accumulating rapidly, it is insufficient due to a lack of species depth, thus intragroup variation is necessarily ignored. As such, apparent differences between groups may be overinflated and generalizations cannot be inferred until an analysis of the variation that exists within groups has been conducted. In this study, we examined microsatellite coverage and motif patterns from 454 shotgun sequences of 154 Eukaryote species from eight distantly related phyla (Cnidaria, Arthropoda, Onychophora, Bryozoa, Mollusca, Echinodermata, Chordata and Streptophyta) to test if a consistent phylogenetic pattern emerges from the microsatellite composition of these species. It is clear from our results that data from model species provide incomplete information regarding the existing microsatellite variability within the Eukaryotes. A very strong heterogeneity of microsatellite composition was found within most phyla, classes and even orders. Autocorrelation analyses indicated that while microsatellite contents of species within clades more recent than 200 Mya tend to be similar, the autocorrelation breaks down and becomes negative or non-significant with increasing divergence time. Therefore, the age of the taxon seems to be a primary factor in degrading the phylogenetic pattern present among related groups. The most recent classes or orders of Chordates still retain the pattern of their common ancestor. However, within older groups, such as classes of Arthropods, the phylogenetic pattern has been scrambled by the long independent evolution of the lineages.
Collapse
Affiliation(s)
- Emese Meglécz
- IMBE UMR 7263 CNRS IRD, Aix-Marseille University, Marseille, France.
| | | | | | | |
Collapse
|