1
|
Crossley ER, Fedorova L, Mulyar O, Freeman R, Khuder S, Fedorov A. Computational identification of ultra-conserved elements in the human genome: a hypothesis on homologous DNA pairing. NAR Genom Bioinform 2024; 6:lqae074. [PMID: 38962254 PMCID: PMC11217675 DOI: 10.1093/nargab/lqae074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Revised: 05/29/2024] [Accepted: 06/19/2024] [Indexed: 07/05/2024] Open
Abstract
Thousands of prolonged sequences of human ultra-conserved non-coding elements (UCNEs) share only one common feature: peculiarities in the unique composition of their dinucleotides. Here we investigate whether the numerous weak signals emanating from these dinucleotide arrangements can be used for computational identification of UCNEs within the human genome. For this purpose, we analyzed 4272 UCNE sequences, encompassing 1 393 448 nucleotides, alongside equally sized control samples of randomly selected human genomic sequences. Our research identified nine different features of dinucleotide arrangements that enable differentiation of UCNEs from the rest of the genome. We employed these nine features, implementing three Machine Learning techniques - Support Vector Machine, Random Forest, and Artificial Neural Networks - to classify UCNEs, achieving an accuracy rate of 82-84%, with specific conditions allowing for over 90% accuracy. Notably, the strongest feature for UCNE identification was the frequency ratio between GpC dinucleotides and the sum of GpG and CpC dinucleotides. Additionally, we investigated the entire pool of 31 046 SNPs located within UCNEs for their representation in the ClinVar database, which catalogs human SNPs with known phenotypic effects. The presence of UCNE-associated SNPs in ClinVar aligns with the expectation of a random distribution, emphasizing the enigmatic nature of UCNE phenotypic manifestation.
Collapse
Affiliation(s)
- Emily R Crossley
- Program of Bioinformatics and Proteomics/Genomics, University of Toledo, Toledo, OH 43606, USA
| | | | | | | | - Sadik Khuder
- Program of Bioinformatics and Proteomics/Genomics, University of Toledo, Toledo, OH 43606, USA
- Department of Medicine, University of Toledo, Toledo, OH 43606, USA
| | - Alexei Fedorov
- Program of Bioinformatics and Proteomics/Genomics, University of Toledo, Toledo, OH 43606, USA
- CRI Genetics LLC, Santa Monica, CA 90404, USA
- Department of Medicine, University of Toledo, Toledo, OH 43606, USA
| |
Collapse
|
2
|
Liang D, Jiang Y, Zhang Y, Mao C, Ma T, Zhang C. The Comparative Genomics of Botryosphaeriaceae Suggests Gene Families of Botryosphaeria dothidea Related to Pathogenicity on Chinese Hickory Tree. J Fungi (Basel) 2024; 10:299. [PMID: 38667970 PMCID: PMC11051394 DOI: 10.3390/jof10040299] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 04/07/2024] [Accepted: 04/07/2024] [Indexed: 04/28/2024] Open
Abstract
Trunk canker poses a major threat to the production of Chinese hickory tree (Carya cathayensis Sarg.), which is primarily determined by Botryosphaeriaceae. In our previous work, we identified Botryosphaeria dothidea as the predominant pathogen of this disease. However, it is still unclear about corresponding gene families and mechanisms associated with B. dothidea's pathogenicity on Chinese hickory tree. Here, we present a comparative analysis of high-quality genome assemblies of Botryosphaeria dothidea and other isolated pathogens, showing highly syntenic relationships between B. dothidea and its closely related species and the conservative evolution of the Botryosphaeriaceae family. Higher GC contents were found in the genomes of B. dothidea and three other isolated pathogens (Botryshaeria cortices, Botryshaeria fabicerciana, and Botryshaeria qingyuanensis) compared to Macrophomina phaseolina, Neofusicoccum parvum, Diplodia corticola, and Lasiodiplodia theobromae. An investigation of genes specific to or expanded in B. dothidea revealed that one secreted glucanase, one orsellinic acid biosynthesis enzyme, and two MFS transporters positively regulated B. dothidea's pathogenicity. We also observed an overrepresentation of viral integrase like gene and heterokaryon incompatibility proteins in the B. dothidea's genome. In addition, we observed one LRR-domain-containing protein and two Sec-domain-containing proteins (Sec_1 and Sec_7) that underwent positive selection. This study will help to understand B. dothidea's pathogenicity and potential influence on the infection of Chinese hickory, which will help in the development of disease control and ensure the security of Chinese hickory production.
Collapse
Affiliation(s)
| | | | | | | | - Tianlin Ma
- College of Advanced Agricultural Sciences, Zhejiang Agriculture and Forest University, Hangzhou 311300, China; (D.L.); (Y.J.); (Y.Z.); (C.M.)
| | - Chuanqing Zhang
- College of Advanced Agricultural Sciences, Zhejiang Agriculture and Forest University, Hangzhou 311300, China; (D.L.); (Y.J.); (Y.Z.); (C.M.)
| |
Collapse
|
3
|
Sgarlata GM, Rasolondraibe E, Salmona J, Le Pors B, Ralantoharijaona T, Rakotonanahary A, Jan F, Manzi S, Iribar A, Zaonarivelo JR, Volasoa Andriaholinirina N, Rasoloharijaona S, Chikhi L. The genomic diversity of the Eliurus genus in northern Madagascar with a putative new species. Mol Phylogenet Evol 2024; 193:107997. [PMID: 38128795 DOI: 10.1016/j.ympev.2023.107997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 12/06/2023] [Accepted: 12/18/2023] [Indexed: 12/23/2023]
Abstract
Madagascar exhibits extraordinarily high level of species richness and endemism, while being severely threatened by habitat loss and fragmentation (HL&F). In front of these threats to biodiversity, conservation effort can be directed, for instance, in the documentation of species that are still unknown to science, or in investigating how species respond to HL&F. The tufted-tail rats genus (Eliurus spp.) is the most speciose genus of endemic rodents in Madagascar, with 13 described species, which occupy two major habitat types: dry or humid forests. The large species diversity and association to specific habitat types make the Eliurus genus a suitable model for investigating species adaptation to new environments, as well as response to HL&F (dry vs humid). In the present study, we investigated Eliurus spp. genomic diversity across northern Madagascar, a region covered by both dry and humid fragmented forests. From the mitochondrial DNA (mtDNA) and nuclear genomic (RAD-seq) data of 124 Eliurus individuals sampled in poorly studied forests of northern Madagascar, we identified an undescribed Eliurus taxon (Eliurus sp. nova). We tested the hypothesis of a new Eliurus species using several approaches: i) DNA barcoding; ii) phylogenetic inferences; iii) species delimitation tests based on the Multi-Species Coalescent (MSC) model, iv) genealogical divergence index (gdi); v) an ad-hoc test of isolation-by-distance within versus between sister-taxa, vi) comparisons of %GC content patterns and vii) morphological analyses. All analyses support the recognition of the undescribed lineage as a putative distinct species. In addition, we show that Eliurus myoxinus, a species known from the dry forests of western Madagascar, is, surprisingly, found mostly in humid forests in northern Madagascar. In conclusion, we discuss the implications of such findings in the context of Eliurus species evolution and diversification, and use the distribution of northern Eliurus species as a proxy for reconstructing past changes in forest cover and vegetation type in northern Madagascar.
Collapse
Affiliation(s)
| | - Emmanuel Rasolondraibe
- Département de Biologie Animale et Ecologie, Faculté des Sciences, Université de Mahajanga, Mahajanga, Madagascar.
| | - Jordi Salmona
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande, 6, 2780-156 Oeiras, Portugal; Centre de Recherche sur la Biodiversité et l'Environnement (CRBE),Université de Toulouse, CNRS, IRD, Toulouse INP, Université Toulouse 3 -Paul Sabatier (UT3), Toulouse, France.
| | - Barbara Le Pors
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande, 6, 2780-156 Oeiras, Portugal
| | - Tantely Ralantoharijaona
- Département de Biologie Animale et Ecologie, Faculté des Sciences, Université de Mahajanga, Mahajanga, Madagascar
| | - Ando Rakotonanahary
- Département de Biologie Animale et Ecologie, Faculté des Sciences, Université de Mahajanga, Mahajanga, Madagascar.
| | - Fabien Jan
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande, 6, 2780-156 Oeiras, Portugal
| | - Sophie Manzi
- Centre de Recherche sur la Biodiversité et l'Environnement (CRBE),Université de Toulouse, CNRS, IRD, Toulouse INP, Université Toulouse 3 -Paul Sabatier (UT3), Toulouse, France.
| | - Amaia Iribar
- Centre de Recherche sur la Biodiversité et l'Environnement (CRBE),Université de Toulouse, CNRS, IRD, Toulouse INP, Université Toulouse 3 -Paul Sabatier (UT3), Toulouse, France.
| | - John Rigobert Zaonarivelo
- Département des Sciences de la Nature et de l'Environnement, Université d'Antsiranana, 201 Antsiranana, Madagascar.
| | | | - Solofonirina Rasoloharijaona
- Département de Biologie Animale et Ecologie, Faculté des Sciences, Université de Mahajanga, Mahajanga, Madagascar
| | - Lounès Chikhi
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande, 6, 2780-156 Oeiras, Portugal; Centre de Recherche sur la Biodiversité et l'Environnement (CRBE),Université de Toulouse, CNRS, IRD, Toulouse INP, Université Toulouse 3 -Paul Sabatier (UT3), Toulouse, France.
| |
Collapse
|
4
|
Vohnoutová M, Sedláková A, Symonová R. Abandoning the Isochore Theory Can Help Explain Genome Compositional Organization in Fish. Int J Mol Sci 2023; 24:13167. [PMID: 37685974 PMCID: PMC10487504 DOI: 10.3390/ijms241713167] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 08/16/2023] [Accepted: 08/18/2023] [Indexed: 09/10/2023] Open
Abstract
The organization of the genome nucleotide (AT/GC) composition in vertebrates remains poorly understood despite the numerous genome assemblies available. Particularly, the origin of the AT/GC heterogeneity in amniotes, in comparison to the homogeneity in anamniotes, is controversial. Recently, several exceptions to this dichotomy were confirmed in an ancient fish lineage with mammalian AT/GC heterogeneity. Hence, our current knowledge necessitates a reevaluation considering this fact and utilizing newly available data and tools. We analyzed fish genomes in silico with as low user input as possible to compare previous approaches to assessing genome composition. Our results revealed a disparity between previously used plots of GC% and histograms representing the authentic distribution of GC% values in genomes. Previous plots heavily reduced the range of GC% values in fish to comply with the alleged AT/GC homogeneity and AT-richness of their genomes. We illustrate how the selected sequence size influences the clustering of GC% values. Previous approaches that disregarded chromosome and genome sizes, which are about three times smaller in fish than in mammals, distorted their results and contributed to the persisting confusion about fish genome composition. Chromosome size and their transposons may drive the AT/GC heterogeneity apparent on mammalian chromosomes, whereas far less in fishes.
Collapse
Affiliation(s)
- Marta Vohnoutová
- Department of Computer Science, Faculty of Science, University of South Bohemia, Branišovská 1760, 370-05 České Budějovice, Czech Republic;
| | - Anastázie Sedláková
- Faculty of Science, University of Hradec Králové, Hradecká 1285, 500-03 Hradec Králové, Czech Republic;
| | - Radka Symonová
- Department of Computer Science, Faculty of Science, University of South Bohemia, Branišovská 1760, 370-05 České Budějovice, Czech Republic;
- Institute of Hydrobiology, Biology Centre, Czech Academy of Sciences, Na Sádkách 7, 370-05 České Budějovice, Czech Republic
| |
Collapse
|
5
|
Campbell LI, Nwezeobi J, van Brunschot SL, Kaweesi T, Seal SE, Swamy RAR, Namuddu A, Maslen GL, Mugerwa H, Armean IM, Haggerty L, Martin FJ, Malka O, Santos-Garcia D, Juravel K, Morin S, Stephens ME, Muhindira PV, Kersey PJ, Maruthi MN, Omongo CA, Navas-Castillo J, Fiallo-Olivé E, Mohammed IU, Wang HL, Onyeka J, Alicai T, Colvin J. Comparative evolutionary analyses of eight whitefly Bemisia tabaci sensu lato genomes: cryptic species, agricultural pests and plant-virus vectors. BMC Genomics 2023; 24:408. [PMID: 37468834 DOI: 10.1186/s12864-023-09474-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Accepted: 06/21/2023] [Indexed: 07/21/2023] Open
Abstract
BACKGROUND The group of > 40 cryptic whitefly species called Bemisia tabaci sensu lato are amongst the world's worst agricultural pests and plant-virus vectors. Outbreaks of B. tabaci s.l. and the associated plant-virus diseases continue to contribute to global food insecurity and social instability, particularly in sub-Saharan Africa and Asia. Published B. tabaci s.l. genomes have limited use for studying African cassava B. tabaci SSA1 species, due to the high genetic divergences between them. Genomic annotations presented here were performed using the 'Ensembl gene annotation system', to ensure that comparative analyses and conclusions reflect biological differences, as opposed to arising from different methodologies underpinning transcript model identification. RESULTS We present here six new B. tabaci s.l. genomes from Africa and Asia, and two re-annotated previously published genomes, to provide evolutionary insights into these globally distributed pests. Genome sizes ranged between 616-658 Mb and exhibited some of the highest coverage of transposable elements reported within Arthropoda. Many fewer total protein coding genes (PCG) were recovered compared to the previously published B. tabaci s.l. genomes and structural annotations generated via the uniform methodology strongly supported a repertoire of between 12.8-13.2 × 103 PCG. An integrative systematics approach incorporating phylogenomic analysis of nuclear and mitochondrial markers supported a monophyletic Aleyrodidae and the basal positioning of B. tabaci Uganda-1 to the sub-Saharan group of species. Reciprocal cross-mating data and the co-cladogenesis pattern of the primary obligate endosymbiont 'Candidatus Portiera aleyrodidarum' from 11 Bemisia genomes further supported the phylogenetic reconstruction to show that African cassava B. tabaci populations consist of just three biological species. We include comparative analyses of gene families related to detoxification, sugar metabolism, vector competency and evaluate the presence and function of horizontally transferred genes, essential for understanding the evolution and unique biology of constituent B. tabaci. s.l species. CONCLUSIONS These genomic resources have provided new and critical insights into the genetics underlying B. tabaci s.l. biology. They also provide a rich foundation for post-genomic research, including the selection of candidate gene-targets for innovative whitefly and virus-control strategies.
Collapse
Affiliation(s)
- Lahcen I Campbell
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK.
| | - Joachim Nwezeobi
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK.
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton, UK.
| | - Sharon L van Brunschot
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
- CSIRO Health and Biosecurity, Dutton Park, QLD, Australia
- School of Biological Sciences, The University of Queensland, Brisbane, QLD, Australia
| | - Tadeo Kaweesi
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
- Rwebitaba Zonal Agricultural Research and Development Institute, Fort Portal, Uganda
| | - Susan E Seal
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
| | - Rekha A R Swamy
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
| | - Annet Namuddu
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
- National Crops Resources Research Institute, Kampala, Uganda
| | - Gareth L Maslen
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
- Imperial College London, South Kensington, London, UK
| | - Habibu Mugerwa
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
- Department of Entomology, University of Georgia, Griffin, GA, USA
| | - Irina M Armean
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Leanne Haggerty
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Fergal J Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Osnat Malka
- Department of Entomology, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Diego Santos-Garcia
- CNRS, Laboratory of Biometry and Evolutionary Biology UMR 5558, University of Lyon, Villeurbanne, France
- Center for Biology and Management of Populations, INRAe UMR1062, Montferrier-sur-Lez, France
| | - Ksenia Juravel
- Department of Entomology, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Shai Morin
- Department of Entomology, The Hebrew University of Jerusalem, Rehovot, Israel
| | | | - Paul Visendi Muhindira
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
- Institute for Molecular Bioscience, The University of Queensland, St Lucia, QLD, Australia
| | - Paul J Kersey
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
- Royal Botanic Gardens, Kew, London, UK
| | - M N Maruthi
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
| | | | - Jesús Navas-Castillo
- Instituto de Hortofruticultura Subtropical Y Mediterránea "La Mayora" (IHSM-UMA-CSIC), Consejo Superior de Investigaciones Científicas, Málaga, Algarrobo-Costa, Spain
| | - Elvira Fiallo-Olivé
- Instituto de Hortofruticultura Subtropical Y Mediterránea "La Mayora" (IHSM-UMA-CSIC), Consejo Superior de Investigaciones Científicas, Málaga, Algarrobo-Costa, Spain
| | | | - Hua-Ling Wang
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
- College of Forestry, Hebei Agricultural University, Baoding, Hebei, China
| | - Joseph Onyeka
- National Root Crops Research Institute (NRCRI), Umudike, Nigeria
| | - Titus Alicai
- National Crops Resources Research Institute, Kampala, Uganda
| | - John Colvin
- Natural Resources Institute, University of Greenwich, Chatham, Kent, UK
| |
Collapse
|
6
|
Lamolle G, Simón D, Iriarte A, Musto H. Main Factors Shaping Amino Acid Usage Across Evolution. J Mol Evol 2023:10.1007/s00239-023-10120-5. [PMID: 37264211 DOI: 10.1007/s00239-023-10120-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Accepted: 05/17/2023] [Indexed: 06/03/2023]
Abstract
The standard genetic code determines that in most species, including viruses, there are 20 amino acids that are coded by 61 codons, while the other three codons are stop triplets. Considering the whole proteome each species features its own amino acid frequencies, given the slow rate of change, closely related species display similar GC content and amino acids usage. In contrast, distantly related species display different amino acid frequencies. Furthermore, within certain multicellular species, as mammals, intragenomic differences in the usage of amino acids are evident. In this communication, we shall summarize some of the most prominent and well-established factors that determine the differences found in the amino acid usage, both across evolution and intragenomically.
Collapse
Affiliation(s)
- Guillermo Lamolle
- Laboratorio de Genómica Evolutiva, Facultad de Ciencias, Universidad de La República, Montevideo, Uruguay
| | - Diego Simón
- Laboratorio de Genómica Evolutiva, Facultad de Ciencias, Universidad de La República, Montevideo, Uruguay
- Laboratorio de Virología Molecular, Centro de Investigaciones Nucleares, Facultad de Ciencias, Universidad de La República, Montevideo, Uruguay
- Laboratorio de Evolución Experimental de Virus, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | - Andrés Iriarte
- Laboratorio de Genómica Evolutiva, Facultad de Ciencias, Universidad de La República, Montevideo, Uruguay
- Laboratorio de Biología Computacional, Departamento de Desarrollo Biotecnológico, Instituto de Higiene, Facultad de Medicina, Universidad de La República, Montevideo, Uruguay
| | - Héctor Musto
- Laboratorio de Genómica Evolutiva, Facultad de Ciencias, Universidad de La República, Montevideo, Uruguay.
| |
Collapse
|
7
|
Zhang Q, Zhou Q, Han S, Li Y, Wang Y, He H. The genome of sheep ked (Melophagus ovinus) reveals potential mechanisms underlying reproduction and narrower ecological niches. BMC Genomics 2023; 24:54. [PMID: 36717784 PMCID: PMC9887928 DOI: 10.1186/s12864-023-09155-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Accepted: 01/27/2023] [Indexed: 01/31/2023] Open
Abstract
BACKGROUND Melophagus ovinus is considered to be of great veterinary health significance. However, little is known about the information on genetic mechanisms of the specific biological characteristics and novel methods for controlling M. ovinus. RESULTS In total, the de novo genome assembly of M. ovinus was 188.421 Mb in size (330 scaffolds, N50 Length: 10.666 Mb), with a mean GC content of 27.74%. A total of 13,372 protein-coding genes were functionally annotated. Phylogenetic analysis indicated that the diversification of M. ovinus and Glossina fuscipes took place 72.76 Mya within the Late Cretaceous. Gene family expansion and contraction analysis revealed that M. ovinus has 65 rapidly-evolving families (26 expansion and 39 contractions) mainly involved DNA metabolic activity, transposases activity, odorant receptor 59a/67d-like, IMD domain-containing protein, and cuticle protein, etc. The universal and tightly conserved list of milk protein orthologues has been assembled from the genome of M. ovinus. Contractions and losses of sensory receptors and vision-associated Rhodopsin genes were significant in M. ovinus, which indicate that the M. ovinus has narrower ecological niches. CONCLUSIONS We sequenced, assembled, and annotated the whole genome sequence of M. ovinus, and launches into the preliminary genetic mechanisms analysis of the adaptive evolution characteristics of M. ovinus. These resources will provide insights to understand the biological underpinnings of this parasite and the disease control strategies.
Collapse
Affiliation(s)
- Qingxun Zhang
- National Research Center for Wildlife-Borne Diseases, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China
- Beijing Milu Ecological Research Center, Beijing, 100076, China
| | - Qingsong Zhou
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Shuyi Han
- National Research Center for Wildlife-Borne Diseases, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Ying Li
- State Key Laboratory of Plateau Ecology and Agriculture, Qinghai University, Xining, 810016, China
| | - Ye Wang
- National Research Center for Wildlife-Borne Diseases, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Hongxuan He
- National Research Center for Wildlife-Borne Diseases, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China.
| |
Collapse
|
8
|
Matoulek D, Ježek B, Vohnoutová M, Symonová R. Advances in Vertebrate (Cyto)Genomics Shed New Light on Fish Compositional Genome Evolution. Genes (Basel) 2023; 14:genes14020244. [PMID: 36833171 PMCID: PMC9956151 DOI: 10.3390/genes14020244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Accepted: 01/05/2023] [Indexed: 01/19/2023] Open
Abstract
Cytogenetic and compositional studies considered fish genomes rather poor in guanine-cytosine content (GC%) because of a putative "sharp increase in genic GC% during the evolution of higher vertebrates". However, the available genomic data have not been exploited to confirm this viewpoint. In contrast, further misunderstandings in GC%, mostly of fish genomes, originated from a misapprehension of the current flood of data. Utilizing public databases, we calculated the GC% in animal genomes of three different, technically well-established fractions: DNA (entire genome), cDNA (complementary DNA), and cds (exons). Our results across chordates help set borders of GC% values that are still incorrect in literature and show: (i) fish in their immense diversity possess comparably GC-rich (or even GC-richer) genomes as higher vertebrates, and fish exons are GC-enriched among vertebrates; (ii) animal genomes generally show a GC-enrichment from the DNA, over cDNA, to the cds level (i.e., not only the higher vertebrates); (iii) fish and invertebrates show a broad(er) inter-quartile range in GC%, while avian and mammalian genomes are more constrained in their GC%. These results indicate no sharp increase in the GC% of genes during the transition to higher vertebrates, as stated and numerously repeated before. We present our results in 2D and 3D space to explore the compositional genome landscape and prepared an online platform to explore the AT/GC compositional genome evolution.
Collapse
Affiliation(s)
- Dominik Matoulek
- Department of Physics, Faculty of Science, University of Hradec Králové, 500 03 Hradec Králové, Czech Republic
| | - Bruno Ježek
- Faculty of Informatics and Management, University of Hradec Králové, Rokitanského 62, 500 02 Hradec Králové, Czech Republic
| | - Marta Vohnoutová
- Department of Computer Science, Faculty of Science, University of South Bohemia, Branišovská 1760, 370 05 České Budějovice, Czech Republic
| | - Radka Symonová
- Department of Computer Science, Faculty of Science, University of South Bohemia, Branišovská 1760, 370 05 České Budějovice, Czech Republic
- Department of Bioinformatics, Wissenschaftszentrum Weihenstephan, Technische Universität München, 85354 Freising, Germany
- Institute of Hydrobiology, Biology Centre of the Czech Academy of Sciences, 370 05 České Budějovice, Czech Republic
- Correspondence:
| |
Collapse
|
9
|
Adams AN, Denton RD, Mueller RL. Gigantic genomes of salamanders indicate that body temperature, not genome size, is the driver of global methylation and 5-methylcytosine deamination in vertebrates. Evolution 2022; 76:1052-1061. [PMID: 35275604 DOI: 10.1111/evo.14468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Revised: 12/29/2021] [Accepted: 01/14/2022] [Indexed: 01/21/2023]
Abstract
Transposable elements (TEs) are sequences that replicate and move throughout genomes, and they can be silenced through methylation of cytosines at CpG dinucleotides. TE abundance contributes to genome size, but TE silencing variation across genomes of different sizes remains underexplored. Salamanders include most of the largest C-values - 9 to 120 Gb. We measured CpG methylation levels in salamanders with genomes ranging from 2N = ∼58 Gb to 4N = ∼116 Gb. We compared these levels to results from endo- and ectothermic vertebrates with more typical genomes. Salamander methylation levels are approximately 90%, higher than all endotherms. However, salamander methylation does not differ from other ectotherms, despite an approximately 100-fold difference in nuclear DNA content. Because methylation affects the nucleotide compositional landscape through 5-methylcytosine deamination to thymine, we quantified salamander CpG dinucleotide levels and compared them to other vertebrates. Salamanders and other ectotherms have comparable CpG levels, and ectotherm levels are higher than endotherms. These data show no shift in global methylation at the base of salamanders, despite a dramatic increase in TE load and genome size. This result is reconcilable with previous studies that considered endothermy and ectothermy, which may be more important drivers of methylation in vertebrates than genome size.
Collapse
Affiliation(s)
| | - Robert Daniel Denton
- Department of Biology, Marian University, Indianapolis, IN, 46222.,Division of Science and Math, University of Minnesota Morris, Morris, MN, 56267
| | | |
Collapse
|
10
|
Sun J, Chen T, Zhu M, Wang R, Huang Y, Wei Q, Yang M, Liao Y. Whole-genome sequencing revealed genetic diversity and selection of Guangxi indigenous chickens. PLoS One 2022; 17:e0250392. [PMID: 35290380 PMCID: PMC8923445 DOI: 10.1371/journal.pone.0250392] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Accepted: 02/20/2022] [Indexed: 11/19/2022] Open
Abstract
Guangxi chickens play a crucial role in promoting the high-quality development of the broiler industry in China, but their value and potential are yet to be discovered. To determine the genetic diversity and population structure of Guangxi indigenous chicken, we analyzed the whole genomes of 185 chickens from 8 phenotypically and geographically representative Guangxi chicken breeds, together with 12 RJFt, 12 BRA and 12 WL genomes available from previous studies. Calculation of heterozygosity (Hp), nucleotide diversity (π), and LD level indicated that Guangxi populations were characterized by higher genetic diversity and lower differentiation than RJFt and commercial breeds except for HGFC. Population structure analysis also confirmed the introgression from commercial broiler breeds. Each population clustered together while the overall differentiation was slight. MA has the richest genetic diversity among all varieties. Selective sweep analysis revealed BCO2, EDN3 and other candidate genes had received strong selection in local breeds. These also provided novel breeding visual and data basis for future breeding.
Collapse
Affiliation(s)
- Junli Sun
- Guangxi Key Laboratory of Livestock Genetic Improvement, Animal Husbandry Research Institute of Guangxi Zhuang Autonomous Region, Nanning, Guangxi, China
| | - Tao Chen
- BGI Institute of Applied Agriculture, BGI-Shenzhen, Shenzhen, China
| | - Min Zhu
- Guangxi Key Laboratory of Livestock Genetic Improvement, Animal Husbandry Research Institute of Guangxi Zhuang Autonomous Region, Nanning, Guangxi, China
| | - Ran Wang
- BGI Institute of Applied Agriculture, BGI-Shenzhen, Shenzhen, China
| | - Yingfei Huang
- Guangxi Key Laboratory of Livestock Genetic Improvement, Animal Husbandry Research Institute of Guangxi Zhuang Autonomous Region, Nanning, Guangxi, China
| | - Qiang Wei
- BGI Institute of Applied Agriculture, BGI-Shenzhen, Shenzhen, China
| | - Manman Yang
- BGI-Shenzhen, Shenzhen, China
- * E-mail: (MY); (YL)
| | - Yuying Liao
- Guangxi Veterinary Research Institute, Nanning, Guangxi, China
- * E-mail: (MY); (YL)
| |
Collapse
|
11
|
Bernardi G. The "Genomic Code": DNA Pervasively Moulds Chromatin Structures Leaving no Room for "Junk". Life (Basel) 2021; 11:342. [PMID: 33924668 PMCID: PMC8070607 DOI: 10.3390/life11040342] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 04/06/2021] [Accepted: 04/07/2021] [Indexed: 02/07/2023] Open
Abstract
The chromatin of the human genome was analyzed at three DNA size levels. At the first, compartment level, two "gene spaces" were found many years ago: A GC-rich, gene-rich "genome core" and a GC-poor, gene-poor "genome desert", the former corresponding to open chromatin centrally located in the interphase nucleus, the latter to closed chromatin located peripherally. This bimodality was later confirmed and extended by the discoveries (1) of LADs, the Lamina-Associated Domains, and InterLADs; (2) of two "spatial compartments", A and B, identified on the basis of chromatin interactions; and (3) of "forests and prairies" characterized by high and low CpG islands densities. Chromatin compartments were shown to be associated with the compositionally different, flat and single- or multi-peak DNA structures of the two, GC-poor and GC-rich, "super-families" of isochores. At the second, sub-compartment, level, chromatin corresponds to flat isochores and to isochore loops (due to compositional DNA gradients) that are susceptible to extrusion. Finally, at the short-sequence level, two sets of sequences, GC-poor and GC-rich, define two different nucleosome spacings, a short one and a long one. In conclusion, chromatin structures are moulded according to a "genomic code" by DNA sequences that pervade the genome and leave no room for "junk".
Collapse
Affiliation(s)
- Giorgio Bernardi
- Science Department, Roma Tre University, Viale Marconi 446, 00146 Rome, Italy; ; Tel.: +39-33-540-5892
- Stazione Zoologica Anton Dohrn, Villa Comunale, 80121 Naples, Italy
| |
Collapse
|
12
|
Characterization of microsatellites in the endangered snow leopard based on the chromosome-level genome. MAMMAL RES 2021. [DOI: 10.1007/s13364-021-00563-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
13
|
Matoulek D, Borůvková V, Ocalewicz K, Symonová R. GC and Repeats Profiling along Chromosomes-The Future of Fish Compositional Cytogenomics. Genes (Basel) 2020; 12:50. [PMID: 33396302 PMCID: PMC7823971 DOI: 10.3390/genes12010050] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Revised: 12/28/2020] [Accepted: 12/29/2020] [Indexed: 12/14/2022] Open
Abstract
The study of fish cytogenetics has been impeded by the inability to produce G-bands that could assign chromosomes to their homologous pairs. Thus, the majority of karyotypes published have been estimated based on morphological similarities of chromosomes. The reason why chromosome G-banding does not work in fish remains elusive. However, the recent increase in the number of fish genomes assembled to the chromosome level provides a way to analyse this issue. We have developed a Python tool to visualize and quantify GC percentage (GC%) of both repeats and unique DNA along chromosomes using a non-overlapping sliding window approach. Our tool profiles GC% and simultaneously plots the proportion of repeats (rep%) in a color scale (or vice versa). Hence, it is possible to assess the contribution of repeats to the total GC%. The main differences are the GC% of repeats homogenizing the overall GC% along fish chromosomes and a greater range of GC% scattered along fish chromosomes. This may explain the inability to produce G-banding in fish. We also show an occasional banding pattern along the chromosomes in some fish that probably cannot be detected with traditional qualitative cytogenetic methods.
Collapse
Affiliation(s)
- Dominik Matoulek
- Faculty of Science, University of Hradec Kralove, 500 03 Hradec Králové, Czech Republic; (D.M.); (V.B.)
| | - Veronika Borůvková
- Faculty of Science, University of Hradec Kralove, 500 03 Hradec Králové, Czech Republic; (D.M.); (V.B.)
| | - Konrad Ocalewicz
- Department of Marine Biology and Ecology, Institute of Oceanography, Faculty of Oceanography and Geography, University of Gdansk, 80-309 Gdansk, Poland;
| | - Radka Symonová
- Department of Bioinformatics, Wissenschaftszentrum Weihenstephan, Technische Universität München, 80333 Freising, Germany
| |
Collapse
|
14
|
Arhondakis S, Milanesi M, Castrignanò T, Gioiosa S, Valentini A, Chillemi G. Evidence of distinct gene functional patterns in GC-poor and GC-rich isochores in Bos taurus. Anim Genet 2020; 51:358-368. [PMID: 32069522 DOI: 10.1111/age.12917] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/20/2020] [Indexed: 01/10/2023]
Abstract
Vertebrate genomes are mosaics of megabase-size DNA segments with a fairly homogeneous base composition, called isochores. They are divided into five families characterized by different guanine-cytosine (GC) levels and linked to several functional and structural properties. The increased availability of fully sequenced genomes allows the investigation of isochores in several species, assessing their level of conservation across vertebrate genomes. In this work, we characterized the isochores in Bos taurus using the ARS-UCD1.2 genome version. The comparison of our results with the well-studied human isochores and those of other mammals revealed a large conservation in isochore families, in number, average GC levels and gene density. Exceptions to the established increase in gene density with the increase in isochores (GC%) were observed for the following gene biotypes: tRNA, small nuclear RNA, small nucleolar RNA and pseudogenes that have their maximum number in H2 and H1 isochores. Subsequently, we assessed the ontology of all gene biotypes looking for functional classes that are statistically over- or under-represented in each isochore. Receptor activity and sensory perception pathways were significantly over-represented in L1 and L2 (GC-poor) isochores. This was also validated for the horse genome. Our analysis of housekeeping genes confirmed a preferential localization in GC-rich isochores, as reported in other species. Finally, we assessed the SNP distribution of a bovine high-density SNP chip across the isochores, finding a higher density in the GC-rich families, reflecting a potential bias in the chip, widely used for genetic selection and biodiversity studies.
Collapse
Affiliation(s)
- S Arhondakis
- Bioinformatics and Computational Science (BioCoS), Boniali 11-19, Chania, 73134, Crete, Greece
| | - M Milanesi
- Department of Support, Production and Animal Health, School of Veterinary Medicine, São Paulo State University, 16050-680 R. Clóvis Pestana 793 - Dona Amelia, Araçatuba, SP, Brazil.,International Atomic Energy Agency Collaborating Centre on Animal Genomics and Bioinformatics, 16050-680 R. Clóvis Pestana 793 - Dona Amelia, Araçatuba, SP, Brazil
| | - T Castrignanò
- SCAI - Super Computing Applications and Innovation Department, CINECA, Rome, Italy
| | - S Gioiosa
- SCAI - Super Computing Applications and Innovation Department, CINECA, Rome, Italy
| | - A Valentini
- Department for Innovation in Biological, Agro-food and Forest Systems, DIBAF, University of Tuscia, via S. Camillo de Lellis s.n.c, 01100, Viterbo, Italy
| | - G Chillemi
- Department for Innovation in Biological, Agro-food and Forest Systems, DIBAF, University of Tuscia, via S. Camillo de Lellis s.n.c, 01100, Viterbo, Italy.,Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, IBIOM, CNR, Bari, Italy
| |
Collapse
|
15
|
Bernardi G. The Genomic Code: A Pervasive Encoding/Molding of Chromatin Structures and a Solution of the "Non-Coding DNA" Mystery. Bioessays 2019; 41:e1900106. [PMID: 31701567 DOI: 10.1002/bies.201900106] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2019] [Revised: 08/07/2019] [Indexed: 12/15/2022]
Abstract
Recent investigations have revealed 1) that the isochores of the human genome group into two super-families characterized by two different long-range 3D structures, and 2) that these structures, essentially based on the distribution and topology of short sequences, mold primary chromatin domains (and define nucleosome binding). More specifically, GC-poor, gene-poor isochores are low-heterogeneity sequences with oligo-A spikes that mold the lamina-associated domains (LADs), whereas GC-rich, gene-rich isochores are characterized by single or multiple GC peaks that mold the topologically associating domains (TADs). The formation of these "primary TADs" may be followed by extrusion under the action of cohesin and CTCF. Finally, the genomic code, which is responsible for the pervasive encoding and molding of primary chromatin domains (LADs and primary TADs, namely the "gene spaces"/"spatial compartments") resolves the longstanding problems of "non-coding DNA," "junk DNA," and "selfish DNA" leading to a new vision of the genome as shaped by DNA sequences.
Collapse
Affiliation(s)
- Giorgio Bernardi
- Science Department, Roma Tre University, Viale Marconi 446, 00146, Rome, Italy
- Stazione Zoologica Anton Dohrn, Villa Comunale, 80121, Naples, Italy
| |
Collapse
|
16
|
Attardo GM, Abd-Alla AMM, Acosta-Serrano A, Allen JE, Bateta R, Benoit JB, Bourtzis K, Caers J, Caljon G, Christensen MB, Farrow DW, Friedrich M, Hua-Van A, Jennings EC, Larkin DM, Lawson D, Lehane MJ, Lenis VP, Lowy-Gallego E, Macharia RW, Malacrida AR, Marco HG, Masiga D, Maslen GL, Matetovici I, Meisel RP, Meki I, Michalkova V, Miller WJ, Minx P, Mireji PO, Ometto L, Parker AG, Rio R, Rose C, Rosendale AJ, Rota-Stabelli O, Savini G, Schoofs L, Scolari F, Swain MT, Takáč P, Tomlinson C, Tsiamis G, Van Den Abbeele J, Vigneron A, Wang J, Warren WC, Waterhouse RM, Weirauch MT, Weiss BL, Wilson RK, Zhao X, Aksoy S. Comparative genomic analysis of six Glossina genomes, vectors of African trypanosomes. Genome Biol 2019; 20:187. [PMID: 31477173 PMCID: PMC6721284 DOI: 10.1186/s13059-019-1768-2] [Citation(s) in RCA: 49] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Accepted: 07/22/2019] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Tsetse flies (Glossina sp.) are the vectors of human and animal trypanosomiasis throughout sub-Saharan Africa. Tsetse flies are distinguished from other Diptera by unique adaptations, including lactation and the birthing of live young (obligate viviparity), a vertebrate blood-specific diet by both sexes, and obligate bacterial symbiosis. This work describes the comparative analysis of six Glossina genomes representing three sub-genera: Morsitans (G. morsitans morsitans, G. pallidipes, G. austeni), Palpalis (G. palpalis, G. fuscipes), and Fusca (G. brevipalpis) which represent different habitats, host preferences, and vectorial capacity. RESULTS Genomic analyses validate established evolutionary relationships and sub-genera. Syntenic analysis of Glossina relative to Drosophila melanogaster shows reduced structural conservation across the sex-linked X chromosome. Sex-linked scaffolds show increased rates of female-specific gene expression and lower evolutionary rates relative to autosome associated genes. Tsetse-specific genes are enriched in protease, odorant-binding, and helicase activities. Lactation-associated genes are conserved across all Glossina species while male seminal proteins are rapidly evolving. Olfactory and gustatory genes are reduced across the genus relative to other insects. Vision-associated Rhodopsin genes show conservation of motion detection/tracking functions and variance in the Rhodopsin detecting colors in the blue wavelength ranges. CONCLUSIONS Expanded genomic discoveries reveal the genetics underlying Glossina biology and provide a rich body of knowledge for basic science and disease control. They also provide insight into the evolutionary biology underlying novel adaptations and are relevant to applied aspects of vector control such as trap design and discovery of novel pest and disease control strategies.
Collapse
Affiliation(s)
- Geoffrey M Attardo
- Department of Entomology and Nematology, University of California, Davis, Davis, CA, USA.
| | - Adly M M Abd-Alla
- Insect Pest Control Laboratory, Joint FAO/IAEA Division of Nuclear Techniques in Food & Agriculture, Vienna, Vienna, Austria
| | - Alvaro Acosta-Serrano
- Department of Vector Biology, Liverpool School of Tropical Medicine, Merseyside, Liverpool, UK
| | - James E Allen
- VectorBase, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, Cambridgeshire, UK
| | - Rosemary Bateta
- Department of Biochemistry, Biotechnology Research Institute - Kenya Agricultural and Livestock Research Organization, Kikuyu, Kenya
| | - Joshua B Benoit
- Department of Biological Sciences, University of Cincinnati, Cincinnati, OH, USA
| | - Kostas Bourtzis
- Insect Pest Control Laboratory, Joint FAO/IAEA Division of Nuclear Techniques in Food & Agriculture, Vienna, Vienna, Austria
| | - Jelle Caers
- Department of Biology - Functional Genomics and Proteomics Group, KU Leuven, Leuven, Belgium
| | - Guy Caljon
- Laboratory of Microbiology, Parasitology and Hygiene, University of Antwerp, Antwerp, Belgium
| | - Mikkel B Christensen
- VectorBase, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, Cambridgeshire, UK
| | - David W Farrow
- Department of Biological Sciences, University of Cincinnati, Cincinnati, OH, USA
| | - Markus Friedrich
- Department of Biological Sciences, Wayne State University, Detroit, MI, USA
| | - Aurélie Hua-Van
- Laboratoire Evolution, Genomes, Comportement, Ecologie, CNRS, IRD, Univ. Paris-Sud, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Emily C Jennings
- Department of Biological Sciences, University of Cincinnati, Cincinnati, OH, USA
| | - Denis M Larkin
- Department of Comparative Biomedical Sciences, Royal Veterinary College, London, UK
| | - Daniel Lawson
- Department of Life Sciences, Imperial College London, London, UK
| | - Michael J Lehane
- Department of Vector Biology, Liverpool School of Tropical Medicine, Merseyside, Liverpool, UK
| | - Vasileios P Lenis
- Schools of Medicine and Dentistry, University of Plymouth, Plymouth, UK
| | - Ernesto Lowy-Gallego
- VectorBase, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, Cambridgeshire, UK
| | - Rosaline W Macharia
- Molecular Biology and Bioinformatics Unit, International Center for Insect Physiology and Ecology, Nairobi, Kenya.,Centre for Biotechnology and Bioinformatics, University of Nairobi, Nairobi, Kenya
| | - Anna R Malacrida
- Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
| | - Heather G Marco
- Department of Biological Sciences, University of Cape Town, Rondebosch, South Africa
| | - Daniel Masiga
- Molecular Biology and Bioinformatics Unit, International Center for Insect Physiology and Ecology, Nairobi, Kenya
| | - Gareth L Maslen
- VectorBase, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, Cambridgeshire, UK
| | - Irina Matetovici
- Biomedical Sciences, Institute of Tropical Medicine, Antwerp, Belgium
| | - Richard P Meisel
- Department of Biology and Biochemistry, University of Houston, Houston, TX, USA
| | - Irene Meki
- Insect Pest Control Laboratory, Joint FAO/IAEA Division of Nuclear Techniques in Food & Agriculture, Vienna, Vienna, Austria
| | - Veronika Michalkova
- Department of Biological Sciences, Florida International University, Miami, Florida, USA.,Institute of Zoology, Slovak Academy of Sciences, Bratislava, Slovakia
| | - Wolfgang J Miller
- Department of Cell and Developmental Biology, Medical University of Vienna, Vienna, Austria
| | - Patrick Minx
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | - Paul O Mireji
- Department of Biochemistry, Biotechnology Research Institute - Kenya Agricultural and Livestock Research Organization, Kikuyu, Kenya.,Centre for Geographic Medicine Research Coast, Kenya Medical Research Institute, Kilifi, Kenya
| | - Lino Ometto
- Department of Sustainable Ecosystems and Bioresources, Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, TN, Italy.,Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
| | - Andrew G Parker
- Insect Pest Control Laboratory, Joint FAO/IAEA Division of Nuclear Techniques in Food & Agriculture, Vienna, Vienna, Austria
| | - Rita Rio
- Department of Biology, West Virginia University, Morgantown, WV, USA
| | - Clair Rose
- Department of Vector Biology, Liverpool School of Tropical Medicine, Merseyside, Liverpool, UK
| | - Andrew J Rosendale
- Department of Biology, Mount St. Joseph University, Cincinnati, OH, USA.,Department of Biological Sciences, University of Cincinnati, Cincinnati, OH, USA
| | - Omar Rota-Stabelli
- Department of Sustainable Ecosystems and Bioresources, Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, TN, Italy
| | - Grazia Savini
- Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
| | - Liliane Schoofs
- Department of Biology - Functional Genomics and Proteomics Group, KU Leuven, Leuven, Belgium
| | - Francesca Scolari
- Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
| | - Martin T Swain
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, Aberystwyth, Ceredigion, UK
| | - Peter Takáč
- Department of Animal Systematics, Ústav zoológie SAV; Scientica, Ltd, Bratislava, Slovakia
| | - Chad Tomlinson
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | - George Tsiamis
- Department of Environmental and Natural Resources Management, University of Patras, Agrinio, Etoloakarnania, Greece
| | | | - Aurelien Vigneron
- Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT, USA
| | - Jingwen Wang
- School of Life Sciences, Fudan University, Shanghai, China
| | - Wesley C Warren
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA.,Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Robert M Waterhouse
- Department of Ecology & Evolution, Swiss Institute of Bioinformatics, University of Lausanne, Lausanne, Switzerland
| | - Matthew T Weirauch
- Center for Autoimmune Genomics and Etiology and Divisions of Biomedical Informatics and Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Brian L Weiss
- Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT, USA
| | - Richard K Wilson
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | - Xin Zhao
- CAS Center for Influenza Research and Early-warning (CASCIRE), Chinese Academy of Sciences, Beijing, China
| | - Serap Aksoy
- Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT, USA.
| |
Collapse
|
17
|
Attardo GM, Abd-Alla AMM, Acosta-Serrano A, Allen JE, Bateta R, Benoit JB, Bourtzis K, Caers J, Caljon G, Christensen MB, Farrow DW, Friedrich M, Hua-Van A, Jennings EC, Larkin DM, Lawson D, Lehane MJ, Lenis VP, Lowy-Gallego E, Macharia RW, Malacrida AR, Marco HG, Masiga D, Maslen GL, Matetovici I, Meisel RP, Meki I, Michalkova V, Miller WJ, Minx P, Mireji PO, Ometto L, Parker AG, Rio R, Rose C, Rosendale AJ, Rota-Stabelli O, Savini G, Schoofs L, Scolari F, Swain MT, Takáč P, Tomlinson C, Tsiamis G, Van Den Abbeele J, Vigneron A, Wang J, Warren WC, Waterhouse RM, Weirauch MT, Weiss BL, Wilson RK, Zhao X, Aksoy S. Comparative genomic analysis of six Glossina genomes, vectors of African trypanosomes. Genome Biol 2019; 20:187. [PMID: 31477173 DOI: 10.1101/531749] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Accepted: 07/22/2019] [Indexed: 05/24/2023] Open
Abstract
BACKGROUND Tsetse flies (Glossina sp.) are the vectors of human and animal trypanosomiasis throughout sub-Saharan Africa. Tsetse flies are distinguished from other Diptera by unique adaptations, including lactation and the birthing of live young (obligate viviparity), a vertebrate blood-specific diet by both sexes, and obligate bacterial symbiosis. This work describes the comparative analysis of six Glossina genomes representing three sub-genera: Morsitans (G. morsitans morsitans, G. pallidipes, G. austeni), Palpalis (G. palpalis, G. fuscipes), and Fusca (G. brevipalpis) which represent different habitats, host preferences, and vectorial capacity. RESULTS Genomic analyses validate established evolutionary relationships and sub-genera. Syntenic analysis of Glossina relative to Drosophila melanogaster shows reduced structural conservation across the sex-linked X chromosome. Sex-linked scaffolds show increased rates of female-specific gene expression and lower evolutionary rates relative to autosome associated genes. Tsetse-specific genes are enriched in protease, odorant-binding, and helicase activities. Lactation-associated genes are conserved across all Glossina species while male seminal proteins are rapidly evolving. Olfactory and gustatory genes are reduced across the genus relative to other insects. Vision-associated Rhodopsin genes show conservation of motion detection/tracking functions and variance in the Rhodopsin detecting colors in the blue wavelength ranges. CONCLUSIONS Expanded genomic discoveries reveal the genetics underlying Glossina biology and provide a rich body of knowledge for basic science and disease control. They also provide insight into the evolutionary biology underlying novel adaptations and are relevant to applied aspects of vector control such as trap design and discovery of novel pest and disease control strategies.
Collapse
Affiliation(s)
- Geoffrey M Attardo
- Department of Entomology and Nematology, University of California, Davis, Davis, CA, USA.
| | - Adly M M Abd-Alla
- Insect Pest Control Laboratory, Joint FAO/IAEA Division of Nuclear Techniques in Food & Agriculture, Vienna, Vienna, Austria
| | - Alvaro Acosta-Serrano
- Department of Vector Biology, Liverpool School of Tropical Medicine, Merseyside, Liverpool, UK
| | - James E Allen
- VectorBase, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, Cambridgeshire, UK
| | - Rosemary Bateta
- Department of Biochemistry, Biotechnology Research Institute - Kenya Agricultural and Livestock Research Organization, Kikuyu, Kenya
| | - Joshua B Benoit
- Department of Biological Sciences, University of Cincinnati, Cincinnati, OH, USA
| | - Kostas Bourtzis
- Insect Pest Control Laboratory, Joint FAO/IAEA Division of Nuclear Techniques in Food & Agriculture, Vienna, Vienna, Austria
| | - Jelle Caers
- Department of Biology - Functional Genomics and Proteomics Group, KU Leuven, Leuven, Belgium
| | - Guy Caljon
- Laboratory of Microbiology, Parasitology and Hygiene, University of Antwerp, Antwerp, Belgium
| | - Mikkel B Christensen
- VectorBase, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, Cambridgeshire, UK
| | - David W Farrow
- Department of Biological Sciences, University of Cincinnati, Cincinnati, OH, USA
| | - Markus Friedrich
- Department of Biological Sciences, Wayne State University, Detroit, MI, USA
| | - Aurélie Hua-Van
- Laboratoire Evolution, Genomes, Comportement, Ecologie, CNRS, IRD, Univ. Paris-Sud, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Emily C Jennings
- Department of Biological Sciences, University of Cincinnati, Cincinnati, OH, USA
| | - Denis M Larkin
- Department of Comparative Biomedical Sciences, Royal Veterinary College, London, UK
| | - Daniel Lawson
- Department of Life Sciences, Imperial College London, London, UK
| | - Michael J Lehane
- Department of Vector Biology, Liverpool School of Tropical Medicine, Merseyside, Liverpool, UK
| | - Vasileios P Lenis
- Schools of Medicine and Dentistry, University of Plymouth, Plymouth, UK
| | - Ernesto Lowy-Gallego
- VectorBase, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, Cambridgeshire, UK
| | - Rosaline W Macharia
- Molecular Biology and Bioinformatics Unit, International Center for Insect Physiology and Ecology, Nairobi, Kenya
- Centre for Biotechnology and Bioinformatics, University of Nairobi, Nairobi, Kenya
| | - Anna R Malacrida
- Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
| | - Heather G Marco
- Department of Biological Sciences, University of Cape Town, Rondebosch, South Africa
| | - Daniel Masiga
- Molecular Biology and Bioinformatics Unit, International Center for Insect Physiology and Ecology, Nairobi, Kenya
| | - Gareth L Maslen
- VectorBase, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, Cambridgeshire, UK
| | - Irina Matetovici
- Biomedical Sciences, Institute of Tropical Medicine, Antwerp, Belgium
| | - Richard P Meisel
- Department of Biology and Biochemistry, University of Houston, Houston, TX, USA
| | - Irene Meki
- Insect Pest Control Laboratory, Joint FAO/IAEA Division of Nuclear Techniques in Food & Agriculture, Vienna, Vienna, Austria
| | - Veronika Michalkova
- Department of Biological Sciences, Florida International University, Miami, Florida, USA
- Institute of Zoology, Slovak Academy of Sciences, Bratislava, Slovakia
| | - Wolfgang J Miller
- Department of Cell and Developmental Biology, Medical University of Vienna, Vienna, Austria
| | - Patrick Minx
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | - Paul O Mireji
- Department of Biochemistry, Biotechnology Research Institute - Kenya Agricultural and Livestock Research Organization, Kikuyu, Kenya
- Centre for Geographic Medicine Research Coast, Kenya Medical Research Institute, Kilifi, Kenya
| | - Lino Ometto
- Department of Sustainable Ecosystems and Bioresources, Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, TN, Italy
- Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
| | - Andrew G Parker
- Insect Pest Control Laboratory, Joint FAO/IAEA Division of Nuclear Techniques in Food & Agriculture, Vienna, Vienna, Austria
| | - Rita Rio
- Department of Biology, West Virginia University, Morgantown, WV, USA
| | - Clair Rose
- Department of Vector Biology, Liverpool School of Tropical Medicine, Merseyside, Liverpool, UK
| | - Andrew J Rosendale
- Department of Biology, Mount St. Joseph University, Cincinnati, OH, USA
- Department of Biological Sciences, University of Cincinnati, Cincinnati, OH, USA
| | - Omar Rota-Stabelli
- Department of Sustainable Ecosystems and Bioresources, Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, TN, Italy
| | - Grazia Savini
- Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
| | - Liliane Schoofs
- Department of Biology - Functional Genomics and Proteomics Group, KU Leuven, Leuven, Belgium
| | - Francesca Scolari
- Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
| | - Martin T Swain
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, Aberystwyth, Ceredigion, UK
| | - Peter Takáč
- Department of Animal Systematics, Ústav zoológie SAV; Scientica, Ltd, Bratislava, Slovakia
| | - Chad Tomlinson
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | - George Tsiamis
- Department of Environmental and Natural Resources Management, University of Patras, Agrinio, Etoloakarnania, Greece
| | | | - Aurelien Vigneron
- Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT, USA
| | - Jingwen Wang
- School of Life Sciences, Fudan University, Shanghai, China
| | - Wesley C Warren
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
- Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Robert M Waterhouse
- Department of Ecology & Evolution, Swiss Institute of Bioinformatics, University of Lausanne, Lausanne, Switzerland
| | - Matthew T Weirauch
- Center for Autoimmune Genomics and Etiology and Divisions of Biomedical Informatics and Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Brian L Weiss
- Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT, USA
| | - Richard K Wilson
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | - Xin Zhao
- CAS Center for Influenza Research and Early-warning (CASCIRE), Chinese Academy of Sciences, Beijing, China
| | - Serap Aksoy
- Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT, USA.
| |
Collapse
|
18
|
Li D, Li Y, Li M, Che T, Tian S, Chen B, Zhou X, Zhang G, Gaur U, Luo M, Tian K, He M, He S, Xu Z, Jin L, Tang Q, Dai Y, Xu H, Hu Y, Zhao X, Yin H, Wang Y, Zhou R, Yang C, Du H, Jiang X, Zhu Q, Li M. Population genomics identifies patterns of genetic diversity and selection in chicken. BMC Genomics 2019; 20:263. [PMID: 30940068 PMCID: PMC6446315 DOI: 10.1186/s12864-019-5622-4] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2018] [Accepted: 03/19/2019] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND There are hundreds of phenotypically distinguishable domestic chicken breeds or lines with highly specialized traits worldwide, which provide a unique opportunity to illustrate how selection shapes patterns of genetic variation. There are many local chicken breeds in China. RESULTS Here, we provide a population genome landscape of genetic variations in 86 domestic chickens representing 10 phenotypically diverse breeds. Genome-wide analysis indicated that sex chromosomes have less genetic diversity and are under stronger selection than autosomes during domestication and local adaptation. We found an evidence of admixture between Tibetan chickens and other domestic population. We further identified strong signatures of selection affecting genomic regions that harbor genes underlying economic traits (typically related to feathers, skin color, growth, reproduction and aggressiveness) and local adaptation (to high altitude). By comparing the genomes of the Tibetan and lowland fowls, we identified genes associated with high-altitude adaptation in Tibetan chickens were mainly involved in energy metabolism, body size maintenance and available food sources. CONCLUSIONS The work provides crucial insights into the distinct evolutionary scenarios occurring under artificial selection for agricultural production and under natural selection for success at high altitudes in chicken. Several genes were identified as candidates for chicken economic traits and other phenotypic traits.
Collapse
Affiliation(s)
- Diyan Li
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Yan Li
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Miao Li
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Tiandong Che
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Shilin Tian
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China.,Novogene Bioinformatics Institute, Beijing, China
| | - Binlong Chen
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Xuming Zhou
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, USA
| | - Guolong Zhang
- Department of Animal Science, Oklahoma State University, Stillwater, OK, USA
| | - Uma Gaur
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Majing Luo
- Hubei Key Laboratory of Cell Homeostasis, Laboratory of Molecular and Developmental Genetics, College of Life Sciences, Wuhan University, Wuhan, China
| | - Kai Tian
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Mengnan He
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Shen He
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Zhongxian Xu
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Long Jin
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Qianzi Tang
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Yifei Dai
- Novogene Bioinformatics Institute, Beijing, China
| | - Huailiang Xu
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Yaodong Hu
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Xiaoling Zhao
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Huadong Yin
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Yan Wang
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China
| | - Rongjia Zhou
- Hubei Key Laboratory of Cell Homeostasis, Laboratory of Molecular and Developmental Genetics, College of Life Sciences, Wuhan University, Wuhan, China
| | - Chaowu Yang
- Sichuan Animal Science Academy, Chengdu, China
| | - Huarui Du
- Sichuan Animal Science Academy, Chengdu, China
| | | | - Qing Zhu
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China.
| | - Mingzhou Li
- Institute of Animal Genetics and Breeding, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, China.
| |
Collapse
|
19
|
Gul IS, Staal J, Hulpiau P, De Keuckelaere E, Kamm K, Deroo T, Sanders E, Staes K, Driege Y, Saeys Y, Beyaert R, Technau U, Schierwater B, van Roy F. GC Content of Early Metazoan Genes and Its Impact on Gene Expression Levels in Mammalian Cell Lines. Genome Biol Evol 2018; 10:909-917. [PMID: 29608715 PMCID: PMC5952964 DOI: 10.1093/gbe/evy040] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/20/2018] [Indexed: 01/20/2023] Open
Abstract
With the genomes available for many animal clades, including the early-branching metazoans, one can readily study the functional conservation of genes across a diversity of animal lineages. Ectopic expression of an animal protein in, for instance, a mammalian cell line is a generally used strategy in structure–function analysis. However, this might turn out to be problematic in case of distantly related species. Here we analyzed the GC content of the coding sequences of basal animals and show its impact on gene expression levels in human cell lines, and, importantly, how this expression efficiency can be improved. Optimization of the GC3 content in the coding sequences of cadherin, alpha-catenin, and paracaspase of Trichoplax adhaerens dramatically increased the expression of these basal animal genes in human cell lines.
Collapse
Affiliation(s)
- Ismail Sahin Gul
- Center for Inflammation Research, Flanders Institute for Biotechnology (VIB), Ghent, Belgium.,Department of Biomedical Molecular Biology, Ghent University, Belgium
| | - Jens Staal
- Center for Inflammation Research, Flanders Institute for Biotechnology (VIB), Ghent, Belgium.,Department of Biomedical Molecular Biology, Ghent University, Belgium
| | - Paco Hulpiau
- Center for Inflammation Research, Flanders Institute for Biotechnology (VIB), Ghent, Belgium.,Department of Biomedical Molecular Biology, Ghent University, Belgium
| | - Evi De Keuckelaere
- Center for Inflammation Research, Flanders Institute for Biotechnology (VIB), Ghent, Belgium.,Department of Biomedical Molecular Biology, Ghent University, Belgium
| | - Kai Kamm
- Institut für Tierökologie und Zellbiologie (ITZ), Division of Ecology and Evolution, Stiftung Tieraerztliche Hochschule Hannover, Hannover, Germany
| | - Tom Deroo
- Center for Inflammation Research, Flanders Institute for Biotechnology (VIB), Ghent, Belgium.,Department of Biomedical Molecular Biology, Ghent University, Belgium
| | - Ellen Sanders
- Center for Inflammation Research, Flanders Institute for Biotechnology (VIB), Ghent, Belgium.,Department of Biomedical Molecular Biology, Ghent University, Belgium
| | - Katrien Staes
- Center for Inflammation Research, Flanders Institute for Biotechnology (VIB), Ghent, Belgium.,Department of Biomedical Molecular Biology, Ghent University, Belgium
| | - Yasmine Driege
- Center for Inflammation Research, Flanders Institute for Biotechnology (VIB), Ghent, Belgium.,Department of Biomedical Molecular Biology, Ghent University, Belgium
| | - Yvan Saeys
- Center for Inflammation Research, Flanders Institute for Biotechnology (VIB), Ghent, Belgium.,Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Belgium
| | - Rudi Beyaert
- Center for Inflammation Research, Flanders Institute for Biotechnology (VIB), Ghent, Belgium.,Department of Biomedical Molecular Biology, Ghent University, Belgium
| | - Ulrich Technau
- Department of Molecular Evolution and Development, Faculty of Life Sciences, University of Vienna, Austria
| | - Bernd Schierwater
- Institut für Tierökologie und Zellbiologie (ITZ), Division of Ecology and Evolution, Stiftung Tieraerztliche Hochschule Hannover, Hannover, Germany
| | - Frans van Roy
- Center for Inflammation Research, Flanders Institute for Biotechnology (VIB), Ghent, Belgium.,Department of Biomedical Molecular Biology, Ghent University, Belgium
| |
Collapse
|
20
|
Zhang D, Hu P, Liu T, Wang J, Jiang S, Xu Q, Chen L. GC bias lead to increased small amino acids and random coils of proteins in cold-water fishes. BMC Genomics 2018; 19:315. [PMID: 29720106 PMCID: PMC5930961 DOI: 10.1186/s12864-018-4684-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2017] [Accepted: 04/16/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Temperature adaptation of biological molecules is fundamental in evolutionary studies but remains unsolved. Fishes living in cold water are adapted to low temperatures through adaptive modification of their biological molecules, which enables their functioning in extreme cold. To study nucleotide and amino acid preference in cold-water fishes, we investigated the substitution asymmetry of codons and amino acids in protein-coding DNA sequences between cold-water fishes and tropical fishes., The former includes two Antarctic fishes, Dissostichus mawsoni (Antarctic toothfish), Gymnodraco acuticeps (Antarctic dragonfish), and two temperate fishes, Gadus morhua (Atlantic cod) and Gasterosteus aculeatus (stickleback), and the latter includes three tropical fishes, including Danio rerio (zebrafish), Oreochromis niloticus (Nile tilapia) and Xiphophorus maculatus (Platyfish). RESULTS Cold-water fishes showed preference for Guanines and cytosines (GCs) in both synonymous and nonsynonymous codon substitution when compared with tropical fishes. Amino acids coded by GC-rich codons are favored in the temperate fishes, while those coded by AT-rich codons are disfavored. Similar trends were discovered in Antarctic fishes but were statistically weaker. The preference of GC rich codons in nonsynonymous substitution tends to increase ratio of small amino acid in proteins, which was demonstrated by biased small amino acid substitutions in the cold-water species when compared with the tropical species, especially in the temperate species. Prediction and comparison of secondary structure of the proteomes showed that frequency of random coils are significantly larger in the cold-water fish proteomes than those of the tropical fishes. CONCLUSIONS Our results suggested that natural selection in cold temperature might favor biased GC content in the coding DNA sequences, which lead to increased frequency of small amino acids and consequently increased random coils in the proteomes of cold-water fishes.
Collapse
Affiliation(s)
- Dongsheng Zhang
- Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources, Shanghai Ocean University, Ministry of Education, National Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University), Shanghai, People's Republic of China
| | - Peng Hu
- Department of Genetics, University of Pennsylvania, Philadelphia, USA
| | - Taigang Liu
- College of Informatics, Shanghai Ocean University, Shanghai, People's Republic of China
| | - Jian Wang
- Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources, Shanghai Ocean University, Ministry of Education, National Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University), Shanghai, People's Republic of China
| | - Shouwen Jiang
- Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources, Shanghai Ocean University, Ministry of Education, National Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University), Shanghai, People's Republic of China
| | - Qianghua Xu
- College of Marine Sciences, Shanghai Ocean University, Shanghai, People's Republic of China
| | - Liangbiao Chen
- Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources, Shanghai Ocean University, Ministry of Education, National Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University), Shanghai, People's Republic of China.
| |
Collapse
|
21
|
Mehanna P, Gagné V, Lajoie M, Spinella JF, St-Onge P, Sinnett D, Brukner I, Krajinovic M. Characterization of the microDNA through the response to chemotherapeutics in lymphoblastoid cell lines. PLoS One 2017; 12:e0184365. [PMID: 28877255 PMCID: PMC5587290 DOI: 10.1371/journal.pone.0184365] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2017] [Accepted: 08/22/2017] [Indexed: 12/18/2022] Open
Abstract
Recently, a new class of extrachromosomal circular DNA, called microDNA, was identified. They are on average 100 to 400 bp long and are derived from unique non-repetitive genomic regions with high gene density. MicroDNAs are thought to arise from DNA breaks associated with RNA metabolism or replication slippage. Given the paucity of information on this entirely novel phenomenon, we aimed to get an additional insight into microDNA features by performing the microDNA analysis in 20 independent human lymphoblastoid cell lines (LCLs) prior and after treatment with chemotherapeutic drugs. The results showed non-random genesis of microDNA clusters from the active regions of the genome. The size periodicity of 190 bp was observed, which matches DNA fragmentation typical for apoptotic cells. The chemotherapeutic drug-induced apoptosis of LCLs increased both number and size of clusters further suggesting that part of microDNAs could result from the programmed cell death. Interestingly, proportion of identified microDNA sequences has common loci of origin when compared between cell line experiments. While compatible with the original observation that microDNAs originate from a normal physiological process, obtained results imply complementary source of its production. Furthermore, non-random genesis of microDNAs depicted by redundancy between samples makes these entities possible candidates for new biomarker generation.
Collapse
Affiliation(s)
- Pamela Mehanna
- CHU Sainte-Justine Research Center, University of Montreal, Montreal, Qc, Canada
| | - Vincent Gagné
- CHU Sainte-Justine Research Center, University of Montreal, Montreal, Qc, Canada
| | - Mathieu Lajoie
- CHU Sainte-Justine Research Center, University of Montreal, Montreal, Qc, Canada
| | | | - Pascal St-Onge
- CHU Sainte-Justine Research Center, University of Montreal, Montreal, Qc, Canada
| | - Daniel Sinnett
- CHU Sainte-Justine Research Center, University of Montreal, Montreal, Qc, Canada
- Department of Pediatrics, Faculty of Medicine, University of Montreal, Montreal, Qc, Canada
| | - Ivan Brukner
- Molecular Diagnostics Laboratory, Jewish General Hospital, McGill University, Montreal, Montreal, Qc, Canada
| | - Maja Krajinovic
- CHU Sainte-Justine Research Center, University of Montreal, Montreal, Qc, Canada
- Department of Pediatrics, Faculty of Medicine, University of Montreal, Montreal, Qc, Canada
- Department of Pharmacology and Physiology, Faculty of Medicine, University of Montreal, Montreal, Qc, Canada
- * E-mail:
| |
Collapse
|
22
|
Experimental Support That Natural Selection Has Shaped the Latitudinal Distribution of Mitochondrial Haplotypes in Australian Drosophila melanogaster. Mol Biol Evol 2017. [DOI: 10.1093/molbev/msx184] [Citation(s) in RCA: 77] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
|
23
|
Costantini M, Musto H. The Isochores as a Fundamental Level of Genome Structure and Organization: A General Overview. J Mol Evol 2017; 84:93-103. [PMID: 28243687 DOI: 10.1007/s00239-017-9785-9] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2016] [Accepted: 02/15/2017] [Indexed: 11/30/2022]
Abstract
The recent availability of a number of fully sequenced genomes (including marine organisms) allowed to map very precisely the isochores, based on DNA sequences, confirming the results obtained before genome sequencing by the ultracentrifugation in CsCl. In fact, the analytical profile of human DNA showed that the vertebrate genome is a mosaic of isochores, typically megabase-size DNA segments that belong to a small number of families characterized by different GC levels. In this review, we will concentrate on some general genome features regarding the compositional organization from different organisms and their evolution, ranging from vertebrates to invertebrates until unicellular organisms. Since isochores are tightly linked to biological properties such as gene density, replication timing, and recombination, the new level of detail provided by the isochore map helped the understanding of genome structure, function, and evolution. All the findings reported here confirm the idea that the isochores can be considered as a "fundamental level of genome structure and organization." We stress that we do not discuss in this review the origin of isochores, which is still a matter of controversy, but we focus on well established structural and physiological aspects.
Collapse
Affiliation(s)
- Maria Costantini
- Department of Biology and Evolution of Marine Organisms, Stazione Zoologica Anton Dohrn, Villa Comunale, 80121, Napoli, Italy.
| | - Héctor Musto
- Laboratorio de Organización y Evolución del Genoma, Unidad de Genómica Evolutiva, Facultad de Ciencias, 11400, Montevideo, Uruguay
| |
Collapse
|
24
|
Jabbari K, Bernardi G. An Isochore Framework Underlies Chromatin Architecture. PLoS One 2017; 12:e0168023. [PMID: 28060840 PMCID: PMC5218411 DOI: 10.1371/journal.pone.0168023] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2016] [Accepted: 11/24/2016] [Indexed: 01/03/2023] Open
Abstract
A recent investigation showed the existence of correlations between the architectural features of mammalian interphase chromosomes and the compositional properties of isochores. This result prompted us to compare maps of the Topologically Associating Domains (TADs) and of the Lamina Associated Domains (LADs) with the corresponding isochore maps of mouse and human chromosomes. This approach revealed that: 1) TADs and LADs correspond to isochores, i.e., isochores are the genomic units that underlie chromatin domains; 2) the conservation of TADs and LADs in mammalian genomes is explained by the evolutionary conservation of isochores; 3) chromatin domains corresponding to GC-poor isochores (e.g., LADs) show not only self-interactions but also intrachromosomal interactions with other domains also corresponding to GC-poor isochores even if located far away; in contrast, chromatin domains corresponding to GC-rich isochores (e.g., TADs) show more localized chromosomal interactions, many of which are inter-chromosomal. In conclusion, this investigation establishes a link between DNA sequences and chromatin architecture, explains the evolutionary conservation of TADs and LADs and provides new information on the spatial distribution of GC-poor/gene-poor and GC-rich/gene-rich chromosomal regions in the interphase nucleus.
Collapse
Affiliation(s)
- Kamel Jabbari
- Max Planck Institute for Biology of Ageing, Joseph-Stelzmann-Straße 9B, Köln, Germany
| | - Giorgio Bernardi
- Science Department, Roma Tre University, Viale Marconi, Rome, Italy, and Stazione Zoologica Anton Dohrn, Villa Comunale, Naples, Italy
| |
Collapse
|
25
|
Sizova TV, Karpova OI. The length of chromatin loops in meiotic prophase I of warm-blooded vertebrates depends on the DNA compositional organization. RUSS J GENET+ 2016. [DOI: 10.1134/s1022795416110144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
26
|
Lamolle G, Protasio AV, Iriarte A, Jara E, Simón D, Musto H. An Isochore-Like Structure in the Genome of the Flatworm Schistosoma mansoni. Genome Biol Evol 2016; 8:2312-8. [PMID: 27435793 PMCID: PMC5010904 DOI: 10.1093/gbe/evw170] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Eukaryotic genomes are compositionally heterogeneous, that is, composed by regions that differ in guanine-cytosine (GC) content (isochores). The most well documented case is that of vertebrates (mainly mammals) although it has been also noted among unicellular eukaryotes and invertebrates. In the human genome, regarded as a typical mammal, this heterogeneity is associated with several features. Specifically, genes located in GC-richest regions are the GC3-richest, display CpG islands and have shorter introns. Furthermore, these genes are more heavily expressed and tend to be located at the extremes of the chromosomes. Although the compositional heterogeneity seems to be widespread among eukaryotes, the associated properties noted in the human genome and other mammals have not been investigated in depth in other taxa Here we provide evidence that the genome of the parasitic flatworm Schistosoma mansoni is compositionally heterogeneous and exhibits an isochore-like structure, displaying some features associated, until now, only with the human and other vertebrate genomes, with the exception of gene concentration.
Collapse
Affiliation(s)
- Guillermo Lamolle
- Laboratorio de Organización y Evolución del Genoma, Facultad de Ciencias, Udelar, Montevideo, Uruguay
| | - Anna V Protasio
- Wellcome Trust Genome Campus, Wellcome Trust Sanger Institute, Cambridge, United Kingdom
| | - Andrés Iriarte
- Laboratorio de Organización y Evolución del Genoma, Facultad de Ciencias, Udelar, Montevideo, Uruguay Dpto. de Desarrollo Biotecnológico, Facultad de Medicina, Instituto de Higiene, Udelar, Montevideo, Uruguay
| | - Eugenio Jara
- Laboratorio de Organización y Evolución del Genoma, Facultad de Ciencias, Udelar, Montevideo, Uruguay
| | - Diego Simón
- Laboratorio de Organización y Evolución del Genoma, Facultad de Ciencias, Udelar, Montevideo, Uruguay
| | - Héctor Musto
- Laboratorio de Organización y Evolución del Genoma, Facultad de Ciencias, Udelar, Montevideo, Uruguay
| |
Collapse
|
27
|
Barton C, Iliopoulos CS, Pissis SP, Arhondakis S. Transcriptome activity of isochores during preimplantation process in human and mouse. FEBS Lett 2016; 590:2297-306. [PMID: 27279593 DOI: 10.1002/1873-3468.12245] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2016] [Revised: 05/27/2016] [Accepted: 06/03/2016] [Indexed: 12/17/2022]
Abstract
This work investigates the role of isochores during preimplantation process. Using RNA-seq data from human and mouse preimplantation stages, we created the spatio-temporal transcriptional profiles of the isochores during preimplantation. We found that from early to late stages, GC-rich isochores increase their expression while GC-poor ones decrease it. Network analysis revealed that modules with few coexpressed isochores are GC-poorer than medium-large ones, characterized by an opposite expression as preimplantation advances, decreasing and increasing respectively. Our results reveal a functional contribution of the isochores, supporting the presence of structural-functional interactions during maturation and early-embryonic development.
Collapse
Affiliation(s)
- Carl Barton
- The Blizard Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, UK
| | | | | | - Stilianos Arhondakis
- Institute of Molecular Biology and Biotechnology (IMBB), Foundation for Research and Technology - Hellas (FORTH), Heraklion, Crete, Greece
| |
Collapse
|
28
|
Bernardi G. Genome Organization and Chromosome Architecture. COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY 2016; 80:83-91. [PMID: 26801160 DOI: 10.1101/sqb.2015.80.027318] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
How the same DNA sequences can function in the three-dimensional architecture of interphase nucleus, fold in the very compact structure of metaphase chromosomes, and go precisely back to the original interphase architecture in the following cell cycle remains an unresolved question to this day. The solution to this question presented here rests on the correlations that were found to hold between the isochore organization of the genome and the architecture of chromosomes from interphase to metaphase. The key points are the following: (1) The transition from the looped domains and subdomains of interphase chromatin to the 30-nm fiber loops of early prophase chromosomes goes through their unfolding into an extended chromatin structure (probably a 10-nm "beads-on-a-string" structure); (2) the architectural proteins of interphase chromatin, such as CTCF and cohesin subunits, are retained in mitosis and are part of the discontinuous protein scaffold of mitotic chromosomes; and (3) the conservation of the link between architectural proteins and their binding sites on DNA through the cell cycle explains the reversibility of the interphase to mitosis process and the "mitotic memory" of interphase architecture.
Collapse
Affiliation(s)
- Giorgio Bernardi
- Science Department, Roma Tre University, 00146 Rome, Italy Stazione Zoologica Anton Dohrn, 80121 Naples, Italy
| |
Collapse
|
29
|
Muñoz JF, Gauthier GM, Desjardins CA, Gallo JE, Holder J, Sullivan TD, Marty AJ, Carmen JC, Chen Z, Ding L, Gujja S, Magrini V, Misas E, Mitreva M, Priest M, Saif S, Whiston EA, Young S, Zeng Q, Goldman WE, Mardis ER, Taylor JW, McEwen JG, Clay OK, Klein BS, Cuomo CA. The Dynamic Genome and Transcriptome of the Human Fungal Pathogen Blastomyces and Close Relative Emmonsia. PLoS Genet 2015; 11:e1005493. [PMID: 26439490 PMCID: PMC4595289 DOI: 10.1371/journal.pgen.1005493] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2015] [Accepted: 08/11/2015] [Indexed: 12/05/2022] Open
Abstract
Three closely related thermally dimorphic pathogens are causal agents of major fungal diseases affecting humans in the Americas: blastomycosis, histoplasmosis and paracoccidioidomycosis. Here we report the genome sequence and analysis of four strains of the etiological agent of blastomycosis, Blastomyces, and two species of the related genus Emmonsia, typically pathogens of small mammals. Compared to related species, Blastomyces genomes are highly expanded, with long, often sharply demarcated tracts of low GC-content sequence. These GC-poor isochore-like regions are enriched for gypsy elements, are variable in total size between isolates, and are least expanded in the avirulent B. dermatitidis strain ER-3 as compared with the virulent B. gilchristii strain SLH14081. The lack of similar regions in related species suggests these isochore-like regions originated recently in the ancestor of the Blastomyces lineage. While gene content is highly conserved between Blastomyces and related fungi, we identified changes in copy number of genes potentially involved in host interaction, including proteases and characterized antigens. In addition, we studied gene expression changes of B. dermatitidis during the interaction of the infectious yeast form with macrophages and in a mouse model. Both experiments highlight a strong antioxidant defense response in Blastomyces, and upregulation of dioxygenases in vivo suggests that dioxide produced by antioxidants may be further utilized for amino acid metabolism. We identify a number of functional categories upregulated exclusively in vivo, such as secreted proteins, zinc acquisition proteins, and cysteine and tryptophan metabolism, which may include critical virulence factors missed before in in vitro studies. Across the dimorphic fungi, loss of certain zinc acquisition genes and differences in amino acid metabolism suggest unique adaptations of Blastomyces to its host environment. These results reveal the dynamics of genome evolution and of factors contributing to virulence in Blastomyces. Dimorphic fungal pathogens including Blastomyces are the cause of major fungal diseases in North and South America. The genus Emmonsia includes species infecting small mammals as well as a newly emerging pathogenic species recently reported in HIV-positive patients in South Africa. Here, we synthesize both genome sequencing of four isolates of Blastomyces and two species of Emmonsia as well as deep sequencing of Blastomyces RNA to draw major new insights into the evolution of this group and the pathogen response to infection. We investigate the trajectory of genome evolution of this group, characterizing the phylogenetic relationships of these species, a remarkable genome expansion that formed large isochore-like regions of low GC content in Blastomyces, and variation of gene content, related to host interaction, among the dimorphic fungal pathogens. Using RNA-Seq, we profile the response of Blastomyces to macrophage and mouse pulmonary infection, identifying key pathways and novel virulence factors. The identification of key fungal genes involved in adaptation to the host suggests targets for further study and therapeutic intervention in Blastomyces and related dimorphic fungal pathogens.
Collapse
Affiliation(s)
- José F. Muñoz
- Cellular and Molecular Biology Unit, Corporación para Investigaciones Biológicas, Medellín, Colombia
- Institute of Biology, Universidad de Antioquia, Medellín, Colombia
| | - Gregory M. Gauthier
- Department of Medicine, University of Wisconsin, Madison, Madison, Wisconsin, United States of America
| | | | - Juan E. Gallo
- Cellular and Molecular Biology Unit, Corporación para Investigaciones Biológicas, Medellín, Colombia
- Doctoral Program in Biomedical Sciences, Universidad del Rosario, Bogotá, Colombia
| | - Jason Holder
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Thomas D. Sullivan
- Department of Pediatrics, University of Wisconsin, Madison, Madison, Wisconsin, United States of America
| | - Amber J. Marty
- Department of Medicine, University of Wisconsin, Madison, Madison, Wisconsin, United States of America
| | - John C. Carmen
- Department of Pediatrics, University of Wisconsin, Madison, Madison, Wisconsin, United States of America
| | - Zehua Chen
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Li Ding
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri, United States of America
| | - Sharvari Gujja
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Vincent Magrini
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri, United States of America
| | - Elizabeth Misas
- Cellular and Molecular Biology Unit, Corporación para Investigaciones Biológicas, Medellín, Colombia
- Institute of Biology, Universidad de Antioquia, Medellín, Colombia
| | - Makedonka Mitreva
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri, United States of America
| | - Margaret Priest
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Sakina Saif
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Emily A. Whiston
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, California, United States of America
| | - Sarah Young
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Qiandong Zeng
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - William E. Goldman
- Department of Microbiology and Immunology, School of Medicine, University of North Carolina, Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Elaine R. Mardis
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri, United States of America
| | - John W. Taylor
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, California, United States of America
| | - Juan G. McEwen
- Cellular and Molecular Biology Unit, Corporación para Investigaciones Biológicas, Medellín, Colombia
- School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Oliver K. Clay
- Cellular and Molecular Biology Unit, Corporación para Investigaciones Biológicas, Medellín, Colombia
- School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | - Bruce S. Klein
- Department of Medicine, University of Wisconsin, Madison, Madison, Wisconsin, United States of America
- Department of Pediatrics, University of Wisconsin, Madison, Madison, Wisconsin, United States of America
- Department of Medical Microbiology & Immunology, University of Wisconsin, Madison, Madison, Wisconsin, United States of America
| | - Christina A. Cuomo
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
30
|
Costantini M. An overview on genome organization of marine organisms. Mar Genomics 2015; 24 Pt 1:3-9. [PMID: 25899406 DOI: 10.1016/j.margen.2015.03.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2015] [Revised: 03/17/2015] [Accepted: 03/17/2015] [Indexed: 11/16/2022]
Abstract
In this review we will concentrate on some general genome features of marine organisms and their evolution, ranging from vertebrate to invertebrates until unicellular organisms. Before genome sequencing, the ultracentrifugation in CsCl led to high resolution of mammalian DNA (without seeing at the sequence). The analytical profile of human DNA showed that the vertebrate genome is a mosaic of isochores, typically megabase-size DNA segments that belong in a small number of families characterized by different GC levels. The recent availability of a number of fully sequenced genomes allowed mapping very precisely the isochores, based on DNA sequences. Since isochores are tightly linked to biological properties such as gene density, replication timing and recombination, the new level of detail provided by the isochore map helped the understanding of genome structure, function and evolution. This led the current level of knowledge and to further insights.
Collapse
Affiliation(s)
- Maria Costantini
- Department of Biology and Evolution of Marine Organisms, Stazione Zoologica Anton Dohrn, Villa Comunale, 80121 Naples, Italy.
| |
Collapse
|
31
|
Carels N, Ponce de Leon M. An Interpretation of the Ancestral Codon from Miller's Amino Acids and Nucleotide Correlations in Modern Coding Sequences. Bioinform Biol Insights 2015; 9:37-47. [PMID: 25922573 PMCID: PMC4401237 DOI: 10.4137/bbi.s24021] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2015] [Revised: 03/08/2015] [Accepted: 03/13/2015] [Indexed: 12/31/2022] Open
Abstract
Purine bias, which is usually referred to as an “ancestral codon”, is known to result in short-range correlations between nucleotides in coding sequences, and it is common in all species. We demonstrate that RWY is a more appropriate pattern than the classical RNY, and purine bias (Rrr) is the product of a network of nucleotide compensations induced by functional constraints on the physicochemical properties of proteins. Through deductions from universal correlation properties, we also demonstrate that amino acids from Miller’s spark discharge experiment are compatible with functional primeval proteins at the dawn of living cell radiation on earth. These amino acids match the hydropathy and secondary structures of modern proteins.
Collapse
Affiliation(s)
- Nicolas Carels
- Laboratório de Modelagem de Sistemas Biológicos, National Institute for Science and Technology on Innovation in Neglected Diseases (INCT/IDN), Centro de Desenvolvimento Tecnológico em Saúde (CDTS), Fundação Oswaldo Cruz (FIOCRUZ), Rio de Janeiro, Brazil
| | - Miguel Ponce de Leon
- Departamento de Bioquímica y Biología Molecular I, Facultad de Ciencias Químicas, Universidad Complutense de Madrid, Ciudad Universitaria, Madrid, Spain
| |
Collapse
|
32
|
Evolutionary consequences of DNA methylation on the GC content in vertebrate genomes. G3-GENES GENOMES GENETICS 2015; 5:441-7. [PMID: 25591920 PMCID: PMC4349097 DOI: 10.1534/g3.114.015545] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The genomes of many vertebrates show a characteristic variation in GC content. To explain its origin and evolution, mainly three mechanisms have been proposed: selection for GC content, mutation bias, and GC-biased gene conversion. At present, the mechanism of GC-biased gene conversion, i.e., short-scale, unidirectional exchanges between homologous chromosomes in the neighborhood of recombination-initiating double-strand breaks in favor for GC nucleotides, is the most widely accepted hypothesis. We here suggest that DNA methylation also plays an important role in the evolution of GC content in vertebrate genomes. To test this hypothesis, we investigated one mammalian (human) and one avian (chicken) genome. We used bisulfite sequencing to generate a whole-genome methylation map of chicken sperm and made use of a publicly available whole-genome methylation map of human sperm. Inclusion of these methylation maps into a model of GC content evolution provided significant support for the impact of DNA methylation on the local equilibrium GC content. Moreover, two different estimates of equilibrium GC content, one that neglects and one that incorporates the impact of DNA methylation and the concomitant CpG hypermutability, give estimates that differ by approximately 15% in both genomes, arguing for a strong impact of DNA methylation on the evolution of GC content. Thus, our results put forward that previous estimates of equilibrium GC content, which neglect the hypermutability of CpG dinucleotides, need to be reevaluated.
Collapse
|
33
|
Šmarda P, Bureš P, Horová L, Leitch IJ, Mucina L, Pacini E, Tichý L, Grulich V, Rotreklová O. Ecological and evolutionary significance of genomic GC content diversity in monocots. Proc Natl Acad Sci U S A 2014; 111:E4096-102. [PMID: 25225383 PMCID: PMC4191780 DOI: 10.1073/pnas.1321152111] [Citation(s) in RCA: 172] [Impact Index Per Article: 17.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
Genomic DNA base composition (GC content) is predicted to significantly affect genome functioning and species ecology. Although several hypotheses have been put forward to address the biological impact of GC content variation in microbial and vertebrate organisms, the biological significance of GC content diversity in plants remains unclear because of a lack of sufficiently robust genomic data. Using flow cytometry, we report genomic GC contents for 239 species representing 70 of 78 monocot families and compare them with genomic characters, a suite of life history traits and climatic niche data using phylogeny-based statistics. GC content of monocots varied between 33.6% and 48.9%, with several groups exceeding the GC content known for any other vascular plant group, highlighting their unusual genome architecture and organization. GC content showed a quadratic relationship with genome size, with the decreases in GC content in larger genomes possibly being a consequence of the higher biochemical costs of GC base synthesis. Dramatic decreases in GC content were observed in species with holocentric chromosomes, whereas increased GC content was documented in species able to grow in seasonally cold and/or dry climates, possibly indicating an advantage of GC-rich DNA during cell freezing and desiccation. We also show that genomic adaptations associated with changing GC content might have played a significant role in the evolution of the Earth's contemporary biota, such as the rise of grass-dominated biomes during the mid-Tertiary. One of the major selective advantages of GC-rich DNA is hypothesized to be facilitating more complex gene regulation.
Collapse
Affiliation(s)
- Petr Šmarda
- Department of Botany and Zoology, Masaryk University, CZ-61137 Brno, Czech Republic;
| | - Petr Bureš
- Department of Botany and Zoology, Masaryk University, CZ-61137 Brno, Czech Republic
| | - Lucie Horová
- Department of Botany and Zoology, Masaryk University, CZ-61137 Brno, Czech Republic
| | - Ilia J Leitch
- Jodrell Laboratory, Royal Botanic Gardens, Kew, Surrey TW93DS, United Kingdom
| | - Ladislav Mucina
- School of Plant Biology, University of Western Australia, Perth, WA 6009, Australia; Centre for Geographic Analysis, Department of Geography and Environmental Studies, Stellenbosch University, Stellenbosch 7600, South Africa; and
| | - Ettore Pacini
- Department of Life Sciences, Siena University, 53100 Siena, Italy
| | - Lubomír Tichý
- Department of Botany and Zoology, Masaryk University, CZ-61137 Brno, Czech Republic
| | - Vít Grulich
- Department of Botany and Zoology, Masaryk University, CZ-61137 Brno, Czech Republic
| | - Olga Rotreklová
- Department of Botany and Zoology, Masaryk University, CZ-61137 Brno, Czech Republic
| |
Collapse
|
34
|
Variation and constraints in species-specific promoter sequences. J Theor Biol 2014; 363:357-66. [PMID: 25149367 DOI: 10.1016/j.jtbi.2014.08.006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2014] [Revised: 07/30/2014] [Accepted: 08/04/2014] [Indexed: 11/24/2022]
Abstract
A vast literature is nowadays devoted to the search of correlations between transcription related functions and the composition of sequences upstream the Transcription Start Site. Little is known about the possible functional effects of nucleotide distributions on the conformational landscape of DNA in such regions. We have used suitable statistical indicators for identifying sequences that may play an important role in regulating transcription processes. In particular, we have analyzed base composition, periodicity and information content in sets of aligned promoters clustered according to functional information in order to obtain an insight on the main structural differences between promoters regulating genes with different functions. Our results show that when we select promoters according to some biological information, in a single species, at least in vertebrates, we observe structurally different classes of sequences. The highly variable and differentiated gene expression patterns may explain the great extent of structural differentiation observed in complex organisms. In fact, despite our analysis is focused on Homo sapiens, we provide also a comparison with other species, selected at different positions in the phylogenetic tree.
Collapse
|
35
|
Carbone A. Information measure for long-range correlated sequences: the case of the 24 human chromosomes. Sci Rep 2014; 3:2721. [PMID: 24056670 PMCID: PMC3779848 DOI: 10.1038/srep02721] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2013] [Accepted: 09/04/2013] [Indexed: 01/14/2023] Open
Abstract
A new approach to estimate the Shannon entropy of a long-range correlated sequence is proposed. The entropy is written as the sum of two terms corresponding respectively to power-law (ordered) and exponentially (disordered) distributed blocks (clusters). The approach is illustrated on the 24 human chromosome sequences by taking the nucleotide composition as the relevant information to be encoded/decoded. Interestingly, the nucleotide composition of the ordered clusters is found, on the average, comparable to the one of the whole analyzed sequence, while that of the disordered clusters fluctuates. From the information theory standpoint, this means that the power-law correlated clusters carry the same information of the whole analysed sequence. Furthermore, the fluctuations of the nucleotide composition of the disordered clusters are linked to relevant biological properties, such as segmental duplications and gene density.
Collapse
Affiliation(s)
- A Carbone
- 1] Politecnico di Torino, Italy [2] ISC-CNR, Unità Università 'La Sapienza' di Roma, Italy [3] ETH Zurich, Switzerland
| |
Collapse
|
36
|
Qiu S, McSweeny A, Choulet S, Saha-Mandal A, Fedorova L, Fedorov A. Genome evolution by matrix algorithms: cellular automata approach to population genetics. Genome Biol Evol 2014; 6:988-99. [PMID: 24723728 PMCID: PMC4007542 DOI: 10.1093/gbe/evu075] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Mammalian genomes are replete with millions of polymorphic sites, among which those genetic variants that are colocated on the same chromosome and exist close to one another form blocks of closely linked mutations known as haplotypes. The linkage within haplotypes is constantly disrupted due to meiotic recombination events. Whole ensembles of such numerous haplotypes are subjected to evolutionary pressure, where mutations influence each other and should be considered as a whole entity—a gigantic matrix, unique for each individual genome. This idea was implemented into a computational approach, named Genome Evolution by Matrix Algorithms (GEMA) to model genomic changes taking into account all mutations in a population. GEMA has been tested for modeling of entire human chromosomes. The program can precisely mimic real biological processes that have influence on genome evolution such as: 1) Authentic arrangements of genes and functional genomic elements, 2) frequencies of various types of mutations in different nucleotide contexts, and 3) nonrandom distribution of meiotic recombination events along chromosomes. Computer modeling with GEMA has demonstrated that the number of meiotic recombination events per gamete is among the most crucial factors influencing population fitness. In humans, these recombinations create a gamete genome consisting on an average of 48 pieces of corresponding parental chromosomes. Such highly mosaic gamete structure allows preserving fitness of population under the intense influx of novel mutations (40 per individual) even when the number of mutations with deleterious effects is up to ten times more abundant than those with beneficial effects.
Collapse
Affiliation(s)
- Shuhao Qiu
- Program in Bioinformatics and Proteomics/Genomics, University of Toledo
| | | | | | | | | | | |
Collapse
|
37
|
Morgens DW, Chang CH, Cavalcanti ARO. Ambushing the Ambush Hypothesis: predicting and evaluating off-frame codon frequencies in prokaryotic genomes. BMC Genomics 2013; 14:418. [PMID: 23799949 PMCID: PMC3700767 DOI: 10.1186/1471-2164-14-418] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2013] [Accepted: 06/19/2013] [Indexed: 11/24/2022] Open
Abstract
Background In this paper, we address the evidence for the Ambush Hypothesis. Proposed by Seligmann and Pollock, this hypothesis posits that there exists a selection for off-frame stop codons (OSCs) to counteract the possible deleterious effects of translational frameshifts, including the waste of resources and potential cytotoxicity. Two main types of study have been used to support the hypothesis. Some studies analyzed codon usage and showed that codons with more potential to create OSCs seem to be favored over codons with lower potential; they used this finding to support the Ambush Hypothesis. Another study used 342 bacterial genomes to evaluate the hypothesis directly, finding significant excesses of OSCs in these genomes. Results We repeated both analyses with newer datasets and searched for other factors that could explain the observed trends. In the first case, the relative frequency of codons with the potential to create OSCs is directly correlated with the GC content of organisms, as stop codons are GC-poor. When evaluating the frequency of OSCs directly in 1,976 bacterial genomes we also detected a significant excess. However, when comparing the excess of OSCs with similarly obtained results for the frequency of out-of-frame sense codons, some sense codons have a more significant excess than stop codons. Conclusions Two avenues of study have been used to support the Ambush Hypothesis. Using the same methods as these previous studies, we demonstrate that the evidence in support of the Ambush Hypothesis does not hold up against more rigorous testing.
Collapse
Affiliation(s)
- David W Morgens
- Biology Department, Pomona College, Claremont, CA 91711, USA
| | | | | |
Collapse
|
38
|
Nabiyouni M, Prakash A, Fedorov A. Vertebrate codon bias indicates a highly GC-rich ancestral genome. Gene 2013; 519:113-9. [PMID: 23376453 DOI: 10.1016/j.gene.2013.01.033] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2012] [Revised: 01/10/2013] [Accepted: 01/17/2013] [Indexed: 11/16/2022]
Abstract
Two factors are thought to have contributed to the origin of codon usage bias in eukaryotes: 1) genome-wide mutational forces that shape overall GC-content and create context-dependent nucleotide bias, and 2) positive selection for codons that maximize efficient and accurate translation. Particularly in vertebrates, these two explanations contradict each other and cloud the origin of codon bias in the taxon. On the one hand, mutational forces fail to explain GC-richness (~60%) of third codon positions, given the GC-poor overall genomic composition among vertebrates (~40%). On the other hand, positive selection cannot easily explain strict regularities in codon preferences. Large-scale bioinformatic assessment, of nucleotide composition of coding and non-coding sequences in vertebrates and other taxa, suggests a simple possible resolution for this contradiction. Specifically, we propose that the last common vertebrate ancestor had a GC-rich genome (~65% GC). The data suggest that whole-genome mutational bias is the major driving force for generating codon bias. As the bias becomes prominent, it begins to affect translation and can result in positive selection for optimal codons. The positive selection can, in turn, significantly modulate codon preferences.
Collapse
Affiliation(s)
- Maryam Nabiyouni
- Program in Bioinformatics and Proteomics/Genomics, University of Toledo, Health Science Campus, Toledo, OH 43614, USA.
| | | | | |
Collapse
|
39
|
Agier N, Romano OM, Touzain F, Cosentino Lagomarsino M, Fischer G. The spatiotemporal program of replication in the genome of Lachancea kluyveri. Genome Biol Evol 2013; 5:370-88. [PMID: 23355306 PMCID: PMC3590768 DOI: 10.1093/gbe/evt014] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/18/2013] [Indexed: 12/11/2022] Open
Abstract
We generated a genome-wide replication profile in the genome of Lachancea kluyveri and assessed the relationship between replication and base composition. This species diverged from Saccharomyces cerevisiae before the ancestral whole genome duplication. The genome comprises eight chromosomes among which a chromosomal arm of 1 Mb has a G + C-content much higher than the rest of the genome. We identified 252 active replication origins in L. kluyveri and found considerable divergence in origin location with S. cerevisiae and with Lachancea waltii. Although some global features of S. cerevisiae replication are conserved: Centromeres replicate early, whereas telomeres replicate late, we found that replication origins both in L. kluyveri and L. waltii do not behave as evolutionary fragile sites. In L. kluyveri, replication timing along chromosomes alternates between regions of early and late activating origins, except for the 1 Mb GC-rich chromosomal arm. This chromosomal arm contains an origin consensus motif different from other chromosomes and is replicated early during S-phase. We showed that precocious replication results from the specific absence of late firing origins in this chromosomal arm. In addition, we found a correlation between GC-content and distance from replication origins as well as a lack of replication-associated compositional skew between leading and lagging strands specifically in this GC-rich chromosomal arm. These findings suggest that the unusual base composition in the genome of L. kluyveri could be linked to replication.
Collapse
Affiliation(s)
- Nicolas Agier
- UPMC, UMR7238, Génomique des Microorganismes, Paris, France
- CNRS, UMR7238, Génomique des Microorganismes, Paris, France
| | | | - Fabrice Touzain
- UPMC, UMR7238, Génomique des Microorganismes, Paris, France
- CNRS, UMR7238, Génomique des Microorganismes, Paris, France
- Present address: ANSES, Ploufragan/Plouzané Laboratory Viral Genomics and Biosecurity Unit (GVB), Ploufragan, France
| | - Marco Cosentino Lagomarsino
- UPMC, UMR7238, Génomique des Microorganismes, Paris, France
- CNRS, UMR7238, Génomique des Microorganismes, Paris, France
| | - Gilles Fischer
- UPMC, UMR7238, Génomique des Microorganismes, Paris, France
- CNRS, UMR7238, Génomique des Microorganismes, Paris, France
| |
Collapse
|
40
|
Frousios K, Iliopoulos CS, Tischler G, Kossida S, Pissis SP, Arhondakis S. Transcriptome map of mouse isochores in embryonic and neonatal cortex. Genomics 2012. [PMID: 23195409 DOI: 10.1016/j.ygeno.2012.11.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Several studies on adult tissues agree on the presence of a positive effect of the genomic and genic base composition on mammalian gene expression. Recent literature supports the idea that during developmental processes GC-poor genomic regions are preferentially implicated. We investigate the relationship between the compositional properties of the isochores and of the genes with their respective expression activity during developmental processes. Using RNA-seq data from two distinct developmental stages of the mouse cortex, embryonic day 18 (E18) and postnatal day 7 (P7), we established for the first time a developmental-related transcriptome map of the mouse isochores. Additionally, for each stage we estimated the correlation between isochores' GC level and their expression activity, and the genes' expression patterns for each isochore family. Our analyses add evidence supporting the idea that during development GC-poor isochores are preferentially implicated, and confirm the positive effect of genes' GC level on their expression activity.
Collapse
Affiliation(s)
- Kimon Frousios
- Department of Informatics, King's College London, The Strand, London WC2R 2LS, UK
| | - Costas S Iliopoulos
- Department of Informatics, King's College London, The Strand, London WC2R 2LS, UK; School of Mathematics and Statistics, University of Western Australia, 35 Stirling Highway, Crawley, Perth WA 6009, Australia
| | - German Tischler
- Lehrstuhl für Informatik 2, Universität Würzburg, Am Hubland, 97074 Würzburg, Germany
| | - Sophia Kossida
- Biomedical Research Foundation of the Academy of Athens, 4 Soranou Ephessiou, Athens 115 27, Greece
| | - Solon P Pissis
- Florida Museum of Natural History, University of Florida, 1659 Museum Road, Gainesville, FL 32611, USA; Heidelberg Institute for Theoretical Studies, 35 Schloss-Wolfsbrunnenweg, Heidelberg D-69118, Germany
| | - Stilianos Arhondakis
- Biomedical Research Foundation of the Academy of Athens, 4 Soranou Ephessiou, Athens 115 27, Greece.
| |
Collapse
|
41
|
Abstract
The genomes of eukaryotes are mosaics of isochores. These are long DNA stretches that are fairly homogeneous in base composition and that belong to a small number of families characterized by different ratios of GC to AT and different short-sequence patterns (i.e., different DNA structures that interact with different proteins). This genome organization led to two discoveries: (1) the genomic code, which refers to two correlations, that of the composition of coding and contiguous noncoding sequences, and that of coding sequences and the structural properties of the encoded proteins; and (2) the genome phenotypes, which correspond to the patterns of isochore families in the genomes. These patterns indicate that genome evolution may proceed either according to a conservative mode or to a transitional (isochore shifting) mode, apparently depending upon whether the environment is constant or shifting. According to the neoselectionist theory, natural selection is responsible for both modes.
Collapse
|
42
|
Stoltzfus A. Constructive neutral evolution: exploring evolutionary theory's curious disconnect. Biol Direct 2012; 7:35. [PMID: 23062217 PMCID: PMC3534586 DOI: 10.1186/1745-6150-7-35] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2012] [Accepted: 10/11/2012] [Indexed: 11/20/2022] Open
Abstract
Abstract Constructive neutral evolution (CNE) suggests that neutral evolution may follow a stepwise path to extravagance. Whether or not CNE is common, the mere possibility raises provocative questions about causation: in classical neo-Darwinian thinking, selection is the sole source of creativity and direction, the only force that can cause trends or build complex features. However, much of contemporary evolutionary genetics departs from the conception of evolution underlying neo-Darwinism, resulting in a widening gap between what formal models allow, and what the prevailing view of the causes of evolution suggests. In particular, a mutationist conception of evolution as a 2-step origin-fixation process has been a source of theoretical innovation for 40 years, appearing not only in the Neutral Theory, but also in recent breakthroughs in modeling adaptation (the “mutational landscape” model), and in practical software for sequence analysis. In this conception, mutation is not a source of raw materials, but an agent that introduces novelty, while selection is not an agent that shapes features, but a stochastic sieve. This view, which now lays claim to important theoretical, experimental, and practical results, demands our attention. CNE provides a way to explore its most significant implications about the role of variation in evolution. Reviewers Alex Kondrashov, Eugene Koonin and Johann Peter Gogarten reviewed this article.
Collapse
Affiliation(s)
- Arlin Stoltzfus
- Institute for Bioscience and Biotechnology Research, 9600 Gudelsky Drive, Rockville, MD 20850, USA.
| |
Collapse
|
43
|
Escape from Adaptive Conflict follows from weak functional trade-offs and mutational robustness. Proc Natl Acad Sci U S A 2012; 109:14888-93. [PMID: 22927372 DOI: 10.1073/pnas.1115620109] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
A fundamental question in molecular evolution is how proteins can adapt to new functions while being conserved for an existing function at the same time. Several theoretical models have been put forward to explain this apparent paradox. The most popular models include neofunctionalization, subfunctionalization (SUBF) by degenerative mutations, and dosage models. All of these models focus on adaptation after gene duplication. A newly proposed model named "Escape from Adaptive Conflict" (EAC) includes adaptive processes before and after gene duplication that lead to multifunctional proteins, and divergence (SUBF). Support for the importance of multifunctionality for the evolution of new protein functions comes from two experimental observations. First, many enzymes have highly evolvable promiscuous side activities. Second, different structural states of the same protein can be associated with different functions. How these observations may be related to the EAC model, under which conditions EAC is possible, and how the different models relate to each other is still unclear. Here, we present a theoretical framework that uses biophysical principles to infer the roles of functional promiscuity, gene dosage, gene duplication, point mutations, and selection pressures in the evolution of proteins. We find that selection pressures can determine whether neofunctionalization or SUBF is the more likely evolutionary process. Multifunctional proteins, arising during EAC evolution, allow rapid adaptation independent of gene duplication. This becomes a crucial advantage when gene duplications are rare. Finally, we propose that an increase in mutational robustness, not necessarily functional optimization, can be the sole driving force behind SUBF.
Collapse
|
44
|
Berná L, Chaurasia A, Angelini C, Federico C, Saccone S, D'Onofrio G. The footprint of metabolism in the organization of mammalian genomes. BMC Genomics 2012; 13:174. [PMID: 22568857 PMCID: PMC3384468 DOI: 10.1186/1471-2164-13-174] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2011] [Accepted: 05/08/2012] [Indexed: 01/02/2023] Open
Abstract
Background At present five evolutionary hypotheses have been proposed to explain the great variability of the genomic GC content among and within genomes: the mutational bias, the biased gene conversion, the DNA breakpoints distribution, the thermal stability and the metabolic rate. Several studies carried out on bacteria and teleostean fish pointed towards the critical role played by the environment on the metabolic rate in shaping the base composition of genomes. In mammals the debate is still open, and evidences have been produced in favor of each evolutionary hypothesis. Human genes were assigned to three large functional categories (as well as to the corresponding functional classes) according to the KOG database: (i) information storage and processing, (ii) cellular processes and signaling, and (iii) metabolism. The classification was extended to the organisms so far analyzed performing a reciprocal Blastp and selecting the best reciprocal hit. The base composition was calculated for each sequence of the whole CDS dataset. Results The GC3 level of the above functional categories was increasing from (i) to (iii). This specific compositional pattern was found, as footprint, in all mammalian genomes, but not in frog and lizard ones. Comparative analysis of human versus both frog and lizard functional categories showed that genes involved in the metabolic processes underwent the highest GC3 increment. Analyzing the KOG functional classes of genes, again a well defined intra-genomic pattern was found in all mammals. Not only genes of metabolic pathways, but also genes involved in chromatin structure and dynamics, transcription, signal transduction mechanisms and cytoskeleton, showed an average GC3 level higher than that of the whole genome. In the case of the human genome, the genes of the aforementioned functional categories showed a high probability to be associated with the chromosomal bands. Conclusions In the light of different evolutionary hypotheses proposed so far, and contributing with different potential to the genome compositional heterogeneity of mammalian genomes, the one based on the metabolic rate seems to play not a minor role. Keeping in mind similar results reported in bacteria and in teleosts, the specific compositional patterns observed in mammals highlight metabolic rate as unifying factor that fits over a wide range of living organisms.
Collapse
Affiliation(s)
- Luisa Berná
- Genome Evolution and Organization - Department Animal Physiology and Evolution, Stazione Zoologica Anton Dohrn, Villa Comunale, 80121 Naples, Italy
| | | | | | | | | | | |
Collapse
|
45
|
The nearly neutral and selection theories of molecular evolution under the fisher geometrical framework: substitution rate, population size, and complexity. Genetics 2012; 191:523-34. [PMID: 22426879 DOI: 10.1534/genetics.112.138628] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
The general theories of molecular evolution depend on relatively arbitrary assumptions about the relative distribution and rate of advantageous, deleterious, neutral, and nearly neutral mutations. The Fisher geometrical model (FGM) has been used to make distributions of mutations biologically interpretable. We explored an FGM-based molecular model to represent molecular evolutionary processes typically studied by nearly neutral and selection models, but in which distributions and relative rates of mutations with different selection coefficients are a consequence of biologically interpretable parameters, such as the average size of the phenotypic effect of mutations and the number of traits (complexity) of organisms. A variant of the FGM-based model that we called the static regime (SR) represents evolution as a nearly neutral process in which substitution rates are determined by a dynamic substitution process in which the population's phenotype remains around a suboptimum equilibrium fitness produced by a balance between slightly deleterious and slightly advantageous compensatory substitutions. As in previous nearly neutral models, the SR predicts a negative relationship between molecular evolutionary rate and population size; however, SR does not have the unrealistic properties of previous nearly neutral models such as the narrow window of selection strengths in which they work. In addition, the SR suggests that compensatory mutations cannot explain the high rate of fixations driven by positive selection currently found in DNA sequences, contrary to what has been previously suggested. We also developed a generalization of SR in which the optimum phenotype can change stochastically due to environmental or physiological shifts, which we called the variable regime (VR). VR models evolution as an interplay between adaptive processes and nearly neutral steady-state processes. When strong environmental fluctuations are incorporated, the process becomes a selection model in which evolutionary rate does not depend on population size, but is critically dependent on the complexity of organisms and mutation size. For SR as well as VR we found that key parameters of molecular evolution are linked by biological factors, and we showed that they cannot be fixed independently by arbitrary criteria, as has usually been assumed in previous molecular evolutionary models.
Collapse
|
46
|
Arhondakis S, Auletta F, Bernardi G. Isochores and the regulation of gene expression in the human genome. Genome Biol Evol 2012; 3:1080-9. [PMID: 21979159 PMCID: PMC3227402 DOI: 10.1093/gbe/evr017] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
It is well established that changes in the phenotype depend much more on changes in gene expression than on changes in protein-coding genes, and that cis-regulatory sequences and chromatin structure are two major factors influencing gene expression. Here, we investigated these factors at the genome-wide level by focusing on the trinucleotide patterns in the 0.1- to 25-kb regions flanking the human genes that are present in the GC-poorest L1 and GC-richest H3 isochore families, the other families exhibiting intermediate patterns. We could show 1) that the trinucleotide patterns of the 25-kb gene-flanking regions are representative of the very different patterns already reported for the whole isochores from the L1 and H3 families and, expectedly, identical in upstream and downstream locations; 2) that the patterns of the 0.1- to 0.5-kb regions in the L1 and H3 isochores are remarkably more divergent and more specific when compared with those of the 25-kb regions, as well as different in the upstream and downstream locations; and 3) that these patterns fade into the 25-kb patterns around 5kb in both upstream and downstream locations. The 25-kb findings indicate differences in nucleosome positioning and density in different isochore families, those of the 0.1- to 0.5-kb sequences indicate differences in the transcription factors that bind upstream and downstream of genes. These results indicate differences in the regulation of genes located in different isochore families, a point of functional and evolutionary relevance.
Collapse
Affiliation(s)
- Stilianos Arhondakis
- Bioinformatics and Medical Informatics Team, Biomedical Research Foundation of the Academy of Athens, Athens, Greece
| | | | | |
Collapse
|
47
|
Smarda P, Bureš P, Smerda J, Horová L. Measurements of genomic GC content in plant genomes with flow cytometry: a test for reliability. THE NEW PHYTOLOGIST 2012; 193:513-21. [PMID: 22050640 DOI: 10.1111/j.1469-8137.2011.03942.x] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
• Knowledge of the phylogenetic pattern and biological relevance of the base composition of large eukaryotic genomes (including those of plants) is poor. With the use of flow cytometry (FCM), the amount of available data on the guanine + cytosine (GC) content of plants has nearly doubled in the last decade. However, skepticism exists concerning the reliability of the method because of uncertainty in some input parameters. • Here, we tested the reliability of FCM for estimating GC content by comparison with the biochemical method of DNA temperature melting analysis (TMA). We conducted measurements in 14 plant species with a maximum currently known GC content range (33.6-47.5% as measured by FCM). We also compared the estimations of the GC content by FCM with genomic sequences in 11 Oryza species. • FCM and TMA data exhibited a high degree of correspondence which remained stable over the relatively wide range of binding lengths (3.39-4.09) assumed for the base-specific dye used. A high correlation was also observed between FCM results and the sequence data in Oryza, although the latter GC contents were consistently lower. • Reliable estimates of the genomic base composition in plants by FCM are comparable with estimates obtained using other methods, and so wider application of FCM in future plant genomic research, although it would pose a challenge, would be supported by these findings.
Collapse
Affiliation(s)
- Petr Smarda
- Department of Botany and Zoology, Masaryk University, Brno, Czech Republic.
| | | | | | | |
Collapse
|
48
|
The distribution of GC nucleotides and regulatory sequence motifs in genes and their adjacent sequences. Gene 2011; 492:375-81. [PMID: 22101187 DOI: 10.1016/j.gene.2011.10.050] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2011] [Revised: 10/24/2011] [Accepted: 10/31/2011] [Indexed: 11/20/2022]
Abstract
The genomes of warm-blooded vertebrates are a mosaic of alternating fragments, isochores, with low and high GC contents and embedded genes. The evolutionary mechanisms leading to such structures are not fully understood. We have compared the distributions of GC base pairs in coding sequences and sequences spanning 5 kb upstream and downstream of genes in human and other species annotated in the RefSeq database and in different isochores of the human genome. Using our computer application NucleoSeq (available at www.bioinformatics.aei.polsl.pl), we also compared the average distributions of AT-rich regulatory motifs and transcription factor binding sites (TFBS) for single transcription factors with those in randomized sequences of the human genome, and revealed that some TFBS have a lower average frequency in a gene's promoter than in the randomized sequence, whereas for other transcription factors the opposite is observed. TFBS for some transcription factors show a higher frequency in the coding sequence than in the regulatory and in randomized sequences, suggesting their accumulation during evolution and possible functional roles. On the basis of the GC content in genes and their adjacent sequences which was similar in all species studied here, and the distribution of regulatory motifs, we hypothesize that the first step in evolution of many genes existing today was the joining of a GC-rich coding sequence to a region with a lower GC content and the potential to create regulatory motifs.
Collapse
|
49
|
On parameters of the human genome. J Theor Biol 2011; 288:92-104. [DOI: 10.1016/j.jtbi.2011.07.021] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2011] [Revised: 06/28/2011] [Accepted: 07/21/2011] [Indexed: 02/06/2023]
|
50
|
Arhondakis S, Frousios K, Iliopoulos CS, Pissis SP, Tischler G, Kossida S. Transcriptome map of mouse isochores. BMC Genomics 2011; 12:511. [PMID: 22004510 PMCID: PMC3215772 DOI: 10.1186/1471-2164-12-511] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2011] [Accepted: 10/17/2011] [Indexed: 12/28/2022] Open
Abstract
Background The availability of fully sequenced genomes and the implementation of transcriptome technologies have increased the studies investigating the expression profiles for a variety of tissues, conditions, and species. In this study, using RNA-seq data for three distinct tissues (brain, liver, and muscle), we investigate how base composition affects mammalian gene expression, an issue of prime practical and evolutionary interest. Results We present the transcriptome map of the mouse isochores (DNA segments with a fairly homogeneous base composition) for the three different tissues and the effects of isochores' base composition on their expression activity. Our analyses also cover the relations between the genes' expression activity and their localization in the isochore families. Conclusions This study is the first where next-generation sequencing data are used to associate the effects of both genomic and genic compositional properties to their corresponding expression activity. Our findings confirm previous results, and further support the existence of a relationship between isochores and gene expression. This relationship corroborates that isochores are primarily a product of evolutionary adaptation rather than a simple by-product of neutral evolutionary processes.
Collapse
Affiliation(s)
- Stilianos Arhondakis
- Biomedical Research Foundation of the Academy of Athens, 4 Soranou Ephessiou, 115 27, Athens, Greece.
| | | | | | | | | | | |
Collapse
|