151
|
Simonti CN, Capra JA. The evolution of the human genome. Curr Opin Genet Dev 2015; 35:9-15. [PMID: 26338498 DOI: 10.1016/j.gde.2015.08.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2015] [Revised: 08/08/2015] [Accepted: 08/12/2015] [Indexed: 02/05/2023]
Abstract
Human genomes hold a record of the evolutionary forces that have shaped our species. Advances in DNA sequencing, functional genomics, and population genetic modeling have deepened our understanding of human demographic history, natural selection, and many other long-studied topics. These advances have also revealed many previously underappreciated factors that influence the evolution of the human genome, including functional modifications to DNA and histones, conserved 3D topological chromatin domains, structural variation, and heterogeneous mutation patterns along the genome. Using evolutionary theory as a lens to study these phenomena will lead to significant breakthroughs in understanding what makes us human and why we get sick.
Collapse
Affiliation(s)
- Corinne N Simonti
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN 37235, USA
| | - John A Capra
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN 37235, USA; Department of Biological Sciences, Vanderbilt University, Nashville, TN 37235, USA; Department of Biomedical Informatics, Vanderbilt University, Nashville, TN 37235, USA.
| |
Collapse
|
152
|
Thompson D, Regev A, Roy S. Comparative analysis of gene regulatory networks: from network reconstruction to evolution. Annu Rev Cell Dev Biol 2015; 31:399-428. [PMID: 26355593 DOI: 10.1146/annurev-cellbio-100913-012908] [Citation(s) in RCA: 95] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Regulation of gene expression is central to many biological processes. Although reconstruction of regulatory circuits from genomic data alone is therefore desirable, this remains a major computational challenge. Comparative approaches that examine the conservation and divergence of circuits and their components across strains and species can help reconstruct circuits as well as provide insights into the evolution of gene regulatory processes and their adaptive contribution. In recent years, advances in genomic and computational tools have led to a wealth of methods for such analysis at the sequence, expression, pathway, module, and entire network level. Here, we review computational methods developed to study transcriptional regulatory networks using comparative genomics, from sequence to functional data. We highlight how these methods use evolutionary conservation and divergence to reliably detect regulatory components as well as estimate the extent and rate of divergence. Finally, we discuss the promise and open challenges in linking regulatory divergence to phenotypic divergence and adaptation.
Collapse
Affiliation(s)
- Dawn Thompson
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142
| | | | | |
Collapse
|
153
|
Gueroussov S, Gonatopoulos-Pournatzis T, Irimia M, Raj B, Lin ZY, Gingras AC, Blencowe BJ. An alternative splicing event amplifies evolutionary differences between vertebrates. Science 2015; 349:868-873. [DOI: 10.1126/science.aaa8381] [Citation(s) in RCA: 105] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/30/2023]
Abstract
Regulation of splicing regulators
The messenger RNAs of most eukaryotic genes are formed by splicing together a series of exons and removing the intervening introns. The identity and order of the exons can vary between mRNAs for the same gene. The alternatively spliced products can generate an increased diversity of protein products. Gueroussov
et al.
show that the alternative splicing of a mammalian splicing regulatory factor affects, in turn, the alternative splicing of a wide range of target RNAs. This regulation mechanism controls a brain-specific alternative splicing program.
Science
, this issue p.
868
Collapse
Affiliation(s)
- Serge Gueroussov
- Donnelly Centre, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| | | | - Manuel Irimia
- Donnelly Centre, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Bushra Raj
- Donnelly Centre, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| | - Zhen-Yuan Lin
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, 600 University Avenue, Toronto, Ontario M5G 1X5, Canada
| | - Anne-Claude Gingras
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, 600 University Avenue, Toronto, Ontario M5G 1X5, Canada
| | - Benjamin J. Blencowe
- Donnelly Centre, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| |
Collapse
|
154
|
Lu HC, Chung SS, Fornili A, Fraternali F. Anatomy of protein disorder, flexibility and disease-related mutations. Front Mol Biosci 2015; 2:47. [PMID: 26322316 PMCID: PMC4532925 DOI: 10.3389/fmolb.2015.00047] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2015] [Accepted: 07/29/2015] [Indexed: 01/23/2023] Open
Abstract
Integration of protein structural information with human genetic variation and pathogenic mutations is essential to understand molecular mechanisms associated with the effects of polymorphisms on protein interactions and cellular processes. We investigate occurrences of non-synonymous SNPs in ordered and disordered protein regions by systematic mapping of common variants and disease-related SNPs onto these regions. We show that common variants accumulate in disordered regions; conversely pathogenic variants are significantly depleted in disordered regions. These different occurrences of pathogenic and common SNPs can be attributed to a negative selection on random mutations in structurally highly constrained regions. New approaches in the study of quantitative effects of pathogenic-related mutations should effectively account for all the possible contexts and relative functional constraints in which the sequence variation occurs.
Collapse
Affiliation(s)
- Hui-Chun Lu
- Randall Division of Cell and Molecular Biophysics, King's College London London, UK
| | - Sun Sook Chung
- Randall Division of Cell and Molecular Biophysics, King's College London London, UK ; Department of Haematological Medicine, King's College London London, UK
| | - Arianna Fornili
- Randall Division of Cell and Molecular Biophysics, King's College London London, UK ; School of Biological and Chemical Sciences, Queen Mary University of London London, UK
| | - Franca Fraternali
- Randall Division of Cell and Molecular Biophysics, King's College London London, UK
| |
Collapse
|
155
|
Mueller JC, Kuhl H, Timmermann B, Kempenaers B. Characterization of the genome and transcriptome of the blue tit Cyanistes caeruleus: polymorphisms, sex-biased expression and selection signals. Mol Ecol Resour 2015. [PMID: 26220359 DOI: 10.1111/1755-0998.12450] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Decoding genomic sequences and determining their variation within populations has potential to reveal adaptive processes and unravel the genetic basis of ecologically relevant trait variation within a species. The blue tit Cyanistes caeruleus--a long-time ecological model species--has been used to investigate fitness consequences of variation in mating and reproductive behaviour. However, very little is known about the underlying genetic changes due to natural and sexual selection in the genome of this songbird. As a step to bridge this gap, we assembled the first draft genome of a single blue tit, mapped the transcriptome of five females and five males to this reference, identified genomewide variants and performed sex-differential expression analysis in the gonads, brain and other tissues. In the gonads, we found a high number of sex-biased genes, and of those, a similar proportion were sex-limited (genes only expressed in one sex) in males and females. However, in the brain, the proportion of female-limited genes within the female-biased gene category (82%) was substantially higher than the proportion of male-limited genes within the male-biased category (6%). This suggests a predominant on-off switching mechanism for the female-limited genes. In addition, most male-biased genes were located on the Z-chromosome, indicating incomplete dosage compensation for the male-biased genes. We called more than 500,000 SNPs from the RNA-seq data. Heterozygote detection in the single reference individual was highly congruent between DNA-seq and RNA-seq calling. Using information from these polymorphisms, we identified potential selection signals in the genome. We list candidate genes which can be used for further sequencing and detailed selection studies, including genes potentially related to meiotic drive evolution. A public genome browser of the blue tit with the described information is available at http://public-genomes-ngs.molgen.mpg.de.
Collapse
Affiliation(s)
- Jakob C Mueller
- Department of Behavioural Ecology & Evolutionary Genetics, Max Planck Institute for Ornithology, 82319 Seewiesen, Germany
| | - Heiner Kuhl
- Sequencing Core Facility, Max Planck Institute for Molecular Genetics, Ihnestrasse 73, 14195 Berlin, Germany
| | - Bernd Timmermann
- Sequencing Core Facility, Max Planck Institute for Molecular Genetics, Ihnestrasse 73, 14195 Berlin, Germany
| | - Bart Kempenaers
- Department of Behavioural Ecology & Evolutionary Genetics, Max Planck Institute for Ornithology, 82319 Seewiesen, Germany
| |
Collapse
|
156
|
Gao Q, Sun W, Ballegeer M, Libert C, Chen W. Predominant contribution of cis-regulatory divergence in the evolution of mouse alternative splicing. Mol Syst Biol 2015; 11:816. [PMID: 26134616 PMCID: PMC4547845 DOI: 10.15252/msb.20145970] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
Divergence of alternative splicing represents one of the major driving forces to shape phenotypic diversity during evolution. However, the extent to which these divergences could be explained by the evolving cis-regulatory versus trans-acting factors remains unresolved. To globally investigate the relative contributions of the two factors for the first time in mammals, we measured splicing difference between C57BL/6J and SPRET/EiJ mouse strains and allele-specific splicing pattern in their F1 hybrid. Out of 11,818 alternative splicing events expressed in the cultured fibroblast cells, we identified 796 with significant difference between the parental strains. After integrating allele-specific data from F1 hybrid, we demonstrated that these events could be predominately attributed to cis-regulatory variants, including those residing at and beyond canonical splicing sites. Contrary to previous observations in Drosophila, such predominant contribution was consistently observed across different types of alternative splicing. Further analysis of liver tissues from the same mouse strains and reanalysis of published datasets on other strains showed similar trends, implying in general the predominant contribution of cis-regulatory changes in the evolution of mouse alternative splicing.
Collapse
Affiliation(s)
- Qingsong Gao
- Laboratory for Systems Biology and Functional Genomics, Berlin Institute for Medical Systems Biology, Max-Delbrück-Centrum für Molekulare Medizin, Berlin, Germany
| | - Wei Sun
- Laboratory for Systems Biology and Functional Genomics, Berlin Institute for Medical Systems Biology, Max-Delbrück-Centrum für Molekulare Medizin, Berlin, Germany
| | - Marlies Ballegeer
- Inflammation Research Center, VIB, Ghent, Belgium Department of Biomedical Molecular Biology, University Ghent, Ghent, Belgium
| | - Claude Libert
- Inflammation Research Center, VIB, Ghent, Belgium Department of Biomedical Molecular Biology, University Ghent, Ghent, Belgium
| | - Wei Chen
- Laboratory for Systems Biology and Functional Genomics, Berlin Institute for Medical Systems Biology, Max-Delbrück-Centrum für Molekulare Medizin, Berlin, Germany
| |
Collapse
|
157
|
Kryuchkova-Mostacci N, Robinson-Rechavi M. Tissue-Specific Evolution of Protein Coding Genes in Human and Mouse. PLoS One 2015; 10:e0131673. [PMID: 26121354 PMCID: PMC4488272 DOI: 10.1371/journal.pone.0131673] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2015] [Accepted: 06/04/2015] [Indexed: 12/23/2022] Open
Abstract
Protein-coding genes evolve at different rates, and the influence of different parameters, from gene size to expression level, has been extensively studied. While in yeast gene expression level is the major causal factor of gene evolutionary rate, the situation is more complex in animals. Here we investigate these relations further, especially taking in account gene expression in different organs as well as indirect correlations between parameters. We used RNA-seq data from two large datasets, covering 22 mouse tissues and 27 human tissues. Over all tissues, evolutionary rate only correlates weakly with levels and breadth of expression. The strongest explanatory factors of purifying selection are GC content, expression in many developmental stages, and expression in brain tissues. While the main component of evolutionary rate is purifying selection, we also find tissue-specific patterns for sites under neutral evolution and for positive selection. We observe fast evolution of genes expressed in testis, but also in other tissues, notably liver, which are explained by weak purifying selection rather than by positive selection.
Collapse
Affiliation(s)
- Nadezda Kryuchkova-Mostacci
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Marc Robinson-Rechavi
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
158
|
Jenkins AM, Waterhouse RM, Muskavitch MAT. Long non-coding RNA discovery across the genus anopheles reveals conserved secondary structures within and beyond the Gambiae complex. BMC Genomics 2015; 16:337. [PMID: 25903279 PMCID: PMC4409983 DOI: 10.1186/s12864-015-1507-3] [Citation(s) in RCA: 57] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2015] [Accepted: 03/30/2015] [Indexed: 12/14/2022] Open
Abstract
Background Long non-coding RNAs (lncRNAs) have been defined as mRNA-like transcripts longer than 200 nucleotides that lack significant protein-coding potential, and many of them constitute scaffolds for ribonucleoprotein complexes with critical roles in epigenetic regulation. Various lncRNAs have been implicated in the modulation of chromatin structure, transcriptional and post-transcriptional gene regulation, and regulation of genomic stability in mammals, Caenorhabditis elegans, and Drosophila melanogaster. The purpose of this study is to identify the lncRNA landscape in the malaria vector An. gambiae and assess the evolutionary conservation of lncRNAs and their secondary structures across the Anopheles genus. Results Using deep RNA sequencing of multiple Anopheles gambiae life stages, we have identified 2,949 lncRNAs and more than 300 previously unannotated putative protein-coding genes. The lncRNAs exhibit differential expression profiles across life stages and adult genders. We find that across the genus Anopheles, lncRNAs display much lower sequence conservation than protein-coding genes. Additionally, we find that lncRNA secondary structure is highly conserved within the Gambiae complex, but diverges rapidly across the rest of the genus Anopheles. Conclusions This study offers one of the first lncRNA secondary structure analyses in vector insects. Our description of lncRNAs in An. gambiae offers the most comprehensive genome-wide insights to date into lncRNAs in this vector mosquito, and defines a set of potential targets for the development of vector-based interventions that may further curb the human malaria burden in disease-endemic countries. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1507-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | - Robert M Waterhouse
- Department of Genetic Medicine and Development, University of Geneva Medical School, rue Michel-Servet 1, 1211, Geneva, Switzerland. .,Swiss Institute of Bioinformatics, rue Michel-Servet 1, 1211, Geneva, Switzerland. .,Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, 32 Vassar Street, Cambridge, MA, 02139, USA. .,The Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA, 02142, USA.
| | - Marc A T Muskavitch
- Boston College, Chestnut Hill, MA, 02467, USA. .,Biogen Idec, 14 Cambridge Center, Cambridge, MA, 02142, USA.
| |
Collapse
|
159
|
Panzeri I, Rossetti G, Abrignani S, Pagani M. Long Intergenic Non-Coding RNAs: Novel Drivers of Human Lymphocyte Differentiation. Front Immunol 2015; 6:175. [PMID: 25926836 PMCID: PMC4397839 DOI: 10.3389/fimmu.2015.00175] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2015] [Accepted: 03/28/2015] [Indexed: 12/29/2022] Open
Abstract
Upon recognition of a foreign antigen, CD4(+) naïve T lymphocytes proliferate and differentiate into subsets with distinct functions. This process is fundamental for the effective immune system function, as CD4(+) T cells orchestrate both the innate and adaptive immune response. Traditionally, this differentiation event has been regarded as the acquisition of an irreversible cell fate so that memory and effector CD4(+) T subsets were considered terminally differentiated cells or lineages. Consequently, these lineages are conventionally defined thanks to their prototypical set of cytokines and transcription factors. However, recent findings suggest that CD4(+) T lymphocytes possess a remarkable phenotypic plasticity, as they can often re-direct their functional program depending on the milieu they encounter. Therefore, new questions are now compelling such as which are the molecular determinants underlying plasticity and stability and how the balance between these two opposite forces drives the cell fate. As already mentioned, in some cases, the mere expression of cytokines and master regulators could not fully explain lymphocytes plasticity. We should consider other layers of regulation, including epigenetic factors such as the modulation of chromatin state or the transcription of non-coding RNAs, whose high cell-specificity give a hint on their involvement in cell fate determination. In this review, we will focus on the recent advances in understanding CD4(+) T lymphocytes subsets specification from an epigenetic point of view. In particular, we will emphasize the emerging importance of non-coding RNAs as key players in these differentiation events. We will also present here new data from our laboratory highlighting the contribution of long non-coding RNAs in driving human CD4(+) T lymphocytes differentiation.
Collapse
Affiliation(s)
- Ilaria Panzeri
- Integrative Biology Unit, Istituto Nazionale Genetica Molecolare "Romeo ed Enrica Invernizzi", IRCCS Ospedale Maggiore Policlinico , Milano , Italy
| | - Grazisa Rossetti
- Integrative Biology Unit, Istituto Nazionale Genetica Molecolare "Romeo ed Enrica Invernizzi", IRCCS Ospedale Maggiore Policlinico , Milano , Italy
| | - Sergio Abrignani
- Integrative Biology Unit, Istituto Nazionale Genetica Molecolare "Romeo ed Enrica Invernizzi", IRCCS Ospedale Maggiore Policlinico , Milano , Italy
| | - Massimiliano Pagani
- Integrative Biology Unit, Istituto Nazionale Genetica Molecolare "Romeo ed Enrica Invernizzi", IRCCS Ospedale Maggiore Policlinico , Milano , Italy ; Department of Medical Biotechnology and Translational Medicine, Università degli Studi di Milano , Milano , Italy
| |
Collapse
|
160
|
Roux J, Rosikiewicz M, Robinson-Rechavi M. What to compare and how: Comparative transcriptomics for Evo-Devo. JOURNAL OF EXPERIMENTAL ZOOLOGY PART B-MOLECULAR AND DEVELOPMENTAL EVOLUTION 2015; 324:372-82. [PMID: 25864439 PMCID: PMC4949521 DOI: 10.1002/jez.b.22618] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 11/07/2014] [Accepted: 02/19/2015] [Indexed: 12/30/2022]
Abstract
Evolutionary developmental biology has grown historically from the capacity to relate patterns of evolution in anatomy to patterns of evolution of expression of specific genes, whether between very distantly related species, or very closely related species or populations. Scaling up such studies by taking advantage of modern transcriptomics brings promising improvements, allowing us to estimate the overall impact and molecular mechanisms of convergence, constraint or innovation in anatomy and development. But it also presents major challenges, including the computational definitions of anatomical homology and of organ function, the criteria for the comparison of developmental stages, the annotation of transcriptomics data to proper anatomical and developmental terms, and the statistical methods to compare transcriptomic data between species to highlight significant conservation or changes. In this article, we review these challenges, and the ongoing efforts to address them, which are emerging from bioinformatics work on ontologies, evolutionary statistics, and data curation, with a focus on their implementation in the context of the development of our database Bgee (http://bgee.org). J. Exp. Zool. (Mol. Dev. Evol.) 324B: 372–382, 2015. © 2015 The Authors. J. Exp. Zool. (Mol. Dev. Evol.) published by Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Julien Roux
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland.,Department of Human Genetics, University of Chicago, Chicago, Illinois
| | - Marta Rosikiewicz
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Marc Robinson-Rechavi
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|