1
|
Karner H, Webb CH, Carmona S, Liu Y, Lin B, Erhard M, Chan D, Baldi P, Spitale RC, Sun S. Functional Conservation of LncRNA JPX Despite Sequence and Structural Divergence. J Mol Biol 2019; 432:283-300. [PMID: 31518612 DOI: 10.1016/j.jmb.2019.09.002] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2019] [Revised: 08/29/2019] [Accepted: 09/02/2019] [Indexed: 02/02/2023]
Abstract
Long noncoding RNAs (lncRNAs) have been identified in all eukaryotes and are most abundant in the human genome. However, the functional importance and mechanisms of action for human lncRNAs are largely unknown. Using comparative sequence, structural, and functional analyses, we characterize the evolution and molecular function of human lncRNA JPX. We find that human JPX and its mouse homolog, lncRNA Jpx, have deep divergence in their nucleotide sequences and RNA secondary structures. Despite such differences, both lncRNAs demonstrate robust binding to CTCF, a protein that is central to Jpx's role in X chromosome inactivation. In addition, our functional rescue experiment using Jpx-deletion mutant cells shows that human JPX can functionally complement the loss of Jpx in mouse embryonic stem cells. Our findings support a model for functional conservation of lncRNAs independent from sequence and structural divergence. This study provides mechanistic insight into the evolution of lncRNA function.
Collapse
Affiliation(s)
- Heather Karner
- Department of Developmental and Cell Biology, School of Biological Sciences, University of California Irvine, Irvine, CA 92697, USA
| | - Chiu-Ho Webb
- Department of Developmental and Cell Biology, School of Biological Sciences, University of California Irvine, Irvine, CA 92697, USA
| | - Sarah Carmona
- Department of Developmental and Cell Biology, School of Biological Sciences, University of California Irvine, Irvine, CA 92697, USA
| | - Yu Liu
- Department of Computer Science, Institute for Genomics and Bioinformatics, University of California Irvine, Irvine, CA 92697, USA
| | - Benjamin Lin
- Department of Developmental and Cell Biology, School of Biological Sciences, University of California Irvine, Irvine, CA 92697, USA
| | - Micaela Erhard
- Department of Developmental and Cell Biology, School of Biological Sciences, University of California Irvine, Irvine, CA 92697, USA
| | - Dalen Chan
- Department of Pharmaceutical Sciences, College of Health Sciences, University of California Irvine, Irvine, CA 92697, USA
| | - Pierre Baldi
- Department of Computer Science, Institute for Genomics and Bioinformatics, University of California Irvine, Irvine, CA 92697, USA
| | - Robert C Spitale
- Department of Pharmaceutical Sciences, College of Health Sciences, University of California Irvine, Irvine, CA 92697, USA
| | - Sha Sun
- Department of Developmental and Cell Biology, School of Biological Sciences, University of California Irvine, Irvine, CA 92697, USA.
| |
Collapse
|
2
|
Pértille F, Da Silva VH, Johansson AM, Lindström T, Wright D, Coutinho LL, Jensen P, Guerrero-Bosagna C. Mutation dynamics of CpG dinucleotides during a recent event of vertebrate diversification. Epigenetics 2019; 14:685-707. [PMID: 31070073 PMCID: PMC6557589 DOI: 10.1080/15592294.2019.1609868] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Abstract
DNA methylation in CpGs dinucleotides is associated with high mutability and disappearance of CpG sites during evolution. Although the high mutability of CpGs is thought to be relevant for vertebrate evolution, very little is known on the role of CpG-related mutations in the genomic diversification of vertebrates. Our study analysed genetic differences in chickens, between Red Junglefowl (RJF; the living closest relative to the ancestor of domesticated chickens) and domesticated breeds, to identify genomic dynamics that have occurred during the process of their domestication, focusing particularly on CpG-related mutations. Single nucleotide polymorphisms (SNPs) and copy number variations (CNVs) between RJF and these domesticated breeds were assessed in a reduced fraction of their genome. Additionally, DNA methylation in the same fraction of the genome was measured in the sperm of RJF individuals to identify possible correlations with the mutations found between RJF and the domesticated breeds. Our study shows that although the vast majority of CpG-related mutations found relate to CNVs, CpGs disproportionally associate to SNPs in comparison to CNVs, where they are indeed substantially under-represented. Moreover, CpGs seem to be hotspots of mutations related to speciation. We suggest that, on the one hand, CpG-related mutations in CNV regions would promote genomic ‘flexibility’ in evolution, i.e., the ability of the genome to expand its functional possibilities; on the other hand, CpG-related mutations in SNPs would relate to genomic ‘specificity’ in evolution, thus, representing mutations that would associate with phenotypic traits relevant for speciation.
Collapse
Affiliation(s)
- Fábio Pértille
- a Avian Behavioral Genomics and Physiology Group, IFM Biology , Linköping University , Linköping , Sweden.,b Animal Biotechnology Laboratory, Animal Science Department , University of São Paulo (USP)/Luiz de Queiroz College of Agriculture (ESALQ) , Piracicaba , São Paulo , Brazil
| | - Vinicius H Da Silva
- c Animal Breeding and Genomics Centre , Wageningen University & Research , Wageningen , The Netherlands.,d Department of Animal Ecology (AnE) , Netherlands Institute of Ecology (NIOO-KNAW) , Wageningen , The Netherlands.,e Department of Animal Breeding and Genetics , Swedish University of Agricultural Sciences , Uppsala , Sweden
| | - Anna M Johansson
- e Department of Animal Breeding and Genetics , Swedish University of Agricultural Sciences , Uppsala , Sweden
| | - Tom Lindström
- f Division of Theoretical Biology, IFM , Linköping University , Linköping , Sweden
| | - Dominic Wright
- a Avian Behavioral Genomics and Physiology Group, IFM Biology , Linköping University , Linköping , Sweden
| | - Luiz L Coutinho
- b Animal Biotechnology Laboratory, Animal Science Department , University of São Paulo (USP)/Luiz de Queiroz College of Agriculture (ESALQ) , Piracicaba , São Paulo , Brazil
| | - Per Jensen
- a Avian Behavioral Genomics and Physiology Group, IFM Biology , Linköping University , Linköping , Sweden
| | - Carlos Guerrero-Bosagna
- a Avian Behavioral Genomics and Physiology Group, IFM Biology , Linköping University , Linköping , Sweden
| |
Collapse
|
3
|
Daub JT, Dupanloup I, Robinson-Rechavi M, Excoffier L. Inference of Evolutionary Forces Acting on Human Biological Pathways. Genome Biol Evol 2015; 7:1546-58. [PMID: 25971280 PMCID: PMC4494071 DOI: 10.1093/gbe/evv083] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/09/2015] [Indexed: 12/15/2022] Open
Abstract
Because natural selection is likely to act on multiple genes underlying a given phenotypic trait, we study here the potential effect of ongoing and past selection on the genetic diversity of human biological pathways. We first show that genes included in gene sets are generally under stronger selective constraints than other genes and that their evolutionary response is correlated. We then introduce a new procedure to detect selection at the pathway level based on a decomposition of the classical McDonald-Kreitman test extended to multiple genes. This new test, called 2DNS, detects outlier gene sets and takes into account past demographic effects and evolutionary constraints specific to gene sets. Selective forces acting on gene sets can be easily identified by a mere visual inspection of the position of the gene sets relative to their two-dimensional null distribution. We thus find several outlier gene sets that show signals of positive, balancing, or purifying selection but also others showing an ancient relaxation of selective constraints. The principle of the 2DNS test can also be applied to other genomic contrasts. For instance, the comparison of patterns of polymorphisms private to African and non-African populations reveals that most pathways show a higher proportion of nonsynonymous mutations in non-Africans than in Africans, potentially due to different demographic histories and selective pressures.
Collapse
Affiliation(s)
- Josephine T Daub
- CMPG, Institute of Ecology and Evolution, University of Berne, Switzerland Swiss Institute of Bioinformatics SIB, Lausanne, Switzerland Present address: Institute of Evolutionary Biology (UPF-CSIC), Barcelona, Spain
| | - Isabelle Dupanloup
- CMPG, Institute of Ecology and Evolution, University of Berne, Switzerland Swiss Institute of Bioinformatics SIB, Lausanne, Switzerland
| | - Marc Robinson-Rechavi
- Swiss Institute of Bioinformatics SIB, Lausanne, Switzerland Department of Ecology and Evolution, University of Lausanne, Switzerland
| | - Laurent Excoffier
- CMPG, Institute of Ecology and Evolution, University of Berne, Switzerland Swiss Institute of Bioinformatics SIB, Lausanne, Switzerland
| |
Collapse
|
4
|
Abstract
Evolutionary conservation has been an accurate predictor of functional elements across the first decade of metazoan genomics. More recently, there has been a move to define functional elements instead from biochemical annotations. Evolutionary methods are, however, more comprehensive than biochemical approaches can be and can assess quantitatively, especially for subtle effects, how biologically important--how injurious after mutation--different types of elements are. Evolutionary methods are thus critical for understanding the large fraction (up to 10%) of the human genome that does not encode proteins and yet might convey function. These methods can also capture the ephemeral nature of much noncoding functional sequence, with large numbers of functional elements having been gained and lost rapidly along each mammalian lineage. Here, we review how different strengths of purifying selection have impacted on protein-coding and non-protein-coding loci and on transcription factor binding sites in mammalian and fruit fly genomes.
Collapse
Affiliation(s)
- Wilfried Haerty
- MRC Functional Genomics Unit, Department of Physiology, Anatomy, and Genetics, University of Oxford, Oxford OX1 3PT, United Kingdom; ,
| | | |
Collapse
|
5
|
Gotea V, Elnitski L. Ascertaining regions affected by GC-biased gene conversion through weak-to-strong mutational hotspots. Genomics 2014; 103:349-56. [PMID: 24727706 DOI: 10.1016/j.ygeno.2014.04.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2013] [Revised: 03/31/2014] [Accepted: 04/04/2014] [Indexed: 12/24/2022]
Abstract
A major objective for evolutionary biology is to identify regions affected by positive selection. High dN/dS values for proteins and accelerated lineage-specific substitution rates for non-coding regions are considered classic signatures of positive selection. However, these could also be the result of non-adaptive phenomena, such as GC-biased gene conversion (gBGC), which favors the fixation of strong (C/G) over weak (A/T) nucleotides. Recent estimates indicate that gBGC affected up to 20% of regions with signatures of positive selection. Here we evaluate the impact of gBGC through its molecular signature of weak-to-strong mutational hotspots. We implemented specific modifications to the test proposed by Tang and Lewontin (1999) for identifying regions of differential variability and applied it to regions previously investigated for the influence of gBGC. While we found significant agreement with previous reports, our results suggest a smaller influence of gBGC than previously estimated, warranting further development of methods for its detection.
Collapse
Affiliation(s)
- Valer Gotea
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.
| | - Laura Elnitski
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
6
|
Jex AR, Koehler AV, Ansell BR, Baker L, Karunajeewa H, Gasser RB. Getting to the guts of the matter: The status and potential of ‘omics’ research of parasitic protists of the human gastrointestinal system. Int J Parasitol 2013; 43:971-82. [DOI: 10.1016/j.ijpara.2013.06.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2013] [Revised: 06/07/2013] [Accepted: 06/07/2013] [Indexed: 11/17/2022]
|
7
|
Dani SU, März W, Neves PMS, Walter GF. Pairomics, the omics way to mate choice. J Hum Genet 2013; 58:643-56. [PMID: 23945982 DOI: 10.1038/jhg.2013.86] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2013] [Revised: 06/17/2013] [Accepted: 07/03/2013] [Indexed: 11/09/2022]
Abstract
The core aspects of the biology and evolution of sexual reproduction are reviewed with a focus on the diploid, sexually reproducing, outbreeding, polymorphic, unspecialized, altricial and cultural human species. Human mate choice and pair bonding are viewed as central to individuals' lives and to the evolution of the species, and genetic assistance in reproduction is viewed as a universal human right. Pairomics is defined as an emerging branch of the omics science devoted to the study of mate choice at the genomic level and its consequences for present and future generations. In pairomics, comprehensive genetic information of individual genomes is stored in a database. Computational tools are employed to analyze the mating schemes and rules that govern mating among the members of the database. Mating models and algorithms simulate the outcomes of mating any given genome with each of a number of genomes represented in the database. The analyses and simulations may help to understand mating schemes and their outcomes, and also contribute a new cue to the multicued schemes of mate choice. The scientific, medical, evolutionary, ethical, legal and social implications of pairomics are far reaching. The use of genetic information as a search tool in mate choice may influence our health, lifestyle, behavior and culture. As knowledge on genomics, population genetics and gene-environment interactions, as well as the size of genomic databases expand, so does the ability of pairomics to investigate and predict the consequences of mate choice for the present and future generations.
Collapse
Affiliation(s)
- Sergio Ulhoa Dani
- Medawar Institute for Medical and Environmental Research, Acangau Foundation, Paracatu, Brazil
| | | | | | | |
Collapse
|
8
|
Clarke AJ, Cooper DN, Krawczak M, Tyler-Smith C, Wallace HM, Wilkie AOM, Raymond FL, Chadwick R, Craddock N, John R, Gallacher J, Chiano M. 'Sifting the significance from the data' - the impact of high-throughput genomic technologies on human genetics and health care. Hum Genomics 2012; 6:11. [PMID: 23244462 PMCID: PMC3500243 DOI: 10.1186/1479-7364-6-11] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2011] [Accepted: 05/18/2012] [Indexed: 01/01/2023] Open
Abstract
This report is of a round-table discussion held in Cardiff in September 2009 for Cesagen, a research centre within the Genomics Network of the UK’s Economic and Social Research Council. The meeting was arranged to explore ideas as to the likely future course of human genomics. The achievements of genomics research were reviewed, and the likely constraints on the pace of future progress were explored. New knowledge is transforming biology and our understanding of evolution and human disease. The difficulties we face now concern the interpretation rather than the generation of new sequence data. Our understanding of gene-environment interaction is held back by our current primitive tools for measuring environmental factors, and in addition, there may be fundamental constraints on what can be known about these complex interactions.
Collapse
Affiliation(s)
- Angus J Clarke
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, Wales CF14 4XN, UK.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Naidoo N, Pawitan Y, Soong R, Cooper DN, Ku CS. Human genetics and genomics a decade after the release of the draft sequence of the human genome. Hum Genomics 2012; 5:577-622. [PMID: 22155605 PMCID: PMC3525251 DOI: 10.1186/1479-7364-5-6-577] [Citation(s) in RCA: 77] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Substantial progress has been made in human genetics and genomics research over the past ten years since the publication of the draft sequence of the human genome in 2001. Findings emanating directly from the Human Genome Project, together with those from follow-on studies, have had an enormous impact on our understanding of the architecture and function of the human genome. Major developments have been made in cataloguing genetic variation, the International HapMap Project, and with respect to advances in genotyping technologies. These developments are vital for the emergence of genome-wide association studies in the investigation of complex diseases and traits. In parallel, the advent of high-throughput sequencing technologies has ushered in the 'personal genome sequencing' era for both normal and cancer genomes, and made possible large-scale genome sequencing studies such as the 1000 Genomes Project and the International Cancer Genome Consortium. The high-throughput sequencing and sequence-capture technologies are also providing new opportunities to study Mendelian disorders through exome sequencing and whole-genome sequencing. This paper reviews these major developments in human genetics and genomics over the past decade.
Collapse
Affiliation(s)
- Nasheen Naidoo
- Centre for Molecular Epidemiology, Department of Epidemiology and Public Health, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
| | | | | | | | | |
Collapse
|
10
|
Hofer T, Foll M, Excoffier L. Evolutionary forces shaping genomic islands of population differentiation in humans. BMC Genomics 2012; 13:107. [PMID: 22439654 PMCID: PMC3317871 DOI: 10.1186/1471-2164-13-107] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2011] [Accepted: 03/22/2012] [Indexed: 12/19/2022] Open
Abstract
BACKGROUND Levels of differentiation among populations depend both on demographic and selective factors: genetic drift and local adaptation increase population differentiation, which is eroded by gene flow and balancing selection. We describe here the genomic distribution and the properties of genomic regions with unusually high and low levels of population differentiation in humans to assess the influence of selective and neutral processes on human genetic structure. METHODS Individual SNPs of the Human Genome Diversity Panel (HGDP) showing significantly high or low levels of population differentiation were detected under a hierarchical-island model (HIM). A Hidden Markov Model allowed us to detect genomic regions or islands of high or low population differentiation. RESULTS Under the HIM, only 1.5% of all SNPs are significant at the 1% level, but their genomic spatial distribution is significantly non-random. We find evidence that local adaptation shaped high-differentiation islands, as they are enriched for non-synonymous SNPs and overlap with previously identified candidate regions for positive selection. Moreover there is a negative relationship between the size of islands and recombination rate, which is stronger for islands overlapping with genes. Gene ontology analysis supports the role of diet as a major selective pressure in those highly differentiated islands. Low-differentiation islands are also enriched for non-synonymous SNPs, and contain an overly high proportion of genes belonging to the 'Oncogenesis' biological process. CONCLUSIONS Even though selection seems to be acting in shaping islands of high population differentiation, neutral demographic processes might have promoted the appearance of some genomic islands since i) as much as 20% of islands are in non-genic regions ii) these non-genic islands are on average two times shorter than genic islands, suggesting a more rapid erosion by recombination, and iii) most loci are strongly differentiated between Africans and non-Africans, a result consistent with known human demographic history.
Collapse
Affiliation(s)
- Tamara Hofer
- Computational and Molecular Population Genetics Lab, Institute of Ecology and Evolution, University of Bern, 3012 Bern, Switzerland
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Matthieu Foll
- Computational and Molecular Population Genetics Lab, Institute of Ecology and Evolution, University of Bern, 3012 Bern, Switzerland
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Laurent Excoffier
- Computational and Molecular Population Genetics Lab, Institute of Ecology and Evolution, University of Bern, 3012 Bern, Switzerland
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| |
Collapse
|
11
|
Finalism in Darwinian and Lamarckian Evolution: Lessons from Epigenetics and Developmental Biology. Evol Biol 2012. [DOI: 10.1007/s11692-012-9163-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
|
12
|
Frenkel S, Kirzhner V, Korol A. Organizational heterogeneity of vertebrate genomes. PLoS One 2012; 7:e32076. [PMID: 22384143 PMCID: PMC3288070 DOI: 10.1371/journal.pone.0032076] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2011] [Accepted: 01/23/2012] [Indexed: 01/06/2023] Open
Abstract
Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS) analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers) in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM) allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences.
Collapse
Affiliation(s)
| | | | - Abraham Korol
- Department of Evolutionary and Environmental Biology and Institute of Evolution, University of Haifa, Mount Carmel, Haifa, Israel
| |
Collapse
|
13
|
Gu M, Dong X, Shi L, Shi L, Lin K, Huang X, Chu J. Differences in mtDNA whole sequence between Tibetan and Han populations suggesting adaptive selection to high altitude. Gene 2011; 496:37-44. [PMID: 22233893 DOI: 10.1016/j.gene.2011.12.016] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2011] [Revised: 12/02/2011] [Accepted: 12/06/2011] [Indexed: 10/14/2022]
Abstract
We performed a mitochondrial whole-genome comparison study in 40 Tibetan and 50 Han Chinese. All subjects could be classified into 13 haplogroups pertained to the Macrohaplogroup M and N that pitched different quadrants by principal component analysis. We observed a difference in the M9 haplogroup and identified 18 significant variants by comparing whole sequences between Tibetan and Han populations. Variants in ND2, COX2, tRNA alanine and 12S rRNA were predicted to confer increased protein stability in Tibetans. We compared the base substitutions of nonsynonymous (NS) versus synonymous (S) of 13 protein-encoding genes and found the NS/S values of the ATP6, ATP8, and Cyt b genes were larger (>1) in Tibetans than that in Han population. Our findings provide clues for the existence of adaptive selection for the ATP6, ATP8, Cyt b, ND2, COX2, tRNA alanine and 12S rRNA genes in Tibetans which likely contributed to adaptation to their specific geographic environment, such as high altitude.
Collapse
Affiliation(s)
- Mingliang Gu
- Department of Medical Genetics, Institute of Medical Biology, Chinese Academy of Medical Sciences & Peking Union Medical College, Kunming, China
| | | | | | | | | | | | | |
Collapse
|
14
|
Harris EE. Nonadaptive processes in primate and human evolution. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 2011; 143 Suppl 51:13-45. [PMID: 21086525 DOI: 10.1002/ajpa.21439] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
Evolutionary biology has tended to focus on adaptive evolution by positive selection as the primum mobile of evolutionary trajectories in species while underestimating the importance of nonadaptive evolutionary processes. In this review, I describe evidence that suggests that primate and human evolution has been strongly influenced by nonadaptive processes, particularly random genetic drift and mutation. This is evidenced by three fundamental effects: a relative relaxation of selective constraints (i.e., purifying selection), a relative increase in the fixation of slightly deleterious mutations, and a general reduction in the efficacy of positive selection. These effects are observed in protein-coding, regulatory regions, and in gene expression data, as well as in an augmentation of fixation of large-scale mutations, including duplicated genes, mobile genetic elements, and nuclear mitochondrial DNA. The evidence suggests a general population-level explanation such as a reduction in effective population size (N(e)). This would have tipped the balance between the evolutionary forces of natural selection and random genetic drift toward genetic drift for variants having small selective effects. After describing these proximate effects, I describe the potential consequences of these effects for primate and human evolution. For example, an increase in the fixation of slightly deleterious mutations could potentially have led to an increase in the fixation rate of compensatory mutations that act to suppress the effects of slightly deleterious substitutions. The potential consequences of compensatory evolution for the evolution of novel gene functions and in potentially confounding the detection of positively selected genes are explored. The consequences of the passive accumulation of large-scale genomic mutations by genetic drift are unclear, though evidence suggests that new gene copies as well as insertions of transposable elements into genes can potentially lead to adaptive phenotypes. Finally, because a decrease in selective constraint at the genetic level is expected to have effects at the morphological level, I review studies that compare rates of morphological change in various mammalian and island populations where N(e) is reduced. Furthermore, I discuss evidence that suggests that craniofacial morphology in the Homo lineage has shifted from an evolutionary rate constrained by purifying selection toward a neutral evolutionary rate.
Collapse
Affiliation(s)
- Eugene E Harris
- Department of Biological Sciences and Geology, Queensborough Community College, City University of New York, Bayside, NY 10364, USA.
| |
Collapse
|
15
|
Cooper DN, Chen JM, Ball EV, Howells K, Mort M, Phillips AD, Chuzhanova N, Krawczak M, Kehrer-Sawatzki H, Stenson PD. Genes, mutations, and human inherited disease at the dawn of the age of personalized genomics. Hum Mutat 2010; 31:631-55. [PMID: 20506564 DOI: 10.1002/humu.21260] [Citation(s) in RCA: 117] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The number of reported germline mutations in human nuclear genes, either underlying or associated with inherited disease, has now exceeded 100,000 in more than 3,700 different genes. The availability of these data has both revolutionized the study of the morbid anatomy of the human genome and facilitated "personalized genomics." With approximately 300 new "inherited disease genes" (and approximately 10,000 new mutations) being identified annually, it is pertinent to ask how many "inherited disease genes" there are in the human genome, how many mutations reside within them, and where such lesions are likely to be located? To address these questions, it is necessary not only to reconsider how we define human genes but also to explore notions of gene "essentiality" and "dispensability."Answers to these questions are now emerging from recent novel insights into genome structure and function and through complete genome sequence information derived from multiple individual human genomes. However, a change in focus toward screening functional genomic elements as opposed to genes sensu stricto will be required if we are to capitalize fully on recent technical and conceptual advances and identify new types of disease-associated mutation within noncoding regions remote from the genes whose function they disrupt.
Collapse
Affiliation(s)
- David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff CF14 4XN, United Kingdom.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Mitterauer BJ. Many Realities: Outline of a Brain Philosophy Based on Glial-Neuronal Interactions. JOURNAL OF INTELLIGENT SYSTEMS 2010. [DOI: 10.1515/jisys.2010.19.4.337] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
|
17
|
Marques AC, Ponting CP. Catalogues of mammalian long noncoding RNAs: modest conservation and incompleteness. Genome Biol 2009; 10:R124. [PMID: 19895688 PMCID: PMC3091318 DOI: 10.1186/gb-2009-10-11-r124] [Citation(s) in RCA: 209] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2009] [Revised: 10/21/2009] [Accepted: 11/06/2009] [Indexed: 12/12/2022] Open
Abstract
A comparative evolutionary analysis of two mouse long noncoding RNA libraries reveals a much larger pool of noncoding RNAs remains yet to be discovered. Background Despite increasing interest in the noncoding fraction of transcriptomes, the number, species-conservation and functions, if any, of many non-protein-coding transcripts remain to be discovered. Two extensive long intergenic noncoding RNA (ncRNA) transcript catalogues are now available for mouse: over 3,000 macroRNAs identified by cDNA sequencing, and 1,600 long intergenic noncoding RNA (lincRNA) intervals that are predicted from chromatin-state maps. Previously we showed that macroRNAs tend to be more highly conserved than putatively neutral sequence, although only 5% of bases are predicted as constrained. By contrast, over a thousand lincRNAs were reported as being highly conserved. This apparent difference may account for the surprisingly small fraction (11%) of transcripts that are represented in both catalogues. Here we sought to resolve the reported discrepancy between the evolutionary rates for these two sets. Results Our analyses reveal lincRNA and macroRNA exon sequences to be subject to the same relatively low degree of sequence constraint. Nonetheless, our observations are consistent with the functionality of a fraction of ncRNA in these sets, with up to a quarter of ncRNA exons having evolved significantly slower than neighboring neutral sequence. The more tissue-specific macroRNAs are enriched in predicted RNA secondary structures and thus may often act in trans, whereas the more highly and broadly expressed lincRNAs appear more likely to act in the cis-regulation of adjacent transcription factor genes. Conclusions Taken together, our results indicate that each of the two ncRNA catalogues unevenly and lightly samples the true, much larger, ncRNA repertoire of the mouse.
Collapse
Affiliation(s)
- Ana C Marques
- MRC Functional Genomics Unit, University of Oxford, Department of Physiology, Anatomy and Genetics, Oxford OX1 3QX, UK.
| | | |
Collapse
|
18
|
Bamshad M, Stephens JC. Assessing human variation data for signatures of natural selection. Cold Spring Harb Protoc 2009; 2009:pdb.top61. [PMID: 20150073 DOI: 10.1101/pdb.top61] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
In this article, we highlight some of the different types of natural selection, their effects on patterns of DNA variation, and some of the statistical tests that are commonly used to detect such effects. We also explain some of the relative strengths and weaknesses of different strategies that can be used to detect signatures of natural selection at individual loci. These strategies are illustrated by their application to empirical data from gene variants that are often associated with differences in disease susceptibility. We briefly outline some of the methods proposed to scan the genome for evidence of selection. Finally, we discuss some of the problems associated with identifying signatures of selection and with making inferences about the nature of the selective process.
Collapse
|
19
|
Morris KV. Long antisense non-coding RNAs function to direct epigenetic complexes that regulate transcription in human cells. Epigenetics 2009; 4:296-301. [PMID: 19633414 DOI: 10.4161/epi.4.5.9282] [Citation(s) in RCA: 93] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Epigenetic silencing of tumor suppressor gene promoters is one of the most common observations found in cancer. Despite the plethora of observed epigenetically silenced cancer related genes little is known about what is guiding the silencing to these particular loci. Two recent articles suggest that long antisense non-coding RNAs function as epigenetic regulators of transcription in human cells. These reports, along with previous observations that small antisense non-coding RNAs can epigenetically regulate transcription, imply that long antisense non-coding RNAs function as endogenous transcriptional regulatory RNAs in humans. Mechanistically, these long antisense non-coding RNAs may be involved in maintaining balanced transcription at bidirectionally transcribed loci as a method to modulate gene expression according to the selective pressures placed on the cell. The loss of this intricate bidirectional RNA based regulatory network can result in overt epigenetic silencing of gene expression. In the case of tumor suppressor genes this silencing can lead to the loss of cellular regulation and be a contributing factor in cancer. This perspective will highlight the endogenous effector RNAs and mechanism of action whereby long antisense non-coding RNAs transcriptionally regulate gene expression in human cells.
Collapse
Affiliation(s)
- Kevin V Morris
- Department of Molecular and Experimental Medicine, The Scripps Research Institute, La Jolla, CA 92037, USA.
| |
Collapse
|
20
|
|
21
|
Zhao Y, Epstein RJ. Programmed genetic instability: a tumor-permissive mechanism for maintaining the evolvability of higher species through methylation-dependent mutation of DNA repair genes in the male germ line. Mol Biol Evol 2008; 25:1737-49. [PMID: 18535014 PMCID: PMC2464741 DOI: 10.1093/molbev/msn126] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Tumor suppressor genes are classified by their somatic behavior either as caretakers (CTs) that maintain DNA integrity or as gatekeepers (GKs) that regulate cell survival, but the germ line role of these disease-related gene subgroups may differ. To test this hypothesis, we have used genomic data mining to compare the features of human CTs (n = 38), GKs (n = 36), DNA repair genes (n = 165), apoptosis genes (n = 622), and their orthologs. This analysis reveals that repair genes are numerically less common than apoptosis genes in the genomes of multicellular organisms (P < 0.01), whereas CT orthologs are commoner than GK orthologs in unicellular organisms (P < 0.05). Gene targeting data show that CTs are less essential than GKs for survival of multicellular organisms (P < 0.0005) and that CT knockouts often permit offspring viability at the cost of male sterility. Patterns of human familial oncogenic mutations confirm that isolated CT loss is commoner than is isolated GK loss (P < 0.00001). In sexually reproducing species, CTs appear subject to less efficient purifying selection (i.e., higher Ka/Ks) than GKs (P = 0.000003); the faster evolution of CTs seems likely to be mediated by gene methylation and reduced transcription-coupled repair, based on differences in dinucleotide patterns (P = 0.001). These data suggest that germ line CT/repair gene function is relatively dispensable for survival, and imply that milder (e.g., epimutational) male prezygotic repair defects could enhance sperm variation—and hence environmental adaptation and speciation—while sparing fertility. We submit that CTs and repair genes are general targets for epigenetically initiated adaptive evolution, and propose a model in which human cancers arise in part as an evolutionarily programmed side effect of age- and damage-inducible genetic instability affecting both somatic and germ line lineages.
Collapse
Affiliation(s)
- Yongzhong Zhao
- Laboratory of Computational Oncology, Faculty of Medicine, The University of Hong Kong, Pokfulam, Hong Kong
| | | |
Collapse
|
22
|
Ropers HH. Genetics of intellectual disability. Curr Opin Genet Dev 2008; 18:241-50. [DOI: 10.1016/j.gde.2008.07.008] [Citation(s) in RCA: 143] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2008] [Accepted: 07/15/2008] [Indexed: 11/16/2022]
|
23
|
Riley BM, Murray JC. Sequence evaluation of FGF and FGFR gene conserved non-coding elements in non-syndromic cleft lip and palate cases. Am J Med Genet A 2008; 143A:3228-34. [PMID: 17963255 DOI: 10.1002/ajmg.a.31965] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Non-syndromic cleft lip and palate (NS CLP) is a complex birth defect resulting from multiple genetic and environmental factors. We have previously reported the sequencing of the coding region of genes in the fibroblast growth factor (FGF) signaling pathway, in which missense and non-sense mutations contribute to approximately 5%-6% NS CLP cases. In this article we report the sequencing of conserved non-coding elements (CNEs) in and around 11 of the FGF and FGFR genes, which identified 55 novel variants. Seven of variants are highly conserved among >/=8 species and 31 variants alter transcription factor binding sites, 8 of which are important for craniofacial development. Additionally, 15 NS CLP patients had a combination of coding mutations and CNE variants, suggesting that an accumulation of variants in the FGF signaling pathway may contribute to clefting.
Collapse
Affiliation(s)
- Bridget M Riley
- Department of Pediatrics, University of Iowa, Iowa City, Iowa, USA
| | | |
Collapse
|
24
|
Levasseur A, Orlando L, Bailly X, Milinkovitch MC, Danchin EGJ, Pontarotti P. Conceptual bases for quantifying the role of the environment on gene evolution: the participation of positive selection and neutral evolution. Biol Rev Camb Philos Soc 2007; 82:551-72. [PMID: 17944617 DOI: 10.1111/j.1469-185x.2007.00024.x] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
To demonstrate that a given change in the environment has contributed to the emergence of a given genotypic and phenotypic shift during the course of evolution, one should ask to what extent such shifts would have occurred without environmental change. Of course, such tests are rarely practical but phenotypic novelties can still be correlated to genomic shifts in response to environmental changes if enough information is available. We surveyed and re-evaluated the published data in order to estimate the role of environmental changes on the course of species and genomic evolution. Only a few published examples clearly demonstrate a causal link between a given environmental change and the fixation of a genomic variant resulting in functional modification (gain, loss or alteration of function). Many others suggested a link between a given phenotypic shift and a given environmental change but failed to identify the underlying genomic determinant(s) and/or the associated functional consequence(s). The proportion of genotypic and phenotypic variation that is fixed concomitantly with environmental changes is often considered adaptive and hence, the result of positive selection, even though alternative causes, such as genetic drift, are rarely investigated. Therefore, the second aim herein is to review evidence for the mechanisms leading to fixation.
Collapse
Affiliation(s)
- Anthony Levasseur
- Phylogenomics Laboratory, EA 3781 Evolution Biologique Université de Provence, Case 19, Pl. V. Hugo, 13331 Marseille Cedex 03, France.
| | | | | | | | | | | |
Collapse
|
25
|
Abnizova I, Subhankulova T, Gilks WR. Recent computational approaches to understand gene regulation: mining gene regulation in silico. Curr Genomics 2007; 8:79-91. [PMID: 18660846 PMCID: PMC2435357 DOI: 10.2174/138920207780368150] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2006] [Revised: 12/13/2006] [Accepted: 12/15/2006] [Indexed: 01/03/2023] Open
Abstract
This paper reviews recent computational approaches to the understanding of gene regulation in eukaryotes. Cis-regulation of gene expression by the binding of transcription factors is a critical component of cellular physiology. In eukaryotes, a number of transcription factors often work together in a combinatorial fashion to enable cells to respond to a wide spectrum of environmental and developmental signals. Integration of genome sequences and/or Chromatin Immunoprecipitation on chip data with gene-expression data has facilitated in silico discovery of how the combinatorics and positioning of transcription factors binding sites underlie gene activation in a variety of cellular processes.The process of gene regulation is extremely complex and intriguing, therefore all possible points of view and related links should be carefully considered. Here we attempt to collect an inventory, not claiming it to be comprehensive and complete, of related computational biological topics covering gene regulation, which may en-lighten the process, and briefly review what is currently occurring in these areas.We will consider the following computational areas:o gene regulatory network construction;o evolution of regulatory DNA;o studies of its structural and statistical informational properties;o and finally, regulatory RNA.
Collapse
Affiliation(s)
| | - T Subhankulova
- Wellcome Trust/Cancer Research UK Gurdon Institute of Cancer and Developmental Biology, Cambridge, UK
| | | |
Collapse
|
26
|
Ponjavic J, Ponting CP, Lunter G. Functionality or transcriptional noise? Evidence for selection within long noncoding RNAs. Genome Res 2007; 17:556-65. [PMID: 17387145 PMCID: PMC1855172 DOI: 10.1101/gr.6036807] [Citation(s) in RCA: 537] [Impact Index Per Article: 31.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Long transcripts that do not encode protein have only rarely been the subject of experimental scrutiny. Presumably, this is owing to the current lack of evidence of their functionality, thereby leaving an impression that, instead, they represent "transcriptional noise." Here, we describe an analysis of 3122 long and full-length, noncoding RNAs ("macroRNAs") from the mouse, and compare their sequences and their promoters with orthologous sequence from human and from rat. We considered three independent signatures of purifying selection related to substitutions, sequence insertions and deletions, and splicing. We find that the evolution of the set of noncoding RNAs is not consistent with neutralist explanations. Rather, our results indicate that purifying selection has acted on the macroRNAs' promoters, primary sequence, and consensus splice site motifs. Promoters have experienced the greatest elimination of nucleotide substitutions, insertions, and deletions. The proportion of conserved sequence (4.1%-5.5%) in these macroRNAs is comparable to the density of exons within protein-coding transcripts (5.2%). These macroRNAs, taken together, thus possess the imprint of purifying selection, thereby indicating their functionality. Our findings should now provide an incentive for the experimental investigation of these macroRNAs' functions.
Collapse
Affiliation(s)
- Jasmina Ponjavic
- MRC Functional Genetics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford OX1 3QX, United Kingdom
| | - Chris P. Ponting
- MRC Functional Genetics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford OX1 3QX, United Kingdom
- Corresponding authors.E-mail ; fax 44-1865-282651.E-mail ; fax 44-1865-282651
| | - Gerton Lunter
- MRC Functional Genetics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford OX1 3QX, United Kingdom
- Corresponding authors.E-mail ; fax 44-1865-282651.E-mail ; fax 44-1865-282651
| |
Collapse
|
27
|
|