1
|
Wang D. DLGP: A database for lineage-conserved and lineage-specific gene pairs in animal and plant genomes. Biochem Biophys Res Commun 2015; 469:542-5. [PMID: 26697753 DOI: 10.1016/j.bbrc.2015.12.039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2015] [Accepted: 12/10/2015] [Indexed: 10/22/2022]
Abstract
The conservation of gene organization in the genome with lineage-specificity is an invaluable resource to decipher their potential functionality with diverse selective constraints, especially in higher animals and plants. Gene pairs appear to be the minimal structure for such kind of gene clusters that tend to reside in their preferred locations, representing the distinctive genomic characteristics in single species or a given lineage. Despite gene families having been investigated in a widespread manner, the definition of gene pair families in various taxa still lacks adequate attention. To address this issue, we report DLGP (http://lcgbase.big.ac.cn/DLGP/) that stores the pre-calculated lineage-based gene pairs in currently available 134 animal and plant genomes and inspect them under the same analytical framework, bringing out a set of innovational features. First, the taxonomy or lineage has been classified into four levels such as Kingdom, Phylum, Class and Order. It adopts all-to-all comparison strategy to identify the possible conserved gene pairs in all species for each gene pair in certain species and reckon those that are conserved in over a significant proportion of species in a given lineage (e.g. Primates, Diptera or Poales) as the lineage-conserved gene pairs. Furthermore, it predicts the lineage-specific gene pairs by retaining the above-mentioned lineage-conserved gene pairs that are not conserved in any other lineages. Second, it carries out pairwise comparison for the gene pairs between two compared species and creates the table including all the conserved gene pairs and the image elucidating the conservation degree of gene pairs in chromosomal level. Third, it supplies gene order browser to extend gene pairs to gene clusters, allowing users to view the evolution dynamics in the gene context in an intuitive manner. This database will be able to facilitate the particular comparison between animals and plants, between vertebrates and arthropods, and between monocots and eudicots, accounting for the significant contribution of gene pairs to speciation and diversification in specific lineages.
Collapse
Affiliation(s)
- Dapeng Wang
- Stem Cell Laboratory, UCL Cancer Institute, University College London, London WC1E 6BT, UK; CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, PR China.
| |
Collapse
|
2
|
Pourkheirandish M, Hensel G, Kilian B, Senthil N, Chen G, Sameri M, Azhaguvel P, Sakuma S, Dhanagond S, Sharma R, Mascher M, Himmelbach A, Gottwald S, Nair SK, Tagiri A, Yukuhiro F, Nagamura Y, Kanamori H, Matsumoto T, Willcox G, Middleton CP, Wicker T, Walther A, Waugh R, Fincher GB, Stein N, Kumlehn J, Sato K, Komatsuda T. Evolution of the Grain Dispersal System in Barley. Cell 2015; 162:527-39. [PMID: 26232223 DOI: 10.1016/j.cell.2015.07.002] [Citation(s) in RCA: 182] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2014] [Revised: 03/13/2015] [Accepted: 06/10/2015] [Indexed: 10/23/2022]
Abstract
About 12,000 years ago in the Near East, humans began the transition from hunter-gathering to agriculture-based societies. Barley was a founder crop in this process, and the most important steps in its domestication were mutations in two adjacent, dominant, and complementary genes, through which grains were retained on the inflorescence at maturity, enabling effective harvesting. Independent recessive mutations in each of these genes caused cell wall thickening in a highly specific grain "disarticulation zone," converting the brittle floral axis (the rachis) of the wild-type into a tough, non-brittle form that promoted grain retention. By tracing the evolutionary history of allelic variation in both genes, we conclude that spatially and temporally independent selections of germplasm with a non-brittle rachis were made during the domestication of barley by farmers in the southern and northern regions of the Levant, actions that made a major contribution to the emergence of early agrarian societies.
Collapse
Affiliation(s)
| | - Goetz Hensel
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, 06466 Stadt Seeland, Germany
| | - Benjamin Kilian
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, 06466 Stadt Seeland, Germany
| | - Natesan Senthil
- National Institute of Agrobiological Sciences, 305-8602 Tsukuba, Japan
| | - Guoxiong Chen
- National Institute of Agrobiological Sciences, 305-8602 Tsukuba, Japan
| | - Mohammad Sameri
- National Institute of Agrobiological Sciences, 305-8602 Tsukuba, Japan
| | - Perumal Azhaguvel
- National Institute of Agrobiological Sciences, 305-8602 Tsukuba, Japan
| | - Shun Sakuma
- National Institute of Agrobiological Sciences, 305-8602 Tsukuba, Japan
| | - Sidram Dhanagond
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, 06466 Stadt Seeland, Germany
| | - Rajiv Sharma
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, 06466 Stadt Seeland, Germany
| | - Martin Mascher
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, 06466 Stadt Seeland, Germany
| | - Axel Himmelbach
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, 06466 Stadt Seeland, Germany
| | - Sven Gottwald
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, 06466 Stadt Seeland, Germany
| | - Sudha K Nair
- National Institute of Agrobiological Sciences, 305-8602 Tsukuba, Japan
| | - Akemi Tagiri
- National Institute of Agrobiological Sciences, 305-8602 Tsukuba, Japan
| | - Fumiko Yukuhiro
- National Institute of Agrobiological Sciences, 305-8602 Tsukuba, Japan
| | - Yoshiaki Nagamura
- National Institute of Agrobiological Sciences, 305-8602 Tsukuba, Japan
| | - Hiroyuki Kanamori
- National Institute of Agrobiological Sciences, 305-8602 Tsukuba, Japan
| | - Takashi Matsumoto
- National Institute of Agrobiological Sciences, 305-8602 Tsukuba, Japan
| | - George Willcox
- Archéorient CNRS UMR 5133, Université de Lyon II, Jalés, Berrias 07460, France
| | | | - Thomas Wicker
- Institute of Plant Biology, University of Zürich, 8008 Zürich, Switzerland
| | - Alexander Walther
- Department of Earth Sciences, University of Gothenburg, 405 30 Gothenburg, Sweden
| | - Robbie Waugh
- University of Dundee, The James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
| | - Geoffrey B Fincher
- ARC Centre of Excellence in Plant Cell Walls, School of Agriculture, Food and Wine, University of Adelaide, Waite Campus Glen Osmond, SA 5066, Australia
| | - Nils Stein
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, 06466 Stadt Seeland, Germany
| | - Jochen Kumlehn
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, 06466 Stadt Seeland, Germany
| | - Kazuhiro Sato
- Institute of Plant Science and Resources, Okayama University, 710-0046 Kurashiki, Japan
| | - Takao Komatsuda
- National Institute of Agrobiological Sciences, 305-8602 Tsukuba, Japan.
| |
Collapse
|
3
|
Yegorov S, Good S. Using paleogenomics to study the evolution of gene families: origin and duplication history of the relaxin family hormones and their receptors. PLoS One 2012; 7:e32923. [PMID: 22470432 PMCID: PMC3310001 DOI: 10.1371/journal.pone.0032923] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2011] [Accepted: 02/05/2012] [Indexed: 11/28/2022] Open
Abstract
Recent progress in the analysis of whole genome sequencing data has resulted in the emergence of paleogenomics, a field devoted to the reconstruction of ancestral genomes. Ancestral karyotype reconstructions have been used primarily to illustrate the dynamic nature of genome evolution. In this paper, we demonstrate how they can also be used to study individual gene families by examining the evolutionary history of relaxin hormones (RLN/INSL) and relaxin family peptide receptors (RXFP). Relaxin family hormones are members of the insulin superfamily, and are implicated in the regulation of a variety of primarily reproductive and neuroendocrine processes. Their receptors are G-protein coupled receptors (GPCR's) and include members of two distinct evolutionary groups, an unusual characteristic. Although several studies have tried to elucidate the origins of the relaxin peptide family, the evolutionary origin of their receptors and the mechanisms driving the diversification of the RLN/INSL-RXFP signaling systems in non-placental vertebrates has remained elusive. Here we show that the numerous vertebrate RLN/INSL and RXFP genes are products of an ancestral receptor-ligand system that originally consisted of three genes, two of which apparently trace their origins to invertebrates. Subsequently, diversification of the system was driven primarily by whole genome duplications (WGD, 2R and 3R) followed by almost complete retention of the ligand duplicates in most vertebrates but massive loss of receptor genes in tetrapods. Interestingly, the majority of 3R duplicates retained in teleosts are potentially involved in neuroendocrine regulation. Furthermore, we infer that the ancestral AncRxfp3/4 receptor may have been syntenically linked to the AncRln-like ligand in the pre-2R genome, and show that syntenic linkages among ligands and receptors have changed dynamically in different lineages. This study ultimately shows the broad utility, with some caveats, of incorporating paleogenomics data into understanding the evolution of gene families.
Collapse
Affiliation(s)
- Sergey Yegorov
- Department of Biology, University of Winnipeg, Winnipeg, Manitoba, Canada
| | - Sara Good
- Department of Biology, University of Winnipeg, Winnipeg, Manitoba, Canada
- * E-mail:
| |
Collapse
|
4
|
Al-Shahrour F, Minguez P, Marqués-Bonet T, Gazave E, Navarro A, Dopazo J. Selection upon genome architecture: conservation of functional neighborhoods with changing genes. PLoS Comput Biol 2010; 6:e1000953. [PMID: 20949098 PMCID: PMC2951340 DOI: 10.1371/journal.pcbi.1000953] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2009] [Accepted: 09/08/2010] [Indexed: 11/19/2022] Open
Abstract
An increasing number of evidences show that genes are not distributed randomly across eukaryotic chromosomes, but rather in functional neighborhoods. Nevertheless, the driving force that originated and maintains such neighborhoods is still a matter of controversy. We present the first detailed multispecies cartography of genome regions enriched in genes with related functions and study the evolutionary implications of such clustering. Our results indicate that the chromosomes of higher eukaryotic genomes contain up to 12% of genes arranged in functional neighborhoods, with a high level of gene co-expression, which are consistently distributed in phylogenies. Unexpectedly, neighborhoods with homologous functions are formed by different (non-orthologous) genes in different species. Actually, instead of being conserved, functional neighborhoods present a higher degree of synteny breaks than the genome average. This scenario is compatible with the existence of selective pressures optimizing the coordinated transcription of blocks of functionally related genes. If these neighborhoods were broken by chromosomal rearrangements, selection would favor further rearrangements reconstructing other neighborhoods of similar function. The picture arising from this study is a dynamic genomic landscape with a high level of functional organization. We describe here the most extensive functional cartography of the genomes of multiple species carried out to date. Our study shows, for the first time, how neighborhoods of functionally related genes arise and how they are maintained through evolution following a pattern that is fully consistent with the evolutionary trees of the analyzed species. Contrary to what would be expected, such neighborhoods are not composed of the same genes in different species but rather by genes unrelated, annotated, however, with the same function. Our analysis also reveals that such neighborhoods are dynamically rebuilt in a way that, while the particular genes often change, it is the function of the genes present in the neighborhood, as the ultimate target of selection, that is preserved.
Collapse
Affiliation(s)
- Fátima Al-Shahrour
- Department of Bioinformatics and Genomics, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain
| | - Pablo Minguez
- Department of Bioinformatics and Genomics, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain
| | - Tomás Marqués-Bonet
- Institut de Biologia Evolutiva, Universitat Pompeu Fabra (UPF) and Consejo Superior de Investigaciones Científicas (CSIC), Barcelona, Spain
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
- Howard Hughes Medical Institute, University of Washington, Seattle, Washington, United States of America
| | - Elodie Gazave
- Institut de Biologia Evolutiva, Universitat Pompeu Fabra (UPF) and Consejo Superior de Investigaciones Científicas (CSIC), Barcelona, Spain
| | - Arcadi Navarro
- Institut de Biologia Evolutiva, Universitat Pompeu Fabra (UPF) and Consejo Superior de Investigaciones Científicas (CSIC), Barcelona, Spain
- Population Genomics Node (National Institute for Bioinformatics, INB), Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
| | - Joaquín Dopazo
- Department of Bioinformatics and Genomics, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain
- CIBER de Enfermedades Raras (CIBERER), Valencia, Spain
- Functional Genomics Node (National Institute for Bioinformatics, INB), CIPF, Valencia, Spain
- * E-mail:
| |
Collapse
|
5
|
Sundström G, Dreborg S, Larhammar D. Concomitant duplications of opioid peptide and receptor genes before the origin of jawed vertebrates. PLoS One 2010; 5:e10512. [PMID: 20463905 PMCID: PMC2865548 DOI: 10.1371/journal.pone.0010512] [Citation(s) in RCA: 81] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2010] [Accepted: 04/13/2010] [Indexed: 12/22/2022] Open
Abstract
Background The opioid system is involved in reward and pain mechanisms and consists in mammals of four receptors and several peptides. The peptides are derived from four prepropeptide genes, PENK, PDYN, PNOC and POMC, encoding enkephalins, dynorphins, orphanin/nociceptin and beta-endorphin, respectively. Previously we have described how two rounds of genome doubling (2R) before the origin of jawed vertebrates formed the receptor family. Methodology/Principal Findings Opioid peptide gene family members were investigated using a combination of sequence-based phylogeny and chromosomal locations of the peptide genes in various vertebrates. Several adjacent gene families were investigated similarly. The results show that the ancestral peptide gene gave rise to two additional copies in the genome doublings. The fourth member was generated by a local gene duplication, as the genes encoding POMC and PNOC are located on the same chromosome in the chicken genome and all three teleost genomes that we have studied. A translocation has disrupted this synteny in mammals. The PDYN gene seems to have been lost in chicken, but not in zebra finch. Duplicates of some peptide genes have arisen in the teleost fishes. Within the prepropeptide precursors, peptides have been lost or gained in different lineages. Conclusions/Significance The ancestral peptide and receptor genes were located on the same chromosome and were thus duplicated concomitantly. However, subsequently genetic linkage has been lost. In conclusion, the system of opioid peptides and receptors was largely formed by the genome doublings that took place early in vertebrate evolution.
Collapse
Affiliation(s)
- Görel Sundström
- Department of Neuroscience, Uppsala University, Uppsala, Sweden
| | - Susanne Dreborg
- Department of Neuroscience, Uppsala University, Uppsala, Sweden
| | - Dan Larhammar
- Department of Neuroscience, Uppsala University, Uppsala, Sweden
- * E-mail:
| |
Collapse
|
6
|
|
7
|
Larsson TA, Olsson F, Sundstrom G, Lundin LG, Brenner S, Venkatesh B, Larhammar D. Early vertebrate chromosome duplications and the evolution of the neuropeptide Y receptor gene regions. BMC Evol Biol 2008; 8:184. [PMID: 18578868 PMCID: PMC2453138 DOI: 10.1186/1471-2148-8-184] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2008] [Accepted: 06/25/2008] [Indexed: 12/31/2022] Open
Abstract
Background One of the many gene families that expanded in early vertebrate evolution is the neuropeptide (NPY) receptor family of G-protein coupled receptors. Earlier work by our lab suggested that several of the NPY receptor genes found in extant vertebrates resulted from two genome duplications before the origin of jawed vertebrates (gnathostomes) and one additional genome duplication in the actinopterygian lineage, based on their location on chromosomes sharing several gene families. In this study we have investigated, in five vertebrate genomes, 45 gene families with members close to the NPY receptor genes in the compact genomes of the teleost fishes Tetraodon nigroviridis and Takifugu rubripes. These correspond to Homo sapiens chromosomes 4, 5, 8 and 10. Results Chromosome regions with conserved synteny were identified and confirmed by phylogenetic analyses in H. sapiens, M. musculus, D. rerio, T. rubripes and T. nigroviridis. 26 gene families, including the NPY receptor genes, (plus 3 described recently by other labs) showed a tree topology consistent with duplications in early vertebrate evolution and in the actinopterygian lineage, thereby supporting expansion through block duplications. Eight gene families had complications that precluded analysis (such as short sequence length or variable number of repeated domains) and another eight families did not support block duplications (because the paralogs in these families seem to have originated in another time window than the proposed genome duplication events). RT-PCR carried out with several tissues in T. rubripes revealed that all five NPY receptors were expressed in the brain and subtypes Y2, Y4 and Y8 were also expressed in peripheral organs. Conclusion We conclude that the phylogenetic analyses and chromosomal locations of these gene families support duplications of large blocks of genes or even entire chromosomes. Thus, these results are consistent with two early vertebrate tetraploidizations forming a paralogon comprising human chromosomes 4, 5, 8 and 10 and one teleost tetraploidization. The combination of positional and phylogenetic data further strengthens the identification of orthologs and paralogs in the NPY receptor family.
Collapse
Affiliation(s)
- Tomas A Larsson
- Department of Neuroscience, Uppsala University, Box 593, 75124 Uppsala, Sweden.
| | | | | | | | | | | | | |
Collapse
|
8
|
Makino T, McLysaght A. Interacting gene clusters and the evolution of the vertebrate immune system. Mol Biol Evol 2008; 25:1855-62. [PMID: 18573844 DOI: 10.1093/molbev/msn137] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Unraveling the "code" of genome structure is an important goal of genomics research. Colocalization of genes in eukaryotic genomes may facilitate preservation of favorable allele combinations between epistasic loci or coregulation of functionally related genes. However, the presence of interacting gene clusters in the human genome has remained unclear. We systematically searched the human genome for evidence of closely linked genes whose protein products interact. We find 83 pairs of interacting genes that are located within 1 Mbp in the human genome or 37 if we exclude hub proteins. This number of interacting gene clusters is significantly more than expected by chance and is not the result of tandem duplications. Furthermore, we find that these clusters are significantly more conserved across vertebrate (but not chordate) genomes than other pairs of genes located within 1 Mbp in the human genome. In many cases, the genes are both present but not clustered in older vertebrate lineages. These results suggest gene cluster creation along the human lineage. These clusters are not enriched for housekeeping genes, but we find a significant contribution from genes involved in "response to stimulus." Many of these genes are involved in the immune response, including, but not limited to, known clusters such as the major histocompatibility complex. That these clusters were formed contemporaneously with the origin of adaptive immunity within the vertebrate lineage suggests that novel evolutionary and regulatory constraints were associated with the operation of the immune system.
Collapse
Affiliation(s)
- Takashi Makino
- Smurfit Institute of Genetics, University of Dublin, Trinity College, Dublin, Ireland
| | | |
Collapse
|
9
|
Yerushalmi U, Teicher M. Examining emergence of functional gene clustering in a simulated evolution. Bull Math Biol 2007; 69:2261-80. [PMID: 17554587 DOI: 10.1007/s11538-007-9219-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2006] [Accepted: 03/29/2007] [Indexed: 11/30/2022]
Abstract
Recent research suggests that rather than being random, gene order may be coupled with gene functionality. These findings may be explained by mechanisms that require physical proximity such as co-expression and co-regulation. Alternatively, they may be due to evolutionary-dynamics forces, as expressed in genetic drift or linkage disequilibrium. This paper proposes a biologically plausible model for evolutionary development. Using the model, which includes natural selection and the development of gene networks and cellular organisms, the co-evolution of recombination rate and gene functionality is examined. The results presented here are compatible with previous biological findings showing that functionally related genes are clustered. These results imply that evolutionary pressure in a complex environment is sufficient for the emergence of gene order that is coupled with functionality. They shed further light on the mechanisms that may cause such gene clusters.
Collapse
Affiliation(s)
- Uri Yerushalmi
- The Leslie and Susan Gonda Interdisciplinary Brain Research Center, Bar-Ilan University, Ramat-Gan, Israel.
| | | |
Collapse
|
10
|
Hallböök F, Wilson K, Thorndyke M, Olinski RP. Formation and evolution of the chordate neurotrophin and Trk receptor genes. BRAIN, BEHAVIOR AND EVOLUTION 2006; 68:133-44. [PMID: 16912467 DOI: 10.1159/000094083] [Citation(s) in RCA: 67] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Neurotrophins are structurally related neurotrophic polypeptide factors that regulate neuronal differentiation and are essential for neuronal survival, neurite growth and plasticity. It has until very recently been thought that the neurotrophin system appeared with the vertebrate species, but identification of a cephalochordate neurotrophin receptor (Trk), and more recently neurotrophin sequences in several genomes of deuterostome invertebrates, show that the system already existed at the stem of the deuterostome group. Comparative genomics supports the hypothesis that two whole genome duplications produced many of the vertebrate gene families, among those the neurotrophin and Trk families. It remains to be proven to what extent the whole genome duplications have driven macroevolutionary change, but it appears certain that the formation of the multi-gene copy neurotrophin and Trk receptor families at the stem of vertebrates has provided a foundation from which the various functions and pleiotropic effects produced by each of the four extant neurotrophins have evolved.
Collapse
Affiliation(s)
- Finn Hallböök
- Department of Neuroscience, Unit for Developmental Neuroscience, Uppsala University, Uppsala, Sweden.
| | | | | | | |
Collapse
|