1
|
Standage DS, Lai T, Brendel VP. iLoci: robust evaluation of genome content and organization for provisional and mature genome assemblies. NAR Genom Bioinform 2022; 4:lqac013. [PMID: 35211671 PMCID: PMC8862717 DOI: 10.1093/nargab/lqac013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Revised: 12/23/2021] [Accepted: 02/10/2022] [Indexed: 11/23/2022] Open
Abstract
We introduce a new framework for genome analyses based on parsing an annotated genome assembly into distinct interval loci (iLoci), available as open-source software as part of the AEGeAn Toolkit (https://github.com/BrendelGroup/AEGeAn). We demonstrate that iLoci provide an alternative coordinate system that is robust to changes in assembly and annotation versions and facilitates granular quality control of genome data. We discuss how statistics computed on iLoci reflect various characteristics of genome content and organization and illustrate how these statistics can be used to establish a baseline for assessment of the completeness and accuracy of the data. We also introduce a well-defined measure of relative genome compactness and compute other iLocus statistics that reveal genome-wide characteristics of gene arrangements in the whole genome context. Given the fast pace of assembly/annotation updates, our AEGeAn Toolkit fills a niche in computational genomics based on deriving persistent and species-specific genome statistics. Gene structure model-centric iLoci provide a precisely defined coordinate system that can be used to store assembly/annotation updates that reflect either stable or changed assessments. Large-scale application of the approach revealed species- and clade-specific genome organization in precisely defined computational terms, promising intriguing forays into the forces of shaping genome structure as more and more genome assemblies are being deposited.
Collapse
Affiliation(s)
- Daniel S Standage
- Department of Biology, Indiana University, Bloomington, IN 47405, USA
| | - Tim Lai
- Department of Mathematics, Indiana University, Bloomington, IN 47405, USA
| | - Volker P Brendel
- Department of Biology, Indiana University, Bloomington, IN 47405, USA
- Department of Computer Science, Indiana University, Bloomington, IN 47405, USA
| |
Collapse
|
2
|
Khan N, Pelletier D, McAlear TS, Croteau N, Veyron S, Bayne AN, Black C, Ichikawa M, Khalifa AAZ, Chaaban S, Kurinov I, Brouhard G, Bechstedt S, Bui KH, Trempe JF. Crystal structure of human PACRG in complex with MEIG1 reveals roles in axoneme formation and tubulin binding. Structure 2021; 29:572-586.e6. [PMID: 33529594 DOI: 10.1016/j.str.2021.01.001] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Revised: 12/16/2020] [Accepted: 01/08/2021] [Indexed: 02/07/2023]
Abstract
The Parkin co-regulated gene protein (PACRG) binds at the inner junction between doublet microtubules of the axoneme, a structure found in flagella and cilia. PACRG binds to the adaptor protein meiosis expressed gene 1 (MEIG1), but how they bind to microtubules is unknown. Here, we report the crystal structure of human PACRG in complex with MEIG1. PACRG adopts a helical repeat fold with a loop that interacts with MEIG1. Using the structure of the axonemal doublet microtubule from the protozoan Chlamydomonas reinhardtii and single-molecule fluorescence microscopy, we propose that PACRG binds to microtubules while simultaneously recruiting free tubulin to catalyze formation of the inner junction. We show that the homologous PACRG-like protein also mediates dual tubulin interactions but does not bind MEIG1. Our findings establish a framework to assess the function of the PACRG family of proteins and MEIG1 in regulating axoneme assembly.
Collapse
Affiliation(s)
- Nimra Khan
- Department of Pharmacology & Therapeutics, McGill University, Montreal, QC, Canada; Centre de Recherche en Biologie Structurale - FRQS, McGill University, Montreal, QC, Canada
| | - Dylan Pelletier
- Department of Pharmacology & Therapeutics, McGill University, Montreal, QC, Canada; Centre de Recherche en Biologie Structurale - FRQS, McGill University, Montreal, QC, Canada
| | - Thomas S McAlear
- Department of Anatomy & Cell Biology, McGill University, Montreal, QC, Canada; Centre de Recherche en Biologie Structurale - FRQS, McGill University, Montreal, QC, Canada
| | - Nathalie Croteau
- Department of Pharmacology & Therapeutics, McGill University, Montreal, QC, Canada; Centre de Recherche en Biologie Structurale - FRQS, McGill University, Montreal, QC, Canada
| | - Simon Veyron
- Department of Pharmacology & Therapeutics, McGill University, Montreal, QC, Canada; Centre de Recherche en Biologie Structurale - FRQS, McGill University, Montreal, QC, Canada
| | - Andrew N Bayne
- Department of Pharmacology & Therapeutics, McGill University, Montreal, QC, Canada; Centre de Recherche en Biologie Structurale - FRQS, McGill University, Montreal, QC, Canada
| | - Corbin Black
- Department of Anatomy & Cell Biology, McGill University, Montreal, QC, Canada; Centre de Recherche en Biologie Structurale - FRQS, McGill University, Montreal, QC, Canada
| | - Muneyoshi Ichikawa
- Department of Anatomy & Cell Biology, McGill University, Montreal, QC, Canada
| | - Ahmad Abdelzaher Zaki Khalifa
- Department of Anatomy & Cell Biology, McGill University, Montreal, QC, Canada; Centre de Recherche en Biologie Structurale - FRQS, McGill University, Montreal, QC, Canada
| | - Sami Chaaban
- Department of Biology, McGill University, Montreal, QC, Canada; Centre de Recherche en Biologie Structurale - FRQS, McGill University, Montreal, QC, Canada
| | - Igor Kurinov
- NECAT, Cornell University, Department of Chemistry and Chemical Biology, Argonne, IL, USA
| | - Gary Brouhard
- Department of Biology, McGill University, Montreal, QC, Canada; Centre de Recherche en Biologie Structurale - FRQS, McGill University, Montreal, QC, Canada
| | - Susanne Bechstedt
- Department of Anatomy & Cell Biology, McGill University, Montreal, QC, Canada; Centre de Recherche en Biologie Structurale - FRQS, McGill University, Montreal, QC, Canada
| | - Khanh Huy Bui
- Department of Anatomy & Cell Biology, McGill University, Montreal, QC, Canada; Centre de Recherche en Biologie Structurale - FRQS, McGill University, Montreal, QC, Canada
| | - Jean-François Trempe
- Department of Pharmacology & Therapeutics, McGill University, Montreal, QC, Canada; Centre de Recherche en Biologie Structurale - FRQS, McGill University, Montreal, QC, Canada.
| |
Collapse
|
3
|
Wang D. DLGP: A database for lineage-conserved and lineage-specific gene pairs in animal and plant genomes. Biochem Biophys Res Commun 2015; 469:542-5. [PMID: 26697753 DOI: 10.1016/j.bbrc.2015.12.039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2015] [Accepted: 12/10/2015] [Indexed: 10/22/2022]
Abstract
The conservation of gene organization in the genome with lineage-specificity is an invaluable resource to decipher their potential functionality with diverse selective constraints, especially in higher animals and plants. Gene pairs appear to be the minimal structure for such kind of gene clusters that tend to reside in their preferred locations, representing the distinctive genomic characteristics in single species or a given lineage. Despite gene families having been investigated in a widespread manner, the definition of gene pair families in various taxa still lacks adequate attention. To address this issue, we report DLGP (http://lcgbase.big.ac.cn/DLGP/) that stores the pre-calculated lineage-based gene pairs in currently available 134 animal and plant genomes and inspect them under the same analytical framework, bringing out a set of innovational features. First, the taxonomy or lineage has been classified into four levels such as Kingdom, Phylum, Class and Order. It adopts all-to-all comparison strategy to identify the possible conserved gene pairs in all species for each gene pair in certain species and reckon those that are conserved in over a significant proportion of species in a given lineage (e.g. Primates, Diptera or Poales) as the lineage-conserved gene pairs. Furthermore, it predicts the lineage-specific gene pairs by retaining the above-mentioned lineage-conserved gene pairs that are not conserved in any other lineages. Second, it carries out pairwise comparison for the gene pairs between two compared species and creates the table including all the conserved gene pairs and the image elucidating the conservation degree of gene pairs in chromosomal level. Third, it supplies gene order browser to extend gene pairs to gene clusters, allowing users to view the evolution dynamics in the gene context in an intuitive manner. This database will be able to facilitate the particular comparison between animals and plants, between vertebrates and arthropods, and between monocots and eudicots, accounting for the significant contribution of gene pairs to speciation and diversification in specific lineages.
Collapse
Affiliation(s)
- Dapeng Wang
- Stem Cell Laboratory, UCL Cancer Institute, University College London, London WC1E 6BT, UK; CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, PR China.
| |
Collapse
|
4
|
Abstract
When considering the evolution of a gene’s expression profile, we commonly assume that this is unaffected by its genomic neighborhood. This is, however, in contrast to what we know about the lack of autonomy between neighboring genes in gene expression profiles in extant taxa. Indeed, in all eukaryotic genomes genes of similar expression-profile tend to cluster, reflecting chromatin level dynamics. Does it follow that if a gene increases expression in a particular lineage then the genomic neighbors will also increase in their expression or is gene expression evolution autonomous? To address this here we consider evolution of human gene expression since the human-chimp common ancestor, allowing for both variation in estimation of current expression level and error in Bayesian estimation of the ancestral state. We find that in all tissues and both sexes, the change in gene expression of a focal gene on average predicts the change in gene expression of neighbors. The effect is highly pronounced in the immediate vicinity (<100 kb) but extends much further. Sex-specific expression change is also genomically clustered. As genes increasing their expression in humans tend to avoid nuclear lamina domains and be enriched for the gene activator 5-hydroxymethylcytosine, we conclude that, most probably owing to chromatin level control of gene expression, a change in gene expression of one gene likely affects the expression evolution of neighbors, what we term expression piggybacking, an analog of hitchhiking.
Collapse
Affiliation(s)
- Avazeh T Ghanbarian
- Department of Biology and Biochemisty, University of Bath, Bath, United Kingdom
| | - Laurence D Hurst
- Department of Biology and Biochemisty, University of Bath, Bath, United Kingdom
| |
Collapse
|
5
|
Orekhova AS, Rubtsov PM. Bidirectional promoters in the transcription of mammalian genomes. BIOCHEMISTRY (MOSCOW) 2014; 78:335-41. [PMID: 23590436 DOI: 10.1134/s0006297913040020] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
In the genomes of humans and other mammals a large number of closely spaced pairs of genes that are transcribed in opposite directions were revealed. Their transcription is directed by so-called bidirectional promoters. This review is devoted to the characteristics of bidirectional promoters and features of their structure. The composition of "core" promoter elements in conventional unidirectional and bidirectional promoters is compared. Data on binding sites of transcription factors that are primarily specific for bidirectional promoters are discussed. The examples of promoters that share protein-coding genes transcribed by RNA polymerase II and the non-coding RNA genes transcribed by RNA polymerase III are described. Data obtained from global transcriptome analysis about the existence of short noncoding antisense RNA associated with the promoters in the context of the hypothesis of bidirectional transcription initiation as an inherent property of eukaryotic promoters are discussed.
Collapse
Affiliation(s)
- A S Orekhova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, 119991, Russia.
| | | |
Collapse
|
6
|
Didych DA, Shamsutdinov MF, Smirnov NA, Akopov SB, Monastyrskaya GS, Uspenskaya NY, Nikolaev LG, Sverdlov ED. Human PSENEN and U2AF1L4 genes are concertedly regulated by a genuine bidirectional promoter. Gene 2012; 515:34-41. [PMID: 23246698 DOI: 10.1016/j.gene.2012.11.058] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2012] [Revised: 10/31/2012] [Accepted: 11/29/2012] [Indexed: 11/16/2022]
Abstract
Head-to-head genes with a short distance between their transcription start sites may constitute up to 10% of all genes in the genomes of various species. It was hypothesized that this intergenic space may represent bidirectional promoters which are able to initiate transcription of both genes, but the true bidirectionality was proved only for a few of them. We present experimental evidence that, according to several criteria, a 269 bp region located between the PSENEN and U2AF1L4 human genes is a genuine bidirectional promoter regulating a concerted divergent transcription of these genes. Concerted transcription of PSENEN and U2AF1L4 can be necessary for regulation of T-cell activity.
Collapse
Affiliation(s)
- D A Didych
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 117997, Moscow, Russia
| | | | | | | | | | | | | | | |
Collapse
|
7
|
Krom N, Ramakrishna W. Retrotransposon insertions in rice gene pairs associated with reduced conservation of gene pairs in grass genomes. Genomics 2012; 99:308-14. [PMID: 22414560 DOI: 10.1016/j.ygeno.2012.02.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2012] [Revised: 02/15/2012] [Accepted: 02/24/2012] [Indexed: 12/26/2022]
Abstract
Small-scale changes in gene order and orientation are common in plant genomes, even across relatively short evolutionary distances. We investigated the association of retrotransposons in and near rice gene pairs with gene pair conservation, inversion, rearrangement, and deletion in sorghum, maize, and Brachypodium. Copia and Gypsy LTR-retrotransposon insertions were found to be primarily associated with reduced frequency of gene pair conservation and an increase in both gene pair rearrangement and gene deletions. SINEs are associated with gene pair rearrangement, while LINEs are associated with gene deletions. Despite being more frequently associated with retrotransposons than convergent and tandem pairs, divergent gene pairs showed the least effects from that association. In contrast, convergent pairs were least frequently associated with retrotransposons yet showed the greatest effects. Insertions between genes were associated with the greatest effects on gene pair arrangement, while insertions flanking gene pairs had significant effects only on divergent pairs.
Collapse
Affiliation(s)
- Nicholas Krom
- Department of Biological Sciences, Michigan Technological University, Houghton, MI 49931, USA
| | | |
Collapse
|
8
|
Weber CC, Hurst LD. Support for multiple classes of local expression clusters in Drosophila melanogaster, but no evidence for gene order conservation. Genome Biol 2011; 12:R23. [PMID: 21414197 PMCID: PMC3129673 DOI: 10.1186/gb-2011-12-3-r23] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2011] [Revised: 03/04/2011] [Accepted: 03/17/2011] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND Gene order in eukaryotic genomes is not random, with genes with similar expression profiles tending to cluster. In yeasts, the model taxon for gene order analysis, such syntenic clusters of non-homologous genes tend to be conserved over evolutionary time. Whether similar clusters show gene order conservation in other lineages is, however, undecided. Here, we examine this issue in Drosophila melanogaster using high-resolution chromosome rearrangement data. RESULTS We show that D. melanogaster has at least three classes of expression clusters: first, as observed in mammals, large clusters of functionally unrelated housekeeping genes; second, small clusters of functionally related highly co-expressed genes; and finally, as previously defined by Spellman and Rubin, larger domains of co-expressed but functionally unrelated genes. The latter are, however, not independent of the small co-expression clusters and likely reflect a methodological artifact. While the small co-expression and housekeeping/essential gene clusters resemble those observed in yeast, in contrast to yeast, we see no evidence that any of the three cluster types are preserved as synteny blocks. If anything, adjacent co-expressed genes are more likely to become rearranged than expected. Again in contrast to yeast, in D. melanogaster, gene pairs with short intergene distance or in divergent orientations tend to have higher rearrangement rates. These findings are consistent with co-expression being partly due to shared chromatin environment. CONCLUSIONS We conclude that, while similar in terms of cluster types, gene order evolution has strikingly different patterns in yeasts and in D. melanogaster, although recombination is associated with gene order rearrangement in both.
Collapse
Affiliation(s)
- Claudia C Weber
- Department of Biology and Biochemistry, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| | | |
Collapse
|
9
|
Chen YQ, Yu H, Li YX, Li YY. Sorting out inherent features of head-to-head gene pairs by evolutionary conservation. BMC Bioinformatics 2010; 11 Suppl 11:S16. [PMID: 21172051 PMCID: PMC3024869 DOI: 10.1186/1471-2105-11-s11-s16] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background A ‘head-to-head’ (h2h) gene pair is defined as a genomic locus in which two adjacent genes are divergently transcribed from opposite strands of DNA. In our previous work, this gene organization was found to be ancient and conserved, which subjects functionally related genes to transcriptional co-regulation. However, some of the biological features of h2h pairs still need further clarification. Results In this work, we assorted human h2h pairs into four sequentially inclusive sets of gradually incremental conservation, and examined whether those previously asserted features were conserved or sharpened in the more conserved h2h pair sets in order to identify the inherent features of the h2h gene organization. The features of TSS distance, expression correlation within h2h pairs and among h2h genes, transcription factor association and functional similarities of h2h genes were examined. Our conservation-based analyses found that the bi-directional promoters of h2h gene pairs are most likely shorter than 100 bp; h2h gene pairs generally have only significant positive expression correlation but not negative correlation, and remarkably high positive expression correlations exist among h2h genes, as well as between h2h pairs observed in our previous study; h2h paired genes tend to share transcription factors. In addition, expression correlation of h2h pairs is positively related with the TF-sharing and functional coordination, while not related with TSS distance. Conclusions Our findings remove the uncertainties of h2h genes about TSS distance, expression correlation and functional coordination, which provide insights into the study on the molecular mechanisms and functional consequences of the transcriptional regulation based on this special gene organization.
Collapse
Affiliation(s)
- Yun-Qin Chen
- School of Life Science and Technology, Tongji University, Shanghai 200092, PR China.
| | | | | | | |
Collapse
|
10
|
|
11
|
Yu H, Yu FD, Zhang GQ, Shen X, Chen YQ, Li YY, Li YX. DBH2H: vertebrate head-to-head gene pairs annotated at genomic and post-genomic levels. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2009; 2009:bap006. [PMID: 20157479 PMCID: PMC2790303 DOI: 10.1093/database/bap006] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/23/2008] [Revised: 03/25/2009] [Accepted: 04/24/2009] [Indexed: 11/30/2022]
Abstract
DBH2H collects head-to-head (h2h) gene pairs identified from human, mouse, rat, chicken and fugu genomes, and distinguishes the ortholog mapping relationship among them. The gene pairs in DBH2H are annotated with sequential features including single nucleotide polymorphisms, CpG islands and transcription factor binding sites, as well as functional terms and genetic disorders. In addition, the expression correlation information based on 117 microarray datasets is included. By providing user-friendly access to these data, DBH2H represents a valuable resource for further analyses of this important gene arrangement in terms of transcriptional regulation mechanisms, evolutionary conservation, disease relevance, etc. Database URL:http://lifecenter.sgst.cn/h2h/
Collapse
Affiliation(s)
- Hui Yu
- Bioinformatics Center, Key Laboratory of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, Shanghai Center for Bioinformation Technology, Shanghai 200235 and College of Life Science and Technology, Tongji University, Shanghai 200092, China
| | | | | | | | | | | | | |
Collapse
|
12
|
Liu X, Han B. Evolutionary conservation of neighbouring gene pairs in plants. Gene 2009; 437:71-9. [PMID: 19264115 DOI: 10.1016/j.gene.2009.02.012] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2008] [Revised: 02/12/2009] [Accepted: 02/16/2009] [Indexed: 12/12/2022]
Abstract
Evolutionary conservation of neighbouring gene pairs has been widely explored in many species, but remains poorly understood in plants. The availability of several plant genome sequences allows for an in-depth investigation of this problem in plants. Here, we analyzed the phylogenetic conservation of physically linked gene pairs in nine plant genomes and compared the conservation in different orientations. We also examined several potential determinants to detect whether they affect the conservation of neighbouring gene pairs. Our results suggested that among the three types of neighbouring gene pairs, closely linked parallel pairs might be the least conserved. Intergenic distance was shown to be a major determinant of linkage conservation, suggesting that the conservation of gene order in plants was determined primarily by chance. The enrichment of housekeeping genes was identified to contribute to the conservation of all three types and the enrichment of genes involved in protein metabolism might contribute to the conservation of parallel pairs. Moreover, a co-expressed signal was detected in conserved divergent pairs, which might be determined by intergenic distance.
Collapse
Affiliation(s)
- Xiling Liu
- National Center for Gene Research/Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 500 Caobao Road, Shanghai 200233, China
| | | |
Collapse
|