1
|
Zheng X, Duan Y, Zheng H, Tang H, Zheng L, Yu X. Genome-Wide Identification and Characterization of the RWP-RK Proteins in Zanthoxylum armatum. Genes (Basel) 2024; 15:665. [PMID: 38927601 PMCID: PMC11202622 DOI: 10.3390/genes15060665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 05/13/2024] [Accepted: 05/21/2024] [Indexed: 06/28/2024] Open
Abstract
Apomixis is a common reproductive characteristic of Zanthoxylum plants, and RWP-RKs are plant-specific transcription factors known to regulate embryonic development. However, the genome-wide analysis and function prediction of RWP-RK family genes in Z. armatum are unclear. In this study, 36 ZaRWP-RK transcription factors were identified in the genome of Z. armatum, among which 15 genes belonged to the RKD subfamily and 21 belonged to the NLP subfamily. Duplication events of ZaRWP-RK genes were mainly segmental duplication, and synteny analysis revealed a close phylogenetic relationship between Z. armatum and Arabidopsis. The analysis of cis-elements indicated that ZaRWP-RK genes may be involved in the regulation of the embryonic development of Z. armatum by responding to plant hormones such as abscisic acid, auxin, and gibberellin. Results of a real-time PCR showed that the expression levels of most ZaRWP-RK genes were significantly increased from flowers to young fruits. Protein-protein interaction network analysis further revealed the potential roles of the ZaRWP-RK proteins in apomixis. Collectively, this study is expected to improve our understanding of ZaRWP-RK transcription factors and provide a theoretical basis for future investigations into the ZaRWP-RK genes and their regulatory mechanisms in the apomixis process of Z. armatum.
Collapse
Affiliation(s)
| | | | | | | | | | - Xiaobo Yu
- Southwest Research Center for Cross Breeding of Special Economic Plants, School of Life Science, Leshan Normal University, Leshan 614000, China; (X.Z.); (Y.D.); (H.Z.); (H.T.); (L.Z.)
| |
Collapse
|
2
|
Xie L, Wu S, Guo X. An important resource and analytic platform for human and mouse cardiovascular-related cis-regulatory elements. MOLECULAR THERAPY. NUCLEIC ACIDS 2023; 34:102033. [PMID: 37808923 PMCID: PMC10556829 DOI: 10.1016/j.omtn.2023.102033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/10/2023]
Affiliation(s)
- Longxiang Xie
- Institute of Biomedical Informatics, Bioinformatics Center, Henan Provincial Engineering Center for Tumor Molecular Medicine, Academy for Advanced Interdisciplinary Studies, School of Basic Medical Sciences, Henan University, Kaifeng, Henan 475004, China
| | - Shengnan Wu
- Institute of Biomedical Informatics, Bioinformatics Center, Henan Provincial Engineering Center for Tumor Molecular Medicine, Academy for Advanced Interdisciplinary Studies, School of Basic Medical Sciences, Henan University, Kaifeng, Henan 475004, China
| | - Xiangqian Guo
- Institute of Biomedical Informatics, Bioinformatics Center, Henan Provincial Engineering Center for Tumor Molecular Medicine, Academy for Advanced Interdisciplinary Studies, School of Basic Medical Sciences, Henan University, Kaifeng, Henan 475004, China
| |
Collapse
|
3
|
Zhang X, Fang B, Huang YF. Transcription factor binding sites are frequently under accelerated evolution in primates. Nat Commun 2023; 14:783. [PMID: 36774380 PMCID: PMC9922303 DOI: 10.1038/s41467-023-36421-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Accepted: 01/31/2023] [Indexed: 02/13/2023] Open
Abstract
Recent comparative genomic studies have identified many human accelerated elements (HARs) with elevated substitution rates in the human lineage. However, it remains unknown to what extent transcription factor binding sites (TFBSs) are under accelerated evolution in humans and other primates. Here, we introduce two pooling-based phylogenetic methods with dramatically enhanced sensitivity to examine accelerated evolution in TFBSs. Using these new methods, we show that more than 6000 TFBSs annotated in the human genome have experienced accelerated evolution in Hominini, apes, and Old World monkeys. Although these TFBSs individually show relatively weak signals of accelerated evolution, they collectively are more abundant than HARs. Also, we show that accelerated evolution in Pol III binding sites may be driven by lineage-specific positive selection, whereas accelerated evolution in other TFBSs might be driven by nonadaptive evolutionary forces. Finally, the accelerated TFBSs are enriched around developmental genes, suggesting that accelerated evolution in TFBSs may drive the divergence of developmental processes between primates.
Collapse
Affiliation(s)
- Xinru Zhang
- Department of Biology, Pennsylvania State University, University Park, PA, 16802, USA. .,Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA, 16802, USA. .,Bioinformatics and Genomics Graduate Program, Pennsylvania State University, University Park, PA, 16802, USA.
| | - Bohao Fang
- Department of Organismic and Evolutionary Biology and the Museum of Comparative Zoology, Harvard University, Boston, MA, 02135, USA
| | - Yi-Fei Huang
- Department of Biology, Pennsylvania State University, University Park, PA, 16802, USA. .,Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA, 16802, USA.
| |
Collapse
|
4
|
Ni P, Wilson D, Su Z. A map of cis-regulatory modules and constituent transcription factor binding sites in 80% of the mouse genome. BMC Genomics 2022; 23:714. [PMID: 36261804 PMCID: PMC9583556 DOI: 10.1186/s12864-022-08933-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2022] [Accepted: 10/11/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Mouse is probably the most important model organism to study mammal biology and human diseases. A better understanding of the mouse genome will help understand the human genome, biology and diseases. However, despite the recent progress, the characterization of the regulatory sequences in the mouse genome is still far from complete, limiting its use to understand the regulatory sequences in the human genome. RESULTS Here, by integrating binding peaks in ~ 9,000 transcription factor (TF) ChIP-seq datasets that cover 79.9% of the mouse mappable genome using an efficient pipeline, we were able to partition these binding peak-covered genome regions into a cis-regulatory module (CRM) candidate (CRMC) set and a non-CRMC set. The CRMCs contain 912,197 putative CRMs and 38,554,729 TF binding sites (TFBSs) islands, covering 55.5% and 24.4% of the mappable genome, respectively. The CRMCs tend to be under strong evolutionary constraints, indicating that they are likely cis-regulatory; while the non-CRMCs are largely selectively neutral, indicating that they are unlikely cis-regulatory. Based on evolutionary profiles of the genome positions, we further estimated that 63.8% and 27.4% of the mouse genome might code for CRMs and TFBSs, respectively. CONCLUSIONS Validation using experimental data suggests that at least most of the CRMCs are authentic. Thus, this unprecedentedly comprehensive map of CRMs and TFBSs can be a good resource to guide experimental studies of regulatory genomes in mice and humans.
Collapse
Affiliation(s)
- Pengyu Ni
- Department of Bioinformatics and Genomics, the University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
| | - David Wilson
- Department of Bioinformatics and Genomics, the University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
| | - Zhengchang Su
- Department of Bioinformatics and Genomics, the University of North Carolina at Charlotte, Charlotte, NC, 28223, USA.
| |
Collapse
|
5
|
Dubois‐Mignon T, Monget P. Gene essentiality and variability: What is the link? A within‐ and between‐species perspective. Bioessays 2022; 44:e2200132. [DOI: 10.1002/bies.202200132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Revised: 08/17/2022] [Accepted: 08/30/2022] [Indexed: 11/07/2022]
Affiliation(s)
- Tania Dubois‐Mignon
- Institut de Biologie de l’École Normale Supérieure Université PSL 46 rue d'Ulm Paris 75005 France
| | - Philippe Monget
- Physiologie de la Reproduction et des Comportements, Centre Val de Loire – UMR INRAE, CNRS, IFCE Université de Tours Nouzilly France
| |
Collapse
|
6
|
Controlling gene expression with deep generative design of regulatory DNA. Nat Commun 2022; 13:5099. [PMID: 36042233 PMCID: PMC9427793 DOI: 10.1038/s41467-022-32818-8] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Accepted: 08/18/2022] [Indexed: 11/25/2022] Open
Abstract
Design of de novo synthetic regulatory DNA is a promising avenue to control gene expression in biotechnology and medicine. Using mutagenesis typically requires screening sizable random DNA libraries, which limits the designs to span merely a short section of the promoter and restricts their control of gene expression. Here, we prototype a deep learning strategy based on generative adversarial networks (GAN) by learning directly from genomic and transcriptomic data. Our ExpressionGAN can traverse the entire regulatory sequence-expression landscape in a gene-specific manner, generating regulatory DNA with prespecified target mRNA levels spanning the whole gene regulatory structure including coding and adjacent non-coding regions. Despite high sequence divergence from natural DNA, in vivo measurements show that 57% of the highly-expressed synthetic sequences surpass the expression levels of highly-expressed natural controls. This demonstrates the applicability and relevance of deep generative design to expand our knowledge and control of gene expression regulation in any desired organism, condition or tissue. Design of de novo synthetic regulatory DNA is a promising avenue to control gene expression in biotechnology and medicine. Here the authors present EspressionGAN, a generative adversarial network that uses genomic and transcriptomic data to generate regulatory sequences.
Collapse
|
7
|
Ni P, Su Z. PCRMS: a database of predicted cis-regulatory modules and constituent transcription factor binding sites in genomes. Database (Oxford) 2022; 2022:6572594. [PMID: 35452518 PMCID: PMC9216522 DOI: 10.1093/database/baac024] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2021] [Revised: 02/20/2022] [Accepted: 04/12/2022] [Indexed: 01/13/2023]
Abstract
More accurate and more complete predictions of cis-regulatory modules (CRMs) and constituent transcription factor (TF) binding sites (TFBSs) in genomes can facilitate characterizing functions of regulatory sequences. Here, we developed a database predicted cis-regulatory modules (PCRMS) (https://cci-bioinfo.uncc.edu) that stores highly accurate and unprecedentedly complete maps of predicted CRMs and TFBSs in the human and mouse genomes. The web interface allows the user to browse CRMs and TFBSs in an organism, find the closest CRMs to a gene, search CRMs around a gene and find all TFBSs of a TF. PCRMS can be a useful resource for the research community to characterize regulatory genomes. Database URL: https://cci-bioinfo.uncc.edu/.
Collapse
Affiliation(s)
- Pengyu Ni
- Department of Bioinformatics and Genomics, The University of North Carolina at Charlotte, 9201 University City Boulevard, Charlotte, NC 28223, USA
| | - Zhengchang Su
- Department of Bioinformatics and Genomics, The University of North Carolina at Charlotte, 9201 University City Boulevard, Charlotte, NC 28223, USA
| |
Collapse
|
8
|
Li J, Zhang Y, Xu L, Wang C, Luo Y, Feng S, Yuan Y, Yang Q, Feng B. Genome-Wide Identification of DNA Binding with One Finger ( Dof) Gene Family in Tartary Buckwheat ( Fagopyrum tataricum) and Analysis of Its Expression Pattern after Exogenous Hormone Stimulation. BIOLOGY 2022; 11:biology11020173. [PMID: 35205040 PMCID: PMC8869700 DOI: 10.3390/biology11020173] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/28/2021] [Revised: 01/14/2022] [Accepted: 01/18/2022] [Indexed: 01/11/2023]
Abstract
Simple Summary A number of studies have demonstrated that DNA binding with one finger (Dof) proteins are involved in multiple biological processes. In the present study, Dof genes or proteins in Tartary buckwheat (FtDofs) were systematically analysed, including their physical properties, phylogenetic relationships, structure, motif composition, cis-acting elements present in promoter regions, chromosomal distribution, gene duplication events, syntenic relationships, expression patterns in different tissues and different fruit developmental stages and responses to exogenous hormone stimulation. The results indicated that the expansion of FtDofs was mainly due to segmental duplication. The tissue-specific expression patterns of FtDofs and their positive responses to exogenous hormone stimulation suggest that they play important roles in the growth and development of Tartary buckwheat as well as in the adaptation to environmental changes. Collectively, this study lays a foundation for further exploration of the function of FtDof genes in Tartary buckwheat. Abstract DNA binding with one finger (Dof) proteins have been proven to be involved in multiple biological processes. However, genome-wide identification of the Dof gene family has not been reported for Tartary buckwheat (Fagopyrum tataricum). In this study, 35 FtDof proteins were identified, and they could be divided into nine phylogenetic subgroups. Proteins within the same subgroup had similar gene structure and motif composition. Moreover, abundant cis-acting elements were present in the promoter regions of FtDof genes. Segmental duplication was the primary driving force for the evolution of the FtDof gene family. Synteny analysis indicated that Tartary buckwheat was closer to dicotyledons, and more orthologous Dof genes existed among them. The expression pattern of FtDofs in different tissues and at different fruit developmental stages varied. Different tissues contained several genes that were specifically expressed. FtDof expression was mainly upregulated under methyl jasmonate treatment and downregulated under other hormone treatments. Taken together, FtDofs may play important roles in the growth and development of Tartary buckwheat and in response to abiotic and biotic stresses. Therefore, the genome-wide identification and expression pattern analysis of the Tartary buckwheat Dof gene family lays a foundation for further exploration of the functional characteristics of FtDofs in the future.
Collapse
Affiliation(s)
- Jing Li
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Xianyang 712000, China; (J.L.); (Y.Z.); (L.X.); (C.W.); (Y.L.); (Y.Y.); (Q.Y.)
| | - Yuchuan Zhang
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Xianyang 712000, China; (J.L.); (Y.Z.); (L.X.); (C.W.); (Y.L.); (Y.Y.); (Q.Y.)
| | - Lei Xu
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Xianyang 712000, China; (J.L.); (Y.Z.); (L.X.); (C.W.); (Y.L.); (Y.Y.); (Q.Y.)
| | - Chenyang Wang
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Xianyang 712000, China; (J.L.); (Y.Z.); (L.X.); (C.W.); (Y.L.); (Y.Y.); (Q.Y.)
| | - Yan Luo
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Xianyang 712000, China; (J.L.); (Y.Z.); (L.X.); (C.W.); (Y.L.); (Y.Y.); (Q.Y.)
| | - Shan Feng
- School of Mathematics and Statistics, Northwestern Polytechnical University, Xi’an 710129, China;
| | - Yuhao Yuan
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Xianyang 712000, China; (J.L.); (Y.Z.); (L.X.); (C.W.); (Y.L.); (Y.Y.); (Q.Y.)
| | - Qinghua Yang
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Xianyang 712000, China; (J.L.); (Y.Z.); (L.X.); (C.W.); (Y.L.); (Y.Y.); (Q.Y.)
| | - Baili Feng
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Xianyang 712000, China; (J.L.); (Y.Z.); (L.X.); (C.W.); (Y.L.); (Y.Y.); (Q.Y.)
- Correspondence:
| |
Collapse
|
9
|
Ni P, Su Z. Accurate prediction of cis-regulatory modules reveals a prevalent regulatory genome of humans. NAR Genom Bioinform 2021; 3:lqab052. [PMID: 34159315 PMCID: PMC8210889 DOI: 10.1093/nargab/lqab052] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Revised: 05/01/2021] [Accepted: 06/14/2021] [Indexed: 02/07/2023] Open
Abstract
cis-regulatory modules(CRMs) formed by clusters of transcription factor (TF) binding sites (TFBSs) are as important as coding sequences in specifying phenotypes of humans. It is essential to categorize all CRMs and constituent TFBSs in the genome. In contrast to most existing methods that predict CRMs in specific cell types using epigenetic marks, we predict a largely cell type agonistic but more comprehensive map of CRMs and constituent TFBSs in the gnome by integrating all available TF ChIP-seq datasets. Our method is able to partition 77.47% of genome regions covered by available 6092 datasets into a CRM candidate (CRMC) set (56.84%) and a non-CRMC set (43.16%). Intriguingly, the predicted CRMCs are under strong evolutionary constraints, while the non-CRMCs are largely selectively neutral, strongly suggesting that the CRMCs are likely cis-regulatory, while the non-CRMCs are not. Our predicted CRMs are under stronger evolutionary constraints than three state-of-the-art predictions (GeneHancer, EnhancerAtlas and ENCODE phase 3) and substantially outperform them for recalling VISTA enhancers and non-coding ClinVar variants. We estimated that the human genome might encode about 1.47M CRMs and 68M TFBSs, comprising about 55% and 22% of the genome, respectively; for both of which, we predicted 80%. Therefore, the cis-regulatory genome appears to be more prevalent than originally thought.
Collapse
Affiliation(s)
- Pengyu Ni
- Department of Bioinformatics and Genomics, the University of North Carolina at Charlotte, 9201 University City Boulevard, Charlotte, NC 28223, USA
| | - Zhengchang Su
- Department of Bioinformatics and Genomics, the University of North Carolina at Charlotte, 9201 University City Boulevard, Charlotte, NC 28223, USA
| |
Collapse
|
10
|
Jovanovic VM, Sarfert M, Reyna-Blanco CS, Indrischek H, Valdivia DI, Shelest E, Nowick K. Positive Selection in Gene Regulatory Factors Suggests Adaptive Pleiotropic Changes During Human Evolution. Front Genet 2021; 12:662239. [PMID: 34079582 PMCID: PMC8166252 DOI: 10.3389/fgene.2021.662239] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2021] [Accepted: 04/19/2021] [Indexed: 01/09/2023] Open
Abstract
Gene regulatory factors (GRFs), such as transcription factors, co-factors and histone-modifying enzymes, play many important roles in modifying gene expression in biological processes. They have also been proposed to underlie speciation and adaptation. To investigate potential contributions of GRFs to primate evolution, we analyzed GRF genes in 27 publicly available primate genomes. Genes coding for zinc finger (ZNF) proteins, especially ZNFs with a Krüppel-associated box (KRAB) domain were the most abundant TFs in all genomes. Gene numbers per TF family differed between all species. To detect signs of positive selection in GRF genes we investigated more than 3,000 human GRFs with their more than 70,000 orthologs in 26 non-human primates. We implemented two independent tests for positive selection, the branch-site-model of the PAML suite and aBSREL of the HyPhy suite, focusing on the human and great ape branch. Our workflow included rigorous procedures to reduce the number of false positives: excluding distantly similar orthologs, manual corrections of alignments, and considering only genes and sites detected by both tests for positive selection. Furthermore, we verified the candidate sites for selection by investigating their variation within human and non-human great ape population data. In order to approximately assign a date to positively selected sites in the human lineage, we analyzed archaic human genomes. Our work revealed with high confidence five GRFs that have been positively selected on the human lineage and one GRF that has been positively selected on the great ape lineage. These GRFs are scattered on different chromosomes and have been previously linked to diverse functions. For some of them a role in speciation and/or adaptation can be proposed based on the expression pattern or association with human diseases, but it seems that they all contributed independently to human evolution. Four of the positively selected GRFs are KRAB-ZNF proteins, that induce changes in target genes co-expression and/or through arms race with transposable elements. Since each positively selected GRF contains several sites with evidence for positive selection, we suggest that these GRFs participated pleiotropically to phenotypic adaptations in humans.
Collapse
Affiliation(s)
- Vladimir M Jovanovic
- Human Biology and Primate Evolution, Freie Universität Berlin, Berlin, Germany.,Bioinformatics Solution Center, Freie Universität Berlin, Berlin, Germany
| | - Melanie Sarfert
- Human Biology and Primate Evolution, Freie Universität Berlin, Berlin, Germany
| | - Carlos S Reyna-Blanco
- Department of Biology, University of Fribourg, Fribourg, Switzerland.,Swiss Institute of Bioinformatics, Fribourg, Switzerland
| | - Henrike Indrischek
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.,Max Planck Institute for the Physics of Complex Systems, Dresden, Germany.,Center for Systems Biology Dresden, Dresden, Germany
| | - Dulce I Valdivia
- Evolutionary Genomics Laboratory and Genome Topology and Regulation Laboratory, Genetic Engineering Department, Center for Research and Advanced Studies of the National Polytechnic Institute (CINVESTAV-Irapuato), Irapuato, Mexico
| | - Ekaterina Shelest
- Centre for Enzyme Innovation, University of Portsmouth, Portsmouth, United Kingdom
| | - Katja Nowick
- Human Biology and Primate Evolution, Freie Universität Berlin, Berlin, Germany
| |
Collapse
|
11
|
Comprehensive Analysis and Expression Profiling of PIN, AUX/LAX, and ABCB Auxin Transporter Gene Families in Solanum tuberosum under Phytohormone Stimuli and Abiotic Stresses. BIOLOGY 2021; 10:biology10020127. [PMID: 33562678 PMCID: PMC7915614 DOI: 10.3390/biology10020127] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Revised: 02/01/2021] [Accepted: 02/02/2021] [Indexed: 11/16/2022]
Abstract
Simple Summary In this study, we provide comprehensive information on auxin transporter gene families in potato, including basic parameters, chromosomal distribution, phylogeny, co-expression network analysis, gene structure, tissue-specific expression patterns, subcellular localization, transcription analysis under exogenous hormone stimuli and abiotic stresses, and cis-regulatory element prediction. The responsiveness of auxin transporter family genes to auxin and polar auxin transport inhibitors implied their possible roles in auxin homoeostasis and redistribution. Additionally, the differential expression levels of auxin transporter family genes in response to abscisic acid and abiotic stresses suggested their specific adaptive mechanisms on tolerance to various environmental stimuli. Promoter cis-regulatory element description analyses indicated that a number of cis-regulatory elements within the promoters of auxin transporter genes in potato were targeted by relevant transcription factors to respond to diverse stresses. We are confident that our results provide a foundation for a better understanding of auxin transporters in potato, as we have demonstrated the biological significance of this family of genes in hormone signaling and adaption to environmental stresses. Abstract Auxin is the only plant hormone that exhibits transport polarity mediated by three families: auxin resistant (AUX) 1/like AUX1 (LAX) influx carriers, pin-formed (PIN) efflux carriers, and ATP-binding cassette B (ABCB) influx/efflux carriers. Extensive studies about the biological functions of auxin transporter genes have been reported in model plants. Information regarding these genes in potato remains scarce. Here, we conducted a comprehensive analysis of auxin transporter gene families in potato to examine genomic distributions, phylogeny, co-expression analysis, gene structure and subcellular localization, and expression profiling using bioinformatics tools and qRT-PCR analysis. From these analyses, 5 StLAXs, 10 StPINs, and 22 StABCBs were identified in the potato genome and distributed in 10 of 18 gene modules correlating to the development of various tissues. Transient expression experiments indicated that three representative auxin transporters showed plasma membrane localizations. The responsiveness to auxin and auxin transport inhibitors implied their possible roles in mediating intercellular auxin homoeostasis and redistribution. The differential expression under abscisic acid and abiotic stresses indicated their specific adaptive mechanisms regulating tolerance to environmental stimuli. A large number of auxin-responsive and stress-related cis-elements within their promoters could account for their responsiveness to diverse stresses. Our study aimed to understand the biological significance of potato auxin transporters in hormone signaling and tolerance to environmental stresses.
Collapse
|
12
|
Canver MC, Tripathi P, Bullen MJ, Olshansky M, Kumar Y, Wong LH, Turner SJ, Lessard S, Pinello L, Orkin SH, Das PP. A saturating mutagenesis CRISPR-Cas9-mediated functional genomic screen identifies cis- and trans-regulatory elements of Oct4 in murine ESCs. J Biol Chem 2020; 295:15797-15809. [PMID: 32994224 DOI: 10.1074/jbc.ra120.013772] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Revised: 09/15/2020] [Indexed: 11/06/2022] Open
Abstract
Regulatory elements (REs) consist of enhancers and promoters that occupy a significant portion of the noncoding genome and control gene expression programs either in cis or in trans Putative REs have been identified largely based on their regulatory features (co-occupancy of ESC-specific transcription factors, enhancer histone marks, and DNase hypersensitivity) in mouse embryonic stem cells (mESCs). However, less has been established regarding their regulatory functions in their native context. We deployed cis- and trans-regulatory elements scanning through saturating mutagenesis and sequencing (ctSCAN-SMS) to target elements within the ∼12-kb cis-region (cis-REs; CREs) of the Oct4 gene locus, as well as genome-wide 2,613 high-confidence trans-REs (TREs), in mESCs. ctSCAN-SMS identified 10 CREs and 12 TREs as novel candidate REs of the Oct4 gene in mESCs. Furthermore, deletions of these candidate REs confirmed that the majority of the REs are functionally active, and CREs are more active than TREs in controlling Oct4 gene expression. A subset of active CREs and TREs physically interact with the Oct4 promoter to varying degrees; specifically, a greater number of active CREs, compared with active TREs, physically interact with the Oct4 promoter. Moreover, comparative genomics analysis reveals that a greater number of active CREs than active TREs are evolutionarily conserved between mice and primates, including humans. Taken together, our study demonstrates the reliability and robustness of ctSCAN-SMS screening to identify critical REs and investigate their roles in the regulation of transcriptional output of a target gene (in this case Oct4) in their native context.
Collapse
Affiliation(s)
- Matthew C Canver
- Division of Hematology/Oncology, Boston Children's Hospital and Department of Pediatric Oncology, Dana-Farber Cancer Institute (DFCI), Harvard Stem Cell Institute, Harvard Medical School, Boston, Massachusetts, USA
| | - Pratibha Tripathi
- Department of Anatomy and Developmental Biology, Monash University, Clayton, Victoria, Australia; Development and Stem Cells Program, Monash Biomedicine Discovery Institute, Monash University, Clayton, Victoria, Australia
| | - Michael J Bullen
- Department of Anatomy and Developmental Biology, Monash University, Clayton, Victoria, Australia; Development and Stem Cells Program, Monash Biomedicine Discovery Institute, Monash University, Clayton, Victoria, Australia
| | - Moshe Olshansky
- Department of Microbiology, Monash Biomedicine Discovery Institute, Monash University, Clayton, Victoria, Australia; Computational Biology and Bioinformatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
| | - Yogesh Kumar
- Department of Anatomy and Developmental Biology, Monash University, Clayton, Victoria, Australia; Development and Stem Cells Program, Monash Biomedicine Discovery Institute, Monash University, Clayton, Victoria, Australia
| | - Lee H Wong
- Department of Biochemistry and Molecular Biology, Cancer Program, Monash Biomedicine Discovery Institute, Monash University, Clayton, Victoria, Australia
| | - Stephen J Turner
- Department of Microbiology, Monash Biomedicine Discovery Institute, Monash University, Clayton, Victoria, Australia; Department of Microbiology and Immunology, University of Melbourne, Parkville, Victoria, Australia
| | - Samuel Lessard
- Research Center, Montreal Heart Institute, Montréal, Quebec, Canada; Department of Medicine, Faculty of Medicine, Université de Montréal, Montréal, Quebec, Canada
| | - Luca Pinello
- Molecular Pathology and Cancer Center, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, USA
| | - Stuart H Orkin
- Division of Hematology/Oncology, Boston Children's Hospital and Department of Pediatric Oncology, Dana-Farber Cancer Institute (DFCI), Harvard Stem Cell Institute, Harvard Medical School, Boston, Massachusetts, USA; Howard Hughes Medical Institute, Boston, Massachusetts, USA.
| | - Partha Pratim Das
- Department of Anatomy and Developmental Biology, Monash University, Clayton, Victoria, Australia; Development and Stem Cells Program, Monash Biomedicine Discovery Institute, Monash University, Clayton, Victoria, Australia.
| |
Collapse
|
13
|
Dukler N, Huang YF, Siepel A. Phylogenetic Modeling of Regulatory Element Turnover Based on Epigenomic Data. Mol Biol Evol 2020; 37:2137-2152. [PMID: 32176292 PMCID: PMC7306682 DOI: 10.1093/molbev/msaa073] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Evolutionary changes in gene expression are often driven by gains and losses of cis-regulatory elements (CREs). The dynamics of CRE evolution can be examined using multispecies epigenomic data, but so far such analyses have generally been descriptive and model-free. Here, we introduce a probabilistic modeling framework for the evolution of CREs that operates directly on raw chromatin immunoprecipitation and sequencing (ChIP-seq) data and fully considers the phylogenetic relationships among species. Our framework includes a phylogenetic hidden Markov model, called epiPhyloHMM, for identifying the locations of multiply aligned CREs, and a combined phylogenetic and generalized linear model, called phyloGLM, for accounting for the influence of a rich set of genomic features in describing their evolutionary dynamics. We apply these methods to previously published ChIP-seq data for the H3K4me3 and H3K27ac histone modifications in liver tissue from nine mammals. We find that enhancers are gained and lost during mammalian evolution at about twice the rate of promoters, and that turnover rates are negatively correlated with DNA sequence conservation, expression level, and tissue breadth, and positively correlated with distance from the transcription start site, consistent with previous findings. In addition, we find that the predicted dosage sensitivity of target genes positively correlates with DNA sequence constraint in CREs but not with turnover rates, perhaps owing to differences in the effect sizes of the relevant mutations. Altogether, our probabilistic modeling framework enables a variety of powerful new analyses.
Collapse
Affiliation(s)
- Noah Dukler
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
- Physiology, Biophysics, and Systems Biology, Weill Cornell Medical College, New York, NY
| | - Yi-Fei Huang
- Department of Biology and Huck Institute of Life Sciences, Pennsylvania State University, University Park, PA
| | - Adam Siepel
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
| |
Collapse
|
14
|
Price N, Lopez L, Platts AE, Lasky JR. In the presence of population structure: From genomics to candidate genes underlying local adaptation. Ecol Evol 2020; 10:1889-1904. [PMID: 32128123 DOI: 10.1101/642306] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2019] [Revised: 12/19/2019] [Accepted: 12/23/2019] [Indexed: 05/26/2023] Open
Abstract
Understanding the genomic signatures, genes, and traits underlying local adaptation of organisms to heterogeneous environments is of central importance to the field evolutionary biology. To identify loci underlying local adaptation, models that combine allelic and environmental variation while controlling for the effects of population structure have emerged as the method of choice. Despite being evaluated in simulation studies, there has not been a thorough investigation of empirical evidence supporting local adaptation across these alleles. To evaluate these methods, we use 875 Arabidopsis thaliana Eurasian accessions and two mixed models (GEMMA and LFMM) to identify candidate SNPs underlying local adaptation to climate. Subsequently, to assess evidence of local adaptation and function among significant SNPs, we examine allele frequency differentiation and recent selection across Eurasian populations, in addition to their distribution along quantitative trait loci (QTL) explaining fitness variation between Italy and Sweden populations and cis-regulatory/nonsynonymous sites showing significant selective constraint. Our results indicate that significant LFMM/GEMMA SNPs show low allele frequency differentiation and linkage disequilibrium across locally adapted Italy and Sweden populations, in addition to a poor association with fitness QTL peaks (highest logarithm of odds score). Furthermore, when examining derived allele frequencies across the Eurasian range, we find that these SNPs are enriched in low-frequency variants that show very large climatic differentiation but low levels of linkage disequilibrium. These results suggest that their enrichment along putative functional sites most likely represents deleterious variation that is independent of local adaptation. Among all the genomic signatures examined, only SNPs showing high absolute allele frequency differentiation (AFD) and linkage disequilibrium (LD) between Italy and Sweden populations showed a strong association with fitness QTL peaks and were enriched along selectively constrained cis-regulatory/nonsynonymous sites. Using these SNPs, we find strong evidence linking flowering time, freezing tolerance, and the abscisic-acid pathway to local adaptation.
Collapse
Affiliation(s)
- Nicholas Price
- Department of Bioagricultural Sciences & Pest Management Colorado State University Fort Collins CO USA
- Department of Biological Sciences University of Cyprus Nicosia Cyprus
| | - Lua Lopez
- Department of Biology Binghamton University (State University of New York) Binghamton NY USA
| | - Adrian E Platts
- Simons Center for Quantitative Biology Cold Spring Harbor Laboratory Cold Spring Harbor NY USA
- Department of Biology Center for Genomics and Systems Biology New York University New York NY USA
| | - Jesse R Lasky
- Department of Biology Pennsylvania State University University Park PA USA
| |
Collapse
|
15
|
Peng PC, Khoueiry P, Girardot C, Reddington JP, Garfield DA, Furlong EEM, Sinha S. The Role of Chromatin Accessibility in cis-Regulatory Evolution. Genome Biol Evol 2020; 11:1813-1828. [PMID: 31114856 PMCID: PMC6601868 DOI: 10.1093/gbe/evz103] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/13/2019] [Indexed: 02/07/2023] Open
Abstract
Transcription factor (TF) binding is determined by sequence as well as chromatin accessibility. Although the role of accessibility in shaping TF-binding landscapes is well recorded, its role in evolutionary divergence of TF binding, which in turn can alter cis-regulatory activities, is not well understood. In this work, we studied the evolution of genome-wide binding landscapes of five major TFs in the core network of mesoderm specification, between Drosophila melanogaster and Drosophila virilis, and examined its relationship to accessibility and sequence-level changes. We generated chromatin accessibility data from three important stages of embryogenesis in both Drosophila melanogaster and Drosophila virilis and recorded conservation and divergence patterns. We then used multivariable models to correlate accessibility and sequence changes to TF-binding divergence. We found that accessibility changes can in some cases, for example, for the master regulator Twist and for earlier developmental stages, more accurately predict binding change than is possible using TF-binding motif changes between orthologous enhancers. Accessibility changes also explain a significant portion of the codivergence of TF pairs. We noted that accessibility and motif changes offer complementary views of the evolution of TF binding and developed a combined model that captures the evolutionary data much more accurately than either view alone. Finally, we trained machine learning models to predict enhancer activity from TF binding and used these functional models to argue that motif and accessibility-based predictors of TF-binding change can substitute for experimentally measured binding change, for the purpose of predicting evolutionary changes in enhancer activity.
Collapse
Affiliation(s)
- Pei-Chen Peng
- Department of Computer Science, University of Illinois at Urbana-Champaign.,Center for Bioinformatics and Functional Genomics, Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, CA
| | - Pierre Khoueiry
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany.,American University of Beirut (AUB), Department of Biochemistry and Molecular Genetics, Beirut, Lebanon
| | - Charles Girardot
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - James P Reddington
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - David A Garfield
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany.,IRI-Life Sciences, Humboldt Universität zu Berlin, Berlin, Germany
| | - Eileen E M Furlong
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Saurabh Sinha
- Department of Computer Science, University of Illinois at Urbana-Champaign.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign
| |
Collapse
|
16
|
Arai Y, Cwetsch AW, Coppola E, Cipriani S, Nishihara H, Kanki H, Saillour Y, Freret-Hodara B, Dutriaux A, Okada N, Okano H, Dehay C, Nardelli J, Gressens P, Shimogori T, D’Onofrio G, Pierani A. Evolutionary Gain of Dbx1 Expression Drives Subplate Identity in the Cerebral Cortex. Cell Rep 2019; 29:645-658.e5. [DOI: 10.1016/j.celrep.2019.09.007] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2019] [Revised: 07/12/2019] [Accepted: 09/04/2019] [Indexed: 10/25/2022] Open
|
17
|
Doan RN, Shin T, Walsh CA. Evolutionary Changes in Transcriptional Regulation: Insights into Human Behavior and Neurological Conditions. Annu Rev Neurosci 2019; 41:185-206. [PMID: 29986162 DOI: 10.1146/annurev-neuro-080317-062104] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Understanding the biological basis for human-specific cognitive traits presents both immense challenges and unique opportunities. Although the question of what makes us human has been investigated with several different methods, the rise of comparative genomics, epigenomics, and medical genetics has provided tools to help narrow down and functionally assess the regions of the genome that seem evolutionarily relevant along the human lineage. In this review, we focus on how medical genetic cases have provided compelling functional evidence for genes and loci that appear to have interesting evolutionary signatures in humans. Furthermore, we examine a special class of noncoding regions, human accelerated regions (HARs), that have been suggested to show human-lineage-specific divergence, and how the use of clinical and population data has started to provide functional information to examine these regions. Finally, we outline methods that provide new insights into functional noncoding sequences in evolution.
Collapse
Affiliation(s)
- Ryan N Doan
- Division of Genetics and Genomics, Boston Children's Hospital, Boston, Massachusetts 02115, USA; .,Allen Discovery Center for Human Brain Evolution, Boston Children's Hospital, Boston, Massachusetts 02115, USA.,Howard Hughes Medical Institute, Boston Children's Hospital, Boston, Massachusetts 02115, USA
| | - Taehwan Shin
- Division of Genetics and Genomics, Boston Children's Hospital, Boston, Massachusetts 02115, USA; .,Allen Discovery Center for Human Brain Evolution, Boston Children's Hospital, Boston, Massachusetts 02115, USA
| | - Christopher A Walsh
- Division of Genetics and Genomics, Boston Children's Hospital, Boston, Massachusetts 02115, USA; .,Allen Discovery Center for Human Brain Evolution, Boston Children's Hospital, Boston, Massachusetts 02115, USA.,Howard Hughes Medical Institute, Boston Children's Hospital, Boston, Massachusetts 02115, USA.,Departments of Pediatrics and Neurology, Harvard Medical School, Boston, Massachusetts 02138, USA
| |
Collapse
|
18
|
Zhong Z, Lin L, Chen M, Lin L, Chen X, Lin Y, Chen X, Wang Z, Norvienyeku J, Zheng H. Expression Divergence as an Evolutionary Alternative Mechanism Adopted by Two Rice Subspecies Against Rice Blast Infection. RICE (NEW YORK, N.Y.) 2019; 12:12. [PMID: 30825020 PMCID: PMC6397267 DOI: 10.1186/s12284-019-0270-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Accepted: 02/18/2019] [Indexed: 05/30/2023]
Abstract
BACKGROUND Rice (Oryza sativa L.) is one of the most important crops that serves as staple food for ~ 50% of the human population worldwide. Some important agronomic traits that allow rice to cope with numerous abiotic and biotic stresses have been selected and fixed during domestication. Knowledge on how expression divergence of genes gradually contributes to phenotypic differentiation in response to biotic stress and their contribution to rice population speciation is still limited. RESULTS Here, we explored gene expression divergence between a japonica rice cultivar Nipponbare and an indica rice cultivar 93-11 in response to invasion by the filamentous ascomycete fungus Magnaporthe oryzae (Pyricularia oryzae), a plant pathogen that causes significant loss to rice production worldwide. We investigated differentially expressed genes in the two cultivars and observed that evolutionarily conserved orthologous genes showed highly variable expression patterns under rice blast infection. Analysis of promoter region of these differentially expressed orthologous genes revealed the existence of cis-regulatory elements associated with the differentiated expression pattern of these genes in the two rice cultivars. Further comparison of these regions in global rice population indicated their fixation and close relationship with rice population divergence. CONCLUSION We proposed that variation in the expression patterns of these orthologous genes mediated by cis-regulatory elements in the two rice cultivars, may constitute an alternative evolutionary mechanism that distinguishes these two genetically and ecologically divergent rice cultivars in response to M. oryzae infection.
Collapse
Affiliation(s)
- Zhenhui Zhong
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Plant Protection, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
- Fujian-Taiwan Joint Center for Ecological Control of Crop Pests, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
| | - Lianyu Lin
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Plant Protection, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
- Fujian-Taiwan Joint Center for Ecological Control of Crop Pests, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
- College of Life Science, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
| | - Meilian Chen
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Plant Protection, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
- Fujian-Taiwan Joint Center for Ecological Control of Crop Pests, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
| | - Lili Lin
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Plant Protection, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
- Fujian-Taiwan Joint Center for Ecological Control of Crop Pests, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
| | - Xiaofeng Chen
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Plant Protection, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
- Fujian-Taiwan Joint Center for Ecological Control of Crop Pests, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
| | - Yahong Lin
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Plant Protection, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
- Fujian-Taiwan Joint Center for Ecological Control of Crop Pests, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
| | - Xi Chen
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Plant Protection, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
- Fujian-Taiwan Joint Center for Ecological Control of Crop Pests, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
| | - Zonghua Wang
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Plant Protection, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
- Fujian-Taiwan Joint Center for Ecological Control of Crop Pests, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
- Institute of Oceanography, Minjiang University, Fuzhou, 350108 China
| | - Justice Norvienyeku
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Plant Protection, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
- Fujian-Taiwan Joint Center for Ecological Control of Crop Pests, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
| | - Huakun Zheng
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Plant Protection, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
- Fujian-Taiwan Joint Center for Ecological Control of Crop Pests, Fujian Agriculture and Forestry University, Fuzhou, 350002 China
| |
Collapse
|
19
|
Niu M, Tabari E, Ni P, Su Z. Towards a map of cis-regulatory sequences in the human genome. Nucleic Acids Res 2018; 46:5395-5409. [PMID: 29733395 PMCID: PMC6009671 DOI: 10.1093/nar/gky338] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2018] [Revised: 04/14/2018] [Accepted: 04/19/2018] [Indexed: 01/10/2023] Open
Abstract
Accumulating evidence indicates that transcription factor (TF) binding sites, or cis-regulatory elements (CREs), and their clusters termed cis-regulatory modules (CRMs) play a more important role than do gene-coding sequences in specifying complex traits in humans, including the susceptibility to common complex diseases. To fully characterize their roles in deriving the complex traits/diseases, it is necessary to annotate all CREs and CRMs encoded in the human genome. However, the current annotations of CREs and CRMs in the human genome are still very limited and mostly coarse-grained, as they often lack the detailed information of CREs in CRMs. Here, we integrated 620 TF ChIP-seq datasets produced by the ENCODE project for 168 TFs in 79 different cell/tissue types and predicted an unprecedentedly completely map of CREs in CRMs in the human genome at single nucleotide resolution. The map includes 305 912 CRMs containing a total of 1 178 913 CREs belonging to 736 unique TF binding motifs. The predicted CREs and CRMs tend to be subject to either purifying selection or positive selection, thus are likely to be functional. Based on the results, we also examined the status of available ChIP-seq datasets for predicting the entire regulatory genome of humans.
Collapse
Affiliation(s)
- Meng Niu
- Department of Bioinformatics and Genomics, College of Computing and Informatics, The University of North Carolina at Charlotte, 9201 University City Blvd., Charlotte, NC 28223, USA
| | - Ehsan Tabari
- Department of Bioinformatics and Genomics, College of Computing and Informatics, The University of North Carolina at Charlotte, 9201 University City Blvd., Charlotte, NC 28223, USA
| | - Pengyu Ni
- Department of Bioinformatics and Genomics, College of Computing and Informatics, The University of North Carolina at Charlotte, 9201 University City Blvd., Charlotte, NC 28223, USA
| | - Zhengchang Su
- Department of Bioinformatics and Genomics, College of Computing and Informatics, The University of North Carolina at Charlotte, 9201 University City Blvd., Charlotte, NC 28223, USA
| |
Collapse
|
20
|
Li J, Zhang S, Erdenee S, Sun X, Dang R, Huang Y, Lei C, Chen H, Xu H, Cai Y, Lan X. Nucleotide variants in prion-related protein (testis-specific) gene (PRNT) and effects on Chinese and Mongolian sheep phenotypes. Prion 2018; 12:185-196. [PMID: 29695200 DOI: 10.1080/19336896.2018.1467193] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022] Open
Abstract
Studies of the ovine prion-related protein (testis-specific) gene (PRNT), including studies of genetic diversity, have highlighted its potential relationship to scrapie infection and economically important ovine traits. PRNT was previously reported to be highly polymorphic in Portuguese sheep. To characterize genetic polymorphisms in this gene in Asian sheep, a direct sequencing method was used to detect polymorphic loci in PRNT in 285 individual sheep from four Chinese and one Mongolian breeds. Seven SNP variants in PRNT were identified, including three novel variants (g.93G>A, g.162G>T, and g.190A>G) and four previously reported variants (g.17 C>T, g.112G>C, g.129C>T, and g.144A>G). In the five breeds that we analyzed, the mutation frequencies of g.190A>G in Lanzhou Fat-tail sheep (LFTS) and g.129C>T in the other four varieties were high (F>0.5). Moreover, thirteen different haplotypes that had a comparable distribution in the tested breeds were also identified; 'C-G-G-C-A-G-A' occurred at the highest frequency in the five sheep breeds. Additionally, we previously explored the significance of relationships between polymorphisms in PRNP or PRND and ovine growth performance. Here, we also performed correlation analysis in all tested loci. These loci polymorphisms were significantly associated with ten different growth traits (P<0.05), except for g.93G>A. Meanwhile, in contrast to a previous study, there was no significant association between the seven SNP loci analyzed and our previously reported sheep PRND or PRNP insertion/deletion mutations. Our findings may provide new insights into polymorphic variation in ovine PRNT, which may contribute to genetic improvements in economic traits that are important for sheep breeding.
Collapse
Affiliation(s)
- Jie Li
- a College of Animal Science and Technology, Northwest A&F University , Xi'an , Shaanxi , China.,b College of Animal Science and Technology, Innovation Experimental College, Northwest A&F University , Xi'an , Shaanxi , China
| | - Shaoli Zhang
- a College of Animal Science and Technology, Northwest A&F University , Xi'an , Shaanxi , China.,b College of Animal Science and Technology, Innovation Experimental College, Northwest A&F University , Xi'an , Shaanxi , China
| | - Sarantsetseg Erdenee
- a College of Animal Science and Technology, Northwest A&F University , Xi'an , Shaanxi , China
| | - Xiuzhu Sun
- a College of Animal Science and Technology, Northwest A&F University , Xi'an , Shaanxi , China
| | - Ruihua Dang
- a College of Animal Science and Technology, Northwest A&F University , Xi'an , Shaanxi , China
| | - Yongzhen Huang
- a College of Animal Science and Technology, Northwest A&F University , Xi'an , Shaanxi , China
| | - Chuzhao Lei
- a College of Animal Science and Technology, Northwest A&F University , Xi'an , Shaanxi , China
| | - Hong Chen
- a College of Animal Science and Technology, Northwest A&F University , Xi'an , Shaanxi , China
| | - Hongwei Xu
- c Science Experimental Center, College of Life Science and Engineering, Northwest University for Nationalities , Chengguan District, Lanzhou City in northwest, Lanzhou , Gansu , China
| | - Yong Cai
- c Science Experimental Center, College of Life Science and Engineering, Northwest University for Nationalities , Chengguan District, Lanzhou City in northwest, Lanzhou , Gansu , China
| | - Xianyong Lan
- a College of Animal Science and Technology, Northwest A&F University , Xi'an , Shaanxi , China
| |
Collapse
|
21
|
Liu J, Chen J, Perrone-Bizzozero NI, Turner JA, Calhoun VD. Regional enrichment analyses on genetic profiles for schizophrenia and bipolar disorder. Schizophr Res 2018; 192:240-246. [PMID: 28442247 PMCID: PMC5651209 DOI: 10.1016/j.schres.2017.04.033] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/03/2016] [Revised: 04/14/2017] [Accepted: 04/16/2017] [Indexed: 01/28/2023]
Abstract
Both schizophrenia (SZ) and bipolar disorder (BD) are highly heritable psychiatric disorders. The significant genomic risk loci are of great importance but with no guarantee of known functional impact and they cannot totally explain the genetic inheritance. In this study we present regional enrichment analyses across the genome, aiming to strike a balance between individual risk loci and integrated regional effects. Chromosomes were partitioned into 2 million base-pair regions (indicated by an underscore sign in the cytogenetic bands) on which enrichment tests are performed. In the discovery phase, we leverage the Psychiatric Genomics Consortium SZ and BD initial association test results for European Ancestry (EA) population and dbGAP SNP data for African Ancestry (AA) population. 78 and 48 regions show significantly enriched associations with SZ and BD respectively in the EA population, and nine are in common including MHC, 3p21.1, 7p22.3_2, 2q32.3_2, 8q24.3_4, and 19q13.33_1. The most unique SZ associated region is 1p21.3_3, while the most unique BD associated region is 6q25.2_1. For the AA population fewer regions are discovered with only 10% overlapping with that of EA population. A replication test using Wellcome Trust Case Control Consortium data for EA population verified 9% of the SZ enriched regions and 40% of the BD enriched regions. In summary, we showed that regional enrichment analyses produce reliable genetic association profiles using about one tenth of samples compared to single base-pair genome wide association approach. The identified association regions will be useful for further genetic functional studies.
Collapse
Affiliation(s)
- Jingyu Liu
- The Mind Research Network, Albuquerque, NM, USA; Dept. of Electrical Engineering, University of New Mexico, Albuquerque, NM, USA.
| | - Jiayu Chen
- The Mind Research Network, Albuquerque, NM, USA
| | | | - Jessica A Turner
- The Mind Research Network, Albuquerque, NM, USA; Psychology Department and Neuroscience Institute, Georgia State University, Atlanta, GA, USA
| | - Vince D Calhoun
- The Mind Research Network, Albuquerque, NM, USA; Dept. of Electrical Engineering, University of New Mexico, Albuquerque, NM, USA; Dept. of Neurosciences, University of New Mexico School of Medicine, Albuquerque, NM, USA
| |
Collapse
|
22
|
Redundant regulation. Nat Ecol Evol 2018; 2:418-419. [PMID: 29379186 DOI: 10.1038/s41559-018-0479-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
23
|
Nikitin D, Penzar D, Garazha A, Sorokin M, Tkachev V, Borisov N, Poltorak A, Prassolov V, Buzdin AA. Profiling of Human Molecular Pathways Affected by Retrotransposons at the Level of Regulation by Transcription Factor Proteins. Front Immunol 2018; 9:30. [PMID: 29441061 PMCID: PMC5797644 DOI: 10.3389/fimmu.2018.00030] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2017] [Accepted: 01/04/2018] [Indexed: 12/22/2022] Open
Abstract
Endogenous retroviruses and retrotransposons also termed retroelements (REs) are mobile genetic elements that were active until recently in human genome evolution. REs regulate gene expression by actively reshaping chromatin structure or by directly providing transcription factor binding sites (TFBSs). We aimed to identify molecular processes most deeply impacted by the REs in human cells at the level of TFBS regulation. By using ENCODE data, we identified ~2 million TFBS overlapping with putatively regulation-competent human REs located in 5-kb gene promoter neighborhood (~17% of all TFBS in promoter neighborhoods; ~9% of all RE-linked TFBS). Most of REs hosting TFBS were highly diverged repeats, and for the evolutionary young (0–8% diverged) elements we identified only ~7% of all RE-linked TFBS. The gene-specific distributions of RE-linked TFBS generally correlated with the distributions for all TFBS. However, several groups of molecular processes were highly enriched in the RE-linked TFBS regulation. They were strongly connected with the immunity and response to pathogens, with the negative regulation of gene transcription, ubiquitination, and protein degradation, extracellular matrix organization, regulation of STAT signaling, fatty acids metabolism, regulation of GTPase activity, protein targeting to Golgi, regulation of cell division and differentiation, development and functioning of perception organs and reproductive system. By contrast, the processes most weakly affected by the REs were linked with the conservative aspects of embryo development. We also identified differences in the regulation features by the younger and older fractions of the REs. The regulation by the older fraction of the REs was linked mainly with the immunity, cell adhesion, cAMP, IGF1R, Notch, Wnt, and integrin signaling, neuronal development, chondroitin sulfate and heparin metabolism, and endocytosis. The younger REs regulate other aspects of immunity, cell cycle progression and apoptosis, PDGF, TGF beta, EGFR, and p38 signaling, transcriptional repression, structure of nuclear lumen, catabolism of phospholipids, and heterocyclic molecules, insulin and AMPK signaling, retrograde Golgi-ER transport, and estrogen signaling. The immunity-linked pathways were highly represented in both categories, but their functional roles were different and did not overlap. Our results point to the most quickly evolving molecular pathways in the recent and ancient evolution of human genome.
Collapse
Affiliation(s)
- Daniil Nikitin
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia.,D. Rogachev Federal Research Center of Pediatric Hematology, Oncology and Immunology, Moscow, Russia
| | - Dmitry Penzar
- The Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia
| | - Andrew Garazha
- D. Rogachev Federal Research Center of Pediatric Hematology, Oncology and Immunology, Moscow, Russia.,OmicsWay Corp., Walnut, CA, United States
| | - Maxim Sorokin
- OmicsWay Corp., Walnut, CA, United States.,National Research Centre Kurchatov Institute, Centre for Convergence of Nano-, Bio-, Information and Cognitive Sciences and Technologies, Moscow, Russia.,Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Moscow, Russia
| | | | - Nicolas Borisov
- OmicsWay Corp., Walnut, CA, United States.,National Research Centre Kurchatov Institute, Centre for Convergence of Nano-, Bio-, Information and Cognitive Sciences and Technologies, Moscow, Russia
| | - Alexander Poltorak
- Program in Immunology, Sackler Graduate School, Tufts University, Boston, MA, United States
| | - Vladimir Prassolov
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | - Anton A Buzdin
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia.,D. Rogachev Federal Research Center of Pediatric Hematology, Oncology and Immunology, Moscow, Russia.,OmicsWay Corp., Walnut, CA, United States.,National Research Centre Kurchatov Institute, Centre for Convergence of Nano-, Bio-, Information and Cognitive Sciences and Technologies, Moscow, Russia
| |
Collapse
|
24
|
An Essential Regulatory System Originating from Polygenic Transcriptional Rewiring of PhoP-PhoQ of Xanthomonas campestris. Genetics 2017; 206:2207-2223. [PMID: 28550013 DOI: 10.1534/genetics.117.200204] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2017] [Accepted: 05/22/2017] [Indexed: 01/06/2023] Open
Abstract
How essential, regulatory genes originate and evolve is intriguing because mutations of these genes not only lead to lethality in organisms, but also have pleiotropic effects since they control the expression of multiple downstream genes. Therefore, the evolution of essential, regulatory genes is not only determined by genetic variations of their own sequences, but also by the biological function of downstream genes and molecular mechanisms of regulation. To understand the origin of essential, regulatory genes, experimental dissection of the complete regulatory cascade is needed. Here, we provide genetic evidences to reveal that PhoP-PhoQ is an essential two-component signal transduction system in the gram-negative bacterium Xanthomonas campestris, but that its orthologs in other bacteria belonging to Proteobacteria are nonessential. Mutational, biochemical, and chromatin immunoprecipitation together with high-throughput sequencing analyses revealed that phoP and phoQ of X. campestris and its close relative Pseudomonas aeruginosa are replaceable, and that the consensus binding motifs of the transcription factor PhoP are also highly conserved. PhoP Xcc in X. campestris regulates the transcription of a number of essential, structural genes by directly binding to cis-regulatory elements (CREs); however, these CREs are lacking in the orthologous essential, structural genes in P. aeruginosa, and thus the regulatory relationships between PhoP Pae and these downstream essential genes are disassociated. Our findings suggested that the recruitment of regulatory proteins by critical structural genes via transcription factor-CRE rewiring is a driving force in the origin and functional divergence of essential, regulatory genes.
Collapse
|
25
|
Abstract
Chromatin immunoprecipitation followed by sequencing is an invaluable assay for identifying the genomic binding sites of transcription factors. However, transcription factors rarely bind chromatin alone but often bind together with other cofactors, forming protein complexes. Here, we describe a computational method that integrates multiple ChIP-seq and RNA-seq datasets to discover protein complexes and determine their role as activators or repressors. This chapter outlines a detailed computational pipeline for discovering and predicting binding partners from ChIP-seq data and inferring their role in regulating gene expression. This work aims at developing hypotheses about gene regulation via binding partners and deciphering the combinatorial nature of DNA-binding proteins.
Collapse
|
26
|
Chai C, Subudhi PK. Comprehensive Analysis and Expression Profiling of the OsLAX and OsABCB Auxin Transporter Gene Families in Rice (Oryza sativa) under Phytohormone Stimuli and Abiotic Stresses. FRONTIERS IN PLANT SCIENCE 2016; 7:593. [PMID: 27200061 PMCID: PMC4853607 DOI: 10.3389/fpls.2016.00593] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/15/2016] [Accepted: 04/18/2016] [Indexed: 05/20/2023]
Abstract
The plant hormone auxin regulates many aspects of plant growth and developmental processes. Auxin gradient is formed in plant as a result of polar auxin transportation by three types of auxin transporters such as OsLAX, OsPIN, and OsABCB. We report here the analysis of two rice auxin transporter gene families, OsLAX and OsABCB, using bioinformatics tools, publicly accessible microarray data, and quantitative RT-PCR. There are 5 putative OsLAXs and 22 putative OsABCBs in rice genome, which were mapped on 8 chromosomes. The exon-intron structure of OsLAX genes and properties of deduced proteins were relatively conserved within grass family, while that of OsABCB genes varied greatly. Both constitutive and organ/tissue specific expression patterns were observed in OsLAXs and OsABCBs. Analysis of evolutionarily closely related "gene pairs" together with organ/tissue specific expression revealed possible "function gaining" and "function losing" events during rice evolution. Most OsLAX and OsABCB genes were regulated by drought and salt stress, as well as hormonal stimuli [auxin and Abscisic Acid (ABA)], which suggests extensive crosstalk between abiotic stresses and hormone signaling pathways. The existence of large number of auxin and stress related cis-regulatory elements in promoter regions might account for their massive responsiveness of these genes to these environmental stimuli, indicating complexity of regulatory networks involved in various developmental and physiological processes. The comprehensive analysis of OsLAX and OsABCB auxin transporter genes in this study would be helpful for understanding the biological significance of these gene families in hormone signaling and adaptation of rice plants to unfavorable environments.
Collapse
|
27
|
Simonti CN, Capra JA. The evolution of the human genome. Curr Opin Genet Dev 2015; 35:9-15. [PMID: 26338498 DOI: 10.1016/j.gde.2015.08.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2015] [Revised: 08/08/2015] [Accepted: 08/12/2015] [Indexed: 02/05/2023]
Abstract
Human genomes hold a record of the evolutionary forces that have shaped our species. Advances in DNA sequencing, functional genomics, and population genetic modeling have deepened our understanding of human demographic history, natural selection, and many other long-studied topics. These advances have also revealed many previously underappreciated factors that influence the evolution of the human genome, including functional modifications to DNA and histones, conserved 3D topological chromatin domains, structural variation, and heterogeneous mutation patterns along the genome. Using evolutionary theory as a lens to study these phenomena will lead to significant breakthroughs in understanding what makes us human and why we get sick.
Collapse
Affiliation(s)
- Corinne N Simonti
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN 37235, USA
| | - John A Capra
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN 37235, USA; Department of Biological Sciences, Vanderbilt University, Nashville, TN 37235, USA; Department of Biomedical Informatics, Vanderbilt University, Nashville, TN 37235, USA.
| |
Collapse
|
28
|
Approximation to the distribution of fitness effects across functional categories in human segregating polymorphisms. PLoS Genet 2014; 10:e1004697. [PMID: 25375159 PMCID: PMC4222666 DOI: 10.1371/journal.pgen.1004697] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2014] [Accepted: 08/22/2014] [Indexed: 02/03/2023] Open
Abstract
Quantifying the proportion of polymorphic mutations that are deleterious or neutral is of fundamental importance to our understanding of evolution, disease genetics and the maintenance of variation genome-wide. Here, we develop an approximation to the distribution of fitness effects (DFE) of segregating single-nucleotide mutations in humans. Unlike previous methods, we do not assume that synonymous mutations are neutral or not strongly selected, and we do not rely on fitting the DFE of all new nonsynonymous mutations to a single probability distribution, which is poorly motivated on a biological level. We rely on a previously developed method that utilizes a variety of published annotations (including conservation scores, protein deleteriousness estimates and regulatory data) to score all mutations in the human genome based on how likely they are to be affected by negative selection, controlling for mutation rate. We map this and other conservation scores to a scale of fitness coefficients via maximum likelihood using diffusion theory and a Poisson random field model on SNP data. Our method serves to approximate the deleterious DFE of mutations that are segregating, regardless of their genomic consequence. We can then compare the proportion of mutations that are negatively selected or neutral across various categories, including different types of regulatory sites. We observe that the distribution of intergenic polymorphisms is highly peaked at neutrality, while the distribution of nonsynonymous polymorphisms has a second peak at [Formula: see text]. Other types of polymorphisms have shapes that fall roughly in between these two. We find that transcriptional start sites, strong CTCF-enriched elements and enhancers are the regulatory categories with the largest proportion of deleterious polymorphisms.
Collapse
|