1
|
Kuraz Abebe B, Wang J, Guo J, Wang H, Li A, Zan L. A review of the role of epigenetic studies for intramuscular fat deposition in beef cattle. Gene 2024; 908:148295. [PMID: 38387707 DOI: 10.1016/j.gene.2024.148295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 01/23/2024] [Accepted: 02/15/2024] [Indexed: 02/24/2024]
Abstract
Intramuscular fat (IMF) deposition profoundly influences meat quality and economic value in beef cattle production. Meanwhile, contemporary developments in epigenetics have opened new outlooks for understanding the molecular basics of IMF regulation, and it has become a key area of research for world scholars. Therefore, the aim of this paper was to provide insight and synthesis into the intricate relationship between epigenetic mechanisms and IMF deposition in beef cattle. The methodology involves a thorough analysis of existing literature, including pertinent books, academic journals, and online resources, to provide a comprehensive overview of the role of epigenetic studies in IMF deposition in beef cattle. This review summarizes the contemporary studies in epigenetic mechanisms in IMF regulation, high-resolution epigenomic mapping, single-cell epigenomics, multi-omics integration, epigenome editing approaches, longitudinal studies in cattle growth, environmental epigenetics, machine learning in epigenetics, ethical and regulatory considerations, and translation to industry practices from perspectives of IMF deposition in beef cattle. Moreover, this paper highlights DNA methylation, histone modifications, acetylation, phosphorylation, ubiquitylation, non-coding RNAs, DNA hydroxymethylation, epigenetic readers, writers, and erasers, chromatin immunoprecipitation followed by sequencing, whole genome bisulfite sequencing, epigenome-wide association studies, and their profound impact on the expression of crucial genes governing adipogenesis and lipid metabolism. Nutrition and stress also have significant influences on epigenetic modifications and IMF deposition. The key findings underscore the pivotal role of epigenetic studies in understanding and enhancing IMF deposition in beef cattle, with implications for precision livestock farming and ethical livestock management. In conclusion, this review highlights the crucial significance of epigenetic pathways and environmental factors in affecting IMF deposition in beef cattle, providing insightful information for improving the economics and meat quality of cattle production.
Collapse
Affiliation(s)
- Belete Kuraz Abebe
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China; Department of Animal Science, Werabe University, P.O. Box 46, Werabe, Ethiopia
| | - Jianfang Wang
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China
| | - Juntao Guo
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China
| | - Hongbao Wang
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China
| | - Anning Li
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China
| | - Linsen Zan
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China; National Beef Cattle Improvement Center, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China.
| |
Collapse
|
2
|
Fulghum B, Tanker SH, White RA. DeGenPrime provides robust primer design and optimization unlocking the biosphere. BIOINFORMATICS ADVANCES 2024; 4:vbae044. [PMID: 38590916 PMCID: PMC11001487 DOI: 10.1093/bioadv/vbae044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/02/2023] [Revised: 02/19/2024] [Accepted: 03/12/2024] [Indexed: 04/10/2024]
Abstract
Motivation Polymerase chain reaction (PCR) is the world's most important molecular diagnostic with applications ranging from medicine to ecology. PCR can fail because of poor primer design. The nearest-neighbor thermodynamic properties, picking conserved regions, and filtration via penalty of oligonucleotides form the basis for good primer design. Results DeGenPrime is a console-based high-quality PCR primer design tool that can utilize MSA formats and degenerate bases expanding the target range for a single primer set. Our software utilizes thermodynamic properties, filtration metrics, penalty scoring, and conserved region finding of any proposed primer. It has degeneracy, repeated k-mers, relative GC content, and temperature range filters. Minimal penalty scoring is included according to secondary structure self-dimerization metrics, GC clamping, tri- and tetra-loop hairpins, and internal repetition. We compared PrimerDesign-M, DegePrime, ConsensusPrimer, and DeGenPrime on acceptable primer yield. PrimerDesign-M, DegePrime, and ConsensusPrimer provided 0%, 11%, and 17% yield, respectively, for the alternative iron nitrogenase (anfD) gene target. DeGenPrime successfully identified quality primers within the conserved regions of the T4-like phage major capsid protein (g23), conserved regions of molybdenum-based nitrogenase (nif), and its alternatives vanadium (vnf) and iron (anf) nitrogenase. DeGenPrime provides a universal and scalable primer design tool for the entire tree of life. Availability and implementation DeGenPrime is written in C++ and distributed under a BSD-3-Clause license. The source code for DeGenPrime is freely available on www.github.com/raw-lab/degenprime.
Collapse
Affiliation(s)
- Bryan Fulghum
- Department of Bioinformatics and Genomics, North Carolina Research Campus (NCRC), The University of North Carolina at Charlotte, Kannapolis, NC 28081, United States
- Department of Bioinformatics and Genomics, Computational Intelligence to Predict Health and Environmental Risks (CIPHER) Research Center, The University of North Carolina at Charlotte, Charlotte, NC 28223, United States
| | - Sophie H Tanker
- Department of Bioinformatics and Genomics, North Carolina Research Campus (NCRC), The University of North Carolina at Charlotte, Kannapolis, NC 28081, United States
- Department of Bioinformatics and Genomics, Computational Intelligence to Predict Health and Environmental Risks (CIPHER) Research Center, The University of North Carolina at Charlotte, Charlotte, NC 28223, United States
| | - Richard Allen White
- Department of Bioinformatics and Genomics, North Carolina Research Campus (NCRC), The University of North Carolina at Charlotte, Kannapolis, NC 28081, United States
- Department of Bioinformatics and Genomics, Computational Intelligence to Predict Health and Environmental Risks (CIPHER) Research Center, The University of North Carolina at Charlotte, Charlotte, NC 28223, United States
| |
Collapse
|
3
|
Jiménez-Gaona Y, Vivanco-Galván O, Cruz D, Armijos-Carrión A, Suárez JP. Compensatory Base Changes in ITS2 Secondary Structure Alignment, Modelling, and Molecular Phylogeny: An Integrated Approach to Improve Species Delimitation in Tulasnella (Basidiomycota). J Fungi (Basel) 2023; 9:894. [PMID: 37755002 PMCID: PMC10532482 DOI: 10.3390/jof9090894] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 08/08/2023] [Accepted: 08/17/2023] [Indexed: 09/28/2023] Open
Abstract
BACKGROUND The delimitation of species of Tulasnella has been extensively studied, mainly at the morphological (sexual and asexual states) and molecular levels-showing ambiguity between them. An integrative species concept that includes characteristics such as molecular, ecology, morphology, and other information is crucial for species delimitation in complex groups such as Tulasnella. OBJECTIVES The aim of this study is to test evolutionary relationships using a combination of alignment-based and alignment-free distance matrices as an alternative molecular tool to traditional methods, and to consider the secondary structures and CBCs from ITS2 (internal transcribed spacer) sequences for species delimitation in Tulasnella. METHODOLOGY Three phylogenetic approaches were plotted: (i) alignment-based, (ii) alignment-free, and (iii) a combination of both distance matrices using the DISTATIS and pvclust libraries from an R package. Finally, the secondary structure consensus was modeled by Mfold, and a CBC analysis was obtained to complement the species delimitation using 4Sale. RESULTS AND CONCLUSIONS The phylogenetic tree results showed delimited monophyletic clades in Tulasnella spp., where all 142 Tulasnella sequences were divided into two main clades A and B and assigned to seven species (T. asymmetrica, T. andina, T. eichleriana ECU6, T. eichleriana ECU4 T. pinicola, T. violea), supported by bootstrap values from 72% to 100%. From the 2D secondary structure alignment, three types of consensus models with helices and loops were obtained. Thus, T. albida belongs to type I; T. eichleriana, T. tomaculum, and T. violea belong to type II; and T. asymmetrica, T. andina, T. pinicola, and T. spp. (GER) belong to type III; each type contains four to six domains, with nine CBCs among these that corroborate different species.
Collapse
Affiliation(s)
- Yuliana Jiménez-Gaona
- Departamento de Química, Universidad Técnica Particular de Loja (UTPL), San Cayetano Alto s/n, Loja 1101608, Ecuador
| | - Oscar Vivanco-Galván
- Departamento de Ciencias Biológicas, Universidad Técnica Particular de Loja (UTPL), San Cayetano Alto s/n, Loja 1101608, Ecuador; (O.V.-G.); (D.C.); (J.P.S.)
| | - Darío Cruz
- Departamento de Ciencias Biológicas, Universidad Técnica Particular de Loja (UTPL), San Cayetano Alto s/n, Loja 1101608, Ecuador; (O.V.-G.); (D.C.); (J.P.S.)
| | - Angelo Armijos-Carrión
- Department of Biology, Memorial University of Newfoundland, St. John’s, NL A1B 3X9, Canada;
| | - Juan Pablo Suárez
- Departamento de Ciencias Biológicas, Universidad Técnica Particular de Loja (UTPL), San Cayetano Alto s/n, Loja 1101608, Ecuador; (O.V.-G.); (D.C.); (J.P.S.)
| |
Collapse
|
4
|
Zhou X, Zhu T, Fang W, Yu R, He Z, Chen D. Systematic annotation of conservation states provides insights into regulatory regions in rice. J Genet Genomics 2022; 49:1127-1137. [PMID: 35470092 DOI: 10.1016/j.jgg.2022.04.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2021] [Revised: 04/08/2022] [Accepted: 04/12/2022] [Indexed: 01/14/2023]
Abstract
Plant genomes contain a large fraction of noncoding sequences. The discovery and annotation of conserved noncoding sequences (CNSs) in plants is an ongoing challenge. Here we report the application of comparative genomics to systematically identify CNSs in 50 well-annotated Gramineae genomes using rice (Oryza sativa) as the reference. We conduct multiple-way whole-genome alignments to the rice genome. The rice genome is annotated as 20 conservation states (CSs) at single-nucleotide resolution using a multivariate hidden Markov model (ConsHMM) based on the multiple-genome alignments. Different states show distinct enrichments for various genomic features, and the conservation scores of CSs are highly correlated with the level of associated chromatin accessibility. We find that at least 33.5% of the rice genome is highly under selection, with more than 70% of the sequence lying outside of coding regions. A catalog of 855,366 regulatory CNSs is generated, and they significantly overlapped with putative active regulatory elements such as promoters, enhancers, and transcription factor binding sites. Collectively, our study provides a resource for elucidating functional noncoding regions of the rice genome and an evolutionary aspect of regulatory sequences in higher plants.
Collapse
Affiliation(s)
- Xinkai Zhou
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, Jiangsu 210023, China
| | - Tao Zhu
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, Jiangsu 210023, China
| | - Wen Fang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, Jiangsu 210023, China
| | - Ranran Yu
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, Jiangsu 210023, China
| | - Zhaohui He
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, Jiangsu 210023, China
| | - Dijun Chen
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, Jiangsu 210023, China.
| |
Collapse
|
5
|
Flaga J, Przybyło M, Górka P. Gene Expression Analysis with No Sequence Data: Study on Reeves's Muntjac ( Muntiacus reevesi). Curr Issues Mol Biol 2021; 43:1576-1582. [PMID: 34698093 PMCID: PMC8929141 DOI: 10.3390/cimb43030111] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2021] [Revised: 10/07/2021] [Accepted: 10/08/2021] [Indexed: 11/16/2022] Open
Abstract
Despite scientific progress, the gene sequences for many species not commonly used in research have not yet been analyzed. This makes it difficult to carry out molecular studies on such animals, as the sequence of genes is the basic information used in many techniques. In this study, we attempt to design primers for a real-time PCR analysis, basing on a comparative analysis of selected gene sequences of species related to Reeves's muntjac (Muntiacus reevesi) and by identifying highly conservative regions. Results of PCR products sequencing and their alignment with the GenBank collection show that all selected primers gave products highly similar (> 90%) to the intended target (among compared species), which led us to the conclusion that our primers may be used for further analyses of gene expression.
Collapse
|
6
|
Mehravar M, Ghaemimanesh F, Poursani EM. Exon and intron sharing in opposite direction-an undocumented phenomenon in human genome-between Pou5f1 and Tcf19 genes. BMC Genomics 2021; 22:718. [PMID: 34610795 PMCID: PMC8493703 DOI: 10.1186/s12864-021-08039-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Accepted: 09/24/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Overlapping genes share same genomic regions in parallel (sense) or anti-parallel (anti-sense) orientations. These gene pairs seem to occur in all domains of life and are best known from viruses. However, the advantage and biological significance of overlapping genes is still unclear. Expressed sequence tags (ESTs) analysis enabled us to uncover an overlapping gene pair in the human genome. RESULTS By using in silico analysis of previous experimental documentations, we reveal a new form of overlapping genes in the human genome, in which two genes found on opposite strands (Pou5f1 and Tcf19), share two exons and one intron enclosed, at the same positions, between OCT4B3 and TCF19-D splice variants. CONCLUSIONS This new form of overlapping gene expands our previous perception of splicing events and may shed more light on the complexity of gene regulation in higher organisms. Additional such genes might be detected by ESTs analysis also of other organisms.
Collapse
Affiliation(s)
- Majid Mehravar
- Department of Anatomy and Developmental Biology, Development and Stem Cells Program, Biomedicine Discovery Institute, Monash University, Melbourne, Australia
| | - Fatemeh Ghaemimanesh
- Monoclonal Antibody Research Center, Avicenna Research Institute, ACECR, Tehran, Iran
| | - Ensieh M Poursani
- Hematology, Oncology and Stem Cell Transplantation Research Center, Tehran University of Medical Sciences, Tehran, Iran.
| |
Collapse
|
7
|
Quick and efficient approach to develop genomic resources in orphan species: Application in Lavandula angustifolia. PLoS One 2020; 15:e0243853. [PMID: 33306734 PMCID: PMC7732122 DOI: 10.1371/journal.pone.0243853] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Accepted: 11/27/2020] [Indexed: 12/24/2022] Open
Abstract
Next-Generation Sequencing (NGS) technologies, by reducing the cost and increasing the throughput of sequencing, have opened doors to generate genomic data in a range of previously poorly studied species. In this study, we propose a method for the rapid development of a large-scale molecular resources for orphan species. We studied as an example the true lavender (Lavandula angustifolia Mill.), a perennial sub-shrub plant native from the Mediterranean region and whose essential oil have numerous applications in cosmetics, pharmaceuticals, and alternative medicines. The heterozygous clone “Maillette” was used as a reference for DNA and RNA sequencing. We first built a reference Unigene, compound of coding sequences, thanks to de novo RNA-seq assembly. Then, we reconstructed the complete genes sequences (with introns and exons) using an Unigene-guided DNA-seq assembly approach. This aimed to maximize the possibilities of finding polymorphism between genetically close individuals despite the lack of a reference genome. Finally, we used these resources for SNP mining within a collection of 16 commercial lavender clones and tested the SNP within the scope of a genetic distance analysis. We obtained a cleaned reference of 8, 030 functionally in silico annotated genes. We found 359K polymorphic sites and observed a high SNP frequency (mean of 1 SNP per 90 bp) and a high level of heterozygosity (more than 60% of heterozygous SNP per genotype). On overall, we found similar genetic distances between pairs of clones, which is probably related to the out-crossing nature of the species and the restricted area of cultivation. The proposed method is transferable to other orphan species, requires little bioinformatics resources and can be realized within a year. This is also the first reported large-scale SNP development on Lavandula angustifolia. All the genomics resources developed herein are publicly available and provide a rich pool of molecular resources to explore and exploit lavender genetic diversity in breeding programs.
Collapse
|
8
|
Srivastava AK, Srivastava R, Sharma A, Bharati AP, Tiwari PK, Singh AK, Srivastava AK, Chakdar H, Kashyap PL, Saxena AK. Pan-genome analysis of Exiguobacterium reveals species delineation and genomic similarity with Exiguobacterium profundum PHM 11. ENVIRONMENTAL MICROBIOLOGY REPORTS 2020; 12:639-650. [PMID: 32996243 DOI: 10.1111/1758-2229.12890] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/24/2020] [Accepted: 09/25/2020] [Indexed: 06/11/2023]
Abstract
The stint of the bacterial species is convoluting, but the new algorithms to calculate genome-to-genome distance (GGD) and DNA-DNA hybridization (DDH) for comparative genome analysis have rejuvenated the exploration of species and sub-species characterization. The present study reports the first whole genome sequence of Exiguobacterium profundum PHM11. PHM11 genome consist of ~ 2.92 Mb comprising 48 contigs, 47.93% G + C content. Functional annotations revealed a total of 3033 protein coding genes and 33 non-protein coding genes. Out of these, only 2316 could be characterized and others reported as hypothetical proteins. The comparative analysis of predicted proteome of PHM11 with five other Exiguobacterium sp. identified 3806 clusters, out of which the PHM11 shared a total of 2723 clusters having 1664 common clusters, 131 singletons and 928 distributed between five species. The pan-genome analysis of 70 different genomic sequences of Exigubacterium strains devoid of a species taxon was done on the basis of GGD and the DDH which identified eight genomes analogous to the PHM11 at species level and may be characterized as E. profundum. The ANI value and phylogenetic tree analysis also support the same. The results regarding pan-genome analysis provide a convincing insight for delineation of these eight strains to species.
Collapse
Affiliation(s)
- Alok Kumar Srivastava
- ICAR-National Bureau of Agriculturally Important Microorganisms (NBAIM), Maunath Bhanjan, UP, 275103, India
| | - Ruchi Srivastava
- ICAR-National Bureau of Agriculturally Important Microorganisms (NBAIM), Maunath Bhanjan, UP, 275103, India
| | - Anjney Sharma
- ICAR-National Bureau of Agriculturally Important Microorganisms (NBAIM), Maunath Bhanjan, UP, 275103, India
| | - Akhilendra Pratap Bharati
- ICAR-National Bureau of Agriculturally Important Microorganisms (NBAIM), Maunath Bhanjan, UP, 275103, India
| | - Praveen Kumar Tiwari
- ICAR-National Bureau of Agriculturally Important Microorganisms (NBAIM), Maunath Bhanjan, UP, 275103, India
| | - Alok Kumar Singh
- ICAR-National Bureau of Agriculturally Important Microorganisms (NBAIM), Maunath Bhanjan, UP, 275103, India
| | - Anchal Kumar Srivastava
- ICAR-National Bureau of Agriculturally Important Microorganisms (NBAIM), Maunath Bhanjan, UP, 275103, India
| | - Hillol Chakdar
- ICAR-National Bureau of Agriculturally Important Microorganisms (NBAIM), Maunath Bhanjan, UP, 275103, India
| | - Prem Lal Kashyap
- ICAR-Indian Institute of Wheat and Barley Research, Karnal, Haryana, 132001, India
| | - Anil Kumar Saxena
- ICAR-National Bureau of Agriculturally Important Microorganisms (NBAIM), Maunath Bhanjan, UP, 275103, India
| |
Collapse
|
9
|
Abstract
The study of chromosome evolution is undergoing a resurgence of interest owing to advances in DNA sequencing technology that facilitate the production of chromosome-scale whole-genome assemblies de novo. This review focuses on the history, methods, discoveries, and current challenges facing the field, with an emphasis on vertebrate genomes. A detailed examination of the literature on the biology of chromosome rearrangements is presented, specifically the relationship between chromosome rearrangements and phenotypic evolution, adaptation, and speciation. A critical review of the methods for identifying, characterizing, and visualizing chromosome rearrangements and computationally reconstructing ancestral karyotypes is presented. We conclude by looking to the future, identifying the enormous technical and scientific challenges presented by the accumulation of hundreds and eventually thousands of chromosome-scale assemblies.
Collapse
Affiliation(s)
- Joana Damas
- The Genome Center, University of California, Davis, California 95616, USA; , ,
| | - Marco Corbo
- The Genome Center, University of California, Davis, California 95616, USA; , ,
| | - Harris A Lewin
- The Genome Center, University of California, Davis, California 95616, USA; , , .,Department of Evolution and Ecology, College of Biological Sciences, University of California, Davis, California 95616, USA
| |
Collapse
|
10
|
Mendrick DL, Mattes WB. Translational toxicology: An overview. CURRENT OPINION IN TOXICOLOGY 2020. [DOI: 10.1016/j.cotox.2020.08.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
11
|
Bayoumi M, Rohaim MA, Munir M. Structural and Virus Regulatory Insights Into Avian N6-Methyladenosine (m6A) Machinery. Front Cell Dev Biol 2020; 8:543. [PMID: 32760718 PMCID: PMC7373739 DOI: 10.3389/fcell.2020.00543] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2020] [Accepted: 06/09/2020] [Indexed: 12/20/2022] Open
Abstract
The addition of a methyl group to the N6 position of adenosine (m6A) is the most common posttranscriptional RNA modification, and it regulates most steps of RNA metabolism including splicing, stability, translation, nuclear-export, and RNA structures. Besides cellular RNA, m6A modifications have also been detected on viral RNA. A range of recent studies have demonstrated the crucial roles of m6A in the virus–host interactions; however, m6A cellular machineries are only characterized in limited mammalian species. Herein, we aim to present comprehensive evolutionary insights into major m6A writers, erasers, and readers and draw a comparative structural analysis between avian and mammalian m6A-associated machineries. The comparative collinearity on the chromosomal scale revealed that the majority of m6A-related genes were found less syntenic even among avian species. Genetic analysis of avian m6A erasers revealed a distinct phylogenetic clustering compared to mammalian orthologs and shared a weak percent (55%) identity with mammalian species with low identity percentage (55%). The overall comparative three-dimensional (3D) structure analyses among different mammalian species were maintained through synonymous structural mutations. Unlike erasers, the putative 3D structures in the active sites as for the aromatic cage in YTH-domain of YTHDC1 and two pivotal loops in MTD-domains in METTL3 exhibited structural alterations in chicken. In conjunction with in silico investigations, influenza viruses significantly downregulated gene the transcription of m6A writers and erasers, whereas m6A readers were moderately regulated in chicken fibroblasts. In light of these findings, future detailed biochemical and crystallographic studies are warranted to define the roles of m6A machinery in regulating both viral and cellular RNA metabolism in avian species.
Collapse
Affiliation(s)
- Mahmoud Bayoumi
- Division of Biomedical and Life Sciences, Lancaster University, Lancaster, United Kingdom
| | - Mohammed A Rohaim
- Division of Biomedical and Life Sciences, Lancaster University, Lancaster, United Kingdom
| | - Muhammad Munir
- Division of Biomedical and Life Sciences, Lancaster University, Lancaster, United Kingdom
| |
Collapse
|
12
|
Muktar MS, Teshome A, Hanson J, Negawo AT, Habte E, Domelevo Entfellner JB, Lee KW, Jones CS. Genotyping by sequencing provides new insights into the diversity of Napier grass (Cenchrus purpureus) and reveals variation in genome-wide LD patterns between collections. Sci Rep 2019; 9:6936. [PMID: 31061417 PMCID: PMC6502793 DOI: 10.1038/s41598-019-43406-0] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2019] [Accepted: 04/24/2019] [Indexed: 12/12/2022] Open
Abstract
Napier grass is an important tropical forage-grass and of growing potential as an energy crop. One-hundred-five Napier grass accessions, encompassing two independent collections, were subjected to genotyping by sequencing which generated a set of high-density genome-wide markers together with short sequence reads. The reads, averaging 54 nucleotides, were mapped to the pearl millet genome and the closest genes and annotation information were used to select candidate genes linked to key forage traits. 980 highly polymorphic SNP markers, distributed across the genome, were used to assess population structure and diversity with seven-subgroups identified. A few representative accessions were selected with the objective of distributing subsets of a manageable size for further evaluation. Genome-wide linkage disequilibrium (LD) analyses revealed a fast LD-decay, on average 2.54 kbp, in the combined population with a slower LD-decay in the ILRI collection compared with the EMBRAPA collection, the significance of which is discussed. This initiative generated high-density markers with a good distribution across the genome. The diversity analysis revealed the existence of a substantial amount of variation in the ILRI collection and identified some unique materials from the EMBRAPA collection, demonstrating the potential of the overall population for further genetic and marker-trait-association studies.
Collapse
Affiliation(s)
- Meki S Muktar
- Feed and Forage Development, International Livestock Research Institute, Addis Ababa, Ethiopia
| | - Abel Teshome
- Teagasc
- CELUP Crop Research, Oak Park, Carlow, R93 XE12, Ireland
| | - Jean Hanson
- Feed and Forage Development, International Livestock Research Institute, Addis Ababa, Ethiopia
| | - Alemayehu T Negawo
- Feed and Forage Development, International Livestock Research Institute, Addis Ababa, Ethiopia
| | - Ermias Habte
- Feed and Forage Development, International Livestock Research Institute, Addis Ababa, Ethiopia
| | | | - Ki-Won Lee
- Grassland and Forages Division, National Institute of Animal Science, Rural Development Administration, Cheonan, 31000, Republic of Korea
| | - Chris S Jones
- Feed and Forage Development, International Livestock Research Institute, Nairobi, Kenya.
| |
Collapse
|
13
|
Li J, Levitan B, Gomez-Jimenez S, Kültz D. Development of a Gill Assay Library for Ecological Proteomics of Threespine Sticklebacks ( Gasterosteus aculeatus). Mol Cell Proteomics 2018; 17:2146-2163. [PMID: 30093419 PMCID: PMC6210217 DOI: 10.1074/mcp.ra118.000973] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2018] [Revised: 08/07/2018] [Indexed: 12/31/2022] Open
Abstract
A data-independent acquisition (DIA) assay library for quantitative analyses of proteome dynamics has been developed for gills of threespine sticklebacks (Gasterosteus aculeatus). A raw spectral library was generated by data-dependent acquisition (DDA) and annotation of tryptic peptides to MSMS spectra and protein database identifiers. The assay library was constructed from the raw spectral library by removal of low-quality, ambiguous, and low-signal peptides. Only unique proteins represented by at least two peptides are included in the assay library, which consists of 1506 proteins, 5074 peptides, 5104 precursors, and 25,322 transitions. This assay library was used with DIA data to identify biochemical differences in gill proteomes of four populations representing different eco- and morpho-types of threespine sticklebacks. The assay library revealed unique and reproducible proteome signatures. Warm-adapted, low-plated, brackish-water fish from Laguna de la Bocana del Rosario (Mexico) show elevated HSP47, extracellular matrix, and innate immunity proteins whereas several immunoglobulins, interferon-induced proteins, ubiquitins, proteolytic enzymes, and nucleic acid remodeling proteins are reduced. Fully-plated, brackish-water fish from Westchester Lagoon (Alaska) display elevated ion regulation, GTPase signaling, and contractile cytoskeleton proteins, altered abundances of many ribosomal, calcium signaling and immunity proteins, and depleted transcriptional regulators and metabolic enzymes. Low-plated freshwater fish from Lake Solano (California) have elevated inflammasomes and proteolytic proteins whereas several iron containing and ion regulatory proteins are reduced. Gills of fully-plated, marine fish from Bodega Harbor (California) have elevated oxidative metabolism enzymes and reduced transglutaminase 2, collagens, and clathrin heavy chains. These distinct proteome signatures represent targets for testing ecological and evolutionary influences on molecular mechanisms of gill function in threespine sticklebacks. Furthermore, the gill assay library represents a model for other tissues and paves the way for accurate and reproducible network analyses of environmental context-dependent proteome dynamics in complex organisms.
Collapse
Affiliation(s)
- Johnathon Li
- From the ‡Department of Animal Sciences, University of California Davis, Meyer Hall, One Shields Ave., Davis, CA 95616
| | - Bryn Levitan
- From the ‡Department of Animal Sciences, University of California Davis, Meyer Hall, One Shields Ave., Davis, CA 95616
| | - Silvia Gomez-Jimenez
- §Centro de Investigación en Alimentación y Desarrollo, Carretera a la Victoria Km. 0.6, Apartado, Hermosillo, Sonora, México C.P. 83000
| | - Dietmar Kültz
- From the ‡Department of Animal Sciences, University of California Davis, Meyer Hall, One Shields Ave., Davis, CA 95616;
- ¶Coastal Marine Sciences Institute, University of California, Davis
| |
Collapse
|
14
|
Pirih N, Kunej T. An Updated Taxonomy and a Graphical Summary Tool for Optimal Classification and Comprehension of Omics Research. ACTA ACUST UNITED AC 2018; 22:337-353. [DOI: 10.1089/omi.2017.0186] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Affiliation(s)
- Nina Pirih
- Department of Animal Science, Biotechnical Faculty, University of Ljubljana, Domzale, Slovenia
| | - Tanja Kunej
- Department of Animal Science, Biotechnical Faculty, University of Ljubljana, Domzale, Slovenia
| |
Collapse
|
15
|
Rybina OY, Rozovsky YM, Veselkina ER, Pasyukova EG. Polycomb/Trithorax group-dependent regulation of the neuronal gene Lim3 involved in Drosophila lifespan control. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2018; 1861:451-462. [PMID: 29555581 DOI: 10.1016/j.bbagrm.2018.03.006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2017] [Revised: 02/18/2018] [Accepted: 03/14/2018] [Indexed: 12/14/2022]
Abstract
Molecular mechanisms governing gene expression and defining complex phenotypes are central to understanding the basics of development and aging. Here, we demonstrate that naturally occurring polymorphisms of the Lim3 regulatory region that are associated with variation in gene expression and Drosophila lifespan control are located exclusively in the Polycomb response element (PRE). We find that the Polycomb group (PcG) protein Polycomb (PC) is bound to the PRE only in embryos where Lim3 is present in both repressed and active states. In contrast, the Trithorax group (TrxG) protein absent, small, or homeotic discs 1 (ASH1) is bound downstream of the PRE, to a region adjacent to the Lim3 transcription start site in embryos and adult flies, in which Lim3 is in an active state. Furthermore, mutations in Pc and ash1 genes affect Lim3 expression depending on the structural integrity of the Lim3 PRE, thus confirming functional interactions between these proteins and Lim3 regulatory region. In addition, we demonstrate that the evolutionary conserved Lim3 core promoter provides basic Lim3 expression, whereas structural changes in the Lim3 PRE of distal promoter provide stage-, and tissue-specific Lim3 expression. Therefore, we hypothesize that PcG/TrxG proteins, which are directly involved in Lim3 transcription regulation, participate in lifespan control.
Collapse
Affiliation(s)
- Olga Y Rybina
- Institute of Molecular Genetics of Russian Academy of Sciences, Kurtchatov Sq. 2, Moscow 123182, Russia; Federal State-Financed Educational Institution of Higher Professional Education, Moscow State Pedagogical University, M. Pirogovskaya Str. 1/1, Moscow 119991, Russia.
| | - Yakov M Rozovsky
- Institute of Molecular Genetics of Russian Academy of Sciences, Kurtchatov Sq. 2, Moscow 123182, Russia
| | - Ekaterina R Veselkina
- Institute of Molecular Genetics of Russian Academy of Sciences, Kurtchatov Sq. 2, Moscow 123182, Russia
| | - Elena G Pasyukova
- Institute of Molecular Genetics of Russian Academy of Sciences, Kurtchatov Sq. 2, Moscow 123182, Russia
| |
Collapse
|
16
|
Bourgard C, Albrecht L, Kayano ACAV, Sunnerhagen P, Costa FTM. Plasmodium vivax Biology: Insights Provided by Genomics, Transcriptomics and Proteomics. Front Cell Infect Microbiol 2018; 8:34. [PMID: 29473024 PMCID: PMC5809496 DOI: 10.3389/fcimb.2018.00034] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2017] [Accepted: 01/25/2018] [Indexed: 12/17/2022] Open
Abstract
During the last decade, the vast omics field has revolutionized biological research, especially the genomics, transcriptomics and proteomics branches, as technological tools become available to the field researcher and allow difficult question-driven studies to be addressed. Parasitology has greatly benefited from next generation sequencing (NGS) projects, which have resulted in a broadened comprehension of basic parasite molecular biology, ecology and epidemiology. Malariology is one example where application of this technology has greatly contributed to a better understanding of Plasmodium spp. biology and host-parasite interactions. Among the several parasite species that cause human malaria, the neglected Plasmodium vivax presents great research challenges, as in vitro culturing is not yet feasible and functional assays are heavily limited. Therefore, there are gaps in our P. vivax biology knowledge that affect decisions for control policies aiming to eradicate vivax malaria in the near future. In this review, we provide a snapshot of key discoveries already achieved in P. vivax sequencing projects, focusing on developments, hurdles, and limitations currently faced by the research community, as well as perspectives on future vivax malaria research.
Collapse
Affiliation(s)
- Catarina Bourgard
- Laboratory of Tropical Diseases, Department of Genetics, Evolution, Microbiology and Immunology, University of Campinas - UNICAMP, Campinas, Brazil
| | - Letusa Albrecht
- Laboratory of Tropical Diseases, Department of Genetics, Evolution, Microbiology and Immunology, University of Campinas - UNICAMP, Campinas, Brazil.,Laboratory of Regulation of Gene Expression, Instituto Carlos Chagas, Curitiba, Brazil
| | - Ana C A V Kayano
- Laboratory of Tropical Diseases, Department of Genetics, Evolution, Microbiology and Immunology, University of Campinas - UNICAMP, Campinas, Brazil
| | - Per Sunnerhagen
- Department of Chemistry and Molecular Biology, University of Gothenburg, Gothenburg, Sweden
| | - Fabio T M Costa
- Laboratory of Tropical Diseases, Department of Genetics, Evolution, Microbiology and Immunology, University of Campinas - UNICAMP, Campinas, Brazil
| |
Collapse
|
17
|
Lee C, Moroldo M, Perdomo-Sabogal A, Mach N, Marthey S, Lecardonnel J, Wahlberg P, Chong AY, Estellé J, Ho SYW, Rogel-Gaillard C, Gongora J. Inferring the evolution of the major histocompatibility complex of wild pigs and peccaries using hybridisation DNA capture-based sequencing. Immunogenetics 2017; 70:401-417. [PMID: 29256177 DOI: 10.1007/s00251-017-1048-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2017] [Accepted: 11/25/2017] [Indexed: 12/20/2022]
Abstract
The major histocompatibility complex (MHC) is a key genomic model region for understanding the evolution of gene families and the co-evolution between host and pathogen. To date, MHC studies have mostly focused on species from major vertebrate lineages. The evolution of MHC classical (Ia) and non-classical (Ib) genes in pigs has attracted interest because of their antigen presentation roles as part of the adaptive immune system. The pig family Suidae comprises over 18 extant species (mostly wild), but only the domestic pig has been extensively sequenced and annotated. To address this, we used a DNA-capture approach, with probes designed from the domestic pig genome, to generate MHC data for 11 wild species of pigs and their closest living family, Tayassuidae. The approach showed good efficiency for wild pigs (~80% reads mapped, ~87× coverage), compared to tayassuids (~12% reads mapped, ~4× coverage). We retrieved 145 MHC loci across both families. Phylogenetic analyses show that the class Ia and Ib genes underwent multiple duplications and diversifications before suids and tayassuids diverged from their common ancestor. The histocompatibility genes mostly form orthologous groups and there is genetic differentiation for most of these genes between Eurasian and sub-Saharan African wild pigs. Tests of selection showed that the peptide-binding region of class Ib genes was under positive selection. These findings contribute to better understanding of the evolutionary history of the MHC, specifically, the class I genes, and provide useful data for investigating the immune response of wild populations against pathogens.
Collapse
Affiliation(s)
- Carol Lee
- Sydney School of Veterinary Science, Faculty of Science, The University of Sydney, Sydney, Australia
| | - Marco Moroldo
- GABI, INRA, AgroParisTech, Université Paris-Saclay, Jouy-en-Josas, France
| | - Alvaro Perdomo-Sabogal
- Sydney School of Veterinary Science, Faculty of Science, The University of Sydney, Sydney, Australia.,Institute of Animal Science (460i), Department of Bioinformatics, University of Hohenheim, Stuttgart, Germany
| | - Núria Mach
- GABI, INRA, AgroParisTech, Université Paris-Saclay, Jouy-en-Josas, France
| | - Sylvain Marthey
- GABI, INRA, AgroParisTech, Université Paris-Saclay, Jouy-en-Josas, France
| | - Jérôme Lecardonnel
- GABI, INRA, AgroParisTech, Université Paris-Saclay, Jouy-en-Josas, France
| | - Per Wahlberg
- GABI, INRA, AgroParisTech, Université Paris-Saclay, Jouy-en-Josas, France
| | - Amanda Y Chong
- Sydney School of Veterinary Science, Faculty of Science, The University of Sydney, Sydney, Australia.,Earlham Institute, Norwich Research Park, Norwich, UK
| | - Jordi Estellé
- GABI, INRA, AgroParisTech, Université Paris-Saclay, Jouy-en-Josas, France
| | - Simon Y W Ho
- School of Life and Environmental Sciences, Faculty of Science, The University of Sydney, Sydney, Australia
| | | | - Jaime Gongora
- Sydney School of Veterinary Science, Faculty of Science, The University of Sydney, Sydney, Australia.
| |
Collapse
|
18
|
Gupta S, Tewatia P, Misri J, Singh R. Molecular Modeling of Cloned Bacillus subtilis Keratinase and Its Insinuation in Psoriasis Treatment Using Docking Studies. Indian J Microbiol 2017; 57:485-491. [PMID: 29151650 DOI: 10.1007/s12088-017-0677-x] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2017] [Accepted: 09/19/2017] [Indexed: 12/13/2022] Open
Abstract
Present study demonstrated the expression of cloned Bacillus subtilis RSE163 keratinase gene and in silico binding affinities of deduced protein with psoriasis topical drugs for systemic absorption and permeation through skin. The ker gene expressed in E. coli showed significantly higher keratinase activity 450 ± 10.43 U representing 1342 bp nucleotides encoding 447 amino acids with molecular weight of 46 kDa. The modeled structure was validated using ramachandran's plot showing 305 residues (84.3%) in most favoured region. Docking studies using extra precision (XP) method of Glide showed optimum binding affinities with the drugs Acitretin (- 39.62 kcal/mol), Clobetasol propionate (- 37.90 kcal/mol), Fluticasone (- 38.53 kcal/mol), Desonide (- 32.23 kcal/mol), Anthralin (- 38.04 kcal/mol), Calcipotreine (- 21.55 kcal/mol) and Mometasone (- 28.40 kcal/mol) in comparison to other psoriasis drugs. The results can further be correlated with in vitro enzymatic experiments using keratinase as an effective drug mediator through skin to serve the unmet need of industries.
Collapse
Affiliation(s)
- Sonali Gupta
- Amity Institute of Microbial Biotechnology, Amity University Uttar Pradesh, Sector-125, Noida, U.P. India
| | - Parul Tewatia
- School of Computer Science and Communication, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Jyoti Misri
- Division of Animal Science, Indian Council of Agricultural Research, Krishi Bhavan, New Delhi, 110114 India
| | - Rajni Singh
- Amity Institute of Microbial Biotechnology, Amity University Uttar Pradesh, Sector-125, Noida, U.P. India
| |
Collapse
|
19
|
Analysis of the first Taraxacum kok-saghyz transcriptome reveals potential rubber yield related SNPs. Sci Rep 2017; 7:9939. [PMID: 28855528 PMCID: PMC5577190 DOI: 10.1038/s41598-017-09034-2] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2017] [Accepted: 07/20/2017] [Indexed: 12/16/2022] Open
Abstract
Taraxacum kok-saghyz (TK) is a potential alternative crop for natural rubber (NR) production, due to its high molecular weight rubber, short breeding cycle, and diverse environmental adaptation. However, improvements in rubber yield and agronomically relevant traits are still required before it can become a commercially-viable crop. An RNA-Seq based transcriptome was developed from a pool of roots from genotypes with high and low rubber yield. A total of 55,532 transcripts with lengths over 200 bp were de novo assembled. As many as 472 transcripts were significantly homologous to 49 out of 50 known plant putative rubber biosynthesis related genes. 158 transcripts were significantly differentially expressed between high rubber and low rubber genotypes. 21,036 SNPs were different in high and low rubber TK genotypes. Among these, 50 SNPs were found within 39 transcripts highly homologous to 49 publically-searched rubber biosynthesis related genes. 117 SNPs were located within 36 of the differentially expressed gene sequences. This comprehensive TK transcriptomic reference, and large set of SNPs including putative exonic markers associated with rubber related gene homologues and differentially expressed genes, provides a solid foundation for further genetic dissection of rubber related traits, comparative genomics and marker-assisted selection for the breeding of TK.
Collapse
|
20
|
Saensuk C, Wanchana S, Choowongkomon K, Wongpornchai S, Kraithong T, Imsabai W, Chaichoompu E, Ruanjaichon V, Toojinda T, Vanavichit A, Arikit S. De novo transcriptome assembly and identification of the gene conferring a "pandan-like" aroma in coconut (Cocos nucifera L.). PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2016; 252:324-334. [PMID: 27717469 DOI: 10.1016/j.plantsci.2016.08.014] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/15/2016] [Revised: 08/26/2016] [Accepted: 08/27/2016] [Indexed: 06/06/2023]
Abstract
Thailand's aromatic coconut (Cocos nucifera L.) is a special type of green dwarf coconut, the liquid endosperm of which is characterized by a pleasant "pandan-like" aroma due to the presence of 2-acetyl-1-pyrroline (2AP). The aim of this study was to perform a de novo assembly of transriptome from C. nucifera endosperm and to identify the gene responsible for 2AP biosynthesis. CnAMADH2 was identified as an ortholog of the rice aromatic gene and a G-to-C substitution found in exon 14 was associated with 2AP content in the aromatic green dwarf coconut accessions. The base substitution caused an amino-acid change, alanine-to-proline, at position 442 (P442A). The presence of P at this position might alter the steric conformation at the loop region and subsequently result in an unstabilized dimer conformation that could lower AMADH enzyme activity. Among AMADH/BADH protein sequences in different plant species, the P442A mutation was found exclusively in aromatic coconut. The PCR marker developed based on this sequence variation can perfectly detect the aromatic and non-aromatic alleles of the gene. This study confirms the hypothesis that plants may share a mechanism of 2AP biosynthesis. This is the first identification of the gene associated with 2AP biosynthesis in a tree plant.
Collapse
Affiliation(s)
- Chatree Saensuk
- Rice Science Center, Kasetsart University Kamphaeng Saen Campus, Nakhon Pathom, 73140, Thailand; Interdisciplinary Graduate Program in Genetic Engineering, Kasetsart University, Bangkok, 10900, Thailand
| | - Samart Wanchana
- National Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency (NSTDA), Khlong Luang, Pathum Thani, 12120, Thailand
| | - Kiattawee Choowongkomon
- Department of Biochemistry, Faculty of Science, Kasetsart University, Bangkok, 10900, Thailand
| | - Sugunya Wongpornchai
- Department of Chemistry, Faculty of Science, Chiang Mai University, Chiang Mai, 50200, Thailand
| | - Tippaya Kraithong
- Chumphon Horticultural Research Center, Department of Agriculture, Bangkok, 10900, Thailand
| | - Wachiraya Imsabai
- Department of Horticulture, Faculty of Agriculture at Kamphaeng Saen, Kasetsart University Kamphaeng Saen Campus, Nakhon Pathom, 73140, Thailand
| | - Ekawat Chaichoompu
- Rice Science Center, Kasetsart University Kamphaeng Saen Campus, Nakhon Pathom, 73140, Thailand
| | - Vinitchan Ruanjaichon
- National Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency (NSTDA), Khlong Luang, Pathum Thani, 12120, Thailand
| | - Theerayut Toojinda
- National Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency (NSTDA), Khlong Luang, Pathum Thani, 12120, Thailand
| | - Apichart Vanavichit
- Rice Science Center, Kasetsart University Kamphaeng Saen Campus, Nakhon Pathom, 73140, Thailand; Department of Agronomy, Faculty of Agriculture at Kamphaeng Saen, Kasetsart University Kamphaeng Saen Campus, Nakhon Pathom, 73140, Thailand
| | - Siwaret Arikit
- Rice Science Center, Kasetsart University Kamphaeng Saen Campus, Nakhon Pathom, 73140, Thailand; Department of Agronomy, Faculty of Agriculture at Kamphaeng Saen, Kasetsart University Kamphaeng Saen Campus, Nakhon Pathom, 73140, Thailand.
| |
Collapse
|
21
|
Baek JH, Kim J, Kim CK, Sohn SH, Choi D, Ratnaparkhe MB, Kim DW, Lee TH. MultiSyn: A Webtool for Multiple Synteny Detection and Visualization of User's Sequence of Interest Compared to Public Plant Species. Evol Bioinform Online 2016; 12:193-9. [PMID: 27594782 PMCID: PMC5003123 DOI: 10.4137/ebo.s40009] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2016] [Revised: 07/12/2016] [Accepted: 07/14/2016] [Indexed: 12/19/2022] Open
Abstract
Information on multiple synteny between plants and/or within a plant is key information to understand genome evolution. In addition, visualization of multiple synteny is helpful in interpreting evolution. So far, some web applications have been developed to determine and visualize multiple homology regions at once. However, the applications are not fully convenient for biologists because some of them do not include the function of synteny determination but visualize the multiple synteny plots by allowing users to upload their synteny data by determining the synteny based only on BLAST similarity information, with some algorithms not designed for synteny determination. Here, we introduce a web application that determines and visualizes multiple synteny from two types of files, simplified browser extensible data and protein sequence file by MCScanX algorithm, which have been used in many synteny studies.
Collapse
Affiliation(s)
- Jeong-Ho Baek
- Genomics Division, National Institute of Agricultural Sciences, Jeonju, Korea
| | - Junah Kim
- Genomics Division, National Institute of Agricultural Sciences, Jeonju, Korea
| | - Chang-Kug Kim
- Genomics Division, National Institute of Agricultural Sciences, Jeonju, Korea
| | - Seong-Han Sohn
- Genomics Division, National Institute of Agricultural Sciences, Jeonju, Korea
| | - Dongsu Choi
- Department of Biology, Kunsan National University, Gunsan-si, Jeollabuk-do, Korea
| | - Milind B Ratnaparkhe
- Directorate of Soybean Research, Indian Council of Agriculture Research (ICAR), Indore, Madhya Pradesh, India
| | - Do-Wan Kim
- Genomics Division, National Institute of Agricultural Sciences, Jeonju, Korea
| | - Tae-Ho Lee
- Genomics Division, National Institute of Agricultural Sciences, Jeonju, Korea
| |
Collapse
|
22
|
Franco-Zorrilla JM, Solano R. Identification of plant transcription factor target sequences. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2016; 1860:21-30. [PMID: 27155066 DOI: 10.1016/j.bbagrm.2016.05.001] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/14/2016] [Revised: 05/01/2016] [Accepted: 05/02/2016] [Indexed: 12/15/2022]
Abstract
Regulation of gene expression depends on specific cis-regulatory sequences located in the gene promoter regions. These DNA sequences are recognized by transcription factors (TFs) in a sequence-specific manner, and their identification could help to elucidate the regulatory networks that underlie plant physiological responses to developmental programs or to environmental adaptation. Here we review recent advances in high throughput methodologies for the identification of plant TF binding sites. Several approaches offer a map of the TF binding locations in vivo and of the dynamics of the gene regulatory networks. As an alternative, high throughput in vitro methods provide comprehensive determination of the DNA sequences recognized by TFs. These advances are helping to decipher the regulatory lexicon and to elucidate transcriptional network hierarchies in plants in response to internal or external cues. This article is part of a Special Issue entitled: Plant Gene Regulatory Mechanisms and Networks, edited by Dr. Erich Grotewold and Dr. Nathan Springer.
Collapse
Affiliation(s)
- José M Franco-Zorrilla
- Genomics Unit, Centro Nacional de Biotecnología, Consejo Superior de Investigaciones Científicas, 28049 Madrid, Spain.
| | - Roberto Solano
- Department of Plant Molecular Genetics, Centro Nacional de Biotecnología, Consejo Superior de Investigaciones Científicas, 28049 Madrid, Spain
| |
Collapse
|
23
|
Yao Y, Minor PJ, Zhao YT, Jeong Y, Pani AM, King AN, Symmons O, Gan L, Cardoso WV, Spitz F, Lowe CJ, Epstein DJ. Cis-regulatory architecture of a brain signaling center predates the origin of chordates. Nat Genet 2016; 48:575-80. [PMID: 27064252 PMCID: PMC4848136 DOI: 10.1038/ng.3542] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2015] [Accepted: 03/11/2016] [Indexed: 12/13/2022]
Abstract
Genomic approaches have predicted hundreds of thousands of tissue-specific cis-regulatory sequences, but the determinants critical to their function and evolutionary history are mostly unknown. Here we systematically decode a set of brain enhancers active in the zona limitans intrathalamica (zli), a signaling center essential for vertebrate forebrain development via the secreted morphogen Sonic hedgehog (Shh). We apply a de novo motif analysis tool to identify six position-independent sequence motifs together with their cognate transcription factors that are essential for zli enhancer activity and Shh expression in the mouse embryo. Using knowledge of this regulatory lexicon, we discover new Shh zli enhancers in mice and a functionally equivalent element in hemichordates, indicating an ancient origin of the Shh zli regulatory network that predates the chordate phylum. These findings support a strategy for delineating functionally conserved enhancers in the absence of overt sequence homologies and over extensive evolutionary distances.
Collapse
Affiliation(s)
- Yao Yao
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, 415 Curie Blvd, Clinical Research Building 470, Philadelphia, PA 19104, USA
| | - Paul J. Minor
- Hopkins Marine Station, Department of Biology, Stanford University, 120 Oceanview Blvd. Pacific Grove, CA 93950, USA
| | - Ying-Tao Zhao
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, 415 Curie Blvd, Clinical Research Building 470, Philadelphia, PA 19104, USA
| | - Yongsu Jeong
- Department of Genetic Engineering, College of Life Sciences and Graduate School of Biotechnology, Kyung Hee University, Yongin-si 446-701, Republic of Korea
| | - Ariel M. Pani
- Hopkins Marine Station, Department of Biology, Stanford University, 120 Oceanview Blvd. Pacific Grove, CA 93950, USA
| | - Anna N. King
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, 415 Curie Blvd, Clinical Research Building 470, Philadelphia, PA 19104, USA
| | - Orsolya Symmons
- Developmental Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Lin Gan
- Department of Ophthalmology, University of Rochester Medical Center, Rochester, NY 14642, USA
| | - Wellington V. Cardoso
- Columbia Center for Human Development, Department of Medicine, Pulmonary Allergy Critical Care, Columbia University Medical Center, New York, NY 10032, USA
| | - François Spitz
- Developmental Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Christopher J. Lowe
- Hopkins Marine Station, Department of Biology, Stanford University, 120 Oceanview Blvd. Pacific Grove, CA 93950, USA
| | - Douglas J. Epstein
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, 415 Curie Blvd, Clinical Research Building 470, Philadelphia, PA 19104, USA
| |
Collapse
|
24
|
Thompson D, Regev A, Roy S. Comparative analysis of gene regulatory networks: from network reconstruction to evolution. Annu Rev Cell Dev Biol 2015; 31:399-428. [PMID: 26355593 DOI: 10.1146/annurev-cellbio-100913-012908] [Citation(s) in RCA: 95] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Regulation of gene expression is central to many biological processes. Although reconstruction of regulatory circuits from genomic data alone is therefore desirable, this remains a major computational challenge. Comparative approaches that examine the conservation and divergence of circuits and their components across strains and species can help reconstruct circuits as well as provide insights into the evolution of gene regulatory processes and their adaptive contribution. In recent years, advances in genomic and computational tools have led to a wealth of methods for such analysis at the sequence, expression, pathway, module, and entire network level. Here, we review computational methods developed to study transcriptional regulatory networks using comparative genomics, from sequence to functional data. We highlight how these methods use evolutionary conservation and divergence to reliably detect regulatory components as well as estimate the extent and rate of divergence. Finally, we discuss the promise and open challenges in linking regulatory divergence to phenotypic divergence and adaptation.
Collapse
Affiliation(s)
- Dawn Thompson
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142
| | | | | |
Collapse
|
25
|
Kaimala S, Kumar S. An evolutionarily conserved non-coding element in casein locus acts as transcriptional repressor. Gene 2015; 554:75-80. [PMID: 25455101 DOI: 10.1016/j.gene.2014.10.026] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2014] [Revised: 10/07/2014] [Accepted: 10/10/2014] [Indexed: 12/31/2022]
Abstract
In mammals, the casein locus consists of stretches of non-coding DNA, the functions of most of which are unknown. These regions are believed to harbour elements responsible for spatio-temporally regulated expression of genes in this locus and so far, only a few such elements have been identified. In this study, we report a novel regulatory element in the casein locus. Comparative analysis of genomic DNA sequences of casein loci from different mammals identified a 147bp long evolutionarily conserved region (ECR) upstream of Odam, a gene in this locus. The ECR was found in close proximity of Odam gene in all the mammals examined. In-silico analysis predicted the ECR as a potential regulatory element. Functional analysis in different cell lines identified it as a unidirectional repressor element. From our findings we speculate that the ECR may be involved in the repression of the Odam expression in the mammary gland during lactation.
Collapse
Affiliation(s)
- Suneesh Kaimala
- Centre for Cellular and Molecular Biology, Council of Scientific and Industrial Research (CSIR), Hyderabad, India.
| | - Satish Kumar
- Centre for Cellular and Molecular Biology, Council of Scientific and Industrial Research (CSIR), Hyderabad, India.
| |
Collapse
|
26
|
Gutierrez-Triana JA, Herget U, Lichtner P, Castillo-Ramírez LA, Ryu S. A vertebrate-conserved cis-regulatory module for targeted expression in the main hypothalamic regulatory region for the stress response. BMC DEVELOPMENTAL BIOLOGY 2014; 14:41. [PMID: 25427861 PMCID: PMC4248439 DOI: 10.1186/s12861-014-0041-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/10/2014] [Accepted: 11/11/2014] [Indexed: 01/30/2023]
Abstract
Background The homeodomain transcription factor orthopedia (Otp) is an evolutionarily conserved regulator of neuronal fates. In vertebrates, Otp is necessary for the proper development of different regions of the brain and is required in the diencephalon to specify several hypothalamic cell types, including the cells that control the stress response. To understand how this widely expressed transcription factor accomplishes hypothalamus-specific functions, we performed a comprehensive screening of otp cis-regulatory regions in zebrafish. Results Here, we report the identification of an evolutionarily conserved vertebrate enhancer module with activity in a restricted area of the forebrain, which includes the region of the hypothalamus that controls the stress response. This region includes neurosecretory cells producing Corticotropin-releasing hormone (Crh), Oxytocin (Oxt) and Arginine vasopressin (Avp), which are key components of the stress axis. Lastly, expression of the bacterial nitroreductase gene under this specific enhancer allowed pharmacological attenuation of the stress response in zebrafish larvae. Conclusion Vertebrates share many cellular and molecular components of the stress response and our work identified a striking conservation at the cis-regulatory level of a key hypothalamic developmental gene. In addition, this enhancer provides a useful tool to manipulate and visualize stress-regulatory hypothalamic cells in vivo with the long-term goal of understanding the ontogeny of the stress axis in vertebrates. Electronic supplementary material The online version of this article (doi:10.1186/s12861-014-0041-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jose Arturo Gutierrez-Triana
- Developmental Genetics of the Nervous System, Max Planck Institute for Medical Research, Jahnstrasse 29, D-69120, Heidelberg, Germany. .,Current address: Centre for Organismal Studies (COS), University of Heidelberg, Im Neuenheimer Feld 230, D-69120, Heidelberg, Germany.
| | - Ulrich Herget
- Developmental Genetics of the Nervous System, Max Planck Institute for Medical Research, Jahnstrasse 29, D-69120, Heidelberg, Germany. .,The Hartmut Hoffmann-Berling International Graduate School of Molecular and Cellular Biology, University of Heidelberg, Heidelberg, Germany.
| | - Patrick Lichtner
- Developmental Genetics of the Nervous System, Max Planck Institute for Medical Research, Jahnstrasse 29, D-69120, Heidelberg, Germany.
| | - Luis A Castillo-Ramírez
- Developmental Genetics of the Nervous System, Max Planck Institute for Medical Research, Jahnstrasse 29, D-69120, Heidelberg, Germany. .,The Hartmut Hoffmann-Berling International Graduate School of Molecular and Cellular Biology, University of Heidelberg, Heidelberg, Germany.
| | - Soojin Ryu
- Developmental Genetics of the Nervous System, Max Planck Institute for Medical Research, Jahnstrasse 29, D-69120, Heidelberg, Germany.
| |
Collapse
|
27
|
Lashermes P, Combes MC, Hueber Y, Severac D, Dereeper A. Genome rearrangements derived from homoeologous recombination following allopolyploidy speciation in coffee. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2014; 78:674-85. [PMID: 24628823 DOI: 10.1111/tpj.12505] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2013] [Revised: 02/26/2014] [Accepted: 03/04/2014] [Indexed: 05/27/2023]
Abstract
Allopolyploidization is widespread and has played a major role in flowering plant diversification. Genomic changes are common consequences of allopolyploidization, but their mechanisms of occurrence and dynamics over time are still poorly understood. Coffea arabica, a recently formed allotetraploid, was chosen as a model to investigate genetic changes in allopolyploid using an approach that exploits next-generation sequencing technologies. Genes affected by putative homoeolog loss were inferred by comparing the numbers of single-nucleotide polymorphisms detected using RNA-seq in individual accessions of C. arabica, and between accessions of its two diploid progenitor species for common sequence positions. Their physical locations were investigated and clusters of genes exhibiting homoeolog loss were identified. To validate these results, genome sequencing data were generated from one accession of C. arabica and further analyzed. Genomic rearrangements involving homoeologous exchanges appear to occur in C. arabica and to be a major source of genetic diversity. At least 5% of the C. arabica genes were inferred to have undergone homoeolog loss. The detection of a large number of homoeologous exchange events (HEEs) shared by all accessions of C. arabica strongly reinforces the assumption of a single allopolyploidization event. Furthermore, HEEs were specific to one or a few accessions, suggesting that HEE accumulates gradually. Our results provide evidence for the important role of HEE in allopolyploid genome evolution.
Collapse
Affiliation(s)
- Philippe Lashermes
- IRD, UMR RPB (IRD, CIRAD, Université Montpellier II), 911 avenue Agropolis, BP 64501, 34394, Montpellier Cedex 5, France
| | | | | | | | | |
Collapse
|
28
|
Abstract
DIALIGN is a software tool for multiple sequence alignment by combining global and local alignment features. It composes multiple alignments from local pairwise sequence similarities. This approach is particularly useful to discover conserved functional regions in sequences that share only local homologies but are otherwise unrelated. An anchoring option allows to use external information and expert knowledge in addition to primary-sequence similarity alone. The latest version of DIALIGN optionally uses matches to the PFAM database to detect weak homologies. Various versions of the program are available through Göttingen Bioinformatics Compute Server (GOBICS) at http://www.gobics.de/department/software.
Collapse
|
29
|
Song X, Wang Y, Tang Y. Rapid diversification of FoxP2 in teleosts through gene duplication in the teleost-specific whole genome duplication event. PLoS One 2013; 8:e83858. [PMID: 24349554 PMCID: PMC3857310 DOI: 10.1371/journal.pone.0083858] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2013] [Accepted: 11/18/2013] [Indexed: 12/26/2022] Open
Abstract
As one of the most conserved genes in vertebrates, FoxP2 is widely involved in a number of important physiological and developmental processes. We systematically studied the evolutionary history and functional adaptations of FoxP2 in teleosts. The duplicated FoxP2 genes (FoxP2a and FoxP2b), which were identified in teleosts using synteny and paralogon analysis on genome databases of eight organisms, were probably generated in the teleost-specific whole genome duplication event. A credible classification with FoxP2, FoxP2a and FoxP2b in phylogenetic reconstructions confirmed the teleost-specific FoxP2 duplication. The unavailability of FoxP2b in Danio rerio suggests that the gene was deleted through nonfunctionalization of the redundant copy after the Otocephala-Euteleostei split. Heterogeneity in evolutionary rates among clusters consisting of FoxP2 in Sarcopterygii (Cluster 1), FoxP2a in Teleostei (Cluster 2) and FoxP2b in Teleostei (Cluster 3), particularly between Clusters 2 and 3, reveals asymmetric functional divergence after the gene duplication. Hierarchical cluster analyses of hydrophobicity profiles demonstrated significant structural divergence among the three clusters with verification of subsequent stepwise discriminant analysis, in which FoxP2 of Leucoraja erinacea and Lepisosteus oculatus were classified into Cluster 1, whereas FoxP2b of Salmo salar was grouped into Cluster 2 rather than Cluster 3. The simulated thermodynamic stability variations of the forkhead box domain (monomer and homodimer) showed remarkable divergence in FoxP2, FoxP2a and FoxP2b clusters. Relaxed purifying selection and positive Darwinian selection probably were complementary driving forces for the accelerated evolution of FoxP2 in ray-finned fishes, especially for the adaptive evolution of FoxP2a and FoxP2b in teleosts subsequent to the teleost-specific gene duplication.
Collapse
Affiliation(s)
- Xiaowei Song
- Department of Herpetology, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, Sichuan, China
- College of Life Science, Sichuan University, Chengdu, Sichuan, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yajun Wang
- College of Life Science, Sichuan University, Chengdu, Sichuan, China
- * (YW); (YT)
| | - Yezhong Tang
- Department of Herpetology, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, Sichuan, China
- * (YW); (YT)
| |
Collapse
|
30
|
Rieswijk L, Lizarraga D, Brauers KJJ, Kleinjans JCS, van Delft JHM. Characterisation of cisplatin-induced transcriptomics responses in primary mouse hepatocytes, HepG2 cells and mouse embryonic stem cells shows conservation of regulating transcription factor networks. Mutagenesis 2013; 29:17-26. [DOI: 10.1093/mutage/get055] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
31
|
Identification of putative ortholog gene blocks involved in gestant and lactating mammary gland development: a rodent cross-species microarray transcriptomics approach. Int J Genomics 2013; 2013:624681. [PMID: 24288657 PMCID: PMC3830774 DOI: 10.1155/2013/624681] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2013] [Revised: 09/03/2013] [Accepted: 09/04/2013] [Indexed: 01/23/2023] Open
Abstract
The mammary gland (MG) undergoes functional and metabolic changes during the transition from pregnancy to lactation, possibly by regulation of conserved genes. The objective was to elucidate orthologous genes, chromosome clusters and putative conserved transcriptional modules during MG development. We analyzed expression of 22,000 transcripts using murine microarrays and RNA samples of MG from virgin, pregnant, and lactating rats by cross-species hybridization. We identified 521 transcripts differentially expressed; upregulated in early (78%) and midpregnancy (89%) and early lactation (64%), but downregulated in mid-lactation (61%). Putative orthologous genes were identified. We mapped the altered genes to orthologous chromosomal locations in human and mouse. Eighteen sets of conserved genes associated with key cellular functions were revealed and conserved transcription factor binding site search entailed possible coregulation among all eight block sets of genes. This study demonstrates that the use of heterologous array hybridization for screening of orthologous gene expression from rat revealed sets of conserved genes arranged in chromosomal order implicated in signaling pathways and functional ontology. Results demonstrate the utilization power of comparative genomics and prove the feasibility of using rodent microarrays to identification of putative coexpressed orthologous genes involved in the control of human mammary gland development.
Collapse
|
32
|
Hasan S, Wheat CW. CEPiNS: Conserved Exon Prediction in Novel Species. Bioinformation 2013; 9:210-1. [PMID: 23519394 PMCID: PMC3602892 DOI: 10.6026/97320630009210] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2013] [Accepted: 01/29/2013] [Indexed: 11/23/2022] Open
Abstract
Exon structure is relatively well conserved among orthologs in several large clades of species (e.g. Mammalia, Diptera, Lepidoptera) across evolutionary distances of up to 80 million years. Thus, it should be straightforward to predict the exon structures in novel species based upon the known exon structures of species that have had their genomes sequenced and well assembled. Being able to predict the exon boundaries in the genes of novel species is important given the quickly growing numbers of transcriptome sequencing projects. CEPiNS is a new pipeline for mining exon boundaries of predicted gene sets from model species and then using this information to identify the exon boundaries in a novel species through codon based alignment. The pipeline uses the freeware SPIDEY, an exon boundary prediction tool, and BLAST (BLASTN, BLASTP, TBLASTX), both of which are part of NCBI's toolkit. CEPiNS provides an important tool to analyze the transcriptome of novel species.
Collapse
Affiliation(s)
- Shihab Hasan
- Bioinformatics, Department of Information Technology, 20014, University of Turku, Finland
- Department of Biological and Environmental Sciences, PL 65, Viikinkaari 1, 00014 University of Helsinki, Finland
| | - Christopher W Wheat
- Department of Biological and Environmental Sciences, PL 65, Viikinkaari 1, 00014 University of Helsinki, Finland
- Centre for Ecology and Conservation, School of Biosciences, University of Exeter, Cornwall Campus, Penryn, Cornwall TR10 9EZ, United Kingdom
- Department of Zoology, Stockholm University, SE-106 91 Stockholm, Sweden
| |
Collapse
|
33
|
Baxter L, Jironkin A, Hickman R, Moore J, Barrington C, Krusche P, Dyer NP, Buchanan-Wollaston V, Tiskin A, Beynon J, Denby K, Ott S. Conserved noncoding sequences highlight shared components of regulatory networks in dicotyledonous plants. THE PLANT CELL 2012; 24:3949-65. [PMID: 23110901 PMCID: PMC3517229 DOI: 10.1105/tpc.112.103010] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
Conserved noncoding sequences (CNSs) in DNA are reliable pointers to regulatory elements controlling gene expression. Using a comparative genomics approach with four dicotyledonous plant species (Arabidopsis thaliana, papaya [Carica papaya], poplar [Populus trichocarpa], and grape [Vitis vinifera]), we detected hundreds of CNSs upstream of Arabidopsis genes. Distinct positioning, length, and enrichment for transcription factor binding sites suggest these CNSs play a functional role in transcriptional regulation. The enrichment of transcription factors within the set of genes associated with CNS is consistent with the hypothesis that together they form part of a conserved transcriptional network whose function is to regulate other transcription factors and control development. We identified a set of promoters where regulatory mechanisms are likely to be shared between the model organism Arabidopsis and other dicots, providing areas of focus for further research.
Collapse
Affiliation(s)
- Laura Baxter
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Aleksey Jironkin
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Richard Hickman
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Jay Moore
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
| | | | - Peter Krusche
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Nigel P. Dyer
- Molecular Organisation and Assembly in Cells Doctoral Training Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Vicky Buchanan-Wollaston
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Alexander Tiskin
- Department of Computer Science, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Jim Beynon
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Katherine Denby
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Sascha Ott
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
- Address correspondence to
| |
Collapse
|
34
|
Revanna KV, Munro D, Gao A, Chiu CC, Pathak A, Dong Q. A web-based multi-genome synteny viewer for customized data. BMC Bioinformatics 2012; 13:190. [PMID: 22856879 PMCID: PMC3430549 DOI: 10.1186/1471-2105-13-190] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2012] [Accepted: 07/18/2012] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Web-based synteny visualization tools are important for sharing data and revealing patterns of complicated genome conservation and rearrangements. Such tools should allow biologists to upload genomic data for their own analysis. This requirement is critical because individual biologists are generating large amounts of genomic sequences that quickly overwhelm any centralized web resources to collect and display all those data. Recently, we published a web-based synteny viewer, GSV, which was designed to satisfy the above requirement. However, GSV can only compare two genomes at a given time. Extending the functionality of GSV to visualize multiple genomes is important to meet the increasing demand of the research community. RESULTS We have developed a multi-Genome Synteny Viewer (mGSV). Similar to GSV, mGSV is a web-based tool that allows users to upload their own genomic data files for visualization. Multiple genomes can be presented in a single integrated view with an enhanced user interface. Users can navigate through all the selected genomes in either pairwise or multiple viewing mode to examine conserved genomic regions as well as the accompanying genome annotations. Besides serving users who manually interact with the web server, mGSV also provides Web Services for machine-to-machine communication to accept data sent by other remote resources. The entire mGSV package can also be downloaded for easy local installation. CONCLUSIONS mGSV significantly enhances the original functionalities of GSV. A web server hosting mGSV is provided at http://cas-bioinfo.cas.unt.edu/mgsv.
Collapse
Affiliation(s)
- Kashi V Revanna
- Department of Biological Sciences, University of North Texas, Denton, Texas, 76203, USA
| | - Daniel Munro
- Department of Biological Sciences, University of North Texas, Denton, Texas, 76203, USA
| | - Alvin Gao
- The Texas Academy of Mathematics and Science, University of North Texas, Denton, Texas, 76203, USA
| | - Chi-Chen Chiu
- Department of Computer Science and Engineering, University of North Texas, Denton, Texas, 76203, USA
| | - Anil Pathak
- Department of Computer Science and Engineering, University of North Texas, Denton, Texas, 76203, USA
| | - Qunfeng Dong
- Department of Biological Sciences, University of North Texas, Denton, Texas, 76203, USA
- Department of Computer Science and Engineering, University of North Texas, Denton, Texas, 76203, USA
| |
Collapse
|
35
|
Abstract
Differential gene expression is the fundamental mechanism underlying animal development and cell differentiation. However, it is a challenge to identify comprehensively and accurately the DNA sequences that are required to regulate gene expression: namely, cis-regulatory modules (CRMs). Three major features, either singly or in combination, are used to predict CRMs: clusters of transcription factor binding site motifs, non-coding DNA that is under evolutionary constraint and biochemical marks associated with CRMs, such as histone modifications and protein occupancy. The validation rates for predictions indicate that identifying diagnostic biochemical marks is the most reliable method, and understanding is enhanced by the analysis of motifs and conservation patterns within those predicted CRMs.
Collapse
|
36
|
Abstract
In the present study, a comprehensive analysis of the arginase gene family in metazoans was performed. A total of 126 arginase genes have been identified in 44 species. Phylogenetic analyses indicate that arginase genes consist of four groups. Conservative and divergent gene structures are found among the groups. The syntenies also exist in distantly related genomes among multiple species. Adaptive evolution shows that, while purifying selection may have been the main force driving the evolution of the arginases, some of critical sites responsible for the functional divergence may have been under positive selection. Overall, the data obtained from our investigation contribute to a better understanding of the complexity of the arginase gene family and of the function and evolution of this family in metazoans.
Collapse
Affiliation(s)
- Jun Cao
- Institute of Life Science, Jiangsu University, Zhenjiang, Jiangsu, 212013, P.R. China.
| | | |
Collapse
|
37
|
Ihuegbu NE, Stormo GD, Buhler J. Fast, sensitive discovery of conserved genome-wide motifs. J Comput Biol 2012; 19:139-47. [PMID: 22300316 DOI: 10.1089/cmb.2011.0249] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Regulatory sites that control gene expression are essential to the proper functioning of cells, and identifying them is critical for modeling regulatory networks. We have developed Magma (Multiple Aligner of Genomic Multiple Alignments), a software tool for multiple species, multiple gene motif discovery. Magma identifies putative regulatory sites that are conserved across multiple species and occur near multiple genes throughout a reference genome. Magma takes as input multiple alignments that can include gaps. It uses efficient clustering methods that make it about 70 times faster than PhyloNet, a previous program for this task, with slightly greater sensitivity. We ran Magma on all non-coding DNA conserved between Caenorhabditis elegans and five additional species, about 70 Mbp in total, in <4 h. We obtained 2,309 motifs with lengths of 6-20 bp, each occurring at least 10 times throughout the genome, which collectively covered about 566 kbp of the genomes, approximately 0.8% of the input. Predicted sites occurred in all types of non-coding sequence but were especially enriched in the promoter regions. Comparisons to several experimental datasets show that Magma motifs correspond to a variety of known regulatory motifs.
Collapse
Affiliation(s)
- Nnamdi E Ihuegbu
- Department of Genetics, Washington University School of Medicine, Saint Louis, Missouri 63108, USA
| | | | | |
Collapse
|
38
|
Chatterjee S, Lufkin T. Regulatory genomics: Insights from the zebrafish. CURRENT TOPICS IN GENETICS 2012; 5:1-10. [PMID: 23440612 PMCID: PMC3577074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
The sequencing of many vertebrate species over the last decade has opened up the possibility of using comparative genomics as a powerful tool to elucidate regulatory elements in the vertebrate genome. The zebrafish has played a pivotal role in this process. Its genome has been used in large-scale genome comparisons to locate vertebrate specific regulatory elements and also it has been an excellent model system to test out the predicted DNA sequences for their ability to drive reporter gene expression in vivo. In spite of all the successes there have still been some issues in using the zebrafish as a model system for these kinds of assays. This review will shed some light on the successes and failures of the zebrafish in pushing forward regulatory genomics.
Collapse
Affiliation(s)
- Sumantra Chatterjee
- Stem Cell and Developmental Biology, Genome Institute of Singapore, 60 Biopolis Street, 138672, Singapore
| | | |
Collapse
|
39
|
Bessa J, Gómez-Skarmeta JL. Making reporter gene constructs to analyze cis-regulatory elements. Methods Mol Biol 2012; 772:397-408. [PMID: 22065451 DOI: 10.1007/978-1-61779-228-1_23] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Cis-regulatory sequences control when, where, and how much genes are transcribed. A better understanding on these elements is a fundamental keystone to better understand development, cell differentiation, and morphogenesis. Several methods based on in silico analysis or ChIP-seq experiments have been developed to detect cis-acting sequences. Here, we describe a protocol to isolate such sequences from genomic DNA and to clone them into expression vectors for functional assays using the Gateway cloning technology.
Collapse
Affiliation(s)
- José Bessa
- Centro Andaluz de Biología del Desarrollo (CABD), CSIC-Universidad Pablo de Olavide, Seville, Spain.
| | | |
Collapse
|
40
|
Use of comparative genomics approaches to characterize interspecies differences in response to environmental chemicals: challenges, opportunities, and research needs. Toxicol Appl Pharmacol 2011; 271:372-85. [PMID: 22142766 DOI: 10.1016/j.taap.2011.11.011] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2011] [Revised: 11/11/2011] [Accepted: 11/16/2011] [Indexed: 01/12/2023]
Abstract
A critical challenge for environmental chemical risk assessment is the characterization and reduction of uncertainties introduced when extrapolating inferences from one species to another. The purpose of this article is to explore the challenges, opportunities, and research needs surrounding the issue of how genomics data and computational and systems level approaches can be applied to inform differences in response to environmental chemical exposure across species. We propose that the data, tools, and evolutionary framework of comparative genomics be adapted to inform interspecies differences in chemical mechanisms of action. We compare and contrast existing approaches, from disciplines as varied as evolutionary biology, systems biology, mathematics, and computer science, that can be used, modified, and combined in new ways to discover and characterize interspecies differences in chemical mechanism of action which, in turn, can be explored for application to risk assessment. We consider how genetic, protein, pathway, and network information can be interrogated from an evolutionary biology perspective to effectively characterize variations in biological processes of toxicological relevance among organisms. We conclude that comparative genomics approaches show promise for characterizing interspecies differences in mechanisms of action, and further, for improving our understanding of the uncertainties inherent in extrapolating inferences across species in both ecological and human health risk assessment. To achieve long-term relevance and consistent use in environmental chemical risk assessment, improved bioinformatics tools, computational methods robust to data gaps, and quantitative approaches for conducting extrapolations across species are critically needed. Specific areas ripe for research to address these needs are recommended.
Collapse
|
41
|
Orth JD, Conrad TM, Na J, Lerman JA, Nam H, Feist AM, Palsson BØ. A comprehensive genome-scale reconstruction of Escherichia coli metabolism--2011. Mol Syst Biol 2011; 7:535. [PMID: 21988831 PMCID: PMC3261703 DOI: 10.1038/msb.2011.65] [Citation(s) in RCA: 692] [Impact Index Per Article: 53.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2011] [Accepted: 08/17/2011] [Indexed: 12/16/2022] Open
Abstract
The initial genome-scale reconstruction of the metabolic network of Escherichia coli K-12 MG1655 was assembled in 2000. It has been updated and periodically released since then based on new and curated genomic and biochemical knowledge. An update has now been built, named iJO1366, which accounts for 1366 genes, 2251 metabolic reactions, and 1136 unique metabolites. iJO1366 was (1) updated in part using a new experimental screen of 1075 gene knockout strains, illuminating cases where alternative pathways and isozymes are yet to be discovered, (2) compared with its predecessor and to experimental data sets to confirm that it continues to make accurate phenotypic predictions of growth on different substrates and for gene knockout strains, and (3) mapped to the genomes of all available sequenced E. coli strains, including pathogens, leading to the identification of hundreds of unannotated genes in these organisms. Like its predecessors, the iJO1366 reconstruction is expected to be widely deployed for studying the systems biology of E. coli and for metabolic engineering applications.
Collapse
Affiliation(s)
- Jeffrey D Orth
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Tom M Conrad
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Jessica Na
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Joshua A Lerman
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA, USA
| | - Hojung Nam
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Adam M Feist
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Bernhard Ø Palsson
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| |
Collapse
|
42
|
Revanna KV, Chiu CC, Bierschank E, Dong Q. GSV: a web-based genome synteny viewer for customized data. BMC Bioinformatics 2011; 12:316. [PMID: 21810250 PMCID: PMC3199762 DOI: 10.1186/1471-2105-12-316] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2011] [Accepted: 08/02/2011] [Indexed: 11/12/2022] Open
Abstract
Background The analysis of genome synteny is a common practice in comparative genomics. With the advent of DNA sequencing technologies, individual biologists can rapidly produce their genomic sequences of interest. Although web-based synteny visualization tools are convenient for biologists to use, none of the existing ones allow biologists to upload their own data for analysis. Results We have developed the web-based Genome Synteny Viewer (GSV) that allows users to upload two data files for synteny visualization, the mandatory synteny file for specifying genomic positions of conserved regions and the optional genome annotation file. GSV presents two selected genomes in a single integrated view while still retaining the browsing flexibility necessary for exploring individual genomes. Users can browse and filter for genomic regions of interest, change the color or shape of each annotation track as well as re-order, hide or show the tracks dynamically. Additional features include downloadable images, immediate email notification and tracking of usage history. The entire GSV package is also light-weighted which enables easy local installation. Conclusions GSV provides a unique option for biologists to analyze genome synteny by uploading their own data set to a web-based comparative genome browser. A web server hosting GSV is provided at http://cas-bioinfo.cas.unt.edu/gsv, and the software is also freely available for local installations.
Collapse
Affiliation(s)
- Kashi V Revanna
- Department of Biological Sciences, University of North Texas, Denton, Texas 76203, USA
| | | | | | | |
Collapse
|
43
|
Te Velthuis AJW, Bagowski CP. Linking fold, function and phylogeny: a comparative genomics view on protein (domain) evolution. Curr Genomics 2011; 9:88-96. [PMID: 19440449 PMCID: PMC2674803 DOI: 10.2174/138920208784139537] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2008] [Revised: 03/20/2008] [Accepted: 03/25/2008] [Indexed: 11/22/2022] Open
Abstract
Domains are the building blocks of all globular proteins and present one of the most useful levels at which protein function can be understood. Through recombination and duplication of a limited set of domains, proteomes evolved and the collection of protein superfamilies in an organism formed. As such, the presence of a shared domain can be regarded as an indicator of similar function and evolutionary history, but it does not necessarily imply it since convergent evolution may give rise to similar gene functions as well as architectures.Through the wealth of sequences and annotation data brought about by genomics, evolutionary links can be sought for via homology relationships and comparative genomics, structural modeling and phylogenetics. The goal hereby is not only to predict the function of newly discovered proteins, but also to spell out their pathway of evolution and, possibly, identify their most likely origin. This can ultimately help to understand protein function and functional relationships of protein families. Additionally, through comparison with transcriptional data, evolutionary data can be linked to gene (and genome) activity and thus allow for the identification of common principles behind fast evolving proteins and relatively stable ones.In this review, we describe the basic principles of studying protein (domain) evolution and illustrate recent developments in molecular evolution and give valuable new insights in the field of comparative genomics. As an example, we include here molecular models of the multiple PDZ domain protein MUPP-1 and present a simple comparative genomic view on its structural course of evolution.
Collapse
Affiliation(s)
- Aartjan J W Te Velthuis
- Institute of Biology, Department of Molecular Virology, Leiden University Medical Centre, Albinusdreef 2, 2333 ZA Leiden, The Netherlands
| | | |
Collapse
|
44
|
Romanov MN, Dodgson JB, Gonser RA, Tuttle EM. Comparative BAC-based mapping in the white-throated sparrow, a novel behavioral genomics model, using interspecies overgo hybridization. BMC Res Notes 2011; 4:211. [PMID: 21693052 PMCID: PMC3155834 DOI: 10.1186/1756-0500-4-211] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2011] [Accepted: 06/21/2011] [Indexed: 12/23/2022] Open
Abstract
Background The genomics era has produced an arsenal of resources from sequenced organisms allowing researchers to target species that do not have comparable mapping and sequence information. These new "non-model" organisms offer unique opportunities to examine environmental effects on genomic patterns and processes. Here we use comparative mapping as a first step in characterizing the genome organization of a novel animal model, the white-throated sparrow (Zonotrichia albicollis), which occurs as white or tan morphs that exhibit alternative behaviors and physiology. Morph is determined by the presence or absence of a complex chromosomal rearrangement. This species is an ideal model for behavioral genomics because the association between genotype and phenotype is absolute, making it possible to identify the genomic bases of phenotypic variation. Findings We initiated a genomic study in this species by characterizing the white-throated sparrow BAC library via filter hybridization with overgo probes designed for the chicken, turkey, and zebra finch. Cross-species hybridization resulted in 640 positive sparrow BACs assigned to 77 chicken loci across almost all macro-and microchromosomes, with a focus on the chromosomes associated with morph. Out of 216 overgos, 36% of the probes hybridized successfully, with an average number of 3.0 positive sparrow BACs per overgo. Conclusions These data will be utilized for determining chromosomal architecture and for fine-scale mapping of candidate genes associated with phenotypic differences. Our research confirms the utility of interspecies hybridization for developing comparative maps in other non-model organisms.
Collapse
Affiliation(s)
- Michael N Romanov
- Dept, of Biology, Indiana State University, Terre Haute, Indiana 47809, USA.
| | | | | | | |
Collapse
|
45
|
Teng H, Cai W, Zhou L, Zhang J, Liu Q, Wang Y, Dai W, Zhao M, Sun Z. Evolutionary mode and functional divergence of vertebrate NMDA receptor subunit 2 genes. PLoS One 2010; 5:e13342. [PMID: 20976280 PMCID: PMC2954789 DOI: 10.1371/journal.pone.0013342] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2010] [Accepted: 09/21/2010] [Indexed: 11/19/2022] Open
Abstract
Background Ionotropic glutamate receptors in the central nervous system play a major role in numerous brain functions including learning and memory in many vertebrate species. NR2 subunits have been regarded as rate-limiting molecules in controlling the optimal N-methyl-D-aspartate (NMDA) receptor's coincidence-detection property and subsequent learning and memory function across multi-species. However, its evolutionary mode among vertebrate species remains unclear. Results With extensive analysis of phylogeny, exon structure, protein domain, paralogon and synteny, we demonstrated that two-round genome duplication generated quartet GRIN2 genes and the third-round fish-specific genome duplication generated extra copies of fish GRIN2 genes. In addition, in-depth investigation has enabled the identification of three novel genes, GRIN2C_Gg, GRIN2D-1_Ol and GRIN2D-2_Tr in the chicken, medaka and fugu genome, respectively. Furthermore, we showed functional divergence of NR2 genes mostly occurred at the first-round duplication, amino acid residues located at the N-terminal Lig_chan domain were responsible for type I functional divergence between these GRIN2 subfamilies and purifying selection has been the prominent natural pressure operating on these diversified GRIN2 genes. Conclusion and Significance These findings provide intriguing subjects for testing the 2R and 3R hypothesis and we expect it could provide new insights into the underlying evolution mechanisms of cognition in vertebrate.
Collapse
Affiliation(s)
- Huajing Teng
- Behavioral Genetics Center, Institute of Psychology, Chinese Academy of Science, Beijing, China
| | - Wanshi Cai
- Institute of Genomic Medicine, Wenzhou Medical College, Wenzhou, China
| | - LingLin Zhou
- Institute of Genomic Medicine, Wenzhou Medical College, Wenzhou, China
| | - Jing Zhang
- Behavioral Genetics Center, Institute of Psychology, Chinese Academy of Science, Beijing, China
| | - Qi Liu
- Institute of Genomic Medicine, Wenzhou Medical College, Wenzhou, China
| | - Yongqing Wang
- Behavioral Genetics Center, Institute of Psychology, Chinese Academy of Science, Beijing, China
| | - Wei Dai
- Behavioral Genetics Center, Institute of Psychology, Chinese Academy of Science, Beijing, China
| | - Mei Zhao
- Behavioral Genetics Center, Institute of Psychology, Chinese Academy of Science, Beijing, China
| | - Zhongsheng Sun
- Behavioral Genetics Center, Institute of Psychology, Chinese Academy of Science, Beijing, China
- * E-mail:
| |
Collapse
|
46
|
Picot E, Krusche P, Tiskin A, Carré I, Ott S. Evolutionary analysis of regulatory sequences (EARS) in plants. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2010; 64:165-176. [PMID: 20659275 DOI: 10.1111/j.1365-313x.2010.04314.x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
Identification of regulatory sequences within non-coding regions of DNA is an essential step towards elucidation of gene networks. This approach constitutes a major challenge, however, as only a very small fraction of non-coding DNA is thought to contribute to gene regulation. The mapping of regulatory regions traditionally involves the laborious construction of promoter deletion series which are then fused to reporter genes and assayed in transgenic organisms. Bioinformatic methods can be used to scan sequences for matches for known regulatory motifs, however these methods are currently hampered by the relatively small amount of such motifs and by a high false-discovery rate. Here, we demonstrate a robust and highly sensitive, in silico method to identify evolutionarily conserved regions within non-coding DNA. Sequence conservation within these regions is taken as evidence for evolutionary pressure against mutations, which is suggestive of functional importance. We test this method on a small set of well characterised promoters, and show that it successfully identifies known regulatory regions. We further show that these evolutionarily conserved sequences contain clusters of transcription binding sites, often described as regulatory modules. A version of the tool optimised for the analysis of plant promoters is available online at http://wsbc.warwick.ac.uk/ears/main.php.
Collapse
Affiliation(s)
- Emma Picot
- Systems Biology Doctoral Training Centre, University of Warwick, Coventry CV47AL, UK
| | | | | | | | | |
Collapse
|
47
|
Functional characterization of a haplotype in the AKT1 gene associated with glucose homeostasis and metabolic syndrome. Hum Genet 2010; 128:635-45. [PMID: 20872231 DOI: 10.1007/s00439-010-0891-7] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2010] [Accepted: 09/12/2010] [Indexed: 10/19/2022]
Abstract
A small 12-kb haplotype upstream of the AKT1 gene has been found to be associated with insulin resistance phenotypes. We sought to define the functional consequences of the three component polymorphic loci (rs1130214, rs10141867, rs33925946) on AKT1 and the upstream ZBTB42 gene. 5' RACE analysis of AKT1 transcripts in human skeletal muscle biopsies showed the predominant promoter to be 2.5 kb upstream of exon 2, and distinct from those promoters previously reported in rat. We then studied the effect of each of the three haplotype polymorphisms in transcriptional reporter assays in muscle, bone, and fat cell culture models, and found that each modulated enhancer and repressor activity are in a cell-specific and differentiation-specific manner. Our results in promoter assays are consistent with the human phenotype data; we found an anabolic effect on muscle and bone with increased mRNA expression of AKT1, and catabolic effect on fat with decreased expression. To test the hypothesis that rs10141867 affects transcription levels of the novel zinc finger protein ZBTB42 in vivo, we developed the allele-specific expression assay using Taqman technology to test for allelic differences within heterozygotes. The allele containing the derived polymorphism (haplotype H2) showed a 1.75-fold increase in expression in human skeletal muscle. Our data show a particularly complex effect of the component polymorphisms of a single haplotype on cells and tissues, suggesting that the coordination of different tissue-specific effects may have driven selection for the H2 haplotype. In light of the recent abundance of SNP association studies, our approach can serve as a method for exploring the biological function of polymorphisms that show significant genotype/phenotype associations.
Collapse
|
48
|
Elyashiv E, Bullaughey K, Sattath S, Rinott Y, Przeworski M, Sella G. Shifts in the intensity of purifying selection: an analysis of genome-wide polymorphism data from two closely related yeast species. Genome Res 2010; 20:1558-73. [PMID: 20817943 DOI: 10.1101/gr.108993.110] [Citation(s) in RCA: 67] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
How much does the intensity of purifying selection vary among populations and species? How uniform are the shifts in selective pressures across the genome? To address these questions, we took advantage of a recent, whole-genome polymorphism data set from two closely related species of yeast, Saccharomyces cerevisiae and S. paradoxus, paying close attention to the population structure within these species. We found that the average intensity of purifying selection on amino acid sites varies markedly among populations and between species. As expected in the presence of extensive weakly deleterious mutations, the effect of purifying selection is substantially weaker on single nucleotide polymorphisms (SNPs) segregating within populations than on SNPs fixed between population samples. Also in accordance with a Nearly Neutral model, the variation in the intensity of purifying selection across populations corresponds almost perfectly to simple measures of their effective size. As a first step toward understanding the processes generating these patterns, we sought to tease apart the relative importance of systematic, genome-wide changes in the efficacy of selection, such as those expected from demographic processes and of gene-specific changes, which may be expected after a shift in selective pressures. For that purpose, we developed a new model for the evolution of purifying selection between populations and inferred its parameters from the genome-wide data using a likelihood approach. We found that most, but not all changes seem to be explained by systematic shifts in the efficacy of selection. One population, the sake-derived strains of S. cerevisiae, however, also shows extensive gene-specific changes, plausibly associated with domestication. These findings have important implications for our understanding of purifying selection as well as for estimates of the rate of molecular adaptation in yeast and in other species.
Collapse
Affiliation(s)
- Eyal Elyashiv
- Department of Evolution, Systematics, and Ecology, Hebrew University of Jerusalem, Jerusalem 91905, Israel
| | | | | | | | | | | |
Collapse
|
49
|
Jenkins RH, Bennagi R, Martin J, Phillips AO, Redman JE, Fraser DJ. A conserved stem loop motif in the 5'untranslated region regulates transforming growth factor-β(1) translation. PLoS One 2010; 5:e12283. [PMID: 20865036 PMCID: PMC2928724 DOI: 10.1371/journal.pone.0012283] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2010] [Accepted: 07/16/2010] [Indexed: 01/10/2023] Open
Abstract
Transforming growth factor-β1 (TGF-β1) regulates cellular proliferation, differentiation, migration, and survival. The human TGF-β1 transcript is inherently poorly translated, and translational activation has been documented in relation to several stimuli. In this paper, we have sought to identify in cis regulatory elements within the TGF-β1 5′Untranslated Region (5′UTR). In silico analysis predicted formation of stable secondary structure in a G/C-rich element between nucleotides +77 to +106, and demonstrated that this element is highly conserved across species. Circular dichroism spectroscopy confirmed the presence of secondary structure in this region. The proximal 5′UTR was inhibitory to translation in reporter gene experiments, and mutation of the secondary structure motif increased translational efficiency. Translational regulation of TGF-β1 mRNA is linked to altered binding of YB-1 protein to its 5′UTR. Immunoprecipitation-RT-qPCR demonstrated a high basal association of YB-1 with TGF-β1 mRNA. However, mutation of the secondary structure motif did not prevent interaction of YB-1 with the 5′UTR, suggesting that YB-1 binds to this region due to its G/C-rich composition, rather than a specific, sequence-dependent, binding site. These data identify a highly conserved element within the TGF-β1 5′UTR that forms stable secondary structure, and is responsible for the inherent low translation efficiency of this cytokine.
Collapse
Affiliation(s)
- Robert H. Jenkins
- Institute of Nephrology, School of Medicine, Cardiff University, Heath Park, Cardiff, Wales, United Kingdom
| | - Rasha Bennagi
- Institute of Nephrology, School of Medicine, Cardiff University, Heath Park, Cardiff, Wales, United Kingdom
| | - John Martin
- Institute of Nephrology, School of Medicine, Cardiff University, Heath Park, Cardiff, Wales, United Kingdom
| | - Aled O. Phillips
- Institute of Nephrology, School of Medicine, Cardiff University, Heath Park, Cardiff, Wales, United Kingdom
| | - James E. Redman
- School of Chemistry, Cardiff University, Park Place, Cardiff, Wales, United Kingdom
| | - Donald J. Fraser
- Institute of Nephrology, School of Medicine, Cardiff University, Heath Park, Cardiff, Wales, United Kingdom
- * E-mail:
| |
Collapse
|
50
|
Bagowski CP, Bruins W, te Velthuis AJ. The nature of protein domain evolution: shaping the interaction network. Curr Genomics 2010; 11:368-76. [PMID: 21286315 PMCID: PMC2945003 DOI: 10.2174/138920210791616725] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2010] [Revised: 06/04/2010] [Accepted: 06/13/2010] [Indexed: 11/30/2022] Open
Abstract
The proteomes that make up the collection of proteins in contemporary organisms evolved through recombination and duplication of a limited set of domains. These protein domains are essentially the main components of globular proteins and are the most principal level at which protein function and protein interactions can be understood. An important aspect of domain evolution is their atomic structure and biochemical function, which are both specified by the information in the amino acid sequence. Changes in this information may bring about new folds, functions and protein architectures. With the present and still increasing wealth of sequences and annotation data brought about by genomics, new evolutionary relationships are constantly being revealed, unknown structures modeled and phylogenies inferred. Such investigations not only help predict the function of newly discovered proteins, but also assist in mapping unforeseen pathways of evolution and reveal crucial, co-evolving inter- and intra-molecular interactions. In turn this will help us describe how protein domains shaped cellular interaction networks and the dynamics with which they are regulated in the cell. Additionally, these studies can be used for the design of new and optimized protein domains for therapy. In this review, we aim to describe the basic concepts of protein domain evolution and illustrate recent developments in molecular evolution that have provided valuable new insights in the field of comparative genomics and protein interaction networks.
Collapse
Affiliation(s)
- Christoph P Bagowski
- German University Cairo, Faculty of Pharmacy and Biotechnology, New Cairo City, Egypt
| | - Wouter Bruins
- Institute of Biology, Leiden University, 2333 AL Leiden, The Netherlands
| | - Aartjan J.W te Velthuis
- Department of Medical Microbiology, Molecular Virology Laboratory, Leiden University Medical Center, Albinusdreef 2, 2333 ZA Leiden, The Netherlands
- Department of Bionanoscience, Delft University of Technology, Lorentzweg 1, 2628 CJ, Delft, The Netherlands
| |
Collapse
|