1
|
Dong Z, Bai Y, Liu S, Yu H, Kong L, Du S, Li Q. A chromosome-level genome assembly of Ostrea denselamellosa provides initial insights into its evolution. Genomics 2023; 115:110582. [PMID: 36796653 DOI: 10.1016/j.ygeno.2023.110582] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 02/09/2023] [Accepted: 02/11/2023] [Indexed: 02/16/2023]
Abstract
The oyster Ostrea denselamellosa is a live-bearing species with a sharp decline in the natural population. Despite recent breakthroughs in long-read sequencing, high quality genomic data are very limited in O. denselamellosa. Here, we carried out the first whole genome sequencing at the chromosome-level in O. denselamellosa. Our studies yielded a 636 Mb assembly with scaffold N50 around 71.80 Mb. 608.3 Mb (95.6% of the assembly) were anchored to 10 chromosomes. A total of 26,412 protein-coding genes were predicted, of which 22,636 (85.7%) were functionally annotated. By comparative genomics, we found that long interspersed nuclear element (LINE) and short interspersed nuclear element (SINE) made up a larger proportion in O. denselamellosa genome than in other oysters'. Moreover, gene family analysis showed some initial insight into its evolution. This high-quality genome of O. denselamellosa provides a valuable genomic resource for studies of evolution, adaption and conservation in oysters.
Collapse
Affiliation(s)
- Zhen Dong
- Key Laboratory of Mariculture, Ministry of Education, Ocean University of China, Qingdao 266003, China
| | - Yitian Bai
- Key Laboratory of Mariculture, Ministry of Education, Ocean University of China, Qingdao 266003, China
| | - Shikai Liu
- Key Laboratory of Mariculture, Ministry of Education, Ocean University of China, Qingdao 266003, China
| | - Hong Yu
- Key Laboratory of Mariculture, Ministry of Education, Ocean University of China, Qingdao 266003, China
| | - Lingfeng Kong
- Key Laboratory of Mariculture, Ministry of Education, Ocean University of China, Qingdao 266003, China
| | - Shaojun Du
- Institute of Marine and Environmental Technology, Department of Biochemistry and Molecular Biology, University of Maryland School of Medicine, Baltimore, MD, United States
| | - Qi Li
- Key Laboratory of Mariculture, Ministry of Education, Ocean University of China, Qingdao 266003, China; Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao 266237, China.
| |
Collapse
|
2
|
Ko BJ, Lee C, Kim J, Rhie A, Yoo DA, Howe K, Wood J, Cho S, Brown S, Formenti G, Jarvis ED, Kim H. Widespread false gene gains caused by duplication errors in genome assemblies. Genome Biol 2022; 23:205. [PMID: 36167596 PMCID: PMC9516828 DOI: 10.1186/s13059-022-02764-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Accepted: 09/02/2022] [Indexed: 12/22/2022] Open
Abstract
Background False duplications in genome assemblies lead to false biological conclusions. We quantified false duplications in popularly used previous genome assemblies for platypus, zebra finch, and Anna’s Hummingbird, and their new counterparts of the same species generated by the Vertebrate Genomes Project, of which the Vertebrate Genomes Project pipeline attempted to eliminate false duplications through haplotype phasing and purging. These assemblies are among the first generated by the Vertebrate Genomes Project where there was a prior chromosomal level reference assembly to compare with. Results Whole genome alignments revealed that 4 to 16% of the sequences are falsely duplicated in the previous assemblies, impacting hundreds to thousands of genes. These lead to overestimated gene family expansions. The main source of the false duplications is heterotype duplications, where the haplotype sequences were relatively more divergent than other parts of the genome leading the assembly algorithms to classify them as separate genes or genomic regions. A minor source is sequencing errors. Ancient ATP nucleotide binding gene families have a higher prevalence of false duplications compared to other gene families. Although present in a smaller proportion, we observe false duplications remaining in the Vertebrate Genomes Project assemblies that can be identified and purged. Conclusions This study highlights the need for more advanced assembly methods that better separate haplotypes and sequence errors, and the need for cautious analyses on gene gains. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-022-02764-1.
Collapse
Affiliation(s)
- Byung June Ko
- Department of Agricultural Biotechnology and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea
| | - Chul Lee
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
| | - Juwan Kim
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, USA
| | - Dong Ahn Yoo
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
| | | | | | - Seoae Cho
- eGnome, Inc, Seoul, Republic of Korea
| | - Samara Brown
- Laboratory of the Neurogenetics of Language, The Rockefeller University, New York, NY, USA.,Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Giulio Formenti
- Laboratory of the Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | - Erich D Jarvis
- Laboratory of the Neurogenetics of Language, The Rockefeller University, New York, NY, USA. .,Howard Hughes Medical Institute, Chevy Chase, MD, USA.
| | - Heebal Kim
- Department of Agricultural Biotechnology and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea. .,Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea. .,eGnome, Inc, Seoul, Republic of Korea.
| |
Collapse
|
3
|
Ning J, Zhou J, Wang H, Liu Y, Ahmad F, Feng X, Fu Y, Gu X, Zhao L. Parallel Evolution of C-Type Lectin Domain Gene Family Sizes in Insect-Vectored Nematodes. FRONTIERS IN PLANT SCIENCE 2022; 13:856826. [PMID: 35557736 PMCID: PMC9085898 DOI: 10.3389/fpls.2022.856826] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Accepted: 03/11/2022] [Indexed: 05/12/2023]
Abstract
The dispersal stage of pathogens is crucial for the successful spread and infection of their hosts. Some plant-parasitic nematodes (PPNs) have evolved specialized dispersal stages to reach healthy hosts by being carried out by insect vectors. Because gene gain and loss is a major factor contributing to the evolution of novel characteristics, it is essential to clarify the gene family characteristics among nematodes with different dispersal modes to disentangle the evolution of insect-mediated dispersal. Here, the size of the C-type lectin (CTL) family genes of insect-vectored nematodes was found to be drastically reduced compared with those of self-dispersing nematodes, whereas the diversity of their functional domains was significantly higher. The gene family sizes of vector-dispersed nematodes were only a twentieth of the size of that of a self-dispersing (i.e., without a biotic vector) nematode model Caenorhabditis elegans, and these genes were inactive during the dispersal stage. Phylogenetic analysis showed that some CTL genes of vector-borne PPNs shared higher homology to the animal parasitic nematodes compared with other PPNs. Moreover, homology modeling predicted that the CTLs of insect-vectored nematodes bear remarkable structural similarity to the lectin genes of their vector's immune system. Because CTL genes are important sugar-binding proteins for the innate immune response of C. elegans, the loss of some CTL genes of vector-transmitted PPNs might be responsible for their parallel adaptations to a mutualistic relationship with their vector. These results expand our understanding of the evolutionary benefits of vector-mediated transmission for the nematode and vector-nematode co-evolution.
Collapse
Affiliation(s)
- Jing Ning
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China
| | - Jiao Zhou
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China
| | - Haixiang Wang
- College of Forestry, Shanxi Agricultural University, Taigu, China
| | - Yaning Liu
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China
| | - Faheem Ahmad
- Department of Biosciences, COMSATS University Islamabad (CUI), Islamabad, Pakistan
| | - Xiaohui Feng
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China
| | - Yu Fu
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China
- School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, China
| | - Xiaoting Gu
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China
| | - Lilin Zhao
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China
- *Correspondence: Lilin Zhao
| |
Collapse
|
4
|
Hao S, Han K, Meng L, Huang X, Cao W, Shi C, Zhang M, Wang Y, Liu Q, Zhang Y, Sun H, Seim I, Xu X, Liu X, Fan G. African Arowana Genome Provides Insights on Ancient Teleost Evolution. iScience 2020; 23:101662. [PMID: 33134892 PMCID: PMC7586111 DOI: 10.1016/j.isci.2020.101662] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2020] [Revised: 08/27/2020] [Accepted: 10/07/2020] [Indexed: 12/11/2022] Open
Abstract
Osteoglossiformes is a basal clade of teleost, evolving since the Jurassic period. The genomes of Osteoglossiformes species would shed light on the evolution and adaptation of teleost. Here, we established a chromosome-level genome of African arowana. Together with the genomes of pirarucu and Asian arowana, we found that they diverged at ∼106.1 million years ago (MYA) and ∼59.2 MYA, respectively, which are coincident with continental separation. Interestingly, we identified a dynamic genome evolution characterized by a fast evolutionary rate and a high pseudogenization rate in African arowana and pirarucu. Additionally, more transposable elements were found in Asian arowana which confer more gene duplications. Moreover, we found the contraction of olfactory receptor and the expansion of UGT in African arowana might be related to its transformation from carnivore to be omnivore. Taken together, we provided valuable genomic resource of Osteoglossidae and revealed the correlation of biogeography and teleost evolution. An evolutionary model of Osteoglossidae along the continental drift is provided A faster evolving rate of African arowana than Asian arowana is revealed The gene duplications of Asian arowana are related to more class I TE insertions A mechanism of African arowana’s feeding habits transition is proposed.
Collapse
Affiliation(s)
- Shijie Hao
- BGI Education Center, University of Chinese Academic of Sciences, Shenzhen 518083, China.,BGI-Qingqao, BGI-Shenzhen, Qingdao, 266555, China
| | - Kai Han
- BGI-Qingqao, BGI-Shenzhen, Qingdao, 266555, China
| | - Lingfeng Meng
- BGI Education Center, University of Chinese Academic of Sciences, Shenzhen 518083, China.,BGI-Qingqao, BGI-Shenzhen, Qingdao, 266555, China
| | | | - Wei Cao
- BGI-Shenzhen, Shenzhen 518083, China
| | - Chengcheng Shi
- BGI Education Center, University of Chinese Academic of Sciences, Shenzhen 518083, China.,BGI-Qingqao, BGI-Shenzhen, Qingdao, 266555, China
| | - Mengqi Zhang
- BGI-Qingqao, BGI-Shenzhen, Qingdao, 266555, China
| | - Yilin Wang
- BGI-Qingqao, BGI-Shenzhen, Qingdao, 266555, China
| | - Qun Liu
- BGI-Qingqao, BGI-Shenzhen, Qingdao, 266555, China
| | - Yaolei Zhang
- BGI-Qingqao, BGI-Shenzhen, Qingdao, 266555, China.,Department of Biotechnology and Biomedicine, Technical University of Denmark, Lyngby, 2800, Denmark
| | - Haixi Sun
- BGI-Shenzhen, Shenzhen 518083, China
| | - Inge Seim
- Integrative Biology Laboratory, College of Life Sciences, Nanjing Normal University, Nanjing, 210046, China.,School of Biology and Environmental Science, Queensland University of Technology, Brisbane 4102, QLD, Australia
| | - Xun Xu
- BGI-Qingqao, BGI-Shenzhen, Qingdao, 266555, China.,BGI-Shenzhen, Shenzhen 518083, China.,Guangdong Provincial Key Laboratory of Genome Read and Write, BGI-Shenzhen, Shenzhen 518120, China
| | - Xin Liu
- BGI-Qingqao, BGI-Shenzhen, Qingdao, 266555, China.,BGI-Shenzhen, Shenzhen 518083, China.,State Key Laboratory of Agricultural Genomics, BGI-Shenzhen, Shenzhen 518083, China
| | - Guangyi Fan
- BGI-Qingqao, BGI-Shenzhen, Qingdao, 266555, China.,BGI-Shenzhen, Shenzhen 518083, China.,State Key Laboratory of Agricultural Genomics, BGI-Shenzhen, Shenzhen 518083, China
| |
Collapse
|
5
|
Xiao G, Tang G, Wang C. Congruence Amidst Discordance between Sequence and Protein-Content Based Phylogenies of Fungi. J Fungi (Basel) 2020; 6:jof6030134. [PMID: 32823730 PMCID: PMC7559059 DOI: 10.3390/jof6030134] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2020] [Revised: 08/05/2020] [Accepted: 08/11/2020] [Indexed: 11/16/2022] Open
Abstract
Amid the genomic data explosion, phylogenomic analysis has resolved the tree of life of different organisms, including fungi. Genome-wide clustering has also been conducted based on gene content data that can lighten the issue of the unequal evolutionary rate of genes. In this study, using different fungal species as models, we performed phylogenomic and protein-content (PC)-based clustering analysis. The obtained sequence tree reflects the phylogenetic trajectory of examined fungal species. However, 15 PC-based trees constructed from the Pfam matrices of the whole genomes, four protein families, and ten subcellular locations largely failed to resolve the speciation relationship of cross-phylum fungal species. However, lifestyle and taxonomic associations were more or less evident between closely related fungal species from PC-based trees. Pairwise congruence tests indicated that a varied level of congruent or discordant relationships were observed between sequence- and PC-based trees, and among PC-based trees. It was intriguing to find that a few protein family and subcellular PC-based trees were more topologically similar to the phylogenomic tree than was the whole genome PC-based phylogeny. In particular, a most significant level of congruence was observed between sequence- and cell wall PC-based trees. Cophylogenetic analysis conducted in this study may benefit the prediction of the magnitude of evolutionary conservation, interactive associations, or networking between different family or subcellular proteins.
Collapse
Affiliation(s)
- Guohua Xiao
- School of Computer Science, Fudan University, Shanghai 200433, China;
- CAS Key Laboratory of Insect Developmental and Evolutionary Biology, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai 200032, China;
| | - Guirong Tang
- CAS Key Laboratory of Insect Developmental and Evolutionary Biology, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai 200032, China;
| | - Chengshu Wang
- CAS Key Laboratory of Insect Developmental and Evolutionary Biology, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai 200032, China;
- CAS Center for Excellence in Biotic interactions, University of Chinese Academy of Sciences, Beijing 100049, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China
- Correspondence:
| |
Collapse
|
6
|
Li L, Liu D, Liu A, Li J, Wang H, Zhou J. Genomic Survey of Tyrosine Kinases Repertoire in Electrophorus electricus With an Emphasis on Evolutionary Conservation and Diversification. Evol Bioinform Online 2020; 16:1176934320922519. [PMID: 32546936 PMCID: PMC7249569 DOI: 10.1177/1176934320922519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Accepted: 04/07/2020] [Indexed: 12/05/2022] Open
Abstract
Tyrosine kinases (TKs) play key roles in the regulation of multicellularity in
organisms and involved primarily in cell growth, differentiation, and
cell-to-cell communication. Genome-wide characterization of TKs has been
conducted in many metazoans; however, systematic information regarding this
superfamily in Electrophorus electricus (electric eel) is still
lacking. In this study, we identified 114 TK genes in the E
electricus genome and investigated their evolution, molecular
features, and domain architecture using phylogenetic profiling to gain a better
understanding of their similarities and specificity. Our results suggested that
the electric eel TK (EeTK) repertoire was shaped by whole-genome duplications
(WGDs) and tandem duplication events. Compared with other vertebrate TKs, gene
members in Jak, Src, and EGFR subfamily duplicated specifically, but with
members lost in Eph, Axl, and Ack subfamily in electric eel. We also conducted
an exhaustive survey of TK genes in genomic databases, identifying 1674 TK
proteins in 31 representative species covering all the main metazoan lineages.
Extensive evolutionary analysis indicated that TK repertoire in vertebrates
tended to be remarkably conserved, but the gene members in each subfamily were
very variable. Comparative expression profile analysis showed that electric
organ tissues and muscle shared a similar pattern with specific highly expressed
TKs (ie, epha7, musk, jak1, and pdgfra), suggesting that regulation of TKs might
play an important role in specifying an electric organ identity from its muscle
precursor. We further identified TK genes exhibiting tissue-specific expression
patterns, indicating that members in TKs participated in subfunctionalization
representing an evolutionary divergence required for the performance of
different tissues. This work generates valuable information for further gene
function analysis and identifying candidate TK genes reflecting their unique
tissue-function specializations in electric eel.
Collapse
Affiliation(s)
- Ling Li
- School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, P.R. China
| | - Dangyun Liu
- Department of Central Laboratory, The Affiliated Huaian No.1 People's Hospital, Nanjing Medical University, Huai'an, P.R. China
| | - Ake Liu
- Faculty of Biological Science and Technology, Changzhi University, Changzhi, P.R. China
| | - Jingquan Li
- School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, P.R. China
| | - Hui Wang
- School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, P.R. China
| | - Jingqi Zhou
- School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, P.R. China
| |
Collapse
|
7
|
Global Evolutionary Analysis of 11 Gene Families Part of Reactive Oxygen Species (ROS) Gene Network in Four Eucalyptus Species. Antioxidants (Basel) 2020; 9:antiox9030257. [PMID: 32245199 PMCID: PMC7139577 DOI: 10.3390/antiox9030257] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Revised: 03/17/2020] [Accepted: 03/18/2020] [Indexed: 12/30/2022] Open
Abstract
Eucalyptus is a worldwide hard-wood species which increasingly focused on. To adapt to various biotic and abiotic stresses, Eucalyptus have evolved complex mechanisms, increasing the cellular concentration of reactive oxygen species (ROS) by numerous ROS controlling enzymes. To better analyse the ROS gene network and discuss the differences between four Eucalyptus species, ROS gene network including 11 proteins families (1CysPrx, 2CysPrx, APx, APx-R, CIII Prx, Diox, GPx, Kat, PrxII, PrxQ and Rboh) were annotated and compared in an expert and exhaustive manner from the genomic data available from E. camaldulensis, E. globulus, E. grandis, and E. gunnii. In addition, a specific sequencing strategy was performed in order to determine if the missed sequences in at least one organism are the results of gain/loss events or only sequencing gaps. We observed that the automatic annotation applied to multigenic families is the source of miss-annotation. Base on the family size, the 11 families can be categorized into duplicated gene families (CIII Prx, Kat, 1CysPrx, and GPx), which contain a lot of gene duplication events and non-duplicated families (APx, APx-R, Rboh, DiOx, 2CysPrx, PrxII, and PrxQ). The gene family sizes are much larger in Eucalyptus than most of other angiosperms due to recent gene duplications, which could give higher adaptability to environmental changes and stresses. The cross-species comparative analysis shows gene gain and loss events during the evolutionary process. The 11 families possess different expression patterns, while in the Eucalyptus genus, the ROS families present similar expression patterns. Overall, the comparative analysis might be a good criterion to evaluate the adaptation of different species with different characters, but only if data mining is as exhaustive as possible. It is also a good indicator to explore the evolutionary process.
Collapse
|
8
|
Feyereisen R. Origin and evolution of the CYP4G subfamily in insects, cytochrome P450 enzymes involved in cuticular hydrocarbon synthesis. Mol Phylogenet Evol 2020; 143:106695. [DOI: 10.1016/j.ympev.2019.106695] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2019] [Revised: 11/23/2019] [Accepted: 11/26/2019] [Indexed: 11/27/2022]
|
9
|
Hunnicutt KE, Tiley GP, Williams RC, Larsen PA, Blanco MB, Rasoloarison RM, Campbell CR, Zhu K, Weisrock DW, Matsunami H, Yoder AD. Comparative Genomic Analysis of the Pheromone Receptor Class 1 Family (V1R) Reveals Extreme Complexity in Mouse Lemurs (Genus, Microcebus) and a Chromosomal Hotspot across Mammals. Genome Biol Evol 2020; 12:3562-3579. [PMID: 31555816 PMCID: PMC6944220 DOI: 10.1093/gbe/evz200] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/08/2019] [Indexed: 12/14/2022] Open
Abstract
Sensory gene families are of special interest for both what they can tell us about molecular evolution and what they imply as mediators of social communication. The vomeronasal type-1 receptors (V1Rs) have often been hypothesized as playing a fundamental role in driving or maintaining species boundaries given their likely function as mediators of intraspecific mate choice, particularly in nocturnal mammals. Here, we employ a comparative genomic approach for revealing patterns of V1R evolution within primates, with a special focus on the small-bodied nocturnal mouse and dwarf lemurs of Madagascar (genera Microcebus and Cheirogaleus, respectively). By doubling the existing genomic resources for strepsirrhine primates (i.e. the lemurs and lorises), we find that the highly speciose and morphologically cryptic mouse lemurs have experienced an elaborate proliferation of V1Rs that we argue is functionally related to their capacity for rapid lineage diversification. Contrary to a previous study that found equivalent degrees of V1R diversity in diurnal and nocturnal lemurs, our study finds a strong correlation between nocturnality and V1R elaboration, with nocturnal lemurs showing elaborate V1R repertoires and diurnal lemurs showing less diverse repertoires. Recognized subfamilies among V1Rs show unique signatures of diversifying positive selection, as might be expected if they have each evolved to respond to specific stimuli. Furthermore, a detailed syntenic comparison of mouse lemurs with mouse (genus Mus) and other mammalian outgroups shows that orthologous mammalian subfamilies, predicted to be of ancient origin, tend to cluster in a densely populated region across syntenic chromosomes that we refer to as a V1R "hotspot."
Collapse
Affiliation(s)
- Kelsie E Hunnicutt
- Department of Biology, Duke University, Durham, North Carolina
- Department of Biological Sciences, University of Denver, Denver, Colorado
| | - George P Tiley
- Department of Biology, Duke University, Durham, North Carolina
| | - Rachel C Williams
- Department of Biology, Duke University, Durham, North Carolina
- Duke Lemur Center, Duke University, Durham, North Carolina
| | - Peter A Larsen
- Department of Biology, Duke University, Durham, North Carolina
- Department of Veterinary and Biomedical Sciences, University of Minnesota, Saint Paul, Minnesota
| | | | - Rodin M Rasoloarison
- Behavioral Ecology and Sociobiology Unit, German Primate Centre, Göttingen, Germany
- Département de Biologie Animale, Université d’Antananarivo, Madagascar, Antananarivo, Madagascar
| | - C Ryan Campbell
- Department of Biology, Duke University, Durham, North Carolina
| | - Kevin Zhu
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, North Carolina
| | - David W Weisrock
- Department of Biology, University of Kentucky, Lexington, Kentucky
| | - Hiroaki Matsunami
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, North Carolina
- Department of Neurobiology, Duke Institute for Brain Sciences, Duke University Medical Center, Durham, North Carolina
| | - Anne D Yoder
- Department of Biology, Duke University, Durham, North Carolina
| |
Collapse
|
10
|
Da Lage JL, Thomas GWC, Bonneau M, Courtier-Orgogozo V. Evolution of salivary glue genes in Drosophila species. BMC Evol Biol 2019; 19:36. [PMID: 30696414 PMCID: PMC6352337 DOI: 10.1186/s12862-019-1364-9] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Accepted: 01/17/2019] [Indexed: 11/23/2022] Open
Abstract
Background At the very end of the larval stage Drosophila expectorate a glue secreted by their salivary glands to attach themselves to a substrate while pupariating. The glue is a mixture of apparently unrelated proteins, some of which are highly glycosylated and possess internal repeats. Because species adhere to distinct substrates (i.e. leaves, wood, rotten fruits), glue genes are expected to evolve rapidly. Results We used available genome sequences and PCR-sequencing of regions of interest to investigate the glue genes in 20 Drosophila species. We discovered a new gene in addition to the seven glue genes annotated in D. melanogaster. We also identified a phase 1 intron at a conserved position present in five of the eight glue genes of D. melanogaster, suggesting a common origin for those glue genes. A slightly significant rate of gene turnover was inferred. Both the number of repeats and the repeat sequence were found to diverge rapidly, even between closely related species. We also detected high repeat number variation at the intrapopulation level in D. melanogaster. Conclusion Most conspicuous signs of accelerated evolution are found in the repeat regions of several glue genes. Electronic supplementary material The online version of this article (10.1186/s12862-019-1364-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jean-Luc Da Lage
- UMR 9191 Évolution, Génomes, Comportement, Écologie. CNRS, IRD, Université Paris-Sud. Université Paris-Saclay, F-91198, Gif-sur-Yvette, France.
| | - Gregg W C Thomas
- Department of Biology and Department of Computer Science, Indiana University, Bloomington, IN, 47405, USA
| | - Magalie Bonneau
- UMR 9191 Évolution, Génomes, Comportement, Écologie. CNRS, IRD, Université Paris-Sud. Université Paris-Saclay, F-91198, Gif-sur-Yvette, France
| | | |
Collapse
|
11
|
Lin M, Jia R, Li J, Zhang M, Chen H, Zhang D, Zhang J, Chen X. Evolution and expression patterns of the trehalose-6-phosphate synthase gene family in drumstick tree (Moringa oleifera Lam.). PLANTA 2018; 248:999-1015. [PMID: 30006657 DOI: 10.1007/s00425-018-2945-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2018] [Accepted: 06/21/2018] [Indexed: 05/28/2023]
Abstract
Moringa oleifera TPSs were genome-wide identified for the first time, and a phylogenetic analysis was performed to investigate evolutionary divergence. The qRT-PCR data show that MoTPS genes response to different stress treatments. The trehalose-6-phosphate synthase (TPS) family is involved in a wide range of stress-resistance processes in plants. Its direct product, trehalose-6-phosphate, acts as a specific signal of sucrose status and a regulator to modulate carbon metabolism within the plant. In this study, eight TPS genes were identified and cloned based on the M. oleifera genome; only MoTPS1 exhibited TPS activity among Group I proteins. The characteristics of the MoTPS gene family were determined by analyzing phylogenetic relationships, gene structures, conserved motifs, selective forces, and expression patterns. The Group II MoTPS genes were under relaxed purifying selection or positive selection. The glycosyltransferase family 20 domains generally had lower Ka/Ks ratios and nonsynonymous (Ka) changes compared with those of trehalose-phosphatase domains, which is consistent with stronger purifying selection due to functional constraints in performing TPS enzyme activity. Phylogenetic analyses of TPS proteins from M. oleifera and 17 other plant species indicated that TPS were present before the monocot-dicot split, whereas Group II TPSs were duplicated after the separation of dicots and monocots. Quantitative real-time PCR analysis showed that the expression patterns of TPSs displayed group specificities in M. oleifera. Particularly, Group I MoTPS genes closely relate to reproductive development and Group II MoTPS genes closely relate to high temperature resistance in leaves, stem, stem tip and roots. This work provides a scientific classification of plant TPSs, dissects the internal relationships between their evolution and expressions, and promotes functional researches.
Collapse
Affiliation(s)
- Mengfei Lin
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, South China Agricultural University, Guangzhou, China
- Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, Guangzhou, China
- Guangdong Province Research Center of Woody Forage Engineering Technology, Guangzhou, China
- College of Forestry and Landscape Architecture, South China Agricultural University, Guangzhou, China
| | - Ruihu Jia
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, South China Agricultural University, Guangzhou, China
- Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, Guangzhou, China
- Guangdong Province Research Center of Woody Forage Engineering Technology, Guangzhou, China
- College of Forestry and Landscape Architecture, South China Agricultural University, Guangzhou, China
| | - Juncheng Li
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, South China Agricultural University, Guangzhou, China
- Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, Guangzhou, China
- Guangdong Province Research Center of Woody Forage Engineering Technology, Guangzhou, China
- College of Forestry and Landscape Architecture, South China Agricultural University, Guangzhou, China
| | - Mengjie Zhang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, South China Agricultural University, Guangzhou, China
- Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, Guangzhou, China
- Guangdong Province Research Center of Woody Forage Engineering Technology, Guangzhou, China
- College of Forestry and Landscape Architecture, South China Agricultural University, Guangzhou, China
| | - Hanbin Chen
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, South China Agricultural University, Guangzhou, China
- Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, Guangzhou, China
- Guangdong Province Research Center of Woody Forage Engineering Technology, Guangzhou, China
- College of Forestry and Landscape Architecture, South China Agricultural University, Guangzhou, China
| | - Deng Zhang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, South China Agricultural University, Guangzhou, China
- Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, Guangzhou, China
- Guangdong Province Research Center of Woody Forage Engineering Technology, Guangzhou, China
- College of Forestry and Landscape Architecture, South China Agricultural University, Guangzhou, China
| | - Junjie Zhang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, South China Agricultural University, Guangzhou, China
- Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, Guangzhou, China
- Guangdong Province Research Center of Woody Forage Engineering Technology, Guangzhou, China
- College of Forestry and Landscape Architecture, South China Agricultural University, Guangzhou, China
| | - Xiaoyang Chen
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, South China Agricultural University, Guangzhou, China.
- Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, Guangzhou, China.
- Guangdong Province Research Center of Woody Forage Engineering Technology, Guangzhou, China.
- College of Forestry and Landscape Architecture, South China Agricultural University, Guangzhou, China.
| |
Collapse
|
12
|
Liu G, Arnaud P, Offmann B, Picimbon JF. Genotyping and Bio-Sensing Chemosensory Proteins in Insects. SENSORS (BASEL, SWITZERLAND) 2017; 17:E1801. [PMID: 28777348 PMCID: PMC5579523 DOI: 10.3390/s17081801] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/24/2017] [Revised: 07/28/2017] [Accepted: 07/29/2017] [Indexed: 12/20/2022]
Abstract
Genotyping is the process of determining differences in the genetic make-up of an individual and comparing it to that of another individual. Focus on the family of chemosensory proteins (CSPs) in insects reveals differences at the genomic level across various strains and biotypes, but none at the level of individuals, which could be extremely useful in the biotyping of insect pest species necessary for the agricultural, medical and veterinary industries. Proposed methods of genotyping CSPs include not only restriction enzymatic cleavage and amplification of cleaved polymorphic sequences, but also detection of retroposons in some specific regions of the insect chromosome. Design of biosensors using CSPs addresses tissue-specific RNA mutations in a particular subtype of the protein, which could be used as a marker of specific physiological conditions. Additionally, we refer to the binding properties of CSP proteins tuned to lipids and xenobiotic insecticides for the development of a new generation of biosensor chips, monitoring lipid blood concentration and chemical environmental pollution.
Collapse
Affiliation(s)
- Guoxia Liu
- Biotechnology Research Center, Shandong Academy of Agricultural Sciences, Jinan 250100, China.
| | - Philippe Arnaud
- Protein Engineering and Functionality Unit, University of Nantes, Nantes 44322, France.
| | - Bernard Offmann
- Protein Engineering and Functionality Unit, University of Nantes, Nantes 44322, France.
| | - Jean-François Picimbon
- Biotechnology Research Center, Shandong Academy of Agricultural Sciences, Jinan 250100, China.
- QILU University of Technology, School of Bioengineering, Jinan 250353, China.
| |
Collapse
|
13
|
Brunet FG, Volff JN, Schartl M. Whole Genome Duplications Shaped the Receptor Tyrosine Kinase Repertoire of Jawed Vertebrates. Genome Biol Evol 2016; 8:1600-13. [PMID: 27260203 PMCID: PMC4898815 DOI: 10.1093/gbe/evw103] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
The receptor tyrosine kinase (RTK) gene family, involved primarily in cell growth and differentiation, comprises proteins with a common enzymatic tyrosine kinase intracellular domain adjacent to a transmembrane region. The amino-terminal portion of RTKs is extracellular and made of different domains, the combination of which characterizes each of the 20 RTK subfamilies among mammals. We analyzed a total of 7,376 RTK sequences among 143 vertebrate species to provide here the first comprehensive census of the jawed vertebrate repertoire. We ascertained the 58 genes previously described in the human and mouse genomes and established their phylogenetic relationships. We also identified five additional RTKs amounting to a total of 63 genes in jawed vertebrates. We found that the vertebrate RTK gene family has been shaped by the two successive rounds of whole genome duplications (WGD) called 1R and 2R (1R/2R) that occurred at the base of the vertebrates. In addition, the Vegfr and Ephrin receptor subfamilies were expanded by single gene duplications. In teleost fish, 23 additional RTK genes have been retained after another expansion through the fish-specific third round (3R) of WGD. Several lineage-specific gene losses were observed. For instance, birds have lost three RTKs, and different genes are missing in several fish sublineages. The RTK gene family presents an unusual high gene retention rate from the vertebrate WGDs (58.75% after 1R/2R, 64.4% after 3R), resulting in an expansion that might be correlated with the evolution of complexity of vertebrate cellular communication and intracellular signaling.
Collapse
Affiliation(s)
- Frédéric G Brunet
- Institut de Génomique Fonctionnelle de Lyon, Ecole Normale Supérieure de Lyon, UMR5242 CNRS, Université Claude Bernard Lyon I, Lyon, France
| | - Jean-Nicolas Volff
- Institut de Génomique Fonctionnelle de Lyon, Ecole Normale Supérieure de Lyon, UMR5242 CNRS, Université Claude Bernard Lyon I, Lyon, France
| | - Manfred Schartl
- Physiologische Chemie, Biozentrum, University of Würzburg, Am Hubland, and Comprehensive Cancer Center, University Clinic Würzburg, Würzburg, Germany Texas Institute for Advanced Study and Department of Biology, Texas A&M University, College Station, USA
| |
Collapse
|
14
|
Purifying selection shapes the coincident SNP distribution of primate coding sequences. Sci Rep 2016; 6:27272. [PMID: 27255481 PMCID: PMC4891680 DOI: 10.1038/srep27272] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2016] [Accepted: 05/17/2016] [Indexed: 12/13/2022] Open
Abstract
Genome-wide analysis has observed an excess of coincident single nucleotide polymorphisms (coSNPs) at human-chimpanzee orthologous positions, and suggested that this is due to cryptic variation in the mutation rate. While this phenomenon primarily corresponds with non-coding coSNPs, the situation in coding sequences remains unclear. Here we calculate the observed-to-expected ratio of coSNPs (coSNPO/E) to estimate the prevalence of human-chimpanzee coSNPs, and show that the excess of coSNPs is also present in coding regions. Intriguingly, coSNPO/E is much higher at zero-fold than at nonzero-fold degenerate sites; such a difference is due to an elevation of coSNPO/E at zero-fold degenerate sites, rather than a reduction at nonzero-fold degenerate ones. These trends are independent of chimpanzee subpopulation, population size, or sequencing techniques; and hold in broad generality across primates. We find that this discrepancy cannot fully explained by sequence contexts, shared ancestral polymorphisms, SNP density, and recombination rate, and that coSNPO/E in coding sequences is significantly influenced by purifying selection. We also show that selection and mutation rate affect coSNPO/E independently, and coSNPs tend to be less damaging and more correlated with human diseases than non-coSNPs. These suggest that coSNPs may represent a “signature” during primate protein evolution.
Collapse
|
15
|
Zhang SD, Ling LZ, Yi TS. Evolution and divergence of SBP-box genes in land plants. BMC Genomics 2015; 16:787. [PMID: 26467431 PMCID: PMC4606839 DOI: 10.1186/s12864-015-1998-y] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2015] [Accepted: 10/03/2015] [Indexed: 01/24/2023] Open
Abstract
Background Squamosa promoter binding protein (SBP)-box family genes encode plant-specific transcription factors that control many important biological functions, including phase transition, inflorescence branching, fruit ripening, and copper homeostasis. Nevertheless, the evolutionary patterns of SBP-box genes and evolutionary forces driving them are still not well understood. Methods 104 SBP-box gene candidates of five representative land plants were obtained from Phytozome database (v10.3). Phylogenetic combined with gene structure analyses were used to identify SBP-box gene lineages in land plants. Gene copy number and the sequence and structure features were then compared among these different SBP-box lineages. Selection analysis, relative rate tests and expression divergence were finally used to interpret the evolutionary relationships and divergence of SBP-box genes in land plants. Results We investigated 104 SBP-box genes from moss, Arabidopsis, poplar, rice, and maize. These genes are divided into group I and II, and the latter is further divided into two subgroups (subgroup II-1 and II-2) based on phylogenetic analysis. Interestingly, subgroup II-1 genes have similar sequence and structural features to group I genes, whereas subgroup II-2 genes exhibit intrinsic differences on these features, including high copy numbers and the presence of miR156/miR529 regulation. Further analyses indicate that subgroup II-1 genes are constrained by stronger purifying selection and evolve at a lower substitution rate than II-2 genes, just as group I genes do when compared to II genes. Among subgroup II-2 genes, miR156 targets evolve more rapidly than miR529 targets and experience comparatively relaxed purifying selection. These results suggest that group I and subgroup II-1 genes under strong selective constraint are conserved. By contrast, subgroup II-2 genes evolve under relaxed purifying selection and have diversified through gene copy duplications and changes in miR156/529 regulation, which might contribute to morphological diversifications of land plants. Conclusions Our results indicate that different evolutionary rates and selection strengths lead to differing evolutionary patterns in SBP-box genes in land plants, providing a guide for future functional diversity analyses of these genes. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1998-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Shu-Dong Zhang
- Germplasm Bank of Wild Species, Kunming Institute of Botany of the Chinese Academy of Sciences, Kunming, 650201, China.
| | - Li-Zhen Ling
- BGI-Yunnan, BGI-Shenzhen, Kunming, 650106, China.
| | - Ting-Shuang Yi
- Germplasm Bank of Wild Species, Kunming Institute of Botany of the Chinese Academy of Sciences, Kunming, 650201, China.
| |
Collapse
|
16
|
De La Torre AR, Lin YC, Van de Peer Y, Ingvarsson PK. Genome-wide analysis reveals diverged patterns of codon bias, gene expression, and rates of sequence evolution in picea gene families. Genome Biol Evol 2015; 7:1002-15. [PMID: 25747252 PMCID: PMC4419791 DOI: 10.1093/gbe/evv044] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
The recent sequencing of several gymnosperm genomes has greatly facilitated studying the evolution of their genes and gene families. In this study, we examine the evidence for expression-mediated selection in the first two fully sequenced representatives of the gymnosperm plant clade (Picea abies and Picea glauca). We use genome-wide estimates of gene expression (>50,000 expressed genes) to study the relationship between gene expression, codon bias, rates of sequence divergence, protein length, and gene duplication. We found that gene expression is correlated with rates of sequence divergence and codon bias, suggesting that natural selection is acting on Picea protein-coding genes for translational efficiency. Gene expression, rates of sequence divergence, and codon bias are correlated with the size of gene families, with large multicopy gene families having, on average, a lower expression level and breadth, lower codon bias, and higher rates of sequence divergence than single-copy gene families. Tissue-specific patterns of gene expression were more common in large gene families with large gene expression divergence than in single-copy families. Recent family expansions combined with large gene expression variation in paralogs and increased rates of sequence evolution suggest that some Picea gene families are rapidly evolving to cope with biotic and abiotic stress. Our study highlights the importance of gene expression and natural selection in shaping the evolution of protein-coding genes in Picea species, and sets the ground for further studies investigating the evolution of individual gene families in gymnosperms.
Collapse
Affiliation(s)
| | - Yao-Cheng Lin
- Department of Plant Systems Biology, VIB, and Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
| | - Yves Van de Peer
- Department of Plant Systems Biology, VIB, and Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium Genomics Research Institute, University of Pretoria, South Africa
| | - Pär K Ingvarsson
- Department of Ecology and Environmental Science, Umeå University, Sweden Umeå Plant Science Centre, Umeå, Sweden
| |
Collapse
|
17
|
Chuang TJ, Chiang TW. Impacts of pretranscriptional DNA methylation, transcriptional transcription factor, and posttranscriptional microRNA regulations on protein evolutionary rate. Genome Biol Evol 2014; 6:1530-41. [PMID: 24923326 PMCID: PMC4080426 DOI: 10.1093/gbe/evu124] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Gene expression is largely regulated by DNA methylation, transcription factor (TF), and
microRNA (miRNA) before, during, and after transcription, respectively. Although the
evolutionary effects of TF/miRNA regulations have been widely studied, evolutionary
analysis of simultaneously accounting for DNA methylation, TF, and miRNA regulations and
whether promoter methylation and gene body (coding regions) methylation have different
effects on the rate of gene evolution remain uninvestigated. Here, we compared
human–macaque and human–mouse protein evolutionary rates against
experimentally determined single base-resolution DNA methylation data, revealing that
promoter methylation level is positively correlated with protein evolutionary rates but
negatively correlated with TF/miRNA regulations, whereas the opposite was observed for
gene body methylation level. Our results showed that the relative importance of these
regulatory factors in determining the rate of mammalian protein evolution is as follows:
Promoter methylation ≈ miRNA regulation > gene body methylation > TF regulation,
and further indicated that promoter methylation and miRNA regulation have a significant
dependent effect on protein evolutionary rates. Although the mechanisms underlying
cooperation between DNA methylation and TFs/miRNAs in gene regulation remain unclear, our
study helps to not only illuminate the impact of these regulatory factors on mammalian
protein evolution but also their intricate interaction within gene regulatory
networks.
Collapse
Affiliation(s)
- Trees-Juen Chuang
- Division of Physical & Computational Genomics, Genomics Research Center, Academia Sinica, Taipei, Taiwan
| | - Tai-Wei Chiang
- Division of Physical & Computational Genomics, Genomics Research Center, Academia Sinica, Taipei, Taiwan
| |
Collapse
|
18
|
Abstract
The evolutionary mechanisms underlying duplicate gene maintenance and divergence remain highly debated. Epigenetic modifications, such as DNA methylation, may contribute to duplicate gene evolution by facilitating tissue-specific regulation. However, the role of epigenetic divergence on duplicate gene evolution remains little understood. Here we show, using comprehensive data across 10 diverse human tissues, that DNA methylation plays critical roles in several aspects of duplicate gene evolution. We first demonstrate that duplicate genes are initially heavily methylated, before gradually losing DNA methylation as they age. Within each pair, DNA methylation divergence between duplicate partners increases with evolutionary age. Importantly, tissue-specific DNA methylation of duplicates correlates with tissue-specific expression, implicating DNA methylation as a causative factor for functional divergence of duplicate genes. These patterns are apparent in promoters but not in gene bodies, in accord with the complex relationship between gene-body DNA methylation and transcription. Remarkably, many duplicate gene pairs exhibit consistent division of DNA methylation across multiple, divergent tissues: For the majority (73%) of duplicate gene pairs, one partner is always hypermethylated compared with the other. This is indicative of a common underlying determinant of DNA methylation. The division of DNA methylation is also consistent with their chromatin accessibility profiles. Moreover, at least two sequence motifs known to interact with the Sp1 transcription factor mark promoters of more hypomethylated duplicate partners. These results demonstrate critical roles of DNA methylation, as well as complex interaction between genome and epigenome, on duplicate gene evolution.
Collapse
|
19
|
Chen YC, Cheng JH, Tsai ZTY, Tsai HK, Chuang TJ. The impact of trans-regulation on the evolutionary rates of metazoan proteins. Nucleic Acids Res 2013; 41:6371-80. [PMID: 23658220 PMCID: PMC3711421 DOI: 10.1093/nar/gkt349] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2013] [Revised: 04/10/2013] [Accepted: 04/14/2013] [Indexed: 11/13/2022] Open
Abstract
Transcription factor (TF) and microRNA (miRNA) are two crucial trans-regulatory factors that coordinately control gene expression. Understanding the impacts of these two factors on the rate of protein sequence evolution is of great importance in evolutionary biology. While many biological factors associated with evolutionary rate variations have been studied, evolutionary analysis of simultaneously accounting for TF and miRNA regulations across metazoans is still uninvestigated. Here, we provide a series of statistical analyses to assess the influences of TF and miRNA regulations on evolutionary rates across metazoans (human, mouse and fruit fly). Our results reveal that the negative correlations between trans-regulation and evolutionary rates hold well across metazoans, but the strength of TF regulation as a rate indicator becomes weak when the other confounding factors that may affect evolutionary rates are controlled. We show that miRNA regulation tends to be a more essential indicator of evolutionary rates than TF regulation, and the combination of TF and miRNA regulations has a significant dependent effect on protein evolutionary rates. We also show that trans-regulation (especially miRNA regulation) is much more important in human/mouse than in fruit fly in determining protein evolutionary rates, suggesting a considerable variation in rate determinants between vertebrates and invertebrates.
Collapse
Affiliation(s)
- Yi-Ching Chen
- Institute of Information Science, Academia Sinica, Taipei 115, Taiwan, Bioinformatics Program, Taiwan International Graduate Program, Academia Sinica, Taipei 115, Taiwan and Genomic Research Center, Academia Sinica, Taipei 115, Taiwan
| | - Jen-Hao Cheng
- Institute of Information Science, Academia Sinica, Taipei 115, Taiwan, Bioinformatics Program, Taiwan International Graduate Program, Academia Sinica, Taipei 115, Taiwan and Genomic Research Center, Academia Sinica, Taipei 115, Taiwan
| | - Zing Tsung-Yeh Tsai
- Institute of Information Science, Academia Sinica, Taipei 115, Taiwan, Bioinformatics Program, Taiwan International Graduate Program, Academia Sinica, Taipei 115, Taiwan and Genomic Research Center, Academia Sinica, Taipei 115, Taiwan
| | - Huai-Kuang Tsai
- Institute of Information Science, Academia Sinica, Taipei 115, Taiwan, Bioinformatics Program, Taiwan International Graduate Program, Academia Sinica, Taipei 115, Taiwan and Genomic Research Center, Academia Sinica, Taipei 115, Taiwan
| | - Trees-Juen Chuang
- Institute of Information Science, Academia Sinica, Taipei 115, Taiwan, Bioinformatics Program, Taiwan International Graduate Program, Academia Sinica, Taipei 115, Taiwan and Genomic Research Center, Academia Sinica, Taipei 115, Taiwan
| |
Collapse
|
20
|
Comparative genomics of chemosensory protein genes reveals rapid evolution and positive selection in ant-specific duplicates. Heredity (Edinb) 2013; 110:538-47. [PMID: 23403962 DOI: 10.1038/hdy.2012.122] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
Gene duplications can have a major role in adaptation, and gene families underlying chemosensation are particularly interesting due to their essential role in chemical recognition of mates, predators and food resources. Social insects add yet another dimension to the study of chemosensory genomics, as the key components of their social life rely on chemical communication. Still, chemosensory gene families are little studied in social insects. Here we annotated chemosensory protein (CSP) genes from seven ant genomes and studied their evolution. The number of functional CSP genes ranges from 11 to 21 depending on species, and the estimated rates of gene birth and death indicate high turnover of genes. Ant CSP genes include seven conservative orthologous groups present in all the ants, and a group of genes that has expanded independently in different ant lineages. Interestingly, the expanded group of genes has a differing mode of evolution from the orthologous groups. The expanded group shows rapid evolution as indicated by a high dN/dS (nonsynonymous to synonymous changes) ratio, several sites under positive selection and many pseudogenes, whereas the genes in the seven orthologous groups evolve slowly under purifying selection and include only one pseudogene. These results show that adaptive changes have played a role in ant CSP evolution. The expanded group of ant-specific genes is phylogenetically close to a conservative orthologous group CSP7, which includes genes known to be involved in ant nestmate recognition, raising an interesting possibility that the expanded CSPs function in ant chemical communication.
Collapse
|
21
|
Position-dependent correlations between DNA methylation and the evolutionary rates of mammalian coding exons. Proc Natl Acad Sci U S A 2012; 109:15841-6. [PMID: 23019368 DOI: 10.1073/pnas.1208214109] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
DNA cytosine methylation is a central epigenetic marker that is usually mutagenic and may increase the level of sequence divergence. However, methylated genes have been reported to evolve more slowly than unmethylated genes. Hence, there is a controversy on whether DNA methylation is correlated with increased or decreased protein evolutionary rates. We hypothesize that this controversy has resulted from the differential correlations between DNA methylation and the evolutionary rates of coding exons in different genic positions. To test this hypothesis, we compare human-mouse and human-macaque exonic evolutionary rates against experimentally determined single-base resolution DNA methylation data derived from multiple human cell types. We show that DNA methylation is significantly related to within-gene variations in evolutionary rates. First, DNA methylation level is more strongly correlated with C-to-T mutations at CpG dinucleotides in the first coding exons than in the internal and last exons, although it is positively correlated with the synonymous substitution rate in all exon positions. Second, for the first exons, DNA methylation level is negatively correlated with exonic expression level, but positively correlated with both nonsynonymous substitution rate and the sample specificity of DNA methylation level. For the internal and last exons, however, we observe the opposite correlations. Our results imply that DNA methylation level is differentially correlated with the biological (and evolutionary) features of coding exons in different genic positions. The first exons appear more prone to the mutagenic effects, whereas the other exons are more influenced by the regulatory effects of DNA methylation.
Collapse
|
22
|
Chia N, Guttenberg N. Dynamics of gene duplication and transposons in microbial genomes following a sudden environmental change. Mob Genet Elements 2012; 1:221-224. [PMID: 22479691 DOI: 10.4161/mge.1.3.18030] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2011] [Accepted: 09/12/2011] [Indexed: 11/19/2022] Open
Abstract
The impact of gene duplication on evolution is as ubiquitous as point mutation, but this realization is not yet reflected in our quantitative models of population genetics. In this Commentary article, we explore the implications of such models of gene duplication, specifically expanding on our previous work. We lay down a framework for understanding the impact of gene duplications on the evolution a biological network and give an analytical argument based on the concept of mutational error threshold for the necessity of gene duplications for the evolution of complex networks. In other words, by realizing that the impact of mutations must act appropriately in order to allow for the maintenance of complex networks, we develop a mathematical scaling argument that shows why gene duplication provides the types of mutations more favorable to increasing complexity. In the process of doing so, we seek to explain the relationship between per base pair mutation rates and genome size.
Collapse
Affiliation(s)
- Nicholas Chia
- Institute for Genomic Biology; Physics of the Living Cell; University of Illinois at Urbana-Champaign; Urbana, IL USA
| | | |
Collapse
|
23
|
Weeks AM, Chang MCY. Constructing de novo biosynthetic pathways for chemical synthesis inside living cells. Biochemistry 2011; 50:5404-18. [PMID: 21591680 DOI: 10.1021/bi200416g] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Living organisms have evolved a vast array of catalytic functions that make them ideally suited for the production of medicinally and industrially relevant small-molecule targets. Indeed, native metabolic pathways in microbial hosts have long been exploited and optimized for the scalable production of both fine and commodity chemicals. Our increasing capacity for DNA sequencing and synthesis has revealed the molecular basis for the biosynthesis of a variety of complex and useful metabolites and allows the de novo construction of novel metabolic pathways for the production of new and exotic molecular targets in genetically tractable microbes. However, the development of commercially viable processes for these engineered pathways is currently limited by our ability to quickly identify or engineer enzymes with the correct reaction and substrate selectivity as well as the speed by which metabolic bottlenecks can be determined and corrected. Efforts to understand the relationship among sequence, structure, and function in the basic biochemical sciences can advance these goals for synthetic biology applications while also serving as an experimental platform for elucidating the in vivo specificity and function of enzymes and reconstituting complex biochemical traits for study in a living model organism. Furthermore, the continuing discovery of natural mechanisms for the regulation of metabolic pathways has revealed new principles for the design of high-flux pathways with minimized metabolic burden and has inspired the development of new tools and approaches to engineering synthetic pathways in microbial hosts for chemical production.
Collapse
Affiliation(s)
- Amy M Weeks
- Department of Chemistry, University of California, Berkeley, California 94720-1460, USA
| | | |
Collapse
|
24
|
Dalmolin RJS, Castro MAA, Rybarczyk Filho JL, Souza LHT, de Almeida RMC, Moreira JCF. Evolutionary plasticity determination by orthologous groups distribution. Biol Direct 2011; 6:22. [PMID: 21586164 PMCID: PMC3117832 DOI: 10.1186/1745-6150-6-22] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2011] [Accepted: 05/17/2011] [Indexed: 12/29/2022] Open
Abstract
Background Genetic plasticity may be understood as the ability of a functional gene network to tolerate alterations in its components or structure. Usually, the studies involving gene modifications in the course of the evolution are concerned to nucleotide sequence alterations in closely related species. However, the analysis of large scale data about the distribution of gene families in non-exclusively closely related species can provide insights on how plastic or how conserved a given gene family is. Here, we analyze the abundance and diversity of all Eukaryotic Clusters of Orthologous Groups (KOG) present in STRING database, resulting in a total of 4,850 KOGs. This dataset comprises 481,421 proteins distributed among 55 eukaryotes. Results We propose an index to evaluate the evolutionary plasticity and conservation of an orthologous group based on its abundance and diversity across eukaryotes. To further KOG plasticity analysis, we estimate the evolutionary distance average among all proteins which take part in the same orthologous group. As a result, we found a strong correlation between the evolutionary distance average and the proposed evolutionary plasticity index. Additionally, we found low evolutionary plasticity in Saccharomyces cerevisiae genes associated with inviability and Mus musculus genes associated with early lethality. At last, we plot the evolutionary plasticity value in different gene networks from yeast and humans. As a result, it was possible to discriminate among higher and lower plastic areas of the gene networks analyzed. Conclusions The distribution of gene families brings valuable information on evolutionary plasticity which might be related with genetic plasticity. Accordingly, it is possible to discriminate among conserved and plastic orthologous groups by evaluating their abundance and diversity across eukaryotes. Reviewers This article was reviewed by Prof Manyuan Long, Hiroyuki Toh, and Sebastien Halary.
Collapse
Affiliation(s)
- Rodrigo J S Dalmolin
- Department of Biochemistry, Institute of Basic Health Sciences, Federal University of Rio Grande do Sul, Rio Grande do Sul, Brazil.
| | | | | | | | | | | |
Collapse
|
25
|
Chen SCC, Chuang TJ, Li WH. The relationships among microRNA regulation, intrinsically disordered regions, and other indicators of protein evolutionary rate. Mol Biol Evol 2011; 28:2513-20. [PMID: 21398349 DOI: 10.1093/molbev/msr068] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Many indicators of protein evolutionary rate have been proposed, but some of them are interrelated. The purpose of this study is to disentangle their correlations. We assess the strength of each indicator by controlling for the other indicators under study. We find that the number of microRNA (miRNA) types that regulate a gene is the strongest rate indicator (a negative correlation), followed by disorder content (the percentage of disordered regions in a protein, a positive correlation); the strength of disorder content as a rate indicator is substantially increased after controlling for the number of miRNA types. By dividing proteins into lowly and highly intrinsically disordered proteins (L-IDPs and H-IDPs), we find that proteins interacting with more H-IDPs tend to evolve more slowly, which largely explains the previous observation of a negative correlation between the number of protein-protein interactions and evolutionary rate. Moreover, all of the indicators examined here, except for the number of miRNA types, have different strengths in L-IDPs and in H-IDPs. Finally, the number of phosphorylation sites is weakly correlated with the number of miRNA types, and its strength as a rate indicator is substantially reduced when other indicators are considered. Our study reveals the relative strength of each rate indicator and increases our understanding of protein evolution.
Collapse
Affiliation(s)
- Sean Chun-Chang Chen
- Institute of BioMedical Informatics, National Yang-Ming University, Taipei, Taiwan
| | | | | |
Collapse
|