1
|
Jia H, Tan S, Cai Y, Guo Y, Shen J, Zhang Y, Ma H, Zhang Q, Chen J, Qiao G, Ruan J, Zhang YE. Low-input PacBio sequencing generates high-quality individual fly genomes and characterizes mutational processes. Nat Commun 2024; 15:5644. [PMID: 38969648 PMCID: PMC11226609 DOI: 10.1038/s41467-024-49992-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 06/20/2024] [Indexed: 07/07/2024] Open
Abstract
Long-read sequencing, exemplified by PacBio, revolutionizes genomics, overcoming challenges like repetitive sequences. However, the high DNA requirement ( > 1 µg) is prohibitive for small organisms. We develop a low-input (100 ng), low-cost, and amplification-free library-generation method for PacBio sequencing (LILAP) using Tn5-based tagmentation and DNA circularization within one tube. We test LILAP with two Drosophila melanogaster individuals, and generate near-complete genomes, surpassing preexisting single-fly genomes. By analyzing variations in these two genomes, we characterize mutational processes: complex transpositions (transposon insertions together with extra duplications and/or deletions) prefer regions characterized by non-B DNA structures, and gene conversion of transposons occurs on both DNA and RNA levels. Concurrently, we generate two complete assemblies for the endosymbiotic bacterium Wolbachia in these flies and similarly detect transposon conversion. Thus, LILAP promises a broad PacBio sequencing adoption for not only mutational studies of flies and their symbionts but also explorations of other small organisms or precious samples.
Collapse
Affiliation(s)
- Hangxing Jia
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
| | - Shengjun Tan
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
| | - Yingao Cai
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yanyan Guo
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jieyu Shen
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yaqiong Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Huijing Ma
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Qingzhu Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jinfeng Chen
- University of Chinese Academy of Sciences, Beijing, China
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Gexia Qiao
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jue Ruan
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China.
| | - Yong E Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
- University of Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
2
|
Nguyen AK, Blacksmith MS, Kidd JM. Duplications and Retrogenes Are Numerous and Widespread in Modern Canine Genomic Assemblies. Genome Biol Evol 2024; 16:evae142. [PMID: 38946312 PMCID: PMC11259980 DOI: 10.1093/gbe/evae142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Revised: 05/08/2024] [Accepted: 06/24/2024] [Indexed: 07/02/2024] Open
Abstract
Recent years have seen a dramatic increase in the number of canine genome assemblies available. Duplications are an important source of evolutionary novelty and are also prone to misassembly. We explored the duplication content of nine canine genome assemblies using both genome self-alignment and read-depth approaches. We find that 8.58% of the genome is duplicated in the canFam4 assembly, derived from the German Shepherd Dog Mischka, including 90.15% of unplaced contigs. Highlighting the continued difficulty in properly assembling duplications, less than half of read-depth and assembly alignment duplications overlap, but the mCanLor1.2 Greenland wolf assembly shows greater concordance. Further study shows the presence of multiple segments that have alignments to four or more duplicate copies. These high-recurrence duplications correspond to gene retrocopies. We identified 3,892 candidate retrocopies from 1,316 parental genes in the canFam4 assembly and find that ∼8.82% of duplicated base pairs involve a retrocopy, confirming this mechanism as a major driver of gene duplication in canines. Similar patterns are found across eight other recent canine genome assemblies, with metrics supporting a greater quality of the PacBio HiFi mCanLor1.2 assembly. Comparison between the wolf and other canine assemblies found that 92% of retrocopy insertions are shared between assemblies. By calculating the number of generations since genome divergence, we estimate that new retrocopy insertions appear, on average, in 1 out of 3,514 births. Our analyses illustrate the impact of retrogene formation on canine genomes and highlight the variable representation of duplicated sequences among recently completed canine assemblies.
Collapse
Affiliation(s)
- Anthony K Nguyen
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Matthew S Blacksmith
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Jeffrey M Kidd
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| |
Collapse
|
3
|
VanKuren NW, Chen J, Long M. Sexual conflict drive in the rapid evolution of new gametogenesis genes. Semin Cell Dev Biol 2024; 159-160:27-37. [PMID: 38309142 DOI: 10.1016/j.semcdb.2024.01.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 01/19/2024] [Accepted: 01/19/2024] [Indexed: 02/05/2024]
Abstract
The evolutionary forces underlying the rapid evolution in sequences and functions of new genes remain a mystery. Adaptation by natural selection explains the evolution of some new genes. However, many new genes perform sex-biased functions that have rapidly evolved over short evolutionary time scales, suggesting that new gene evolution may often be driven by conflicting selective pressures on males and females. It is well established that such sexual conflict (SC) plays a central role in maintaining phenotypic and genetic variation within populations, but the role of SC in driving new gene evolution remains essentially unknown. This review explores the connections between SC and new gene evolution through discussions of the concept of SC, the phenotypic and genetic signatures of SC in evolving populations, and the molecular mechanisms by which SC could drive the evolution of new genes. We synthesize recent work in this area with a discussion of the case of Apollo and Artemis, two extremely young genes (<200,000 years) in Drosophila melanogaster, which offered the first empirical insights into the evolutionary process by which SC could drive the evolution of new genes. These new duplicate genes exhibit the hallmarks of sexually antagonistic selection: rapid DNA and protein sequence evolution, essential sex-specific functions in gametogenesis, and complementary sex-biased expression patterns. Importantly, Apollo is essential for male fitness but detrimental to female fitness, while Artemis is essential for female fitness but detrimental to male fitness. These sexually antagonistic fitness effects and complementary changes to expression, sequence, and function suggest that these duplicates were selected for mitigating SC, but that SC has not been fully resolved. Finally, we propose Sexual Conflict Drive as a self-driven model to interpret the rapid evolution of new genes, explain the potential for SC and sexually antagonistic selection to contribute to long-term evolution, and suggest its utility for understanding the rapid evolution of new genes in gametogenesis.
Collapse
Affiliation(s)
- Nicholas W VanKuren
- Department of Ecology and Evolution, The University of Chicago, United States.
| | - Jianhai Chen
- Department of Ecology and Evolution, The University of Chicago, United States
| | - Manyuan Long
- Department of Ecology and Evolution, The University of Chicago, United States.
| |
Collapse
|
4
|
Castellanos MDP, Wickramasinghe CD, Betrán E. The roles of gene duplications in the dynamics of evolutionary conflicts. Proc Biol Sci 2024; 291:20240555. [PMID: 38865605 DOI: 10.1098/rspb.2024.0555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Accepted: 04/02/2024] [Indexed: 06/14/2024] Open
Abstract
Evolutionary conflicts occur when there is antagonistic selection between different individuals of the same or different species, life stages or between levels of biological organization. Remarkably, conflicts can occur within species or within genomes. In the dynamics of evolutionary conflicts, gene duplications can play a major role because they can bring very specific changes to the genome: changes in protein dose, the generation of novel paralogues with different functions or expression patterns or the evolution of small antisense RNAs. As we describe here, by having those effects, gene duplication might spark evolutionary conflict or fuel arms race dynamics that takes place during conflicts. Interestingly, gene duplication can also contribute to the resolution of a within-locus evolutionary conflict by partitioning the functions of the gene that is under an evolutionary trade-off. In this review, we focus on intraspecific conflicts, including sexual conflict and illustrate the various roles of gene duplications with a compilation of examples. These examples reveal the level of complexity and the differences in the patterns of gene duplications within genomes under different conflicts. These examples also reveal the gene ontologies involved in conflict and the genomic location of the elements of the conflict. The examples provide a blueprint for the direct study of these conflicts or the exploration of the presence of similar conflicts in other lineages.
Collapse
Affiliation(s)
| | | | - Esther Betrán
- Department of Biology, University of Texas at Arlington , Arlington, TX 76019, USA
| |
Collapse
|
5
|
Plaitakis A, Sidiropoulou K, Kotzamani D, Litso I, Zaganas I, Spanaki C. Evolution of Glutamate Metabolism via GLUD2 Enhances Lactate-Dependent Synaptic Plasticity and Complex Cognition. Int J Mol Sci 2024; 25:5297. [PMID: 38791334 PMCID: PMC11120665 DOI: 10.3390/ijms25105297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Revised: 05/07/2024] [Accepted: 05/09/2024] [Indexed: 05/26/2024] Open
Abstract
Human evolution is characterized by rapid brain enlargement and the emergence of unique cognitive abilities. Besides its distinctive cytoarchitectural organization and extensive inter-neuronal connectivity, the human brain is also defined by high rates of synaptic, mainly glutamatergic, transmission, and energy utilization. While these adaptations' origins remain elusive, evolutionary changes occurred in synaptic glutamate metabolism in the common ancestor of humans and apes via the emergence of GLUD2, a gene encoding the human glutamate dehydrogenase 2 (hGDH2) isoenzyme. Driven by positive selection, hGDH2 became adapted to function upon intense excitatory firing, a process central to the long-term strengthening of synaptic connections. It also gained expression in brain astrocytes and cortical pyramidal neurons, including the CA1-CA3 hippocampal cells, neurons crucial to cognition. In mice transgenic for GLUD2, theta-burst-evoked long-term potentiation (LTP) is markedly enhanced in hippocampal CA3-CA1 synapses, with patch-clamp recordings from CA1 pyramidal neurons revealing increased sNMDA receptor currents. D-lactate blocked LTP enhancement, implying that glutamate metabolism via hGDH2 potentiates L-lactate-dependent glia-neuron interaction, a process essential to memory consolidation. The transgenic (Tg) mice exhibited increased dendritic spine density/synaptogenesis in the hippocampus and improved complex cognitive functions. Hence, enhancement of neuron-glia communication, via GLUD2 evolution, likely contributed to human cognitive advancement by potentiating synaptic plasticity and inter-neuronal connectivity.
Collapse
Affiliation(s)
- Andreas Plaitakis
- Department of Neurology, School of Health Sciences, Faculty of Medicine, University of Crete, Voutes, 71003 Heraklion, Crete, Greece; (D.K.); (I.L.); (I.Z.)
| | - Kyriaki Sidiropoulou
- Department of Biology, University of Crete, Voutes, 71003 Heraklion, Crete, Greece;
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology Hellas (IMBB-FORTH), 70013 Heraklion, Crete, Greece
| | - Dimitra Kotzamani
- Department of Neurology, School of Health Sciences, Faculty of Medicine, University of Crete, Voutes, 71003 Heraklion, Crete, Greece; (D.K.); (I.L.); (I.Z.)
| | - Ionela Litso
- Department of Neurology, School of Health Sciences, Faculty of Medicine, University of Crete, Voutes, 71003 Heraklion, Crete, Greece; (D.K.); (I.L.); (I.Z.)
| | - Ioannis Zaganas
- Department of Neurology, School of Health Sciences, Faculty of Medicine, University of Crete, Voutes, 71003 Heraklion, Crete, Greece; (D.K.); (I.L.); (I.Z.)
- Neurology Department, PaGNI University General Hospital of Heraklion, 71500 Heraklion, Crete, Greece
| | - Cleanthe Spanaki
- Department of Neurology, School of Health Sciences, Faculty of Medicine, University of Crete, Voutes, 71003 Heraklion, Crete, Greece; (D.K.); (I.L.); (I.Z.)
- Neurology Department, PaGNI University General Hospital of Heraklion, 71500 Heraklion, Crete, Greece
| |
Collapse
|
6
|
Kaku Y, Isono Y, Tanaka H, Kobayashi T, Kanemori Y, Kashiwabara SI. Intronless Pabpc6 encodes a testis-specific, cytoplasmic poly(A)-binding protein but is dispensable for spermatogenesis in the mouse†. Biol Reprod 2024; 110:834-847. [PMID: 38281153 DOI: 10.1093/biolre/ioae017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 12/25/2023] [Accepted: 01/23/2024] [Indexed: 01/30/2024] Open
Abstract
Besides ubiquitous poly(A)-binding protein, cytoplasmic 1 (PABPC1), testis-specific PABPC2/PABPt (in humans, referred to as PABPC3), and female and male germline-specific PABPC1L/ePAB, have been reported in the mouse testis. Recent in silico analysis additionally identified testis-specific Pabpc6 in the mouse. In this study, we characterized PABPC6 and its mutant mice. PABPC6 was initially detectable in the cytoplasm of pachytene spermatocytes, increased in abundance in round spermatids, and decreased in elongating spermatids. PABPC6 was capable of binding to poly(A) tails of various mRNAs and interacting with translation-associated factors, including EIF4G, PAIP1, and PAIP2. Noteworthy was that PABPC6, unlike PABPC1, was barely associated with translationally active polysomes and enriched in chromatoid bodies of round spermatids. Despite these unique characteristics, neither synthesis of testicular proteins nor spermatogenesis was affected in the mutant mice lacking PABPC6, suggesting that PABPC6 is functionally redundant with other co-existing PABPC proteins during spermatogenesis.
Collapse
Affiliation(s)
- Yuko Kaku
- Institute of Life and Environmental Sciences, University of Tsukuba, Tsukuba, Ibaraki, Japan
| | - Yuka Isono
- Institute of Life and Environmental Sciences, University of Tsukuba, Tsukuba, Ibaraki, Japan
| | - Hideto Tanaka
- Institute of Life and Environmental Sciences, University of Tsukuba, Tsukuba, Ibaraki, Japan
| | - Tomohiro Kobayashi
- Institute of Life and Environmental Sciences, University of Tsukuba, Tsukuba, Ibaraki, Japan
| | - Yoshinori Kanemori
- Institute of Life and Environmental Sciences, University of Tsukuba, Tsukuba, Ibaraki, Japan
| | - Shin-Ichi Kashiwabara
- Institute of Life and Environmental Sciences, University of Tsukuba, Tsukuba, Ibaraki, Japan
| |
Collapse
|
7
|
Gidhi A, Jha SK, Kumar M, Mukhopadhyay K. The F-box protein encoding genes of the leaf-rust fungi Puccinia triticina: genome-wide identification, characterization and expression dynamics during pathogenesis. Arch Microbiol 2024; 206:209. [PMID: 38587657 DOI: 10.1007/s00203-024-03936-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 01/31/2024] [Accepted: 03/19/2024] [Indexed: 04/09/2024]
Abstract
The F-box proteins in fungi perform diverse functions including regulation of cell cycle, circadian clock, development, signal transduction and nutrient sensing. Genome-wide analysis revealed 10 F-box genes in Puccinia triticina, the causal organism for the leaf rust disease in wheat and were characterized using in silico approaches for revealing phylogenetic relationships, gene structures, gene ontology, protein properties, sequence analysis and gene expression studies. Domain analysis predicted functional domains like WD40 and LRR at C-terminus along with the obvious presence of F-box motif in N-terminus. MSA showed amino acid replacements, which might be due to nucleotide substitution during replication. Phylogenetic analysis revealed the F-box proteins with similar domains to be clustered together while some sequences were spread out in different clades, which might be due to functional diversity. The clustering of Puccinia triticina GG705409 with Triticum aestivum TaAFB4/TaAFB5 in a single clade suggested the possibilities of horizontal gene transfer during the coevolution of P. triticina and wheat. Gene ontological annotation categorized them into three classes and were functionally involved in protein degradation through the protein ubiquitination pathway. Protein-protein interaction network revealed F-box proteins to interact with other components of the SCF complex involved in protein ubiquitination. Relative expression analysis of five F-box genes in a time course experiment denoted their involvement in leaf rust susceptible wheat plants. This study provides information on structure elucidation of F-box proteins of a basidiomycetes plant pathogenic fungi and their role during pathogenesis.
Collapse
Affiliation(s)
- Anupama Gidhi
- School of Genomics and Molecular Breeding, ICAR-Indian Institute of Agricultural Biotechnology, Garhkhatanga, Ranchi, Jharkhand, 834003, India
| | - Shailendra Kumar Jha
- Division of Genetics, ICAR-Indian Agricultural Research Institute, New Delhi, 110012, India
| | - Manish Kumar
- Department of Bioengineering and Biotechnology, Birla Institute of Technology, Mesra, Ranchi, Jharkhand, 835215, India
| | - Kunal Mukhopadhyay
- Department of Bioengineering and Biotechnology, Birla Institute of Technology, Mesra, Ranchi, Jharkhand, 835215, India.
| |
Collapse
|
8
|
Zhou F, Tan P, Liu S, Chang L, Yang J, Sun M, Guo Y, Si Y, Wang D, Yu J, Ma Y. Subcellular RNA distribution and its change during human embryonic stem cell differentiation. Stem Cell Reports 2024; 19:126-140. [PMID: 38134924 PMCID: PMC10828685 DOI: 10.1016/j.stemcr.2023.11.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Revised: 11/16/2023] [Accepted: 11/17/2023] [Indexed: 12/24/2023] Open
Abstract
The spatial localization of RNA within cells is closely related to its function and also involved in cell fate determination. However, the atlas of RNA distribution within cells and dynamic changes during the developmental process are largely unknown. In this study, five subcellular components, including cytoplasmic extract, membrane extract, soluble nuclear extract, chromatin-bound nuclear extract, and cytoskeletal extract, were isolated and the rules of subcellular RNA distribution in human embryonic stem cells (hESCs) and its change during hESC differentiation are summarized for the first time. The overall distribution patterns of coding and non-coding RNAs are revealed. Interestingly, some developmental genes are found to be transcribed but confined to the chromatin in undifferentiated hESC. Unexpectedly, alternative splicing and polyadenylation endow spatial heterogeneity among different isoforms of the same gene. Finally, the dynamic pattern of RNA distribution during hESC differentiation is characterized, which provides new clues for a comprehensive understanding hESC pluripotency and differentiation.
Collapse
Affiliation(s)
- Fanqi Zhou
- State Key Laboratory of Common Mechanism Research for Major Diseases, Haihe laboratory of Cell Ecosystem, Key Laboratory of RNA and Hematopoietic Regulation, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, School of Basic Medicine Peking Union Medical College, Beijing, China
| | - Puwen Tan
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China
| | - Siqi Liu
- State Key Laboratory of Common Mechanism Research for Major Diseases, Haihe laboratory of Cell Ecosystem, Key Laboratory of RNA and Hematopoietic Regulation, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, School of Basic Medicine Peking Union Medical College, Beijing, China
| | - Le Chang
- State Key Laboratory of Common Mechanism Research for Major Diseases, Haihe laboratory of Cell Ecosystem, Key Laboratory of RNA and Hematopoietic Regulation, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, School of Basic Medicine Peking Union Medical College, Beijing, China
| | - Jiabin Yang
- State Key Laboratory of Common Mechanism Research for Major Diseases, Haihe laboratory of Cell Ecosystem, Key Laboratory of RNA and Hematopoietic Regulation, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, School of Basic Medicine Peking Union Medical College, Beijing, China
| | - Mengyao Sun
- State Key Laboratory of Common Mechanism Research for Major Diseases, Haihe laboratory of Cell Ecosystem, Key Laboratory of RNA and Hematopoietic Regulation, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, School of Basic Medicine Peking Union Medical College, Beijing, China
| | - Yuehong Guo
- State Key Laboratory of Common Mechanism Research for Major Diseases, Haihe laboratory of Cell Ecosystem, Key Laboratory of RNA and Hematopoietic Regulation, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, School of Basic Medicine Peking Union Medical College, Beijing, China
| | - Yanmin Si
- State Key Laboratory of Common Mechanism Research for Major Diseases, Haihe laboratory of Cell Ecosystem, Key Laboratory of RNA and Hematopoietic Regulation, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, School of Basic Medicine Peking Union Medical College, Beijing, China
| | - Dong Wang
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China.
| | - Jia Yu
- State Key Laboratory of Common Mechanism Research for Major Diseases, Haihe laboratory of Cell Ecosystem, Key Laboratory of RNA and Hematopoietic Regulation, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, School of Basic Medicine Peking Union Medical College, Beijing, China; Institute of Blood Transfusion, Chinese Academy of Medical Sciences, Chengdu, Sichuan, China; State Key Laboratory of Experimental Hematology, National Clinical Research Center for Blood Diseases, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin, China.
| | - Yanni Ma
- State Key Laboratory of Common Mechanism Research for Major Diseases, Haihe laboratory of Cell Ecosystem, Key Laboratory of RNA and Hematopoietic Regulation, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, School of Basic Medicine Peking Union Medical College, Beijing, China; Institute of Blood Transfusion, Chinese Academy of Medical Sciences, Chengdu, Sichuan, China.
| |
Collapse
|
9
|
Yan Y, Tian Y, Wu Z, Zhang K, Yang R. Interchromosomal Colocalization with Parental Genes Is Linked to the Function and Evolution of Mammalian Retrocopies. Mol Biol Evol 2023; 40:msad265. [PMID: 38060983 PMCID: PMC10733166 DOI: 10.1093/molbev/msad265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 10/25/2023] [Accepted: 11/29/2023] [Indexed: 12/22/2023] Open
Abstract
Retrocopies are gene duplicates arising from reverse transcription of mature mRNA transcripts and their insertion back into the genome. While long being regarded as processed pseudogenes, more and more functional retrocopies have been discovered. How the stripped-down retrocopies recover expression capability and become functional paralogs continually intrigues evolutionary biologists. Here, we investigated the function and evolution of retrocopies in the context of 3D genome organization. By mapping retrocopy-parent pairs onto sequencing-based and imaging-based chromatin contact maps in human and mouse cell lines and onto Hi-C interaction maps in 5 other mammals, we found that retrocopies and their parental genes show a higher-than-expected interchromosomal colocalization frequency. The spatial interactions between retrocopies and parental genes occur frequently at loci in active subcompartments and near nuclear speckles. Accordingly, colocalized retrocopies are more actively transcribed and translated and are more evolutionarily conserved than noncolocalized ones. The active transcription of colocalized retrocopies may result from their permissive epigenetic environment and shared regulatory elements with parental genes. Population genetic analysis of retroposed gene copy number variants in human populations revealed that retrocopy insertions are not entirely random in regard to interchromosomal interactions and that colocalized retroposed gene copy number variants are more likely to reach high frequencies, suggesting that both insertion bias and natural selection contribute to the colocalization of retrocopy-parent pairs. Further dissection implies that reduced selection efficacy, rather than positive selection, contributes to the elevated allele frequency of colocalized retroposed gene copy number variants. Overall, our results hint a role of interchromosomal colocalization in the "resurrection" of initially neutral retrocopies.
Collapse
Affiliation(s)
- Yubin Yan
- College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, China
| | - Yuhan Tian
- College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, China
| | - Zefeng Wu
- College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, China
| | - Kunling Zhang
- College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, China
| | - Ruolin Yang
- College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, China
| |
Collapse
|
10
|
Rosspopoff O, Trono D. Take a walk on the KRAB side. Trends Genet 2023; 39:844-857. [PMID: 37716846 DOI: 10.1016/j.tig.2023.08.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 08/18/2023] [Accepted: 08/18/2023] [Indexed: 09/18/2023]
Abstract
Canonical Krüppel-associated box (KRAB)-containing zinc finger proteins (KZFPs) act as major repressors of transposable elements (TEs) via the KRAB-mediated recruitment of the heterochromatin scaffold KRAB-associated protein (KAP)1. KZFP genes emerged some 420 million years ago in the last common ancestor of coelacanth, lungfish, and tetrapods, and dramatically expanded to give rise to lineage-specific repertoires in contemporary species paralleling their TE load and turnover. However, the KRAB domain displays sequence and function variations that reveal repeated diversions from a linear TE-KZFP trajectory. This Review summarizes current knowledge on the evolution of KZFPs and discusses how ancestral noncanonical KZFPs endowed with variant KRAB, SCAN or DUF3669 domains have been utilized to achieve KAP1-independent functions.
Collapse
Affiliation(s)
- Olga Rosspopoff
- School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Didier Trono
- School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland.
| |
Collapse
|
11
|
Duhamel M, Hood ME, Rodríguez de la Vega RC, Giraud T. Dynamics of transposable element accumulation in the non-recombining regions of mating-type chromosomes in anther-smut fungi. Nat Commun 2023; 14:5692. [PMID: 37709766 PMCID: PMC10502011 DOI: 10.1038/s41467-023-41413-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Accepted: 08/30/2023] [Indexed: 09/16/2023] Open
Abstract
In the absence of recombination, the number of transposable elements (TEs) increases due to less efficient selection, but the dynamics of such TE accumulations are not well characterized. Leveraging a dataset of 21 independent events of recombination cessation of different ages in mating-type chromosomes of Microbotryum fungi, we show that TEs rapidly accumulated in regions lacking recombination, but that TE content reached a plateau at ca. 50% of occupied base pairs by 1.5 million years following recombination suppression. The same TE superfamilies have expanded in independently evolved non-recombining regions, in particular rolling-circle replication elements (Helitrons). Long-terminal repeat (LTR) retrotransposons of the Copia and Ty3 superfamilies also expanded, through transposition bursts (distinguished from gene conversion based on LTR divergence), with both non-recombining regions and autosomes affected, suggesting that non-recombining regions constitute TE reservoirs. This study improves our knowledge of genome evolution by showing that TEs can accumulate through bursts, following non-linear decelerating dynamics.
Collapse
Affiliation(s)
- Marine Duhamel
- Ecologie Systématique Evolution, IDEEV, CNRS, Université Paris-Saclay, AgroParisTech, Bâtiment 680, 12 route RD128, 91190, Gif-sur-Yvette, France.
- Evolution der Pflanzen und Pilze, Ruhr-Universität Bochum, Universitätsstraße 150, 44780, Bochum, Germany.
| | - Michael E Hood
- Department of Biology, Amherst College, 01002-5000, Amherst, MA, USA
| | - Ricardo C Rodríguez de la Vega
- Ecologie Systématique Evolution, IDEEV, CNRS, Université Paris-Saclay, AgroParisTech, Bâtiment 680, 12 route RD128, 91190, Gif-sur-Yvette, France
| | - Tatiana Giraud
- Ecologie Systématique Evolution, IDEEV, CNRS, Université Paris-Saclay, AgroParisTech, Bâtiment 680, 12 route RD128, 91190, Gif-sur-Yvette, France
| |
Collapse
|
12
|
Mercuri RLV, Conceição HB, Guardia GDA, Goldstein G, Vibranovski MD, Hinske LC, Galante PAF. Retro-miRs: novel and functional miRNAs originating from mRNA retrotransposition. Mob DNA 2023; 14:12. [PMID: 37684690 PMCID: PMC10486083 DOI: 10.1186/s13100-023-00301-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2023] [Accepted: 08/30/2023] [Indexed: 09/10/2023] Open
Abstract
BACKGROUND Reverse-transcribed gene copies (retrocopies) have emerged as major sources of evolutionary novelty. MicroRNAs (miRNAs) are small and highly conserved RNA molecules that serve as key post-transcriptional regulators of gene expression. The origin and subsequent evolution of miRNAs have been addressed but not fully elucidated. RESULTS In this study, we performed a comprehensive investigation of miRNA origination through retroduplicated mRNA sequences (retro-miRs). We identified 17 retro-miRs that emerged from the mRNA retrocopies. Four of these retro-miRs had de novo origins within retrocopied sequences, while 13 retro-miRNAs were located within exon regions and duplicated along with their host mRNAs. We found that retro-miRs were primate-specific, including five retro-miRs conserved among all primates and two human-specific retro-miRs. All retro-miRs were expressed, with predicted and experimentally validated target genes except miR-10527. Notably, the target genes of retro-miRs are involved in key biological processes such as metabolic processes, cell signaling, and regulation of neurotransmitters in the central nervous system. Additionally, we found that these retro-miRs play a potential oncogenic role in cancer by targeting key cancer genes and are overexpressed in several cancer types, including liver hepatocellular carcinoma and stomach adenocarcinoma. CONCLUSIONS Our findings demonstrated that mRNA retrotransposition is a key mechanism for the generation of novel miRNAs (retro-miRs) in primates. These retro-miRs are expressed, conserved, have target genes with important cellular functions, and play important roles in cancer.
Collapse
Affiliation(s)
- Rafael L V Mercuri
- Hospital Sirio-Libanes, São Paulo, 01308-060, Brazil
- Interunidades Em Bioinformática, Universidade de São Paulo, São Paulo, 05508-000, Brazil
| | - Helena B Conceição
- Hospital Sirio-Libanes, São Paulo, 01308-060, Brazil
- Interunidades Em Bioinformática, Universidade de São Paulo, São Paulo, 05508-000, Brazil
| | | | - Gabriel Goldstein
- Department of Genetics and Evolutionary Biology, University of São Paulo, São Paulo, Brazil
| | - Maria D Vibranovski
- Department of Genetics and Evolutionary Biology, University of São Paulo, São Paulo, Brazil
- School of Mathematical and Natural Sciences, New College of Interdisciplinary Arts and Sciences, Arizona State University, Tempe, AZ, USA
| | - Ludwig C Hinske
- Institute for Digital Medicine/Clinic of Anaesthesiology, University of Augsburg, Augsburg, Germany
| | | |
Collapse
|
13
|
Dudka D, Akins RB, Lampson MA. FREEDA: An automated computational pipeline guides experimental testing of protein innovation. J Cell Biol 2023; 222:e202212084. [PMID: 37358475 PMCID: PMC10292211 DOI: 10.1083/jcb.202212084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2022] [Revised: 04/22/2023] [Accepted: 06/07/2023] [Indexed: 06/27/2023] Open
Abstract
Cell biologists typically focus on conserved regions of a protein, overlooking innovations that can shape its function over evolutionary time. Computational analyses can reveal potential innovations by detecting statistical signatures of positive selection that lead to rapid accumulation of beneficial mutations. However, these approaches are not easily accessible to non-specialists, limiting their use in cell biology. Here, we present an automated computational pipeline FREEDA that provides a simple graphical user interface requiring only a gene name; integrates widely used molecular evolution tools to detect positive selection in rodents, primates, carnivores, birds, and flies; and maps results onto protein structures predicted by AlphaFold. Applying FREEDA to >100 centromere proteins, we find statistical evidence of positive selection within loops and turns of ancient domains, suggesting innovation of essential functions. As a proof-of-principle experiment, we show innovation in centromere binding of mouse CENP-O. Overall, we provide an accessible computational tool to guide cell biology research and apply it to experimentally demonstrate functional innovation.
Collapse
Affiliation(s)
- Damian Dudka
- Department of Biology, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA, USA
| | - R. Brian Akins
- Department of Biology, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA, USA
| | - Michael A. Lampson
- Department of Biology, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|
14
|
Nakamura-García AK, Espinal-Enríquez J. Pseudogenes in Cancer: State of the Art. Cancers (Basel) 2023; 15:4024. [PMID: 37627052 PMCID: PMC10452131 DOI: 10.3390/cancers15164024] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 07/24/2023] [Accepted: 07/26/2023] [Indexed: 08/27/2023] Open
Abstract
Pseudogenes are duplicates of protein-coding genes that have accumulated multiple detrimental alterations, rendering them unable to produce the protein they encode. Initially disregarded as "junk DNA" due to their perceived lack of functionality, research on their biological roles has been hindered by this assumption. Nevertheless, recent focus has shifted towards these molecules due to their abnormal expression in cancer phenotypes. In this review, our objective is to provide a thorough overview of the current understanding of pseudogene formation, the mechanisms governing their expression, and the roles they may play in promoting tumorigenesis.
Collapse
|
15
|
Schelle L, Abrantes J, Baldauf HM, Esteves PJ. Evolution of primate interferon-induced transmembrane proteins (IFITMs): a story of gain and loss with a differentiation into a canonical cluster and IFITM retrogenes. Front Microbiol 2023; 14:1213685. [PMID: 37577422 PMCID: PMC10415907 DOI: 10.3389/fmicb.2023.1213685] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Accepted: 07/06/2023] [Indexed: 08/15/2023] Open
Abstract
Interferon-inducible transmembrane proteins (IFITMs) are a family of transmembrane proteins. The subgroup of immunity-related (IR-)IFITMs is involved in adaptive and innate immune responses, being especially active against viruses. Here, we suggest that IFITMs should be classified as (1) a canonical IFITM gene cluster, which is located on the same chromosome, and (2) IFITM retrogenes, with a random and unique location at different positions within the genome. Phylogenetic analyses of the canonical cluster revealed the existence of three novel groups of primate IFITMs (pIFITM) in the IR-IFITM clade: the prosimian pIFITMs(pro), the new world monkey pIFITMs(nwm) and the old world monkey pIFITMs(owm). Therefore, we propose a new nomenclature: IR-pIFITM1, IR-pIFITM2, IR-pIFITM3, IR-pIFITMnwm, IR-pIFITMowm, and IR-pIFITMpro. We observed divergent evolution for pIFITM5 and pIFITM10, and evidence for concerted evolution and a mechanism of birth-and-death evolution model for the IR-pIFITMs. In contrast, the IFITMs scattered throughout the genomes possessed features of retrogenes retrotransposed by class 1 transposable elements. The origin of the IFITM retrogenes correspond to more recent events. We hypothesize that the transcript of a canonical IFITM3 has been constantly retrotransposed using class 1 transposable elements resulting in the IFITM retro(pseudo)genes. The unique pattern of each species has most likely been caused by constant pseudogenization and loss of the retro(pseudo)genes. This suggests a third mechanism of evolution for the IR-IFITMs in primates, similar to the birth-and-death model of evolution, but via a transposable element mechanism, which resulted in retro(pseudo)genes.
Collapse
Affiliation(s)
- Luca Schelle
- Faculty of Medicine, Max von Pettenkofer Institute and Gene Center, Virology, National Reference Center for Retroviruses, LMU München, Munich, Germany
| | - Joana Abrantes
- CIBIO-InBIO, Research Center in Biodiversity and Genetic Resources, University of Porto, Vairão, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Vairão, Portugal
- Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Porto, Portugal
| | - Hanna-Mari Baldauf
- Faculty of Medicine, Max von Pettenkofer Institute and Gene Center, Virology, National Reference Center for Retroviruses, LMU München, Munich, Germany
| | - Pedro José Esteves
- CIBIO-InBIO, Research Center in Biodiversity and Genetic Resources, University of Porto, Vairão, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Vairão, Portugal
- Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Porto, Portugal
- CITS - Center of Investigation in Health Technologies, CESPU, Gandra, Portugal
| |
Collapse
|
16
|
Ma H, Wang M, Zhang YE, Tan S. The power of "controllers": Transposon-mediated duplicated genes evolve towards neofunctionalization. J Genet Genomics 2023; 50:462-472. [PMID: 37068629 DOI: 10.1016/j.jgg.2023.04.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 04/04/2023] [Accepted: 04/05/2023] [Indexed: 04/19/2023]
Abstract
Since the discovery of the first transposon by Dr. Barbara McClintock, the prevalence and diversity of transposable elements (TEs) have been gradually recognized. As fundamental genetic components, TEs drive organismal evolution not only by contributing functional sequences (e.g., regulatory elements or "controllers" as phrased by Dr. McClintock) but also by shuffling genomic sequences. In the latter respect, TE-mediated gene duplications have contributed to the origination of new genes and attracted extensive interest. In response to the development of this field, we herein attempt to provide an overview of TE-mediated duplication by focusing on common rules emerging across duplications generated by different TE types. Specifically, despite the huge divergence of transposition machinery across TEs, we identify three common features of various TE-mediated duplication mechanisms, including end bypass, template switching, and recurrent transposition. These three features lead to one common functional outcome, namely, TE-mediated duplicates tend to be subjected to exon shuffling and neofunctionalization. Therefore, the intrinsic properties of the mutational mechanism constrain the evolutionary trajectories of these duplicates. We finally discuss the future of this field including an in-depth characterization of both the duplication mechanisms and functions of TE-mediated duplicates.
Collapse
Affiliation(s)
- Huijing Ma
- Key Laboratory of Zoological Systematics and Evolution & State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Mengxia Wang
- Key Laboratory of Zoological Systematics and Evolution & State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yong E Zhang
- Key Laboratory of Zoological Systematics and Evolution & State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China; CAS Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, Yunnan 650223, China; Chinese Institute for Brain Research, Beijing 102206, China.
| | - Shengjun Tan
- Key Laboratory of Zoological Systematics and Evolution & State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China.
| |
Collapse
|
17
|
Fujiyabu C, Sato K, Ohuchi H, Yamashita T. Diversification processes of teleost intron-less opsin genes. J Biol Chem 2023:104899. [PMID: 37295773 PMCID: PMC10339062 DOI: 10.1016/j.jbc.2023.104899] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Revised: 05/31/2023] [Accepted: 06/01/2023] [Indexed: 06/12/2023] Open
Abstract
Opsins are universal photosensitive proteins in animals. Vertebrates have a variety of opsin genes for visual and non-visual photoreceptions. Analysis of the gene structures shows that most opsin genes have introns in their coding regions. However, teleosts exceptionally have several intron-less opsin genes which are presumed to have been duplicated by an RNA-based gene duplication mechanism, retroduplication. Among these retrogenes, we focused on the Opn4 (melanopsin) gene responsible for non-image-forming photoreception. Many teleosts have five Opn4 genes including one intron-less gene, which is speculated to have been formed from a parental intron-containing gene in the Actinopterygii. In this study, to reveal the evolutionary history of Opn4 genes, we analyzed them in teleost (zebrafish and medaka) and non-teleost (bichir, sturgeon and gar) fishes. Our synteny analysis suggests that the intron-less Opn4 gene emerged by retroduplication after branching of the bichir lineage. In addition, our biochemical and histochemical analyses showed that, in the teleost lineage, the newly acquired intron-less Opn4 gene became abundantly used without substantial changes of the molecular properties of the Opn4 protein. This stepwise evolutionary model of Opn4 genes is quite similar to that of rhodopsin genes in the Actinopterygii. The unique acquisition of rhodopsin and Opn4 retrogenes would have contributed to the diversification of the opsin gene repertoires in the Actinopterygii and the adaptation of teleosts to various aquatic environments.
Collapse
Affiliation(s)
- Chihiro Fujiyabu
- Department of Biophysics, Graduate School of Science, Kyoto University, Kyoto 606-8502, Japan
| | - Keita Sato
- Department of Cytology and Histology, Okayama University Faculty of Medicine, Dentistry and Pharmaceutical Sciences, Okayama 700-8558, Japan
| | - Hideyo Ohuchi
- Department of Cytology and Histology, Okayama University Faculty of Medicine, Dentistry and Pharmaceutical Sciences, Okayama 700-8558, Japan
| | - Takahiro Yamashita
- Department of Biophysics, Graduate School of Science, Kyoto University, Kyoto 606-8502, Japan.
| |
Collapse
|
18
|
Titus-McQuillan JE, Nanni AV, McIntyre LM, Rogers RL. Estimating transcriptome complexities across eukaryotes. BMC Genomics 2023; 24:254. [PMID: 37170194 PMCID: PMC10173493 DOI: 10.1186/s12864-023-09326-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Accepted: 04/20/2023] [Indexed: 05/13/2023] Open
Abstract
BACKGROUND Genomic complexity is a growing field of evolution, with case studies for comparative evolutionary analyses in model and emerging non-model systems. Understanding complexity and the functional components of the genome is an untapped wealth of knowledge ripe for exploration. With the "remarkable lack of correspondence" between genome size and complexity, there needs to be a way to quantify complexity across organisms. In this study, we use a set of complexity metrics that allow for evaluating changes in complexity using TranD. RESULTS We ascertain if complexity is increasing or decreasing across transcriptomes and at what structural level, as complexity varies. In this study, we define three metrics - TpG, EpT, and EpG- to quantify the transcriptome's complexity that encapsulates the dynamics of alternative splicing. Here we compare complexity metrics across 1) whole genome annotations, 2) a filtered subset of orthologs, and 3) novel genes to elucidate the impacts of orthologs and novel genes in transcript model analysis. Effective Exon Number (EEN) issued to compare the distribution of exon sizes within transcripts against random expectations of uniform exon placement. EEN accounts for differences in exon size, which is important because novel gene differences in complexity for orthologs and whole-transcriptome analyses are biased towards low-complexity genes with few exons and few alternative transcripts. CONCLUSIONS With our metric analyses, we are able to quantify changes in complexity across diverse lineages with greater precision and accuracy than previous cross-species comparisons under ortholog conditioning. These analyses represent a step toward whole-transcriptome analysis in the emerging field of non-model evolutionary genomics, with key insights for evolutionary inference of complexity changes on deep timescales across the tree of life. We suggest a means to quantify biases generated in ortholog calling and correct complexity analysis for lineage-specific effects. With these metrics, we directly assay the quantitative properties of newly formed lineage-specific genes as they lower complexity.
Collapse
Affiliation(s)
- James E Titus-McQuillan
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, 28223, USA.
| | - Adalena V Nanni
- Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL, 32611, USA
- University of Florida Genetics Institute, University of Florida, Gainesville, FL, 32611, USA
| | - Lauren M McIntyre
- Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL, 32611, USA
- University of Florida Genetics Institute, University of Florida, Gainesville, FL, 32611, USA
| | - Rebekah L Rogers
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
| |
Collapse
|
19
|
Ryan CJ, Mehta I, Kebabci N, Adams DJ. Targeting synthetic lethal paralogs in cancer. Trends Cancer 2023; 9:397-409. [PMID: 36890003 DOI: 10.1016/j.trecan.2023.02.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Revised: 02/02/2023] [Accepted: 02/07/2023] [Indexed: 03/08/2023]
Abstract
Synthetic lethal interactions, where mutation of one gene renders cells sensitive to inhibition of another gene, can be exploited for the development of targeted therapeutics in cancer. Pairs of duplicate genes (paralogs) often share common functionality and hence are a potentially rich source of synthetic lethal interactions. Because the majority of human genes have paralogs, exploiting such interactions could be a widely applicable approach for targeting gene loss in cancer. Moreover, existing small-molecule drugs may exploit synthetic lethal interactions by inhibiting multiple paralogs simultaneously. Consequently, the identification of synthetic lethal interactions between paralogs could be extremely informative for drug development. Here we review approaches to identify such interactions and discuss some of the challenges of exploiting them.
Collapse
Affiliation(s)
- Colm J Ryan
- Conway Institute and School of Computer Science, University College Dublin, Dublin, Ireland; Systems Biology Ireland, University College Dublin, Dublin, Ireland.
| | - Ishan Mehta
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Narod Kebabci
- Conway Institute and School of Computer Science, University College Dublin, Dublin, Ireland; Science Foundation Ireland (SFI) Centre for Research Training in Genomics Data Science, University College Dublin, Dublin, Ireland
| | - David J Adams
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| |
Collapse
|
20
|
Seczynska M, Lehner PJ. The sound of silence: mechanisms and implications of HUSH complex function. Trends Genet 2023; 39:251-267. [PMID: 36754727 DOI: 10.1016/j.tig.2022.12.005] [Citation(s) in RCA: 18] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 12/14/2022] [Accepted: 12/30/2022] [Indexed: 02/08/2023]
Abstract
The vertebrate genome is under constant threat of invasion by genetic parasites. Whether the host can immediately recognize and respond to invading elements has been unclear. The discovery of the human silencing hub (HUSH) complex, and the finding that it provides immediate protection from genome invasion by silencing products of reverse transcription, have important implications for mammalian genome evolution. In this review, we summarize recent insights into HUSH function and describe how cellular introns provide a novel means of self-nonself discrimination, allowing HUSH to recognize and transcriptionally repress a broad range of intronless genetic elements. We discuss how HUSH contributes to genome evolution, and highlight studies reporting the critical role of HUSH in development and implicating HUSH in the control of immune signaling and cancer progression.
Collapse
Affiliation(s)
- Marta Seczynska
- Cambridge Institute for Therapeutic Immunology & Infectious Disease, Jeffrey Cheah Biomedical Centre, Cambridge Biomedical Campus, University of Cambridge, Cambridge CB2 0AW, UK.
| | - Paul J Lehner
- Cambridge Institute for Therapeutic Immunology & Infectious Disease, Jeffrey Cheah Biomedical Centre, Cambridge Biomedical Campus, University of Cambridge, Cambridge CB2 0AW, UK.
| |
Collapse
|
21
|
Evolutionary honing in and mutational replacement: how long-term directed mutational responses to specific environmental pressures are possible. Theory Biosci 2023; 142:87-105. [PMID: 36899155 DOI: 10.1007/s12064-023-00387-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2022] [Accepted: 01/13/2023] [Indexed: 03/12/2023]
Abstract
Recent results have shown that the human malaria-resistant hemoglobin S mutation originates de novo more frequently in the gene and in the population where it is of adaptive significance, namely, in the hemoglobin subunit beta gene compared to the nonresistant but otherwise identical 20A[Formula: see text]T mutation in the hemoglobin subunit delta gene, and in sub-Saharan Africans, who have been subject to intense malarial pressure for many generations, compared to northern Europeans, who have not. This finding raises a fundamental challenge to the traditional notion of accidental mutation. Here, we address this finding with the replacement hypothesis, according to which preexisting genetic interactions can lead directly and mechanistically to mutations that simplify and replace them. Thus, an evolutionary process under selection can gradually hone in on interactions of importance for the currently evolving adaptations, from which large-effect mutations follow that are relevant to these adaptations. We exemplify this hypothesis using multiple types of mutation, including gene fusion mutations, gene duplication mutations, A[Formula: see text]G mutations in RNA-edited sites and transcription-associated mutations, and place it in the broader context of a system-level view of mutation origination called interaction-based evolution. Potential consequences include that similarity of mutation pressures may contribute to parallel evolution in genetically related species, that the evolution of genome organization may be driven by mutational mechanisms, that transposable element movements may also be explained by replacement, and that long-term directed mutational responses to specific environmental pressures are possible. Such mutational phenomena need to be further tested by future studies in natural and artificial settings.
Collapse
|
22
|
Dudka D, Akins RB, Lampson MA. FREEDA: an automated computational pipeline guides experimental testing of protein innovation by detecting positive selection. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.27.530329. [PMID: 36909479 PMCID: PMC10002610 DOI: 10.1101/2023.02.27.530329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]
Abstract
Cell biologists typically focus on conserved regions of a protein, overlooking innovations that can shape its function over evolutionary time. Computational analyses can reveal potential innovations by detecting statistical signatures of positive selection that leads to rapid accumulation of beneficial mutations. However, these approaches are not easily accessible to non-specialists, limiting their use in cell biology. Here, we present an automated computational pipeline FREEDA (Finder of Rapidly Evolving Exons in De novo Assemblies) that provides a simple graphical user interface requiring only a gene name, integrates widely used molecular evolution tools to detect positive selection, and maps results onto protein structures predicted by AlphaFold. Applying FREEDA to >100 mouse centromere proteins, we find evidence of positive selection in intrinsically disordered regions of ancient domains, suggesting innovation of essential functions. As a proof-of-principle experiment, we show innovation in centromere binding of CENP-O. Overall, we provide an accessible computational tool to guide cell biology research and apply it to experimentally demonstrate functional innovation.
Collapse
|
23
|
Zhang L, Bisht P, Flamier A, Barrasa MI, Friesen M, Richards A, Hughes SH, Jaenisch R. LINE1-Mediated Reverse Transcription and Genomic Integration of SARS-CoV-2 mRNA Detected in Virus-Infected but Not in Viral mRNA-Transfected Cells. Viruses 2023; 15:629. [PMID: 36992338 PMCID: PMC10057545 DOI: 10.3390/v15030629] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Revised: 02/22/2023] [Accepted: 02/23/2023] [Indexed: 03/03/2023] Open
Abstract
SARS-CoV-2 sequences can be reverse-transcribed and integrated into the genomes of virus-infected cells by a LINE1-mediated retrotransposition mechanism. Whole-genome sequencing (WGS) methods detected retrotransposed SARS-CoV-2 subgenomic sequences in virus-infected cells overexpressing LINE1, while an enrichment method (TagMap) identified retrotranspositions in cells that did not overexpress LINE1. LINE1 overexpression increased retrotranspositions about 1000-fold as compared to non-overexpressing cells. Nanopore WGS can directly recover retrotransposed viral and flanking host sequences, but its sensitivity depends on the depth of sequencing (a typical 20-fold sequencing depth would only examine 10 diploid cell equivalents). In contrast, TagMap enriches the host-virus junctions and can interrogate up to 20,000 cells and is able to detect rare viral retrotranspositions in LINE1 non-overexpressing cells. Although Nanopore WGS is 10-20-fold more sensitive per tested cell, TagMap can interrogate 1000-2000-fold more cells and, therefore, can identify infrequent retrotranspositions. When comparing SARS-CoV-2 infection and viral nucleocapsid mRNA transfection by TagMap, retrotransposed SARS-CoV-2 sequences were only detected in infected but not in transfected cells. Retrotransposition in virus-infected cells, in contrast to transfected cells, may be facilitated because virus infection, in contrast to viral RNA transfection, results in significantly higher viral RNA levels and stimulates LINE1 expression by causing cellular stress.
Collapse
Affiliation(s)
- Liguo Zhang
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Punam Bisht
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Anthony Flamier
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | | | - Max Friesen
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Alexsia Richards
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Stephen H. Hughes
- HIV Dynamics and Replication Program, Center for Cancer Research, National Cancer Institute, Frederick, MD 21702, USA
| | - Rudolf Jaenisch
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| |
Collapse
|
24
|
Chen Y, Ma T, Zhang T, Ma L. Trends in the evolution of intronless genes in Poaceae. FRONTIERS IN PLANT SCIENCE 2023; 14:1065631. [PMID: 36875616 PMCID: PMC9978806 DOI: 10.3389/fpls.2023.1065631] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Accepted: 02/01/2023] [Indexed: 06/18/2023]
Abstract
Intronless genes (IGs), which are a feature of prokaryotes, are a fascinating group of genes that are also present in eukaryotes. In the current study, a comparison of Poaceae genomes revealed that the origin of IGs may have involved ancient intronic splicing, reverse transcription, and retrotranspositions. Additionally, IGs exhibit the typical features of rapid evolution, including recent duplications, variable copy numbers, low divergence between paralogs, and high non-synonymous to synonymous substitution ratios. By tracing IG families along the phylogenetic tree, we determined that the evolutionary dynamics of IGs differed among Poaceae subfamilies. IG families developed rapidly before the divergence of Pooideae and Oryzoideae and expanded slowly after the divergence. In contrast, they emerged gradually and consistently in the Chloridoideae and Panicoideae clades during evolution. Furthermore, IGs are expressed at low levels. Under relaxed selection pressure, retrotranspositions, intron loss, and gene duplications and conversions may promote the evolution of IGs. The comprehensive characterization of IGs is critical for in-depth studies on intron functions and evolution as well as for assessing the importance of introns in eukaryotes.
Collapse
Affiliation(s)
- Yong Chen
- *Correspondence: Tingting Zhang, ; Lei Ma,
| | | | | | - Lei Ma
- *Correspondence: Tingting Zhang, ; Lei Ma,
| |
Collapse
|
25
|
Liang J, Kong L, Hu X, Fu C, Bai S. Chromosomal-level genome assembly of the high-quality Xian/Indica rice (Oryza sativa L.) Xiangyaxiangzhan. BMC PLANT BIOLOGY 2023; 23:94. [PMID: 36782126 PMCID: PMC9926808 DOI: 10.1186/s12870-023-04114-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Accepted: 02/10/2023] [Indexed: 06/18/2023]
Abstract
The indica rice variety XYXZ carries elite traits including appearance and eating quality. Here, we report the de novo assembly of XYXZ using Illumine paired-end whole-genome shotgun sequencing and Nanopore sequencing. We annotated 39,722 protein-coding genes in the 395.04 Mb assembly. In comparison to other cultivars, XYXZ showed a larger gene size including the transcripts and introns, and more exons per gene. And hundreds of ultra-long genes were also detected. A total of 4362 complete LTRs were annotated, and among them, many were located next to or in protein-coding genes including several genes related to rice quality. We observed the different distributions of LTRs in these genes among XYXZ, Nipponbare, and R498, implying these LTRs might potentially affect expressions of the proximal genes and rice quality. Overall, This chromosome-length genome assembly of XYXZ provides a valuable resource for gene discovery, genetic variation and evolution, and the breeding of high-quality rice.
Collapse
Affiliation(s)
- Jiayan Liang
- Rice Research Institute Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory of New Technology in Rice Breeding, Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Leilei Kong
- Rice Research Institute Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory of New Technology in Rice Breeding, Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Xiaodan Hu
- Rice Research Institute Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory of New Technology in Rice Breeding, Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Chongyun Fu
- Rice Research Institute Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory of New Technology in Rice Breeding, Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China.
| | - Song Bai
- Rice Research Institute Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory of New Technology in Rice Breeding, Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China.
| |
Collapse
|
26
|
Comprehensive Comparative Analysis Sheds Light on the Patterns of Microsatellite Distribution across Birds Based on the Chromosome-Level Genomes. Animals (Basel) 2023; 13:ani13040655. [PMID: 36830442 PMCID: PMC9951716 DOI: 10.3390/ani13040655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 01/30/2023] [Accepted: 02/09/2023] [Indexed: 02/16/2023] Open
Abstract
Microsatellites (SSRs) are widely distributed in the genomes of organisms and are an important genetic basis for genome evolution and phenotypic adaptation. Although the distribution patterns of microsatellites have been investigated in many phylogenetic lineages, they remain unclear within the morphologically and physiologically diverse avian clades. Here, based on high-quality chromosome-level genomes, we examined the microsatellite distribution patterns for 53 birds from 16 orders. The results demonstrated that each type of SSR had the same ratio between taxa. For example, the frequency of imperfect SSRs (I-SSRs) was 69.90-84.61%, while perfect SSRs (P-SSRs) were 14.86-28.13% and compound SSRs (C-SSRs) were 0.39-2.24%. Mononucleotide SSRs were dominant for perfect SSRs (32.66-76.48%) in most bird species (98.11%), and A(n) was the most abundant repeat motifs of P-SSRs in all birds (5.42-68.22%). Our study further confirmed that the abundance and diversity of microsatellites were less effected by evolutionary history but its length. The number of P-SSRs decreased with increasing repeat times, and longer P-SSRs motifs had a higher variability coefficient of the repeat copy number and lower diversity, indicating that longer motifs tended to have more stable preferences in avian genomes. We also found that P-SSRs were mainly distributed at the gene ends, and the functional annotation for these genes demonstrated that they were related to signal transduction and cellular process. In conclusion, our research provided avian SSR distribution patterns, which will help to explore the genetic basis for phenotypic diversity in birds.
Collapse
|
27
|
Zhang L, Bisht P, Flamier A, Barrasa MI, Richards A, Hughes SH, Jaenisch R. LINE1-mediated reverse transcription and genomic integration of SARS-CoV-2 mRNA detected in virus-infected but not in viral mRNA-transfected cells. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.10.527906. [PMID: 37293025 PMCID: PMC10245962 DOI: 10.1101/2023.02.10.527906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
SARS-CoV-2 sequences can be reverse-transcribed and integrated into the genomes of virus-infected cells by a LINE1-mediated retrotransposition mechanism. Whole genome sequencing (WGS) methods detected retrotransposed SARS-CoV-2 subgenomic sequences in virus-infected cells overexpressing LINE1, while an enrichment method (TagMap) identified retrotranspositions in cells that did not overexpress LINE1. LINE1 overexpression increased retrotranspositions about 1,000-fold as compared to non-overexpressing cells. Nanopore WGS can directly recover retrotransposed viral and flanking host sequences but its sensitivity depends on the depth of sequencing (a typical 20-fold sequencing depth would only examine 10 diploid cell equivalents). In contrast, TagMap enriches for the host-virus junctions and can interrogate up to 20,000 cells and is able to detect rare viral retrotranspositions in LINE1 non-overexpressing cells. Although Nanopore WGS is 10 - 20-fold more sensitive per tested cell, TagMap can interrogate 1,000 - 2,000-fold more cells and therefore can identify infrequent retrotranspositions. When comparing SARS-CoV-2 infection and viral nucleocapsid mRNA transfection by TagMap, retrotransposed SARS-CoV-2 sequences were only detected in infected but not in transfected cells. Retrotransposition in virus-infected in contrast to transfected cells may be facilitated because virus infection in contrast to viral RNA transfection results in significantly higher viral RNA levels and stimulates LINE1-expression which causes cellular stress.
Collapse
|
28
|
Craddock V, Mahajan A, Spikes L, Krishnamachary B, Ram AK, Kumar A, Chen L, Chalise P, Dhillon NK. Persistent circulation of soluble and extracellular vesicle-linked Spike protein in individuals with postacute sequelae of COVID-19. J Med Virol 2023; 95:e28568. [PMID: 36756925 PMCID: PMC10048846 DOI: 10.1002/jmv.28568] [Citation(s) in RCA: 37] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Revised: 02/03/2023] [Accepted: 02/06/2023] [Indexed: 02/10/2023]
Abstract
SARS-CoV-2, the causative agent of COVID-19 disease, has resulted in the death of millions worldwide since the beginning of the pandemic in December 2019. While much progress has been made to understand acute manifestations of SARS-CoV-2 infection, less is known about post-acute sequelae of COVID-19 (PASC). We investigated the levels of both Spike protein (Spike) and viral RNA circulating in patients hospitalized with acute COVID-19 and in patients with and without PASC. We found that Spike and viral RNA were more likely to be present in patients with PASC. Among these patients, 30% were positive for both Spike and viral RNA; whereas, none of the individuals without PASC were positive for both. The levels of Spike and/or viral RNA in the PASC+ve patients were found to be increased or remained the same as in the acute phase; whereas, in the PASC-ve group, these viral components decreased or were totally absent. Additionally, this is the first report to show that part of the circulating Spike is linked to extracellular vesicles without any presence of viral RNA in these vesicles. In conclusion, our findings suggest that Spike and/or viral RNA fragments persist in the recovered COVID-19 patients with PASC up to 1 year or longer after acute SARS-CoV-2 infection.
Collapse
Affiliation(s)
- Vaughn Craddock
- Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Kansas Medical Center, Kansas City, KS
| | - Aatish Mahajan
- Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Kansas Medical Center, Kansas City, KS
| | - Leslie Spikes
- Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Kansas Medical Center, Kansas City, KS
| | - Balaji Krishnamachary
- Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Kansas Medical Center, Kansas City, KS
| | - Anil Kumar Ram
- Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Kansas Medical Center, Kansas City, KS
| | - Ashok Kumar
- Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Kansas Medical Center, Kansas City, KS
| | - Ling Chen
- Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Kansas Medical Center, Kansas City, KS
| | - Prabhakar Chalise
- Department of Biostatistics & Data Science, University of Kansas Medical Center, Kansas City, KS
| | - Navneet K. Dhillon
- Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Kansas Medical Center, Kansas City, KS
| |
Collapse
|
29
|
Amici DR, Cingoz H, Alasady MJ, Alhayek S, Phoumyvong CM, Sahni N, Yi SS, Mendillo ML. The HAPSTR2 retrogene buffers stress signaling and resilience in mammals. Nat Commun 2023; 14:152. [PMID: 36631436 PMCID: PMC9834230 DOI: 10.1038/s41467-022-35697-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 12/20/2022] [Indexed: 01/12/2023] Open
Abstract
We recently identified HAPSTR1 (C16orf72) as a key component in a novel pathway which regulates the cellular response to molecular stressors, such as DNA damage, nutrient scarcity, and protein misfolding. Here, we identify a functional paralog to HAPSTR1: HAPSTR2. HAPSTR2 formed early in mammalian evolution, via genomic integration of a reverse transcribed HAPSTR1 transcript, and has since been preserved under purifying selection. HAPSTR2, expressed primarily in neural and germline tissues and a subset of cancers, retains established biochemical features of HAPSTR1 to achieve two functions. In normal physiology, HAPSTR2 directly interacts with HAPSTR1, markedly augmenting HAPSTR1 protein stability in a manner independent from HAPSTR1's canonical E3 ligase, HUWE1. Alternatively, in the context of HAPSTR1 loss, HAPSTR2 expression is sufficient to buffer stress signaling and resilience. Thus, we discover a mammalian retrogene which safeguards fitness.
Collapse
Affiliation(s)
- David R Amici
- Dept. of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Simpson Querrey Center for Epigenetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
| | - Harun Cingoz
- Dept. of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Simpson Querrey Center for Epigenetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
| | - Milad J Alasady
- Dept. of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Simpson Querrey Center for Epigenetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
| | - Sammy Alhayek
- Dept. of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Simpson Querrey Center for Epigenetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
| | - Claire M Phoumyvong
- Dept. of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Simpson Querrey Center for Epigenetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
| | - Nidhi Sahni
- Department of Epigenetics and Molecular Carcinogenesis, and Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
- Quantitative and Computational Biosciences Program, Baylor College of Medicine, Houston, TX, 77030, USA
| | - S Stephen Yi
- Livestrong Cancer Institutes, Department of Oncology, and Department of Biomedical Engineering, The University of Texas at Austin, Austin, TX, 78712, USA
- Interdisciplinary Life Sciences Graduate Programs (ILSGP), and Oden Institute for Computational Engineering and Sciences (ICES), The University of Texas at Austin, Austin, TX, 78712, USA
| | - Marc L Mendillo
- Dept. of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA.
- Simpson Querrey Center for Epigenetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA.
- Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA.
| |
Collapse
|
30
|
López-Cortegano E, Craig RJ, Chebib J, Balogun EJ, Keightley PD. Rates and spectra of de novo structural mutations in Chlamydomonas reinhardtii. Genome Res 2023; 33:45-60. [PMID: 36617667 PMCID: PMC9977147 DOI: 10.1101/gr.276957.122] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 12/06/2022] [Indexed: 12/14/2022]
Abstract
Genetic variation originates from several types of spontaneous mutation, including single-nucleotide substitutions, short insertions and deletions (indels), and larger structural changes. Structural mutations (SMs) drive genome evolution and are thought to play major roles in evolutionary adaptation, speciation, and genetic disease, including cancers. Sequencing of mutation accumulation (MA) lines has provided estimates of rates and spectra of single-nucleotide and indel mutations in many species, yet the rate of new SMs is largely unknown. Here, we use long-read sequencing to determine the full mutation spectrum in MA lines derived from two strains (CC-1952 and CC-2931) of the green alga Chlamydomonas reinhardtii The SM rate is highly variable between strains and between MA lines, and SMs represent a substantial proportion of all mutations in both strains (CC-1952 6%; CC-2931 12%). The SM spectra differ considerably between the two strains, with almost all inversions and translocations occurring in CC-2931 MA lines. This variation is associated with heterogeneity in the number and type of active transposable elements (TEs), which comprise major proportions of SMs in both strains (CC-1952 22%; CC-2931 38%). In CC-2931, a Crypton and a previously undescribed type of DNA element have caused 71% of chromosomal rearrangements, whereas in CC-1952, a Dualen LINE is associated with 87% of duplications. Other SMs, notably large duplications in CC-2931, are likely products of various double-strand break repair pathways. Our results show that diverse types of SMs occur at substantial rates, and support prominent roles for SMs and TEs in evolution.
Collapse
Affiliation(s)
- Eugenio López-Cortegano
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| | - Rory J. Craig
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom;,California Institute for Quantitative Biosciences, UC Berkeley, Berkeley, California 94720, USA
| | - Jobran Chebib
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| | - Eniolaye J. Balogun
- Department of Ecology and Evolutionary Biology, University of Toronto, Ontario ON M5S 3B2, Canada;,Department of Biology, University of Toronto Mississauga, Mississauga ON L5L 1C6, Canada
| | - Peter D. Keightley
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| |
Collapse
|
31
|
Vaidya M, Smith J, Field M, Sugaya K. Analysis of regulatory sequences in exosomal DNA of NANOGP8. PLoS One 2023; 18:e0280959. [PMID: 36696426 PMCID: PMC9876286 DOI: 10.1371/journal.pone.0280959] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Accepted: 12/20/2022] [Indexed: 01/26/2023] Open
Abstract
Exosomes participate in intercellular communication by transporting functionally active molecules. Such cargo from the original cells comprising proteins, micro-RNA, mRNA, single-stranded (ssDNA) and double-stranded DNA (dsDNA) molecules pleiotropically transforms the target cells. Although cancer cells secrete exosomes carrying a significant level of DNA capable of modulating oncogene expression in a recipient cell, the regulatory mechanism is unknown. We have previously reported that cancer cells produce exosomes containing NANOGP8 DNA. NANOGP8 is an oncogenic paralog of embryonic stem cell transcription factor NANOG and does not express in cells since it is a pseudogene. However, in this study, we evaluated NANOGP8 expression in glioblastoma multiforme (GBM) tissue from a surgically removed brain tumor of a patient. Significantly higher NANOGP8 transcription was observed in GBM cancer stem cells (CSCs) than in GBM cancer cells or neural stem cells (NSCs), despite identical sequences of NANOGP8-upstream genomic region in all the cell lines. This finding suggests that upstream genomic sequences of NANOGP8 may have environment-dependent promoter activity. We also found that the regulatory sequences upstream of exosomal NANOGP8 GBM DNA contain multiple core promoter elements, transcription factor binding sites, and segments of human viruses known for their oncogenic role. The exosomal sequence of NANOGP8-upstream GBM DNA is different from corresponding genomic sequences in CSCs, cancer cells, and NSCs as well as from the sequences reported by NCBI. These sequence dissimilarities suggest that exosomal NANOGP8 GBM DNA may not be a part of the genomic DNA. Exosomes possibly acquire this DNA from other sources where it is synthesized by an unknown mechanism. The significance of exosome-bestowed regulatory elements in the transcription of promoter-less retrogene such as NANOGP8 remains to be determined.
Collapse
Affiliation(s)
- Manjusha Vaidya
- Burnett School of Biomedical Sciences, College of Medicine, University of Central Florida, Orlando, FL, United States of America
| | - Jonhoi Smith
- Burnett School of Biomedical Sciences, College of Medicine, University of Central Florida, Orlando, FL, United States of America
| | - Melvin Field
- Burnett School of Biomedical Sciences, College of Medicine, University of Central Florida, Orlando, FL, United States of America
- AdventHealth Cancer Institute, Orlando, FL, United States of America
| | - Kiminobu Sugaya
- Burnett School of Biomedical Sciences, College of Medicine, University of Central Florida, Orlando, FL, United States of America
- * E-mail:
| |
Collapse
|
32
|
Genes that are Used Together are More Likely to be Fused Together in Evolution by Mutational Mechanisms: A Bioinformatic Test of the Used-Fused Hypothesis. Evol Biol 2023; 50:30-55. [PMID: 36816837 PMCID: PMC9925542 DOI: 10.1007/s11692-022-09579-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2021] [Accepted: 09/11/2022] [Indexed: 12/05/2022]
Abstract
Cases of parallel or recurrent gene fusions in evolution as well as in genetic disease and cancer are difficult to explain, because unlike point mutations, they can require the repetition of a similar configuration of multiple breakpoints rather than the repetition of a single point mutation. The used-together-fused-together hypothesis holds that genes that are used together repeatedly and persistently in a specific context are more likely to undergo fusion mutation in the course of evolution for mechanistic reasons. This hypothesis offers to explain gene fusion in both evolution and disease under one umbrella. Using bioinformatic data, we tested this hypothesis against alternatives, including that all gene pairs can fuse by random mutation, but among pairs thus fused, those that had interacted previously are more likely to be favored by selection. Results show that across multiple measures of gene interaction, human genes whose orthologs are fused in one or more species are more likely to interact with each other than random pairs of genes of the same genomic distance between pair members; that an overlap exists between genes that fused in the course of evolution in non-human species and genes that undergo fusion in human cancers; and that across six primate species studied, fusions predominate over fissions and exhibit substantial evolutionary parallelism. Together, these results support the used-together-fused-together hypothesis over its alternatives. Multiple implications are discussed, including the relevance of mutational mechanisms to the evolution of genome organization, to the distribution of fitness effects of mutation, to evolutionary parallelism and more. Supplementary Information The online version contains supplementary material available at 10.1007/s11692-022-09579-9.
Collapse
|
33
|
Gabrielli F, Antinucci M, Tofanelli S. Gene Structure Evolution of the Short-Chain Dehydrogenase/Reductase (SDR) Family. Genes (Basel) 2022; 14:110. [PMID: 36672851 PMCID: PMC9859523 DOI: 10.3390/genes14010110] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Revised: 12/12/2022] [Accepted: 12/22/2022] [Indexed: 12/31/2022] Open
Abstract
SDR (Short-chain Dehydrogenases/Reductases) are one of the oldest and heterogeneous superfamily of proteins, whose classification is problematic because of the low percent identity, even within families. To get clearer insights into SDR molecular evolution, we explored the splicing site organization of the 75 human SDR genes across their vertebrate and invertebrate orthologs. We found anomalous gene structures in members of the human SDR7C and SDR42E families that provide clues of retrogene properties and independent evolutionary trajectories from a common invertebrate ancestor. The same analyses revealed that the identity value between human and invertebrate non-allelic variants is not necessarily associated with the homologous gene structure. Accordingly, a revision of the SDR nomenclature is proposed by including the human SDR40C1 and SDR7C gene in the same family.
Collapse
Affiliation(s)
- Franco Gabrielli
- Department of Biology, University of Pisa, Via Ghini, 13-56126 Pisa, Italy
| | - Marco Antinucci
- Department of Medicine and Life Sciences, Institute of Evolutionary Biology (UPF-CSIC), Universitat Pompeu Fabra, 08002 Barcelona, Spain
| | - Sergio Tofanelli
- Department of Biology, University of Pisa, Via Ghini, 13-56126 Pisa, Italy
| |
Collapse
|
34
|
Qian SH, Chen L, Xiong YL, Chen ZX. Evolution and function of developmentally dynamic pseudogenes in mammals. Genome Biol 2022; 23:235. [PMID: 36348461 PMCID: PMC9641868 DOI: 10.1186/s13059-022-02802-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Accepted: 10/23/2022] [Indexed: 11/11/2022] Open
Abstract
BACKGROUND Pseudogenes are excellent markers for genome evolution, which are emerging as crucial regulators of development and disease, especially cancer. However, systematic functional characterization and evolution of pseudogenes remain largely unexplored. RESULTS To systematically characterize pseudogenes, we date the origin of human and mouse pseudogenes across vertebrates and observe a burst of pseudogene gain in these two lineages. Based on a hybrid sequencing dataset combining full-length PacBio sequencing, sample-matched Illumina sequencing, and public time-course transcriptome data, we observe that abundant mammalian pseudogenes could be transcribed, which contribute to the establishment of organ identity. Our analyses reveal that developmentally dynamic pseudogenes are evolutionarily conserved and show an increasing weight during development. Besides, they are involved in complex transcriptional and post-transcriptional modulation, exhibiting the signatures of functional enrichment. Coding potential evaluation suggests that 19% of human pseudogenes could be translated, thus serving as a new way for protein innovation. Moreover, pseudogenes carry disease-associated SNPs and conduce to cancer transcriptome perturbation. CONCLUSIONS Our discovery reveals an unexpectedly high abundance of mammalian pseudogenes that can be transcribed and translated, and these pseudogenes represent a novel regulatory layer. Our study also prioritizes developmentally dynamic pseudogenes with signatures of functional enrichment and provides a hybrid sequencing dataset for further unraveling their biological mechanisms in organ development and carcinogenesis in the future.
Collapse
Affiliation(s)
- Sheng Hu Qian
- grid.35155.370000 0004 1790 4137Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan, 430070 PR China ,grid.35155.370000 0004 1790 4137Hubei Key Laboratory of Agricultural Bioinformatics, College of Life Science and Technology, Huazhong Agricultural University, Wuhan, 430070 PR China
| | - Lu Chen
- grid.35155.370000 0004 1790 4137Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan, 430070 PR China ,grid.35155.370000 0004 1790 4137Hubei Key Laboratory of Agricultural Bioinformatics, College of Life Science and Technology, Huazhong Agricultural University, Wuhan, 430070 PR China
| | - Yu-Li Xiong
- grid.35155.370000 0004 1790 4137Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan, 430070 PR China ,grid.35155.370000 0004 1790 4137Hubei Key Laboratory of Agricultural Bioinformatics, College of Life Science and Technology, Huazhong Agricultural University, Wuhan, 430070 PR China
| | - Zhen-Xia Chen
- grid.35155.370000 0004 1790 4137Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan, 430070 PR China ,grid.35155.370000 0004 1790 4137Hubei Key Laboratory of Agricultural Bioinformatics, College of Life Science and Technology, Huazhong Agricultural University, Wuhan, 430070 PR China ,grid.35155.370000 0004 1790 4137Interdisciplinary Sciences Institute, Huazhong Agricultural University, Wuhan, 430070 PR China ,grid.35155.370000 0004 1790 4137Shenzhen Institute of Nutrition and Health, Huazhong Agricultural University, Shenzhen, 518124 PR China ,grid.488316.00000 0004 4912 1102Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518124 PR China
| |
Collapse
|
35
|
Kuang L, Shen Q, Chen L, Ye L, Yan T, Chen ZH, Waugh R, Li Q, Huang L, Cai S, Fu L, Xing P, Wang K, Shao J, Wu F, Jiang L, Wu D, Zhang G. The genome and gene editing system of sea barleygrass provide a novel platform for cereal domestication and stress tolerance studies. PLANT COMMUNICATIONS 2022; 3:100333. [PMID: 35643085 PMCID: PMC9482977 DOI: 10.1016/j.xplc.2022.100333] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Revised: 04/24/2022] [Accepted: 05/02/2022] [Indexed: 06/15/2023]
Abstract
The tribe Triticeae provides important staple cereal crops and contains elite wild species with wide genetic diversity and high tolerance to abiotic stresses. Sea barleygrass (Hordeum marinum Huds.), a wild Triticeae species, thrives in saline marshlands and is well known for its high tolerance to salinity and waterlogging. Here, a 3.82-Gb high-quality reference genome of sea barleygrass is assembled de novo, with 3.69 Gb (96.8%) of its sequences anchored onto seven chromosomes. In total, 41 045 high-confidence (HC) genes are annotated by homology, de novo prediction, and transcriptome analysis. Phylogenetics, non-synonymous/synonymous mutation ratios (Ka/Ks), and transcriptomic and functional analyses provide genetic evidence for the divergence in morphology and salt tolerance among sea barleygrass, barley, and wheat. The large variation in post-domestication genes (e.g. IPA1 and MOC1) may cause interspecies differences in plant morphology. The extremely high salt tolerance of sea barleygrass is mainly attributed to low Na+ uptake and root-to-shoot translocation, which are mainly controlled by SOS1, HKT, and NHX transporters. Agrobacterium-mediated transformation and CRISPR/Cas9-mediated gene editing systems were developed for sea barleygrass to promote its utilization for exploration and functional studies of hub genes and for the genetic improvement of cereal crops.
Collapse
Affiliation(s)
- Liuhui Kuang
- Department of Agronomy, Key Laboratory of Crop Germplasm Resource of Zhejiang Province, Zhejiang University, Hangzhou 310058, China
| | - Qiufang Shen
- Department of Agronomy, Key Laboratory of Crop Germplasm Resource of Zhejiang Province, Zhejiang University, Hangzhou 310058, China
| | - Liyang Chen
- Novogene Bioinformatics Institute, Beijing 100083, China
| | - Lingzhen Ye
- Department of Agronomy, Key Laboratory of Crop Germplasm Resource of Zhejiang Province, Zhejiang University, Hangzhou 310058, China
| | - Tao Yan
- College of Agronomy, Hunan Agricultural University, Changsha 410128, China
| | - Zhong-Hua Chen
- School of Science, Hawkesbury Institute for the Environment, Western Sydney University, Richmond, NSW 2753, Australia
| | - Robbie Waugh
- The James Hutton Institute, Dundee DD2 5DA, UK; The Division of Plant Sciences, School of Life Sciences, University of Dundee, Dundee DD2 5DA, UK; School of Agriculture and Wine & Waite Research Institute, University of Adelaide, Waite Campus, Glen Osmond, SA 5064, Australia
| | - Qi Li
- Department of Agronomy, Key Laboratory of Crop Germplasm Resource of Zhejiang Province, Zhejiang University, Hangzhou 310058, China
| | - Lu Huang
- Department of Agronomy, Key Laboratory of Crop Germplasm Resource of Zhejiang Province, Zhejiang University, Hangzhou 310058, China
| | - Shengguan Cai
- Department of Agronomy, Key Laboratory of Crop Germplasm Resource of Zhejiang Province, Zhejiang University, Hangzhou 310058, China
| | - Liangbo Fu
- Department of Agronomy, Key Laboratory of Crop Germplasm Resource of Zhejiang Province, Zhejiang University, Hangzhou 310058, China
| | - Pengwei Xing
- Novogene Bioinformatics Institute, Beijing 100083, China
| | - Kai Wang
- Novogene Bioinformatics Institute, Beijing 100083, China
| | - Jiari Shao
- Novogene Bioinformatics Institute, Beijing 100083, China
| | - Feibo Wu
- Department of Agronomy, Key Laboratory of Crop Germplasm Resource of Zhejiang Province, Zhejiang University, Hangzhou 310058, China
| | - Lixi Jiang
- Department of Agronomy, Key Laboratory of Crop Germplasm Resource of Zhejiang Province, Zhejiang University, Hangzhou 310058, China
| | - Dezhi Wu
- College of Agronomy, Hunan Agricultural University, Changsha 410128, China.
| | - Guoping Zhang
- Department of Agronomy, Key Laboratory of Crop Germplasm Resource of Zhejiang Province, Zhejiang University, Hangzhou 310058, China.
| |
Collapse
|
36
|
Evolutionary New Genes in a Growing Paradigm. Genes (Basel) 2022; 13:genes13091605. [PMID: 36140774 PMCID: PMC9498540 DOI: 10.3390/genes13091605] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Accepted: 08/28/2022] [Indexed: 11/26/2022] Open
|
37
|
Guo Z, Xu S, Xie W, Shao S, Feng X, He Z, Zhong C, Huang K, Wu CI, Shi S. Adaptation to a new environment with pre-adaptive genomic features - Evidence from woody plants colonizing the land-sea interface. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 111:1411-1424. [PMID: 35796621 DOI: 10.1111/tpj.15899] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Revised: 06/12/2022] [Accepted: 07/05/2022] [Indexed: 06/15/2023]
Abstract
Adaptation to new environments is a key evolutionary process which presumably involves complex genomic changes. Mangroves, a collection of approximately 80 woody plants that have independently invaded intertidal zones >20 times, are ideal for studying this process. We assembled near-chromosome-scale genomes of three Xylocarpus species as well as an outgroup species using single-molecule real-time sequencing. Phylogenomic analysis reveals two separate lineages, one with the mangrove Xylocarpus granatum and the other comprising a mangrove Xylocarpus moluccensis and a terrestrial Xylocarpus rumphii. In conjunction with previous studies, we identified several genomic features associated with mangroves: (i) signals of positive selection in genes related to salt tolerance and root development; (ii) genome-wide elevated ratios of non-synonymous to synonymous substitution relative to terrestrial relatives; and (iii) active elimination of long terminal repeats. These features are found in the terrestrial X. rumphii in addition to the two mangroves. These genomic features, not being strictly mangrove-specific, are hence considered pre-adaptive. We infer that the coastal but non-intertidal habitat of X. rumphii may have predisposed the common ancestor to invasion of true mangrove habitats. Other features including the preferential retention of duplicated genes and intolerance to pseudogenization are not found in X. rumphii and are likely true adaptive features in mangroves. In conclusion, by studying adaptive shift and partial shifts among closely related species, we set up a framework to study genomic features that are acquired at different stages of the pre-adaptation and adaptation to new environments.
Collapse
Affiliation(s)
- Zixiao Guo
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-Sen University, 510275, Guangzhou, Guangdong, China
| | - Shaohua Xu
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-Sen University, 510275, Guangzhou, Guangdong, China
| | - Wei Xie
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-Sen University, 510275, Guangzhou, Guangdong, China
| | - Shao Shao
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-Sen University, 510275, Guangzhou, Guangdong, China
| | - Xiao Feng
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-Sen University, 510275, Guangzhou, Guangdong, China
| | - Ziwen He
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-Sen University, 510275, Guangzhou, Guangdong, China
| | - Cairong Zhong
- Hainan Academy of Forestry (Hainan Academy of Mangrove), 571199, Haikou, Hainan, China
| | - Kaichi Huang
- Department of Botany and Biodiversity Research Centre, University of British Columbia, Vancouver, British Columbia, Canada
| | - Chung-I Wu
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-Sen University, 510275, Guangzhou, Guangdong, China
| | - Suhua Shi
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-Sen University, 510275, Guangzhou, Guangdong, China
| |
Collapse
|
38
|
Functional Diversity and Evolution of the Drosophila Sperm Proteome. Mol Cell Proteomics 2022; 21:100281. [PMID: 35985624 PMCID: PMC9494239 DOI: 10.1016/j.mcpro.2022.100281] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 07/28/2022] [Accepted: 08/08/2022] [Indexed: 01/18/2023] Open
Abstract
Spermatozoa are central to fertilization and the evolutionary fitness of sexually reproducing organisms. As such, a deeper understanding of sperm proteomes (and associated reproductive tissues) has proven critical to the advancement of the fields of sexual selection and reproductive biology. Due to their extraordinary complexity, proteome depth-of-coverage is dependent on advancements in technology and related bioinformatics, both of which have made significant advancements in the decade since the last Drosophila sperm proteome was published. Here, we provide an updated version of the Drosophila melanogaster sperm proteome (DmSP3) using improved separation and detection methods and an updated genome annotation. Combined with previous versions of the sperm proteome, the DmSP3 contains a total of 3176 proteins, and we provide the first label-free quantitation of the sperm proteome for 2125 proteins. The top 20 most abundant proteins included the structural elements α- and β-tubulins and sperm leucyl-aminopeptidases. Both gene content and protein abundance were significantly reduced on the X chromosome, consistent with prior genomic studies of X chromosome evolution. We identified 9 of the 16 Y-linked proteins, including known testis-specific male fertility factors. We also identified almost one-half of known Drosophila ribosomal proteins in the DmSP3. The role of this subset of ribosomal proteins in sperm is unknown. Surprisingly, our expanded sperm proteome also identified 122 seminal fluid proteins (Sfps), proteins originally identified in the accessory glands. We show that a significant fraction of 'sperm-associated Sfps' are recalcitrant to concentrated salt and detergent treatments, suggesting this subclass of Sfps are expressed in testes and may have additional functions in sperm, per se. Overall, our results add to a growing landscape of both sperm and seminal fluid protein biology and in particular provides quantitative evidence at the protein level for prior findings supporting the meiotic sex-chromosome inactivation model for male-specific gene and X chromosome evolution.
Collapse
|
39
|
Zhou Y, Zhang C, Zhang L, Ye Q, Liu N, Wang M, Long G, Fan W, Long M, Wing RA. Gene fusion as an important mechanism to generate new genes in the genus Oryza. Genome Biol 2022; 23:130. [PMID: 35706016 PMCID: PMC9199173 DOI: 10.1186/s13059-022-02696-w] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Accepted: 05/30/2022] [Indexed: 11/16/2022] Open
Abstract
Background Events of gene fusion have been reported in several organisms. However, the general role of gene fusion as part of new gene origination remains unknown. Results We conduct genome-wide interrogations of four Oryza genomes by designing and implementing novel pipelines to detect fusion genes. Based on the phylogeny of ten plant species, we detect 310 fusion genes across four Oryza species. The estimated rate of origination of fusion genes in the Oryza genus is as high as 63 fusion genes per species per million years, which is fixed at 16 fusion genes per species per million years and much higher than that in flies. By RNA sequencing analysis, we find more than 44% of the fusion genes are expressed and 90% of gene pairs show strong signals of purifying selection. Further analysis of CRISPR/Cas9 knockout lines indicates that newly formed fusion genes regulate phenotype traits including seed germination, shoot length and root length, suggesting the functional significance of these genes. Conclusions We detect new fusion genes that may drive phenotype evolution in Oryza. This study provides novel insights into the genome evolution of Oryza. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-022-02696-w.
Collapse
Affiliation(s)
- Yanli Zhou
- Germplasm Bank of Wild species, Kunming Institute of Botany, Chinese Academy of Science, Kunming, Yunnan, 650201, China
| | - Chengjun Zhang
- Germplasm Bank of Wild species, Kunming Institute of Botany, Chinese Academy of Science, Kunming, Yunnan, 650201, China. .,Department of Ecology and Evolution, The University of Chicago, 1101 E. 57th Street, Chicago, IL, 60637, USA.
| | - Li Zhang
- Department of Ecology and Evolution, The University of Chicago, 1101 E. 57th Street, Chicago, IL, 60637, USA.,Chinese Institute for Brain Research, (CIBR), Beijing, 102206, China
| | - Qiannan Ye
- Germplasm Bank of Wild species, Kunming Institute of Botany, Chinese Academy of Science, Kunming, Yunnan, 650201, China
| | - Ningyawen Liu
- Germplasm Bank of Wild species, Kunming Institute of Botany, Chinese Academy of Science, Kunming, Yunnan, 650201, China
| | - Muhua Wang
- Arizona Genomics Institute, School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA.,State Key Laboratory for Biocontrol, School of Marine Sciences, Sun Yat-sen University, Zhuhai, 519000, China
| | - Guangqiang Long
- Key Laboratory of Medicinal Plant Biology of Yunnan Province, Yunnan Agricultural University, Kunming, Yunnan, 650201, China
| | - Wei Fan
- Key Laboratory of Medicinal Plant Biology of Yunnan Province, Yunnan Agricultural University, Kunming, Yunnan, 650201, China
| | - Manyuan Long
- Department of Ecology and Evolution, The University of Chicago, 1101 E. 57th Street, Chicago, IL, 60637, USA.
| | - Rod A Wing
- Arizona Genomics Institute, School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA. .,Center for Desert Agriculture, King Abdullah University of Science & Technology, Thuwal, 23955-6900, Kingdom of Saudi Arabia.
| |
Collapse
|
40
|
Retrotransposition facilitated the establishment of a primary plastid in the thecate amoeba Paulinella. Proc Natl Acad Sci U S A 2022; 119:e2121241119. [PMID: 35639693 DOI: 10.1073/pnas.2121241119] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
SignificancePrimary endosymbiosis allowed the evolution of complex life on Earth. In this process, a prokaryote was engulfed and retained in the cytoplasm of another microbe, where it developed into a new organelle (mitochondria and plastids). During organelle evolution, genes from the endosymbiont are transferred to the host nuclear genome, where they must become active despite differences in the genetic nature of the "partner" organisms. Here, we show that in the amoeba Paulinella micropora, which harbors a nascent photosynthetic organelle, the "copy-paste" mechanism of retrotransposition allowed domestication of endosymbiont-derived genes in the host nuclear genome. This duplication mechanism is widespread in eukaryotes and may be a major facilitator for host-endosymbiont integration and the evolution of organelles.
Collapse
|
41
|
Miller D, Chen J, Liang J, Betrán E, Long M, Sharakhov IV. Retrogene Duplication and Expression Patterns Shaped by the Evolution of Sex Chromosomes in Malaria Mosquitoes. Genes (Basel) 2022; 13:genes13060968. [PMID: 35741730 PMCID: PMC9222922 DOI: 10.3390/genes13060968] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Revised: 05/23/2022] [Accepted: 05/25/2022] [Indexed: 12/19/2022] Open
Abstract
Genes that originate during evolution are an important source of novel biological functions. Retrogenes are functional copies of genes produced by retroduplication and as such are located in different genomic positions. To investigate retroposition patterns and retrogene expression, we computationally identified interchromosomal retroduplication events in nine portions of the phylogenetic history of malaria mosquitoes, making use of species that do or do not have classical sex chromosomes to test the roles of sex-linkage. We found 40 interchromosomal events and a significant excess of retroduplications from the X chromosome to autosomes among a set of young retrogenes. These young retroposition events occurred within the last 100 million years in lineages where all species possessed differentiated sex chromosomes. An analysis of available microarray and RNA-seq expression data for Anopheles gambiae showed that many of the young retrogenes evolved male-biased expression in the reproductive organs. Young autosomal retrogenes with increased meiotic or postmeiotic expression in the testes tend to be male biased. In contrast, older retrogenes, i.e., in lineages with undifferentiated sex chromosomes, do not show this particular chromosomal bias and are enriched for female-biased expression in reproductive organs. Our reverse-transcription PCR data indicates that most of the youngest retrogenes, which originated within the last 47.6 million years in the subgenus Cellia, evolved non-uniform expression patterns across body parts in the males and females of An. coluzzii. Finally, gene annotation revealed that mitochondrial function is a prominent feature of the young autosomal retrogenes. We conclude that mRNA-mediated gene duplication has produced a set of genes that contribute to mosquito reproductive functions and that different biases are revealed after the sex chromosomes evolve. Overall, these results suggest potential roles for the evolution of meiotic sex chromosome inactivation in males and of sexually antagonistic conflict related to mitochondrial energy function as the main selective pressures for X-to-autosome gene reduplication and testis-biased expression in these mosquito lineages.
Collapse
Affiliation(s)
- Duncan Miller
- Department of Entomology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, USA; (D.M.); (J.L.)
| | - Jianhai Chen
- Department of Ecology and Evolution, University of Chicago, Chicago, IL 60637, USA;
| | - Jiangtao Liang
- Department of Entomology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, USA; (D.M.); (J.L.)
| | - Esther Betrán
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019, USA;
| | - Manyuan Long
- Department of Ecology and Evolution, University of Chicago, Chicago, IL 60637, USA;
- Correspondence: (M.L.); (I.V.S.)
| | - Igor V. Sharakhov
- Department of Entomology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, USA; (D.M.); (J.L.)
- Department of Genetics and Cell Biology, Tomsk State University, 634050 Tomsk, Russia
- Correspondence: (M.L.); (I.V.S.)
| |
Collapse
|
42
|
The Role of Transposable Elements of the Human Genome in Neuronal Function and Pathology. Int J Mol Sci 2022; 23:ijms23105847. [PMID: 35628657 PMCID: PMC9148063 DOI: 10.3390/ijms23105847] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 05/17/2022] [Accepted: 05/19/2022] [Indexed: 12/13/2022] Open
Abstract
Transposable elements (TEs) have been extensively studied for decades. In recent years, the introduction of whole-genome and whole-transcriptome approaches, as well as single-cell resolution techniques, provided a breakthrough that uncovered TE involvement in host gene expression regulation underlying multiple normal and pathological processes. Of particular interest is increased TE activity in neuronal tissue, and specifically in the hippocampus, that was repeatedly demonstrated in multiple experiments. On the other hand, numerous neuropathologies are associated with TE dysregulation. Here, we provide a comprehensive review of literature about the role of TEs in neurons published over the last three decades. The first chapter of the present review describes known mechanisms of TE interaction with host genomes in general, with the focus on mammalian and human TEs; the second chapter provides examples of TE exaptation in normal neuronal tissue, including TE involvement in neuronal differentiation and plasticity; and the last chapter lists TE-related neuropathologies. We sought to provide specific molecular mechanisms of TE involvement in neuron-specific processes whenever possible; however, in many cases, only phenomenological reports were available. This underscores the importance of further studies in this area.
Collapse
|
43
|
Chang CH. Correlated Expression of the Opsin Retrogene LWS-R and its Host Gene in Two Poeciliid Fishes. Zool Stud 2022; 61:e16. [PMID: 36330033 PMCID: PMC9579955 DOI: 10.6620/zs.2022.61-16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Accepted: 02/22/2022] [Indexed: 06/16/2023]
Abstract
The important role of retrogenes in genome evolution and species differentiation is becoming increasingly accepted. One synapomorphy among cyprinodontoid fish is a retrotransposed version of a long-wavelength sensitive (LWS) opsin gene, LWS-R, within an intron of the gephyrin (GPHN) gene. These two genes display opposing orientations. It had been speculated that LWS-R hijacks the cis-regulatory elements of GPHN for transcription, but whether their expression is correlated had remained unclear. Here, in silico predictions identified putative promoters upstream of the translation start site of LWS-R, indicating that its transcription is driven by its own promoter rather than by the GPHN promoter. However, consistent expression ratios of LWS-R:GPHN in the eyeball and brain of fishes indicate that the respective gene transcriptions are correlated. Co-expression is potentially modulated by histone exchange during GPHN transcription. Two isoforms were detected in this study, i.e., intron-free and intron-retaining. Intron-free LWS-R was only expressed in the eyeball of fishes, whereas intron-retaining LWS-R occurred in both eyeball and brain. Expression of vision-associated LWS-R beyond the eyeball supports that it is co-expressed with more ubiquitous GPHN.
Collapse
Affiliation(s)
- Chia-Hao Chang
- Department of Science Education, National Taipei University of Education, No.134, Sec.2, Heping E. Rd., Da'an District, Taipei City 10671, Taiwan. E-mail: (Chang)
| |
Collapse
|
44
|
Domazet-Lošo T. mRNA Vaccines: Why Is the Biology of Retroposition Ignored? Genes (Basel) 2022; 13:719. [PMID: 35627104 PMCID: PMC9141755 DOI: 10.3390/genes13050719] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Revised: 04/14/2022] [Accepted: 04/15/2022] [Indexed: 02/07/2023] Open
Abstract
The major advantage of mRNA vaccines over more conventional approaches is their potential for rapid development and large-scale deployment in pandemic situations. In the current COVID-19 crisis, two mRNA COVID-19 vaccines have been conditionally approved and broadly applied, while others are still in clinical trials. However, there is no previous experience with the use of mRNA vaccines on a large scale in the general population. This warrants a careful evaluation of mRNA vaccine safety properties by considering all available knowledge about mRNA molecular biology and evolution. Here, I discuss the pervasive claim that mRNA-based vaccines cannot alter genomes. Surprisingly, this notion is widely stated in the mRNA vaccine literature but never supported by referencing any primary scientific papers that would specifically address this question. This discrepancy becomes even more puzzling if one considers previous work on the molecular and evolutionary aspects of retroposition in murine and human populations that clearly documents the frequent integration of mRNA molecules into genomes, including clinical contexts. By performing basic comparisons, I show that the sequence features of mRNA vaccines meet all known requirements for retroposition using L1 elements-the most abundant autonomously active retrotransposons in the human genome. In fact, many factors associated with mRNA vaccines increase the possibility of their L1-mediated retroposition. I conclude that is unfounded to a priori assume that mRNA-based therapeutics do not impact genomes and that the route to genome integration of vaccine mRNAs via endogenous L1 retroelements is easily conceivable. This implies that we urgently need experimental studies that would rigorously test for the potential retroposition of vaccine mRNAs. At present, the insertional mutagenesis safety of mRNA-based vaccines should be considered unresolved.
Collapse
Affiliation(s)
- Tomislav Domazet-Lošo
- Laboratory of Evolutionary Genetics, Division of Molecular Biology, Ruđer Bošković Institute, Bijenička Cesta 54, HR-10000 Zagreb, Croatia;
- School of Medicine, Catholic University of Croatia, Ilica 242, HR-10000 Zagreb, Croatia
| |
Collapse
|
45
|
Rheinemann L, Downhour DM, Davenport KA, McKeown AN, Sundquist WI, Elde NC. Recurrent evolution of an inhibitor of ESCRT-dependent virus budding and LINE-1 retrotransposition in primates. Curr Biol 2022; 32:1511-1522.e6. [PMID: 35245459 PMCID: PMC9007875 DOI: 10.1016/j.cub.2022.02.018] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Revised: 12/20/2021] [Accepted: 02/03/2022] [Indexed: 12/15/2022]
Abstract
Most antiviral proteins recognize specific features of viruses. In contrast, the recently described antiviral factor retroCHMP3 interferes with the "host endosomal complexes required for transport" (ESCRT) pathway to inhibit the budding of enveloped viruses. RetroCHMP3 arose independently on multiple occasions via duplication and truncation of the gene encoding the ESCRT-III factor CHMP3. However, since the ESCRT pathway is essential for cellular membrane fission reactions, ESCRT inhibition is potentially cytotoxic. This raises fundamental questions about how hosts can repurpose core cellular functions into antiviral functions without incurring a fitness cost due to excess cellular toxicity. We reveal the evolutionary process of detoxification for retroCHMP3 in New World monkeys using a combination of ancestral reconstructions, cytotoxicity, and virus release assays. A duplicated, full-length copy of retroCHMP3 in the ancestors of New World monkeys provides modest inhibition of virus budding while exhibiting subtle cytotoxicity. Ancient retroCHMP3 then accumulated mutations that reduced cytotoxicity but preserved virus inhibition before a truncating stop codon arose in the more recent ancestors of squirrel monkeys, resulting in potent inhibition. In species where full-length copies of retroCHMP3 still exist, their artificial truncation generated potent virus-budding inhibitors with little cytotoxicity, revealing the potential for future antiviral defenses in modern species. In addition, we discovered that retroCHMP3 restricts LINE-1 retrotransposition, revealing how different challenges to genome integrity might explain multiple independent origins of retroCHMP3 in different species to converge on new immune functions.
Collapse
Affiliation(s)
- Lara Rheinemann
- Department of Biochemistry, University of Utah School of Medicine, 15 N Medical Drive East, Salt Lake City, UT 84112, USA
| | - Diane Miller Downhour
- Department of Human Genetics, University of Utah School of Medicine, 15 N 2030 E, Salt Lake City, UT 84112, USA
| | - Kristen A Davenport
- Department of Biochemistry, University of Utah School of Medicine, 15 N Medical Drive East, Salt Lake City, UT 84112, USA; Department of Human Genetics, University of Utah School of Medicine, 15 N 2030 E, Salt Lake City, UT 84112, USA
| | - Alesia N McKeown
- Department of Human Genetics, University of Utah School of Medicine, 15 N 2030 E, Salt Lake City, UT 84112, USA
| | - Wesley I Sundquist
- Department of Biochemistry, University of Utah School of Medicine, 15 N Medical Drive East, Salt Lake City, UT 84112, USA
| | - Nels C Elde
- Department of Human Genetics, University of Utah School of Medicine, 15 N 2030 E, Salt Lake City, UT 84112, USA; Howard Hughes Medical Institute, 4000 Jones Bridge Rd, Chevy Chase, MD 20815, USA.
| |
Collapse
|
46
|
Xu T, Yang X, Jia Y, Li Z, Tang G, Li X, Wang B, Wang T, Lin J, Guo L, Ye K. A global survey of the transcriptome of the opium poppy (Papaver somniferum) based on single-molecule long-read isoform sequencing. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 110:607-620. [PMID: 35092713 DOI: 10.1111/tpj.15689] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Revised: 01/13/2022] [Accepted: 01/19/2022] [Indexed: 06/14/2023]
Affiliation(s)
- Tun Xu
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
- MOE Key Lab for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
| | - Xiaofei Yang
- MOE Key Lab for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
- School of Computer Science and Technology, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
- Genome Institute, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
| | - Yanyan Jia
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
| | - Zihang Li
- School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, China
| | - Guangbo Tang
- School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, China
| | - Xiujuan Li
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
| | - Bo Wang
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
| | - Tingjie Wang
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
| | - Jiadong Lin
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
- MOE Key Lab for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
- Faculty of Science, Leiden University, Leiden, The Netherlands
| | - Li Guo
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
- MOE Key Lab for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
- School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, China
| | - Kai Ye
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
- MOE Key Lab for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, China
- Genome Institute, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
- School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, China
- Faculty of Science, Leiden University, Leiden, The Netherlands
| |
Collapse
|
47
|
Wang X, Yan X, Hu Y, Qin L, Wang D, Jia J, Jiao Y. A recent burst of gene duplications in Triticeae. PLANT COMMUNICATIONS 2022; 3:100268. [PMID: 35529951 PMCID: PMC9073319 DOI: 10.1016/j.xplc.2021.100268] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Revised: 11/09/2021] [Accepted: 12/09/2021] [Indexed: 06/13/2023]
Abstract
Gene duplication provides raw genetic materials for evolution and potentially novel genes for crop improvement. The two seminal genomic studies of Aegilops tauschii both mentioned the large number of genes independently duplicated in recent years, but the duplication mechanism and the evolutionary significance of these gene duplicates have not yet been investigated. Here, we found that a recent burst of gene duplications (hereafter abbreviated as the RBGD) has probably occurred in all sequenced Triticeae species. Further investigations of the characteristics of the gene duplicates and their flanking sequences suggested that transposable element (TE) activity may have been involved in generating the RBGD. We also characterized the duplication timing, retention pattern, diversification, and expression of the duplicates following the evolution of Triticeae. Multiple subgenome-specific comparisons of the duplicated gene pairs clearly supported extensive differential regulation and related functional diversity among such pairs in the three subgenomes of bread wheat. Moreover, several duplicated genes from the RBGD have evolved into key factors that influence important agronomic traits of wheat. Our results provide insights into a unique source of gene duplicates in Triticeae species, which has increased the gene dosage together with the two polyploidization events in the evolutionary history of wheat.
Collapse
Affiliation(s)
- Xiaoliang Wang
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xueqing Yan
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yiheng Hu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Liuyu Qin
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Daowen Wang
- College of Agronomy, Collaborative Innovation Center of Henan Grain Crops, Henan Agricultural University, Zhengzhou, Henan 450046, China
| | - Jizeng Jia
- College of Agronomy, Collaborative Innovation Center of Henan Grain Crops, Henan Agricultural University, Zhengzhou, Henan 450046, China
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Yuannian Jiao
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
48
|
Witkop EM, Proestou DA, Gomez-Chiarri M. The expanded inhibitor of apoptosis gene family in oysters possesses novel domain architectures and may play diverse roles in apoptosis following immune challenge. BMC Genomics 2022; 23:201. [PMID: 35279090 PMCID: PMC8917759 DOI: 10.1186/s12864-021-08233-6] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Accepted: 12/06/2021] [Indexed: 11/23/2022] Open
Abstract
Background Apoptosis plays important roles in a variety of functions, including immunity and response to environmental stress. The Inhibitor of Apoptosis (IAP) gene family of apoptosis regulators is expanded in molluscs, including eastern, Crassostrea virginica, and Pacific, Crassostrea gigas, oysters. The functional importance of IAP expansion in apoptosis and immunity in oysters remains unknown. Results Phylogenetic analysis of IAP genes in 10 molluscs identified lineage specific gene expansion in bivalve species. Greater IAP gene family expansion was observed in C. virginica than C. gigas (69 vs. 40), resulting mainly from tandem duplications. Functional domain analysis of oyster IAP proteins revealed 3 novel Baculoviral IAP Repeat (BIR) domain types and 14 domain architecture types across gene clusters, 4 of which are not present in model organisms. Phylogenetic analysis of bivalve IAPs suggests a complex history of domain loss and gain. Most IAP genes in oysters (76% of C. virginica and 82% of C. gigas), representing all domain architecture types, were expressed in response to immune challenge (Ostreid Herpesvirus OsHV-1, bacterial probionts Phaeobacter inhibens and Bacillus pumilus, several Vibrio spp., pathogenic Aliiroseovarius crassostreae, and protozoan parasite Perkinsus marinus). Patterns of IAP and apoptosis-related differential gene expression differed between the two oyster species, where C. virginica, in general, differentially expressed a unique set of IAP genes in each challenge, while C. gigas differentially expressed an overlapping set of IAP genes across challenges. Apoptosis gene expression patterns clustered mainly by resistance/susceptibility of the oyster host to immune challenge. Weighted Gene Correlation Network Analysis (WGCNA) revealed unique combinations of transcripts for 1 to 12 IAP domain architecture types, including novel types, were significantly co-expressed in response to immune challenge with transcripts in apoptosis-related pathways. Conclusions Unprecedented diversity characterized by novel BIR domains and protein domain architectures was observed in oyster IAPs. Complex patterns of gene expression of novel and conserved IAPs in response to a variety of ecologically-relevant immune challenges, combined with evidence of direct co-expression of IAP genes with apoptosis-related transcripts, suggests IAP expansion facilitates complex and nuanced regulation of apoptosis and other immune responses in oysters. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-08233-6.
Collapse
|
49
|
Identification of multiple TAR DNA binding protein retropseudogene lineages during the evolution of primates. Sci Rep 2022; 12:3823. [PMID: 35264686 PMCID: PMC8907276 DOI: 10.1038/s41598-022-07908-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Accepted: 02/22/2022] [Indexed: 11/08/2022] Open
Abstract
The TAR DNA Binding Protein (TARDBP) gene has become relevant after the discovery of its several pathogenic mutations. The lack of evolutionary history is in contrast to the amount of studies found in the literature. This study investigated the evolutionary dynamics associated with the retrotransposition of the TARDBP gene in primates. We identified novel retropseudogenes that likely originated in the ancestors of anthropoids, catarrhines, and lemuriformes, i.e. the strepsirrhine clade that inhabit Madagascar. We also found species-specific retropseudogenes in the Philippine tarsier, Bolivian squirrel monkey, capuchin monkey and vervet. The identification of a retropseudocopy of the TARDBP gene overlapping a lncRNA that is potentially expressed opens a new avenue to investigate TARDBP gene regulation, especially in the context of TARDBP associated pathologies.
Collapse
|
50
|
Farhat S, Bonnivard E, Pales Espinosa E, Tanguy A, Boutet I, Guiglielmoni N, Flot JF, Allam B. Comparative analysis of the Mercenaria mercenaria genome provides insights into the diversity of transposable elements and immune molecules in bivalve mollusks. BMC Genomics 2022; 23:192. [PMID: 35260071 PMCID: PMC8905726 DOI: 10.1186/s12864-021-08262-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Accepted: 12/15/2021] [Indexed: 01/05/2023] Open
Abstract
BACKGROUND The hard clam Mercenaria mercenaria is a major marine resource along the Atlantic coasts of North America and has been introduced to other continents for resource restoration or aquaculture activities. Significant mortality events have been reported in the species throughout its native range as a result of diseases (microbial infections, leukemia) and acute environmental stress. In this context, the characterization of the hard clam genome can provide highly needed resources to enable basic (e.g., oncogenesis and cancer transmission, adaptation biology) and applied (clam stock enhancement, genomic selection) sciences. RESULTS Using a combination of long and short-read sequencing technologies, a 1.86 Gb chromosome-level assembly of the clam genome was generated. The assembly was scaffolded into 19 chromosomes, with an N50 of 83 Mb. Genome annotation yielded 34,728 predicted protein-coding genes, markedly more than the few other members of the Venerida sequenced so far, with coding regions representing only 2% of the assembly. Indeed, more than half of the genome is composed of repeated elements, including transposable elements. Major chromosome rearrangements were detected between this assembly and another recent assembly derived from a genetically segregated clam stock. Comparative analysis of the clam genome allowed the identification of a marked diversification in immune-related proteins, particularly extensive tandem duplications and expansions in tumor necrosis factors (TNFs) and C1q domain-containing proteins, some of which were previously shown to play a role in clam interactions with infectious microbes. The study also generated a comparative repertoire highlighting the diversity and, in some instances, the specificity of LTR-retrotransposons elements, particularly Steamer elements in bivalves. CONCLUSIONS The diversity of immune molecules in M. mercenaria may allow this species to cope with varying and complex microbial and environmental landscapes. The repertoire of transposable elements identified in this study, particularly Steamer elements, should be a prime target for the investigation of cancer cell development and transmission among bivalve mollusks.
Collapse
Affiliation(s)
- Sarah Farhat
- Marine Animal Disease Laboratory, School of Marine and Atmospheric Sciences, 100 Nicolls Road, Stony Brook University, Stony Brook, NY, 11794-5000, USA
| | - Eric Bonnivard
- Sorbonne Université, CNRS, UMR 7144 AD2M, Station Biologique de Roscoff, Place Georges Teissier, 29688, Roscoff, France
| | - Emmanuelle Pales Espinosa
- Marine Animal Disease Laboratory, School of Marine and Atmospheric Sciences, 100 Nicolls Road, Stony Brook University, Stony Brook, NY, 11794-5000, USA
| | - Arnaud Tanguy
- Sorbonne Université, CNRS, UMR 7144 AD2M, Station Biologique de Roscoff, Place Georges Teissier, 29688, Roscoff, France
| | - Isabelle Boutet
- Sorbonne Université, CNRS, UMR 7144 AD2M, Station Biologique de Roscoff, Place Georges Teissier, 29688, Roscoff, France
| | - Nadège Guiglielmoni
- Université libre de Bruxelles (ULB), Evolutionary Biology & Ecology, Avenue F.D. Roosevelt 50, B-1050, Brussels, Belgium
| | - Jean-François Flot
- Université libre de Bruxelles (ULB), Evolutionary Biology & Ecology, Avenue F.D. Roosevelt 50, B-1050, Brussels, Belgium.,Interuniversity Institute of Bioinformatics in Brussels - (IB)2, B-1050, Brussels, Belgium
| | - Bassem Allam
- Marine Animal Disease Laboratory, School of Marine and Atmospheric Sciences, 100 Nicolls Road, Stony Brook University, Stony Brook, NY, 11794-5000, USA.
| |
Collapse
|