1
|
Cheng L, Wang N, Bao Z, Zhou Q, Guarracino A, Yang Y, Wang P, Zhang Z, Tang D, Zhang P, Wu Y, Zhou Y, Zheng Y, Hu Y, Lian Q, Ma Z, Lassois L, Zhang C, Lucas WJ, Garrison E, Stein N, Städler T, Zhou Y, Huang S. Leveraging a phased pangenome for haplotype design of hybrid potato. Nature 2025:10.1038/s41586-024-08476-9. [PMID: 39843749 DOI: 10.1038/s41586-024-08476-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Accepted: 12/02/2024] [Indexed: 01/24/2025]
Abstract
The tetraploid genome and clonal propagation of the cultivated potato (Solanum tuberosum L.)1,2 dictate a slow, non-accumulative breeding mode of the most important tuber crop. Transitioning potato breeding to a seed-propagated hybrid system based on diploid inbred lines has the potential to greatly accelerate its improvement3. Crucially, the development of inbred lines is impeded by manifold deleterious variants; explaining their nature and finding ways to eliminate them is the current focus of hybrid potato research4-10. However, most published diploid potato genomes are unphased, concealing crucial information on haplotype diversity and heterozygosity11-13. Here we develop a phased potato pangenome graph of 60 haplotypes from cultivated diploids and the ancestral wild species, and find evidence for the prevalence of transposable elements in generating structural variants. Compared with the linear reference, the graph pangenome represents a broader diversity (3,076 Mb versus 742 Mb). Notably, we observe enhanced heterozygosity in cultivated diploids compared with wild ones (14.0% versus 9.5%), indicating extensive hybridization during potato domestication. Using conservative criteria, we identify 19,625 putatively deleterious structural variants (dSVs) and reveal a biased accumulation of deleterious single nucleotide polymorphisms (dSNPs) around dSVs in coupling phase. Based on the graph pangenome, we computationally design ideal potato haplotypes with minimal dSNPs and dSVs. These advances provide critical insights into the genomic basis of clonal propagation and will guide breeders to develop a suite of promising inbred lines.
Collapse
Affiliation(s)
- Lin Cheng
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
- Plant Genetics and Rhizosphere Processes Laboratory, TERRA Teaching and Research Center, Gembloux Agro-Bio Tech, University of Liège, Gembloux, Belgium
| | - Nan Wang
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
- National Key Laboratory of Tropical Crop Breeding, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Zhigui Bao
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, Tübingen, Germany
| | - Qian Zhou
- School of Agriculture and Biotechnology, Sun Yat-Sen University, Shenzhen, China
| | - Andrea Guarracino
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Yuting Yang
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Pei Wang
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Zhiyang Zhang
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Dié Tang
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA
| | - Pingxian Zhang
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Yaoyao Wu
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
- College of Horticulture, Nanjing Agricultural University, Nanjing, China
| | - Yao Zhou
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
- Key Laboratory of Plant Molecular Physiology, Institute of Botany, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Yi Zheng
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Yong Hu
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Qun Lian
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Zhaoxu Ma
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Ludivine Lassois
- Plant Genetics and Rhizosphere Processes Laboratory, TERRA Teaching and Research Center, Gembloux Agro-Bio Tech, University of Liège, Gembloux, Belgium
| | - Chunzhi Zhang
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - William J Lucas
- Department of Plant Biology, College of Biological Sciences, University of California, Davis, Davis, CA, USA
| | - Erik Garrison
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Nils Stein
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
- Crop Plant Genetics, Institute of Agricultural and Nutritional Sciences, Martin-Luther-University of Halle-Wittenberg, Halle (Saale), Germany
| | - Thomas Städler
- Institute of Integrative Biology and Zurich-Basel Plant Science Center, ETH Zurich, Zurich, Switzerland
| | - Yongfeng Zhou
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
- National Key Laboratory of Tropical Crop Breeding, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Sanwen Huang
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China.
- National Key Laboratory of Tropical Crop Breeding, Chinese Academy of Tropical Agricultural Sciences, Haikou, China.
| |
Collapse
|
2
|
López-Cortegano E, Chebib J, Jonas A, Vock A, Künzel S, Keightley PD, Tautz D. The rate and spectrum of new mutations in mice inferred by long-read sequencing. Genome Res 2025; 35:43-54. [PMID: 39622636 DOI: 10.1101/gr.279982.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2024] [Accepted: 11/26/2024] [Indexed: 01/12/2025]
Abstract
All forms of genetic variation originate from new mutations, making it crucial to understand their rates and mechanisms. Here, we use long-read sequencing from Pacific Biosciences (PacBio) to investigate de novo mutations that accumulated in 12 inbred mouse lines derived from three commonly used inbred strains (C3H, C57BL/6, and FVB) maintained for 8 to 15 generations in a mutation accumulation (MA) experiment. We built chromosome-level genome assemblies based on the MA line founders' genomes and then employed a combination of read and assembly-based methods to call the complete spectrum of new mutations. On average, there are about 45 mutations per haploid genome per generation, about half of which (54%) are insertions and deletions shorter than 50 bp (indels). The remainder are single-nucleotide mutations (SNMs; 44%) and large structural mutations (SMs; 2%). We found that the degree of DNA repetitiveness is positively correlated with SNM and indel rates and that a substantial fraction of SMs can be explained by homology-dependent mechanisms associated with repeat sequences. Most (90%) indels can be attributed to microsatellite contractions and expansions, and there is a marked bias toward 4 bp indels. Among the different types of SMs, tandem repeat mutations have the highest mutation rate, followed by insertions of transposable elements (TEs). We uncover a rich landscape of active TEs, notable differences in their spectrum among MA lines and strains, and a high rate of gene retroposition. Our study offers novel insights into mammalian genome evolution and highlights the importance of repetitive elements in shaping genomic diversity.
Collapse
Affiliation(s)
- Eugenio López-Cortegano
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom;
| | - Jobran Chebib
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| | - Anika Jonas
- Department for Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| | - Anastasia Vock
- Department for Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| | - Sven Künzel
- Department for Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| | - Peter D Keightley
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| | - Diethard Tautz
- Department for Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| |
Collapse
|
3
|
Collins RL, Talkowski ME. Diversity and consequences of structural variation in the human genome. Nat Rev Genet 2025:10.1038/s41576-024-00808-9. [PMID: 39838028 DOI: 10.1038/s41576-024-00808-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/26/2024] [Indexed: 01/23/2025]
Abstract
The biomedical community is increasingly invested in capturing all genetic variants across human genomes, interpreting their functional consequences and translating these findings to the clinic. A crucial component of this endeavour is the discovery and characterization of structural variants (SVs), which are ubiquitous in the human population, heterogeneous in their mutational processes, key substrates for evolution and adaptation, and profound drivers of human disease. The recent emergence of new technologies and the remarkable scale of sequence-based population studies have begun to crystalize our understanding of SVs as a mutational class and their widespread influence across phenotypes. In this Review, we summarize recent discoveries and new insights into SVs in the human genome in terms of their mutational patterns, population genetics, functional consequences, and impact on human traits and disease. We conclude by outlining three frontiers to be explored by the field over the next decade.
Collapse
Affiliation(s)
- Ryan L Collins
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Michael E Talkowski
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
4
|
Trost H, Lopezcolorado FW, Merkell A, Stark JM. Functions of PMS2 and MLH1 important for regulation of divergent repeat-mediated deletions. DNA Repair (Amst) 2025; 145:103791. [PMID: 39615226 DOI: 10.1016/j.dnarep.2024.103791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2024] [Revised: 11/15/2024] [Accepted: 11/22/2024] [Indexed: 01/25/2025]
Abstract
Repeat-mediated deletions (RMDs) are a type of deletion rearrangement that utilizes two repetitive elements to bridge a DNA double-strand break (DSB) that leads to loss of the intervening sequence and one of the repeats. Sequence divergence between repeats causes RMD suppression and indeed this divergence must be resolved in the RMD products. The mismatch repair factor, MLH1, was shown to be critical for both RMD suppression and a polarity of sequence divergence resolution in RMDs. Here, we sought to study the interrelationship between these two aspects of RMD regulation (i.e., RMD suppression and polar divergence resolution), by examining several mutants of MLH1 and its binding partner PMS2. To begin with, we show that PMS2 is also critical for both RMD suppression and polar resolution of sequence divergence in RMD products. Then, with six mutants of the MLH1-PMS2 heterodimer, we found several different patterns: three mutants showed defects in both functions, one mutant showed loss of RMD suppression but not polar divergence resolution, whereas another mutant showed the opposite, and finally one mutant showed loss of RMD suppression but had a complex effect on polar divergence resolution. These findings indicate that RMD suppression vs. polar resolution of sequence divergence are distinct functions of MLH1-PMS2.
Collapse
Affiliation(s)
- Hannah Trost
- Department of Cancer Genetics and Epigenetics, Beckman Research Institute of the City of Hope, Duarte, CA 91010, USA; Irell and Manella Graduate School of Biological Sciences, Beckman Research Institute of the City of Hope, Duarte, CA 91010, USA
| | | | - Arianna Merkell
- Department of Cancer Genetics and Epigenetics, Beckman Research Institute of the City of Hope, Duarte, CA 91010, USA
| | - Jeremy M Stark
- Department of Cancer Genetics and Epigenetics, Beckman Research Institute of the City of Hope, Duarte, CA 91010, USA; Irell and Manella Graduate School of Biological Sciences, Beckman Research Institute of the City of Hope, Duarte, CA 91010, USA.
| |
Collapse
|
5
|
van der Kuyl AC. Mutation Rate Variation and Other Challenges in 2-LTR Dating of Primate Endogenous Retrovirus Integrations. J Mol Evol 2024:10.1007/s00239-024-10225-5. [PMID: 39715846 DOI: 10.1007/s00239-024-10225-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2024] [Accepted: 12/07/2024] [Indexed: 12/25/2024]
Abstract
The time of integration of germline-targeting Long Terminal Repeat (LTR) retroposons, such as endogenous retroviruses (ERVs), can be estimated by assessing the nucleotide divergence between the LTR sequences flanking the viral genes. Due to the viral replication mechanism, both LTRs are identical at the moment of integration, when the provirus becomes part of the host genome. After that time, proviral sequences evolve within the host DNA. When the mutation rate is known, nucleotide divergence between the LTRs would then be a measure of time elapsed since integration. Though frequently used, the approach has been complicated by the choice of host mutation rate and, to a lesser extent, by the method selected to estimate nucleotide divergence. As a result, outcomes can be incompatible with, for instance, speciation events identified from the fossil record. The review will give an overview of research reporting LTR-retroposon dating, and a summary of important factors to consider, including the quality, assembly, and alignment of sequences, the mutation rate of foreign DNA in host genomes, and the choice of a distance estimation method. Primates will here be the focus of the analysis because their genomes, ERVs, and fossil record have been extensively studied. However, most of the factors discussed have a wide applicability in the vertebrate field.
Collapse
Affiliation(s)
- Antoinette Cornelia van der Kuyl
- Laboratory of Experimental Virology, Department of Medical Microbiology and Infection Prevention, Amsterdam UMC, Location AMC, University of Amsterdam, Meibergdreef 9, 1105 AZ, Amsterdam, The Netherlands.
- Amsterdam Institute for Immunology & Infectious Diseases, 1100 DD, Amsterdam, The Netherlands.
| |
Collapse
|
6
|
Guo M, Bi G, Wang H, Ren H, Chen J, Lian Q, Wang X, Fang W, Zhang J, Dong Z, Pang Y, Zhang Q, Huang S, Yan J, Zhao X. Genomes of autotetraploid wild and cultivated Ziziphus mauritiana reveal polyploid evolution and crop domestication. PLANT PHYSIOLOGY 2024; 196:2701-2720. [PMID: 39325737 DOI: 10.1093/plphys/kiae512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Revised: 08/28/2024] [Accepted: 09/12/2024] [Indexed: 09/28/2024]
Abstract
Indian jujube (Ziziphus mauritiana) holds a prominent position in the global fruit and pharmaceutical markets. Here, we report the assemblies of haplotype-resolved, telomere-to-telomere genomes of autotetraploid wild and cultivated Indian jujube plants using a 2-stage assembly strategy. The generation of these genomes permitted in-depth investigations into the divergence and evolutionary history of this important fruit crop. Using a graph-based pan-genome constructed from 8 monoploid genomes, we identified structural variation (SV)-FST hotspots and SV hotspots. Gap-free genomes provide a means to obtain a global view of centromere structures. We identified presence-absence variation-related genes in 4 monoploid genomes (cI, cIII, wI, and wIII) and resequencing populations. We also present the population structure and domestication trajectory of the Indian jujube based on the resequencing of 73 wild and cultivated accessions. Metabolomic and transcriptomic analyses of mature fruits of wild and cultivated accessions unveiled the genetic basis underlying loss of fruit astringency during domestication of Indian jujube. This study reveals mechanisms underlying the divergence, evolution, and domestication of the autotetraploid Indian jujube and provides rich and reliable genetic resources for future research.
Collapse
Affiliation(s)
- Mingxin Guo
- College of Life Sciences, Luoyang Normal University, Luoyang 471934, China
| | - Guiqi Bi
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
| | - Huan Wang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
- Guangdong Province Key Laboratory of Microbial Signals and Disease Control, Integrative Microbiology Research Centre, and College of Plant Protection, South China Agricultural University, Guangzhou 510642, China
| | - Hui Ren
- Horticultural Research Institute, Guangxi Academy of Agricultural Sciences, Nanning 530007, China
| | - Jiaying Chen
- South Subtropical Crops Research Institute, Chinese Academy of Tropical Agricultural Sciences, Zhanjiang 524000, China
| | - Qun Lian
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
| | - Xiaomei Wang
- Horticultural Research Institute, Guangxi Academy of Agricultural Sciences, Nanning 530007, China
| | - Weikuan Fang
- Horticultural Research Institute, Guangxi Academy of Agricultural Sciences, Nanning 530007, China
| | - Jiangjiang Zhang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
| | - Zhaonian Dong
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
| | - Yi Pang
- College of Life Sciences, Luoyang Normal University, Luoyang 471934, China
| | - Quanling Zhang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
| | - Sanwen Huang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
| | - Jianbin Yan
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
| | - Xusheng Zhao
- College of Life Sciences, Luoyang Normal University, Luoyang 471934, China
| |
Collapse
|
7
|
Singh J, Gudi S, Maughan PJ, Liu Z, Kolmer J, Wang M, Chen X, Rouse MN, Lasserre‐Zuber P, Rimbert H, Sehgal S, Fiedler JD, Choulet F, Acevedo M, Gupta R, Gill U. Genomes of Aegilops umbellulata provide new insights into unique structural variations and genetic diversity in the U-genome for wheat improvement. PLANT BIOTECHNOLOGY JOURNAL 2024; 22:3505-3519. [PMID: 39292731 PMCID: PMC11606429 DOI: 10.1111/pbi.14470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Revised: 08/09/2024] [Accepted: 08/26/2024] [Indexed: 09/20/2024]
Abstract
Aegilops umbellulata serve as an important reservoir for novel biotic and abiotic stress tolerance for wheat improvement. However, chromosomal rearrangements and evolutionary trajectory of this species remain to be elucidated. Here, we present a comprehensive investigation into Ae. umbellulata genome by generating a high-quality near telomere-to-telomere genome assembly of PI 554389 and resequencing 20 additional Ae. umbellulata genomes representing diverse geographical and phenotypic variations. Our analysis unveils complex chromosomal rearrangements, most prominently in 4U and 6U chromosomes, delineating a distinct evolutionary trajectory of Ae. umbellulata from wheat and its relatives. Furthermore, our data rectified the erroneous naming of chromosomes 4U and 6U in the past and highlighted multiple major evolutionary events that led to the present-day U-genome. Resequencing of diverse Ae. umbellulata accessions revealed high genetic diversity within the species, partitioning into three distinct evolutionary sub-populations and supported by extensive phenotypic variability in resistance against several races/pathotypes of five major wheat diseases. Disease evaluations indicated the presence of several novel resistance genes in the resequenced lines for future studies. Resequencing also resulted in the identification of six new haplotypes for Lr9, the first resistance gene cloned from Ae. umbellulata. The extensive genomic and phenotypic resources presented in this study will expedite the future genetic exploration of Ae. umbellulata, facilitating efforts aimed at enhancing resiliency and productivity in wheat.
Collapse
Affiliation(s)
| | - Santosh Gudi
- North Dakota State UniversityFargoNorth DakotaUSA
| | | | - Zhaohui Liu
- North Dakota State UniversityFargoNorth DakotaUSA
| | - James Kolmer
- Cereal Disease LaboratoryUnited States Department of Agriculture (USDA) Agricultural Research Service (ARS)St. PaulMinnesotaUSA
| | - Meinan Wang
- Washington State UniversityPullmanWashingtonUSA
| | - Xianming Chen
- Washington State UniversityPullmanWashingtonUSA
- Wheat Health, Genetics, and Quality Research UnitUnited States Department of Agriculture (USDA) Agricultural Research Service (ARS)PullmanWashingtonUSA
| | - Matthew N. Rouse
- Cereal Disease LaboratoryUnited States Department of Agriculture (USDA) Agricultural Research Service (ARS)St. PaulMinnesotaUSA
- University of MinnesotaSt. PaulMinnesotaUSA
| | | | - Héléne Rimbert
- Université Clermont Auvergne, INRAE, GDECClermont‐FerrandFrance
| | - Sunish Sehgal
- South Dakota State UniversityBrookingsSouth DakotaUSA
| | - Jason D. Fiedler
- United States Department of Agriculture (USDA) Agricultural Research Service (ARS)Cereal Crops Research Unit, Edward T. Schafer Agriculture Research CenterFargoNorth DakotaUSA
| | | | | | - Rajeev Gupta
- North Dakota State UniversityFargoNorth DakotaUSA
- United States Department of Agriculture (USDA) Agricultural Research Service (ARS)Cereal Crops Research Unit, Edward T. Schafer Agriculture Research CenterFargoNorth DakotaUSA
| | - Upinder Gill
- North Dakota State UniversityFargoNorth DakotaUSA
| |
Collapse
|
8
|
Vazquez JM, Lauterbur ME, Mottaghinia S, Bucci M, Fraser D, Gray-Sandoval G, Gaucherand L, Haidar ZR, Han M, Kohler W, Lama TM, Le Corf A, Loyer C, Maesen S, McMillan D, Li S, Lo J, Rey C, Capel SLR, Singer M, Slocum K, Thomas W, Tyburec JD, Villa S, Miller R, Buchalski M, Vazquez-Medina JP, Pfeffer S, Etienne L, Enard D, Sudmant PH. Extensive longevity and DNA virus-driven adaptation in nearctic Myotis bats. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.10.617725. [PMID: 39416019 PMCID: PMC11482938 DOI: 10.1101/2024.10.10.617725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/19/2024]
Abstract
The genus Myotis is one of the largest clades of bats, and exhibits some of the most extreme variation in lifespans among mammals alongside unique adaptations to viral tolerance and immune defense. To study the evolution of longevity-associated traits and infectious disease, we generated near-complete genome assemblies and cell lines for 8 closely related species of Myotis. Using genome-wide screens of positive selection, analyses of structural variation, and functional experiments in primary cell lines, we identify new patterns of adaptation contributing to longevity, cancer resistance, and viral interactions in bats. We find that Myotis bats have some of the most significant variation in cancer risk across mammals and demonstrate a unique DNA damage response in primary cells of the long-lived M. lucifugus. We also find evidence of abundant adaptation in response to DNA viruses - but not RNA viruses - in Myotis and other bats in sharp contrast with other mammals, potentially contributing to the role of bats as reservoirs of zoonoses. Together, our results demonstrate how genomics and primary cells derived from diverse taxa uncover the molecular bases of extreme adaptations in non-model organisms.
Collapse
Affiliation(s)
- Juan M Vazquez
- Department of Integrative Biology, University of California, Berkeley, Berkeley, CA USA
- These authors contributed equally
| | - M. Elise Lauterbur
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ USA
- Current affiliation: Department of Biology, University of Vermont, Burlington, VT USA
- These authors contributed equally
| | - Saba Mottaghinia
- Centre International de Recherche en Infectiologie (CIRI), Inserm U1111, UCBL1, CNRS UMR5308, Ecole Normale Supérieure ENS de Lyon, Université de Lyon, Lyon, France
| | - Melanie Bucci
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ USA
| | - Devaughn Fraser
- Wildlife Genetics Research Unit, Wildlife Health Laboratory, California Department of Fish and Wildlife, Sacramento, CA, United States
- Current affiliation: Wildlife Diversity Program, Wildlife Division, Connecticut Department of Energy and Environmental Protection, Burlington, CT, United States
| | | | - Léa Gaucherand
- Université de Strasbourg, Architecture et Réactivité de l’ARN, Institut de Biologie Moléculaire et Cellulaire du CNRS, Strasbourg, France
| | - Zeinab R Haidar
- Department of Biology, California State Polytechnic University, Humboldt, Arcata, CA USA
- Current affiliation: Western EcoSystems Technology Inc, Cheyenne, WY USA
| | - Melissa Han
- Department of Pathology and Clinical Laboratories, University of Michigan, Ann Arbor, MI USA
| | - William Kohler
- Department of Pathology and Clinical Laboratories, University of Michigan, Ann Arbor, MI USA
| | - Tanya M. Lama
- Department of Biological Sciences, Smith College, Northampton, MA USA
| | - Amandine Le Corf
- Centre International de Recherche en Infectiologie (CIRI), Inserm U1111, UCBL1, CNRS UMR5308, Ecole Normale Supérieure ENS de Lyon, Université de Lyon, Lyon, France
| | - Clara Loyer
- Centre International de Recherche en Infectiologie (CIRI), Inserm U1111, UCBL1, CNRS UMR5308, Ecole Normale Supérieure ENS de Lyon, Université de Lyon, Lyon, France
| | - Sarah Maesen
- Centre International de Recherche en Infectiologie (CIRI), Inserm U1111, UCBL1, CNRS UMR5308, Ecole Normale Supérieure ENS de Lyon, Université de Lyon, Lyon, France
| | - Dakota McMillan
- Department of Integrative Biology, University of California, Berkeley, Berkeley, CA USA
- Department of Science and Biotechnology, Berkeley City College, Berkeley, CA USA
| | - Stacy Li
- Department of Integrative Biology, University of California, Berkeley, Berkeley, CA USA
- Center for Computational Biology, University of California, Berkeley, Berkeley, CA USA
| | - Johnathan Lo
- Center for Computational Biology, University of California, Berkeley, Berkeley, CA USA
| | - Carine Rey
- Centre International de Recherche en Infectiologie (CIRI), Inserm U1111, UCBL1, CNRS UMR5308, Ecole Normale Supérieure ENS de Lyon, Université de Lyon, Lyon, France
| | - Samantha LR Capel
- Current affiliation: Wildlife Diversity Program, Wildlife Division, Connecticut Department of Energy and Environmental Protection, Burlington, CT, United States
| | - Michael Singer
- Department of Molecular and Cellular Biology, University of California, Berkeley, Berkeley, CA USA
| | | | - William Thomas
- Department of Ecology and Evolution, Stony Brook University, Stony Brook NY USA
| | | | - Sarah Villa
- Department of Molecular and Cellular Biology, University of California, Berkeley, Berkeley, CA USA
| | - Richard Miller
- Department of Pathology and Clinical Laboratories, University of Michigan, Ann Arbor, MI USA
| | - Michael Buchalski
- Wildlife Genetics Research Unit, Wildlife Health Laboratory, California Department of Fish and Wildlife, Sacramento, CA, United States
| | | | - Sébastien Pfeffer
- Université de Strasbourg, Architecture et Réactivité de l’ARN, Institut de Biologie Moléculaire et Cellulaire du CNRS, Strasbourg, France
| | - Lucie Etienne
- Centre International de Recherche en Infectiologie (CIRI), Inserm U1111, UCBL1, CNRS UMR5308, Ecole Normale Supérieure ENS de Lyon, Université de Lyon, Lyon, France
- Senior author
| | - David Enard
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ USA
- Senior author
- These authors contributed equally
| | - Peter H Sudmant
- Department of Integrative Biology, University of California, Berkeley, Berkeley, CA USA
- Center for Computational Biology, University of California, Berkeley, Berkeley, CA USA
- Senior author
- These authors contributed equally
- Lead contact
| |
Collapse
|
9
|
Shah S, Yu S, Zhang C, Ali I, Wang X, Qian Y, Xiao T. Retrotransposon SINEs in age-related diseases: Mechanisms and therapeutic implications. Ageing Res Rev 2024; 101:102539. [PMID: 39395576 DOI: 10.1016/j.arr.2024.102539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2024] [Revised: 09/27/2024] [Accepted: 10/03/2024] [Indexed: 10/14/2024]
Abstract
Retrotransposons are self-replicating genomic elements that move from one genomic location to another using a "copy-and-paste" method involving RNA intermediaries. One family of retrotransposon that has garnered considerable attention for its association with age-related diseases and anti-aging interventions is the short interspersed nuclear elements (SINEs). This review summarizes current knowledge on the roles of SINEs in aging processes and therapies. To underscore the significant research on the involvement of SINEs in aging-related diseases, we commence by outlining compelling evidence on the classification and mechanism, highlighting implications in age-related phenomena. The intricate relationship between SINEs and diseases such as neurodegenerative disorders, heart failure, high blood pressure, atherosclerosis, type 2 diabetes mellitus, osteoporosis, visual system dysfunctions, and cancer is explored, emphasizing their roles in various age-related diseases. Recent investigations into the anti-aging potential of SINE-targeted treatments are examined, with particular attention to how SINE antisense RNA mitigate age-related alterations at the cellular and molecular levels, offering insights into potential therapeutic targets for age-related pathologies. This review aims to compile the most recent advances on the multifaceted roles of SINE retrotransposons in age-related diseases and anti-aging interventions, providing valuable insights into underlying mechanisms and therapeutic avenues for promoting healthy aging.
Collapse
Affiliation(s)
- Suleman Shah
- Thoracic Surgery Department of the First Affiliated Hospital, Guangdong Key Laboratory of Genome Instability and Human Disease Prevention, Department of Cell Biology and Genetics, Shenzhen University Medical School, Shenzhen University, Shenzhen 518055, China; Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, School of Biomedical Engineering, Shenzhen University Medical school, Shenzhen 518055, China
| | - Siyi Yu
- State Key Laboratory of Oral & Maxillofacial Reconstruction and Regeneration, Key Laboratory of Oral Biomedicine Ministry of Education, Hubei Key Laboratory of Stomatology, School & Hospital of Stomatology, Wuhan University, Wuhan 430079, China
| | - Chen Zhang
- Department of Thoracic Surgery, The People's Hospital of Guangxi Zhuang Autonomous Region, Guangxi Academy of Medical Sciences, Nanning 530021, China
| | - Ilyas Ali
- Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, School of Biomedical Engineering, Shenzhen University Medical school, Shenzhen 518055, China
| | - Xiufang Wang
- Department of Genetics, Hebei Medical University, Hebei Key Lab of Laboratory Animal, Shijiazhuang 050017, China
| | - Youhui Qian
- Thoracic Surgery Department of the First Affiliated Hospital, Guangdong Key Laboratory of Genome Instability and Human Disease Prevention, Department of Cell Biology and Genetics, Shenzhen University Medical School, Shenzhen University, Shenzhen 518055, China.
| | - Tian Xiao
- Thoracic Surgery Department of the First Affiliated Hospital, Guangdong Key Laboratory of Genome Instability and Human Disease Prevention, Department of Cell Biology and Genetics, Shenzhen University Medical School, Shenzhen University, Shenzhen 518055, China.
| |
Collapse
|
10
|
Ali A, Liang P. Transposable elements contribute to tissue-specific gene regulation in humans. Genes Genomics 2024; 46:1327-1343. [PMID: 39088190 PMCID: PMC11602805 DOI: 10.1007/s13258-024-01550-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Accepted: 07/15/2024] [Indexed: 08/02/2024]
Abstract
BACKGROUND Transposable elements (TEs) contribute to approximately half of the human genome, and along with many other functions, they have been known to play a role in gene regulation in the genome. With TEs' active/repressed states varying across tissue and cell types, they have the potential to regulate gene expression in a tissue-specific manner. OBJECTIVE AND METHODS To provide a systematic analysis of TEs' contribution in tissue-specific gene regulation, we examined the regulatory elements and genes in association with TE-derived regulatory sequences in 14 human cell lines belonging to 10 different tissue types using the functional genomics data from the ENCODE project. Specifically, we separately analyzed regulatory regions identified by three different approaches (DNase hypersensitive sites (DHS), histone active sites (HA), and histone repressive sites (HR)). RESULTS These regulatory regions showed to be distinct from each other by sharing less than 2.5% among all three types and more than 95% showed to be cell line-specific. Despite a lower total TE content overall than the genome average, each regulatory sequence type showed enrichment for one or two specific TE type(s): DHS for long terminal repeats (LTRs) and DNA transposons, HA for short interspersed nucleotide elements (SINEs), and HR for LTRs. In contrast, SINE was shown to be overrepresented in all three types of regulatory sequences located in gene-neighboring regions. TE-regulated genes were mostly shown to have cell line specific pattern, and tissue-specific genes (TSGs) showed higher usage of TE regulatory sequences in the tissue of their expression. While TEs in the regulatory sequences showed to be older than their genome-wide counterparts, younger TEs were shown to be more likely used in cell line specific regulatory sequences. CONCLUSIONS Collectively, our study provided further evidence enforcing an important contribution of TEs to tissue-specific gene regulation in humans.
Collapse
Affiliation(s)
- Arsala Ali
- Department of Biological Sciences, Brock University, St. Catharines, ON, L2S 3A1, Canada
| | - Ping Liang
- Department of Biological Sciences, Brock University, St. Catharines, ON, L2S 3A1, Canada.
- Centre of Biotechnologies, Brock University, St. Catharines, ON, L2S 3A1, Canada.
| |
Collapse
|
11
|
Martelossi J, Iannello M, Ghiselli F, Luchetti A. Widespread HCD-tRNA derived SINEs in bivalves rely on multiple LINE partners and accumulate in genic regions. Mob DNA 2024; 15:22. [PMID: 39415259 PMCID: PMC11481361 DOI: 10.1186/s13100-024-00332-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Accepted: 10/03/2024] [Indexed: 10/18/2024] Open
Abstract
BACKGROUND Short interspersed nuclear elements (SINEs) are non-autonomous non-LTR retrotransposons widespread across eukaryotes. They exist both as lineage-specific, fast-evolving elements and as ubiquitous superfamilies characterized by highly conserved domains (HCD). Several of these superfamilies have been described in bivalves, however their overall distribution and impact on host genome evolution are still unknown due to the extreme scarcity of transposon libraries for the clade. In this study, we examined more than 40 bivalve genomes to uncover the distribution of HCD-tRNA-related SINEs, discover novel SINE-LINE partnerships, and understand their possible role in shaping bivalve genome evolution. RESULTS We found that bivalve HCD SINEs have an ancient origin, and they can rely on at least four different LINE clades. According to a "mosaic" evolutionary scenario, multiple LINE partner can promote the amplification of the same HCD SINE superfamilies while homologues LINE-derived tails are present between different superfamilies. Multiple SINEs were found to be highly similar between phylogenetically related species but separated by extremely long evolutionary timescales, up to ~ 400 million years. Studying their genomic distribution in a subset of five species, we observed different patterns of SINE enrichment in various genomic compartments as well as differences in the tendency of SINEs to form tandem-like and palindromic structures also within intronic sequences. Despite these differences, we observed that SINEs, especially older ones, tend to accumulate preferentially within genes, or in their close proximity, consistently with a model of survival bias for less harmful, short non-coding transposons in euchromatic genomic regions. CONCLUSION Here we conducted a wide characterization of tRNA-related SINEs in bivalves revealing their taxonomic distribution and LINE partnerships across the clade. Moreover, through the study of their genomic distribution in five species, we highlighted commonalities and differences with other previously studied eukaryotes, thus extending our understanding of SINE evolution across the tree of life.
Collapse
Affiliation(s)
- Jacopo Martelossi
- Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy.
| | - Mariangela Iannello
- Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy
| | - Fabrizio Ghiselli
- Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy.
| | - Andrea Luchetti
- Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy
| |
Collapse
|
12
|
Logsdon GA, Ebert P, Audano PA, Loftus M, Porubsky D, Ebler J, Yilmaz F, Hallast P, Prodanov T, Yoo D, Paisie CA, Harvey WT, Zhao X, Martino GV, Henglin M, Munson KM, Rabbani K, Chin CS, Gu B, Ashraf H, Austine-Orimoloye O, Balachandran P, Bonder MJ, Cheng H, Chong Z, Crabtree J, Gerstein M, Guethlein LA, Hasenfeld P, Hickey G, Hoekzema K, Hunt SE, Jensen M, Jiang Y, Koren S, Kwon Y, Li C, Li H, Li J, Norman PJ, Oshima KK, Paten B, Phillippy AM, Pollock NR, Rausch T, Rautiainen M, Scholz S, Song Y, Söylev A, Sulovari A, Surapaneni L, Tsapalou V, Zhou W, Zhou Y, Zhu Q, Zody MC, Mills RE, Devine SE, Shi X, Talkowski ME, Chaisson MJP, Dilthey AT, Konkel MK, Korbel JO, Lee C, Beck CR, Eichler EE, Marschall T. Complex genetic variation in nearly complete human genomes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.24.614721. [PMID: 39372794 PMCID: PMC11451754 DOI: 10.1101/2024.09.24.614721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/08/2024]
Abstract
Diverse sets of complete human genomes are required to construct a pangenome reference and to understand the extent of complex structural variation. Here, we sequence 65 diverse human genomes and build 130 haplotype-resolved assemblies (130 Mbp median continuity), closing 92% of all previous assembly gaps1,2 and reaching telomere-to-telomere (T2T) status for 39% of the chromosomes. We highlight complete sequence continuity of complex loci, including the major histocompatibility complex (MHC), SMN1/SMN2, NBPF8, and AMY1/AMY2, and fully resolve 1,852 complex structural variants (SVs). In addition, we completely assemble and validate 1,246 human centromeres. We find up to 30-fold variation in α-satellite high-order repeat (HOR) array length and characterize the pattern of mobile element insertions into α-satellite HOR arrays. While most centromeres predict a single site of kinetochore attachment, epigenetic analysis suggests the presence of two hypomethylated regions for 7% of centromeres. Combining our data with the draft pangenome reference1 significantly enhances genotyping accuracy from short-read data, enabling whole-genome inference3 to a median quality value (QV) of 45. Using this approach, 26,115 SVs per sample are detected, substantially increasing the number of SVs now amenable to downstream disease association studies.
Collapse
Affiliation(s)
- Glennis A Logsdon
- Perelman School of Medicine, University of Pennsylvania, Department of Genetics, Epigenetics Institute, Philadelphia, PA, USA
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Peter Ebert
- Core Unit Bioinformatics, Medical Faculty and University Hospital Düsseldorf, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Peter A Audano
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Mark Loftus
- Clemson University, Department of Genetics & Biochemistry, Clemson, SC, USA
- Center for Human Genetics, Clemson University, Greenwood, SC, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Jana Ebler
- Institute for Medical Biometry and Bioinformatics, Medical Faculty and University Hospital Düsseldorf, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Feyza Yilmaz
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Pille Hallast
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Timofey Prodanov
- Institute for Medical Biometry and Bioinformatics, Medical Faculty and University Hospital Düsseldorf, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - DongAhn Yoo
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Carolyn A Paisie
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - William T Harvey
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Xuefang Zhao
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Gianni V Martino
- Clemson University, Department of Genetics & Biochemistry, Clemson, SC, USA
- Center for Human Genetics, Clemson University, Greenwood, SC, USA
- Medical University of South Carolina, College of Graduate Studies, Charleston, SC, USA
| | - Mir Henglin
- Institute for Medical Biometry and Bioinformatics, Medical Faculty and University Hospital Düsseldorf, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Katherine M Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Keon Rabbani
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Chen-Shan Chin
- Foundation of Biological Data Sciences, Belmont, CA, USA
| | - Bida Gu
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Hufsah Ashraf
- Institute for Medical Biometry and Bioinformatics, Medical Faculty and University Hospital Düsseldorf, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Olanrewaju Austine-Orimoloye
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
| | | | - Marc Jan Bonder
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands; Oncode Institute, Utrecht, The Netherlands
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center, Heidelberg, Germany
| | - Haoyu Cheng
- Department of Biomedical Informatics and Data Science, Yale School of Medicine, New Haven, CT, USA
| | - Zechen Chong
- Department of Biomedical Informatics and Data Science, Heersink School of Medicine, University of Alabama, Birmingham, AL, USA
| | - Jonathan Crabtree
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Mark Gerstein
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
| | - Lisbeth A Guethlein
- Department of Structural Biology, School of Medicine, Stanford University, Stanford, CA, USA
| | - Patrick Hasenfeld
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
| | - Glenn Hickey
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Sarah E Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Matthew Jensen
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
| | - Yunzhe Jiang
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
| | - Sergey Koren
- Genome Informatics Section, Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Youngjun Kwon
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Chong Li
- Temple University, Department of Computer and Information Sciences, College of Science and Technology, Philadelphia, PA, USA
- Temple University, Institute for Genomics and Evolutionary Medicine, Philadelphia, PA, USA
| | - Heng Li
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Jiaqi Li
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
| | - Paul J Norman
- Department of Biomedical Informatics, University of Colorado School of Medicine, Aurora, CO, USA
- Department of Immunology and Microbiology, University of Colorado School of Medicine, Aurora, CO, USA
| | - Keisuke K Oshima
- Perelman School of Medicine, University of Pennsylvania, Department of Genetics, Epigenetics Institute, Philadelphia, PA, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Adam M Phillippy
- Genome Informatics Section, Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Nicholas R Pollock
- Department of Biomedical Informatics, University of Colorado School of Medicine, Aurora, CO, USA
| | - Tobias Rausch
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
| | - Mikko Rautiainen
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
| | - Stephan Scholz
- Institute of Medical Microbiology and Hospital Hygiene, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
| | - Yuwei Song
- Department of Biomedical Informatics and Data Science, Heersink School of Medicine, University of Alabama, Birmingham, AL, USA
| | - Arda Söylev
- Institute for Medical Biometry and Bioinformatics, Medical Faculty and University Hospital Düsseldorf, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Arvis Sulovari
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Likhitha Surapaneni
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Vasiliki Tsapalou
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
| | - Weichen Zhou
- Department of Computational Medicine & Bioinformatics, University of Michigan, MI, USA
| | - Ying Zhou
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Qihui Zhu
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
- Stanford Health Care, Palo Alto, CA, USA
| | | | - Ryan E Mills
- Department of Computational Medicine & Bioinformatics, University of Michigan, MI, USA
| | - Scott E Devine
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Xinghua Shi
- Temple University, Department of Computer and Information Sciences, College of Science and Technology, Philadelphia, PA, USA
- Temple University, Institute for Genomics and Evolutionary Medicine, Philadelphia, PA, USA
| | - Mike E Talkowski
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Neurology, Harvard Medical School, Boston, MA, USA
| | - Mark J P Chaisson
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Alexander T Dilthey
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
- Institute of Medical Microbiology and Hospital Hygiene, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
| | - Miriam K Konkel
- Clemson University, Department of Genetics & Biochemistry, Clemson, SC, USA
- Center for Human Genetics, Clemson University, Greenwood, SC, USA
| | - Jan O Korbel
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
| | - Charles Lee
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Christine R Beck
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
- The University of Connecticut Health Center, Farmington, CT, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Tobias Marschall
- Institute for Medical Biometry and Bioinformatics, Medical Faculty and University Hospital Düsseldorf, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| |
Collapse
|
13
|
Höps W, Rausch T, Jendrusch M, Korbel JO, Sedlazeck FJ. Impact and characterization of serial structural variations across humans and great apes. Nat Commun 2024; 15:8007. [PMID: 39266513 PMCID: PMC11393467 DOI: 10.1038/s41467-024-52027-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2024] [Accepted: 08/23/2024] [Indexed: 09/14/2024] Open
Abstract
Modern sequencing technology enables the systematic detection of complex structural variation (SV) across genomes. However, extensive DNA rearrangements arising through a series of mutations, a phenomenon we refer to as serial SV (sSV), remain underexplored, posing a challenge for SV discovery. Here, we present NAHRwhals ( https://github.com/WHops/NAHRwhals ), a method to infer repeat-mediated series of SVs in long-read genomic assemblies. Applying NAHRwhals to haplotype-resolved human genomes from 28 individuals reveals 37 sSV loci of various length and complexity. These sSVs explain otherwise cryptic variation in medically relevant regions such as the TPSAB1 gene, 8p23.1, 22q11 and Sotos syndrome regions. Comparisons with great ape assemblies indicate that most human sSVs formed recently, after the human-ape split, and involved non-repeat-mediated processes in addition to non-allelic homologous recombination. NAHRwhals reliably discovers and characterizes sSVs at scale and independent of species, uncovering their genomic abundance and suggesting broader implications for disease.
Collapse
Affiliation(s)
- Wolfram Höps
- European Molecular Biology Laboratory, Genome Biology Unit, Meyerhofstr. 1, 69117, Heidelberg, Germany
| | - Tobias Rausch
- European Molecular Biology Laboratory, Genome Biology Unit, Meyerhofstr. 1, 69117, Heidelberg, Germany
- Molecular Medicine Partnership Unit, European Molecular Biology Laboratory, University of Heidelberg, Heidelberg, Germany
| | - Michael Jendrusch
- European Molecular Biology Laboratory, Genome Biology Unit, Meyerhofstr. 1, 69117, Heidelberg, Germany
| | - Jan O Korbel
- European Molecular Biology Laboratory, Genome Biology Unit, Meyerhofstr. 1, 69117, Heidelberg, Germany.
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
| | - Fritz J Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, 77030, USA
- Department of Computer Science, Rice University, Houston, TX, USA
| |
Collapse
|
14
|
Wang Y, Xu H, He Q, Wu Z, Han GZ. Natural Transposable Element Insertions Contribute to Host Fitness in Model Yeasts. Genome Biol Evol 2024; 16:evae193. [PMID: 39228319 PMCID: PMC11403283 DOI: 10.1093/gbe/evae193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Revised: 08/30/2024] [Accepted: 08/31/2024] [Indexed: 09/05/2024] Open
Abstract
Transposable elements (TEs) are ubiquitous in the eukaryote genomes, but their evolutionary and functional significance remains largely obscure and contentious. Here, we explore the evolution and functional impact of TEs in two model unicellular eukaryotes, the fission yeast Schizosaccharomyces pombe and the budding yeast Saccharomyces cerevisiae, which diverged around 330 to 420 million years ago. We analyze the distribution of LTR retrotransposons (LTR-RTs, the only TE order identified in both species) and their solo-LTR derivatives in 35 strains of S. pombe and 128 strains of S. cerevisiae. We find that natural LTR-RT and solo-LTR insertions exhibit high presence-absence polymorphism among individuals in both species. Population genetics analyses show that solo-LTR insertions experienced functional constraints similar to synonymous sites of host genes in both species, indicating a majority of solo-LTR insertions might have evolved in a neutral manner. When knocking out nine representative solo-LTR insertions separately in the S. pombe strain 972h- and 12 representative solo-LTR insertions separately in the S. cerevisiae strain S288C, we find that one solo-LTR insertion in S. pombe has a significant effect on the fitness and transcriptome of its host. Together, our findings indicate that a fraction of natural TE insertions likely shape their host transcriptomes and thereby contribute to their host fitness, with implications for understanding the functional significance of TEs in eukaryotes.
Collapse
Affiliation(s)
- Yan Wang
- College of Life Sciences, Nanjing Normal University, Nanjing, Jiangsu 210023, China
| | - Hao Xu
- College of Life Sciences, Nanjing Normal University, Nanjing, Jiangsu 210023, China
| | - Qinliu He
- College of Life Sciences, Nanjing Normal University, Nanjing, Jiangsu 210023, China
| | - Zhiwei Wu
- College of Life Sciences, Nanjing Normal University, Nanjing, Jiangsu 210023, China
| | - Guan-Zhu Han
- College of Life Sciences, Nanjing Normal University, Nanjing, Jiangsu 210023, China
| |
Collapse
|
15
|
Trost H, Lopezcolorado FW, Merkell A, Stark JM. Functions of PMS2 and MLH1 important for regulation of divergent repeat-mediated deletions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.05.606388. [PMID: 39149360 PMCID: PMC11326157 DOI: 10.1101/2024.08.05.606388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/17/2024]
Abstract
Repeat-mediated deletions (RMDs) are a type of deletion rearrangement that utilizes two repetitive elements to bridge a DNA double-strand break (DSB) that leads to loss of the intervening sequence and one of the repeats. Sequence divergence between repeats causes RMD suppression and indeed this divergence must be resolved in the RMD products. The mismatch repair factor, MLH1, was shown to be critical for both RMD suppression and a polarity of sequence divergence resolution in RMDs. Here, we sought to study the interrelationship between these two aspects of RMD regulation (i.e., RMD suppression and polar divergence resolution), by examining several mutants of MLH1 and its binding partner PMS2. To begin with, we show that PMS2 is also critical for both RMD suppression and polar resolution of sequence divergence in RMD products. Then, with six mutants of the MLH1-PMS2 heterodimer, we found several different patterns: three mutants showed defects in both functions, one mutant showed loss of RMD suppression but not polar divergence resolution, whereas another mutant showed the opposite, and finally one mutant showed loss of RMD suppression but had a complex effect on polar divergence resolution. These findings indicate that RMD suppression vs. polar resolution of sequence divergence are distinct functions of MLH1-PMS2.
Collapse
Affiliation(s)
- Hannah Trost
- Department of Cancer Genetics and Epigenetics, Beckman Research Institute of the City of Hope, Duarte, California 91010, USA
- Irell and Manella Graduate School of Biological Sciences, Beckman Research Institute of the City of Hope, Duarte, California 91010, USA
| | - Felicia Wednesday Lopezcolorado
- Department of Cancer Genetics and Epigenetics, Beckman Research Institute of the City of Hope, Duarte, California 91010, USA
| | - Arianna Merkell
- Department of Cancer Genetics and Epigenetics, Beckman Research Institute of the City of Hope, Duarte, California 91010, USA
| | - Jeremy M. Stark
- Department of Cancer Genetics and Epigenetics, Beckman Research Institute of the City of Hope, Duarte, California 91010, USA
- Irell and Manella Graduate School of Biological Sciences, Beckman Research Institute of the City of Hope, Duarte, California 91010, USA
| |
Collapse
|
16
|
Betancourt AJ, Wei KHC, Huang Y, Lee YCG. Causes and Consequences of Varying Transposable Element Activity: An Evolutionary Perspective. Annu Rev Genomics Hum Genet 2024; 25:1-25. [PMID: 38603565 DOI: 10.1146/annurev-genom-120822-105708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/13/2024]
Abstract
Transposable elements (TEs) are genomic parasites found in nearly all eukaryotes, including humans. This evolutionary success of TEs is due to their replicative activity, involving insertion into new genomic locations. TE activity varies at multiple levels, from between taxa to within individuals. The rapidly accumulating evidence of the influence of TE activity on human health, as well as the rapid growth of new tools to study it, motivated an evaluation of what we know about TE activity thus far. Here, we discuss why TE activity varies, and the consequences of this variation, from an evolutionary perspective. By studying TE activity in nonhuman organisms in the context of evolutionary theories, we can shed light on the factors that affect TE activity. While the consequences of TE activity are usually deleterious, some have lasting evolutionary impacts by conferring benefits on the host or affecting other evolutionary processes.
Collapse
Affiliation(s)
- Andrea J Betancourt
- Institute of Infection, Veterinary, and Ecological Sciences, University of Liverpool, Liverpool, United Kingdom
| | - Kevin H-C Wei
- Department of Zoology, University of British Columbia, Vancouver, British Columbia, Canada
| | - Yuheng Huang
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California, USA
| | - Yuh Chwen G Lee
- Center for Complex Biological Systems, University of California, Irvine, California, USA;
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California, USA
| |
Collapse
|
17
|
Russo A, Alessandrini M, El Baidouri M, Frei D, Galise TR, Gaidusch L, Oertel HF, Garcia Morales SE, Potente G, Tian Q, Smetanin D, Bertrand JAM, Onstein RE, Panaud O, Frey JE, Cozzolino S, Wicker T, Xu S, Grossniklaus U, Schlüter PM. Genome of the early spider-orchid Ophrys sphegodes provides insights into sexual deception and pollinator adaptation. Nat Commun 2024; 15:6308. [PMID: 39060266 PMCID: PMC11282089 DOI: 10.1038/s41467-024-50622-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Accepted: 07/17/2024] [Indexed: 07/28/2024] Open
Abstract
Pollinator-driven evolution of floral traits is thought to be a major driver of angiosperm speciation and diversification. Ophrys orchids mimic female insects to lure male pollinators into pseudocopulation. This strategy, called sexual deception, is species-specific, thereby providing strong premating reproductive isolation. Identifying the genomic architecture underlying pollinator adaptation and speciation may shed light on the mechanisms of angiosperm diversification. Here, we report the 5.2 Gb chromosome-scale genome sequence of Ophrys sphegodes. We find evidence for transposable element expansion that preceded the radiation of the O. sphegodes group, and for gene duplication having contributed to the evolution of chemical mimicry. We report a highly differentiated genomic candidate region for pollinator-mediated evolution on chromosome 2. The Ophrys genome will prove useful for investigations into the repeated evolution of sexual deception, pollinator adaptation and the genomic architectures that facilitate evolutionary radiations.
Collapse
Affiliation(s)
- Alessia Russo
- Department of Plant Evolutionary Biology, Institute of Biology, University of Hohenheim, Stuttgart, Germany.
- Department of Plant and Microbial Biology and Zürich-Basel Plant Science Centre, University of Zurich, Zürich, Switzerland.
- Department of Systematic and Evolutionary Botany and Zürich-Basel Plant Science Centre, University of Zurich, Zürich, Switzerland.
| | - Mattia Alessandrini
- Department of Plant Evolutionary Biology, Institute of Biology, University of Hohenheim, Stuttgart, Germany
| | - Moaine El Baidouri
- Université Perpignan Via Domitia, Laboratoire Génome et Développement des Plantes, UMR5096, Perpignan, France
- CNRS, Laboratoire Génome et Développement des Plantes, UMR5096, Perpignan, France
- EMR269 MANGO, Institut de Recherche pour le Développement, Perpignan, France
| | - Daniel Frei
- Department of Methods Development and Analytics, Agroscope, Wädenswil, Switzerland
| | | | - Lara Gaidusch
- Department of Plant Evolutionary Biology, Institute of Biology, University of Hohenheim, Stuttgart, Germany
| | - Hannah F Oertel
- Department of Plant Evolutionary Biology, Institute of Biology, University of Hohenheim, Stuttgart, Germany
| | - Sara E Garcia Morales
- Department of Plant Evolutionary Biology, Institute of Biology, University of Hohenheim, Stuttgart, Germany
| | - Giacomo Potente
- Department of Systematic and Evolutionary Botany and Zürich-Basel Plant Science Centre, University of Zurich, Zürich, Switzerland
| | - Qin Tian
- Naturalis Biodiversity Centre, Leiden, The Netherlands
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
| | - Dmitry Smetanin
- Department of Plant and Microbial Biology and Zürich-Basel Plant Science Centre, University of Zurich, Zürich, Switzerland
| | - Joris A M Bertrand
- Université Perpignan Via Domitia, Laboratoire Génome et Développement des Plantes, UMR5096, Perpignan, France
- CNRS, Laboratoire Génome et Développement des Plantes, UMR5096, Perpignan, France
- EMR269 MANGO, Institut de Recherche pour le Développement, Perpignan, France
| | - Renske E Onstein
- Naturalis Biodiversity Centre, Leiden, The Netherlands
- German Centre for Integrative Biodiversity Research (iDiv) Halle - Jena - Leipzig, Leipzig, Germany
| | - Olivier Panaud
- Université Perpignan Via Domitia, Laboratoire Génome et Développement des Plantes, UMR5096, Perpignan, France
- CNRS, Laboratoire Génome et Développement des Plantes, UMR5096, Perpignan, France
- EMR269 MANGO, Institut de Recherche pour le Développement, Perpignan, France
| | - Jürg E Frey
- Department of Methods Development and Analytics, Agroscope, Wädenswil, Switzerland
| | | | - Thomas Wicker
- Department of Plant and Microbial Biology and Zürich-Basel Plant Science Centre, University of Zurich, Zürich, Switzerland
| | - Shuqing Xu
- Institute of Organismic and Molecular Evolution, University of Mainz, Mainz, Germany
| | - Ueli Grossniklaus
- Department of Plant and Microbial Biology and Zürich-Basel Plant Science Centre, University of Zurich, Zürich, Switzerland
| | - Philipp M Schlüter
- Department of Plant Evolutionary Biology, Institute of Biology, University of Hohenheim, Stuttgart, Germany.
- Department of Systematic and Evolutionary Botany and Zürich-Basel Plant Science Centre, University of Zurich, Zürich, Switzerland.
| |
Collapse
|
18
|
Cinar M, Martinez-Medina L, Puvvula P, Arakelyan A, Vardarajan B, Anthony N, Nagaraju G, Park D, Feng L, Sheff F, Mosunjac M, Saxe D, Flygare S, Alese O, Kaufman J, Lonial S, Sarmiento J, Lossos I, Vertino P, Lopez J, El-Rayes B, Bernal-Mizrachi L. Transposon DNA sequences facilitate the tissue-specific gene transfer of circulating tumor DNA between human cells. Nucleic Acids Res 2024; 52:7539-7555. [PMID: 38783375 PMCID: PMC11260451 DOI: 10.1093/nar/gkae427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2023] [Revised: 05/01/2024] [Accepted: 05/09/2024] [Indexed: 05/25/2024] Open
Abstract
The exchange of genes between cells is known to play an important physiological and pathological role in many organisms. We show that circulating tumor DNA (ctDNA) facilitates cell-specific gene transfer between human cancer cells and explain part of the mechanisms behind this phenomenon. As ctDNA migrates into the nucleus, genetic information is transferred. Cell targeting and ctDNA integration require ERVL, SINE or LINE DNA sequences. Chemically manufactured AluSp and MER11C sequences replicated multiple myeloma (MM) ctDNA cell targeting and integration. Additionally, we found that ctDNA may alter the treatment response of MM and pancreatic cancer models. This study shows that retrotransposon DNA sequences promote cancer gene transfer. However, because cell-free DNA has been detected in physiological and other pathological conditions, our findings have a broader impact than just cancer. Furthermore, the discovery that transposon DNA sequences mediate tissue-specific targeting will open up a new avenue for the delivery of genes and therapies.
Collapse
Affiliation(s)
- Munevver Cinar
- Department of Hematology and Medical Oncology, Winship Cancer Institute of Emory University, Atlanta, GA, USA
| | | | | | - Arsen Arakelyan
- Bioinformatics group, Institute of Molecular Biology NAS RA, Yerevan, Armenia
| | | | - Neil Anthony
- Integrated Cellular Imaging Core, Winship Cancer Institute of Emory University, Atlanta, GA, USA
| | - Ganji P Nagaraju
- Division of hematology and oncology, O’Neal Comprehensive Cancer Center, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Dongkyoo Park
- Department of Hematology and Medical Oncology, Winship Cancer Institute of Emory University, Atlanta, GA, USA
| | - Lei Feng
- Kodikaz Therapeutic Solutions, Inc, New York, NY, USA
| | - Faith Sheff
- Pathology and Laboratory Medicine, Winship Cancer Institute of Emory University, Atlanta, GA, USA
| | - Marina Mosunjac
- Pathology and Laboratory Medicine, Winship Cancer Institute of Emory University, Atlanta, GA, USA
| | - Debra Saxe
- Pathology and Laboratory Medicine, Winship Cancer Institute of Emory University, Atlanta, GA, USA
| | - Steven Flygare
- Department of Computational Biology/ Genetics, The University of Utah, Salt Lake City, UT, USA
| | - Olatunji B Alese
- Department of Hematology and Medical Oncology, Winship Cancer Institute of Emory University, Atlanta, GA, USA
| | - Jonathan L Kaufman
- Department of Hematology and Medical Oncology, Winship Cancer Institute of Emory University, Atlanta, GA, USA
| | - Sagar Lonial
- Department of Hematology and Medical Oncology, Winship Cancer Institute of Emory University, Atlanta, GA, USA
| | - Juan M Sarmiento
- Department of Surgery, Winship Cancer Institute of Emory University, Atlanta, GA, USA
| | - Izidore S Lossos
- Department of Medicine, Division of Hematology-Oncology and Molecular and Cellular Pharmacology, Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, USA
| | - Paula M Vertino
- Department of Biomedical Genetics and the Wilmot Cancer Institute, University of Rochester Medical Center, Rochester, NY, USA
| | - Jose A Lopez
- Bloodworks Northwest Research Institute, Division of Hematology, University of Washington School of Medicine, Seattle, WA, USA
| | - Bassel El-Rayes
- Division of hematology and oncology, O’Neal Comprehensive Cancer Center, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Leon Bernal-Mizrachi
- Department of Hematology and Medical Oncology, Winship Cancer Institute of Emory University, Atlanta, GA, USA
| |
Collapse
|
19
|
Hu K, Ni P, Xu M, Zou Y, Chang J, Gao X, Li Y, Ruan J, Hu B, Wang J. HiTE: a fast and accurate dynamic boundary adjustment approach for full-length transposable element detection and annotation. Nat Commun 2024; 15:5573. [PMID: 38956036 PMCID: PMC11219922 DOI: 10.1038/s41467-024-49912-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2023] [Accepted: 06/25/2024] [Indexed: 07/04/2024] Open
Abstract
Recent advancements in genome assembly have greatly improved the prospects for comprehensive annotation of Transposable Elements (TEs). However, existing methods for TE annotation using genome assemblies suffer from limited accuracy and robustness, requiring extensive manual editing. In addition, the currently available gold-standard TE databases are not comprehensive, even for extensively studied species, highlighting the critical need for an automated TE detection method to supplement existing repositories. In this study, we introduce HiTE, a fast and accurate dynamic boundary adjustment approach designed to detect full-length TEs. The experimental results demonstrate that HiTE outperforms RepeatModeler2, the state-of-the-art tool, across various species. Furthermore, HiTE has identified numerous novel transposons with well-defined structures containing protein-coding domains, some of which are directly inserted within crucial genes, leading to direct alterations in gene expression. A Nextflow version of HiTE is also available, with enhanced parallelism, reproducibility, and portability.
Collapse
Affiliation(s)
- Kang Hu
- School of Computer Science and Engineering, Central South University, Changsha, 410083, China
- Xiangjiang Laboratory, Changsha, 410205, China
- Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha, 410083, China
| | - Peng Ni
- School of Computer Science and Engineering, Central South University, Changsha, 410083, China
- Xiangjiang Laboratory, Changsha, 410205, China
- Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha, 410083, China
| | - Minghua Xu
- School of Computer Science and Engineering, Central South University, Changsha, 410083, China
- Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha, 410083, China
| | - You Zou
- School of Computer Science and Engineering, Central South University, Changsha, 410083, China
- Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha, 410083, China
| | - Jianye Chang
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518000, China
| | - Xin Gao
- Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
- Center of Excellence on Smart Health, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| | - Yaohang Li
- Department of Computer Science, Old Dominion University, Norfolk, VA, 23529, USA
| | - Jue Ruan
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518000, China
| | - Bin Hu
- Key Laboratory of Brain Health Intelligent Evaluation and Intervention, Ministry of Education (Beijing Institute of Technology), Beijing, P. R. China.
- School of Medical Technology, Beijing Institute of Technology, Beijing, P. R. China.
| | - Jianxin Wang
- School of Computer Science and Engineering, Central South University, Changsha, 410083, China.
- Xiangjiang Laboratory, Changsha, 410205, China.
- Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha, 410083, China.
| |
Collapse
|
20
|
Lee RJ, Horton CA, Van Treeck B, McIntyre JJR, Collins K. Conserved and divergent DNA recognition specificities and functions of R2 retrotransposon N-terminal domains. Cell Rep 2024; 43:114239. [PMID: 38753487 PMCID: PMC11204384 DOI: 10.1016/j.celrep.2024.114239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 04/04/2024] [Accepted: 05/01/2024] [Indexed: 05/18/2024] Open
Abstract
R2 non-long terminal repeat (non-LTR) retrotransposons are among the most extensively distributed mobile genetic elements in multicellular eukaryotes and show promise for applications in transgene supplementation of the human genome. They insert new gene copies into a conserved site in 28S ribosomal DNA with exquisite specificity. R2 clades are defined by the number of zinc fingers (ZFs) at the N terminus of the retrotransposon-encoded protein, postulated to additively confer DNA site specificity. Here, we illuminate general principles of DNA recognition by R2 N-terminal domains across and between clades, with extensive, specific recognition requiring only one or two compact domains. DNA-binding and protection assays demonstrate broadly shared as well as clade-specific DNA interactions. Gene insertion assays in cells identify the N-terminal domains sufficient for target-site insertion and reveal roles in second-strand cleavage or synthesis for clade-specific ZFs. Our results have implications for understanding evolutionary diversification of non-LTR retrotransposon insertion mechanisms and the design of retrotransposon-based gene therapies.
Collapse
Affiliation(s)
- Rosa Jooyoung Lee
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA 94720, USA
| | - Connor A Horton
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA 94720, USA
| | - Briana Van Treeck
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA 94720, USA
| | - Jeremy J R McIntyre
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA 94720, USA
| | - Kathleen Collins
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA 94720, USA.
| |
Collapse
|
21
|
Schloissnig S, Pani S, Rodriguez-Martin B, Ebler J, Hain C, Tsapalou V, Söylev A, Hüther P, Ashraf H, Prodanov T, Asparuhova M, Hunt S, Rausch T, Marschall T, Korbel JO. Long-read sequencing and structural variant characterization in 1,019 samples from the 1000 Genomes Project. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.18.590093. [PMID: 38659906 PMCID: PMC11042266 DOI: 10.1101/2024.04.18.590093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
Structural variants (SVs) contribute significantly to human genetic diversity and disease 1-4 . Previously, SVs have remained incompletely resolved by population genomics, with short-read sequencing facing limitations in capturing the whole spectrum of SVs at nucleotide resolution 5-7 . Here we leveraged nanopore sequencing 8 to construct an intermediate coverage resource of 1,019 long-read genomes sampled within 26 human populations from the 1000 Genomes Project. By integrating linear and graph-based approaches for SV analysis via pangenome graph-augmentation, we uncover 167,291 sequence-resolved SVs in these samples, considerably advancing SV characterization compared to population-wide short-read sequencing studies 3,4 . Our analysis details diverse SV classes-deletions, duplications, insertions, and inversions-at population-scale. LINE-1 and SVA retrotransposition activities frequently mediate transductions 9,10 of unique sequences, with both mobile element classes transducing sequences at either the 3'- or 5'-end, depending on the source element locus. Furthermore, analyses of SV breakpoint junctions suggest a continuum of homology-mediated rearrangement processes are integral to SV formation, and highlight evidence for SV recurrence involving repeat sequences. Our open-access dataset underscores the transformative impact of long-read sequencing in advancing the characterisation of polymorphic genomic architectures, and provides a resource for guiding variant prioritisation in future long-read sequencing-based disease studies.
Collapse
|
22
|
Jin SW, Seong Y, Yoon D, Kwon YS, Song H. Dissolution of ribonucleoprotein condensates by the embryonic stem cell protein L1TD1. Nucleic Acids Res 2024; 52:3310-3326. [PMID: 38165001 PMCID: PMC11014241 DOI: 10.1093/nar/gkad1244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Revised: 11/22/2023] [Accepted: 12/18/2023] [Indexed: 01/03/2024] Open
Abstract
L1TD1 is a cytoplasmic RNA-binding protein specifically expressed in pluripotent stem cells and, unlike its mouse ortholog, is essential for the maintenance of stemness in human cells. Although L1TD1 is the only known protein-coding gene domesticated from a LINE-1 (L1) retroelement, the functional legacy of its ancestral protein, ORF1p of L1, and how it is manifested in L1TD1 are still unknown. Here, we determined RNAs associated with L1TD1 and found that, like ORF1p, L1TD1 binds L1 RNAs and localizes to high-density ribonucleoprotein (RNP) condensates. Unexpectedly, L1TD1 enhanced the translation of a subset of mRNAs enriched in the condensates. L1TD1 depletion promoted the formation of stress granules in embryonic stem cells. In HeLa cells, ectopically expressed L1TD1 facilitated the dissolution of stress granules and granules formed by pathological mutations of TDP-43 and FUS. The glutamate-rich domain and the ORF1-homology domain of L1TD1 facilitated dispersal of the RNPs and induced autophagy, respectively. These results provide insights into how L1TD1 regulates gene expression in pluripotent stem cells. We propose that the ability of L1TD1 to dissolve stress granules may provide novel opportunities for treatment of neurodegenerative diseases caused by disturbed stress granule dynamics.
Collapse
Affiliation(s)
- Sang Woo Jin
- Department of Biomedical Sciences, College of Medicine, Korea University, Seoul 02841, Republic of Korea
| | - Youngmo Seong
- Department of Biomedical Sciences, College of Medicine, Korea University, Seoul 02841, Republic of Korea
| | - Dayoung Yoon
- Department of Biomedical Sciences, College of Medicine, Korea University, Seoul 02841, Republic of Korea
| | - Young-Soo Kwon
- Department of Integrative Bioscience & Biotechnology, Sejong University, Seoul 05006, Republic of Korea
| | - Hoseok Song
- Department of Biomedical Sciences, College of Medicine, Korea University, Seoul 02841, Republic of Korea
| |
Collapse
|
23
|
Le Breton A, Bettencourt MP, Gendrel AV. Navigating the brain and aging: exploring the impact of transposable elements from health to disease. Front Cell Dev Biol 2024; 12:1357576. [PMID: 38476259 PMCID: PMC10927736 DOI: 10.3389/fcell.2024.1357576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Accepted: 02/08/2024] [Indexed: 03/14/2024] Open
Abstract
Transposable elements (TEs) are mobile genetic elements that constitute on average 45% of mammalian genomes. Their presence and activity in genomes represent a major source of genetic variability. While this is an important driver of genome evolution, TEs can also have deleterious effects on their hosts. A growing number of studies have focused on the role of TEs in the brain, both in physiological and pathological contexts. In the brain, their activity is believed to be important for neuronal plasticity. In neurological and age-related disorders, aberrant activity of TEs may contribute to disease etiology, although this remains unclear. After providing a comprehensive overview of transposable elements and their interactions with the host, this review summarizes the current understanding of TE activity within the brain, during the aging process, and in the context of neurological and age-related conditions.
Collapse
Affiliation(s)
| | | | - Anne-Valerie Gendrel
- Instituto de Medicina Molecular João Lobo Antunes, Faculdade de Medicina, Universidade de Lisboa, Lisbon, Portugal
| |
Collapse
|
24
|
Audano PA, Beck CR. Small polymorphisms are a source of ancestral bias in structural variant breakpoint placement. Genome Res 2024; 34:7-19. [PMID: 38176712 PMCID: PMC10904011 DOI: 10.1101/gr.278203.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Accepted: 01/02/2024] [Indexed: 01/06/2024]
Abstract
High-quality genome assemblies and sophisticated algorithms have increased sensitivity for a wide range of variant types, and breakpoint accuracy for structural variants (SVs, ≥50 bp) has improved to near base pair precision. Despite these advances, many SV breakpoint locations are subject to systematic bias affecting variant representation. To understand why SV breakpoints are inconsistent across samples, we reanalyzed 64 phased haplotypes constructed from long-read assemblies released by the Human Genome Structural Variation Consortium (HGSVC). We identify 882 SV insertions and 180 SV deletions with variable breakpoints not anchored in tandem repeats (TRs) or segmental duplications (SDs). SVs called from aligned sequencing reads increase breakpoint disagreements by 2×-16×. Sequence accuracy had a minimal impact on breakpoints, but we observe a strong effect of ancestry. We confirm that SNP and indel polymorphisms are enriched at shifted breakpoints and are also absent from variant callsets. Breakpoint homology increases the likelihood of imprecise SV calls and the distance they are shifted, and tandem duplications are the most heavily affected SVs. Because graph genome methods normalize SV calls across samples, we investigated graphs generated by two different methods and find the resulting breakpoints are subject to other technical biases affecting breakpoint accuracy. The breakpoint inconsistencies we characterize affect ∼5% of the SVs called in a human genome and can impact variant interpretation and annotation. These limitations underscore a need for algorithm development to improve SV databases, mitigate the impact of ancestry on breakpoints, and increase the value of callsets for investigating breakpoint features.
Collapse
Affiliation(s)
- Peter A Audano
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA
| | - Christine R Beck
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA;
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, Connecticut 06030, USA
| |
Collapse
|
25
|
Mendez-Dorantes C, Burns KH. LINE-1 retrotransposition and its deregulation in cancers: implications for therapeutic opportunities. Genes Dev 2023; 37:948-967. [PMID: 38092519 PMCID: PMC10760644 DOI: 10.1101/gad.351051.123] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2023]
Abstract
Long interspersed element 1 (LINE-1) is the only protein-coding transposon that is active in humans. LINE-1 propagates in the genome using RNA intermediates via retrotransposition. This activity has resulted in LINE-1 sequences occupying approximately one-fifth of our genome. Although most copies of LINE-1 are immobile, ∼100 copies are retrotransposition-competent. Retrotransposition is normally limited via epigenetic silencing, DNA repair, and other host defense mechanisms. In contrast, LINE-1 overexpression and retrotransposition are hallmarks of cancers. Here, we review mechanisms of LINE-1 regulation and how LINE-1 may promote genetic heterogeneity in tumors. Finally, we discuss therapeutic strategies to exploit LINE-1 biology in cancers.
Collapse
Affiliation(s)
- Carlos Mendez-Dorantes
- Department of Pathology, Dana-Farber Cancer Institute, Boston, Massachusetts 02115, USA;
- Department of Pathology, Harvard Medical School, Boston, Massachusetts 02115, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts 02142, USA
| | - Kathleen H Burns
- Department of Pathology, Dana-Farber Cancer Institute, Boston, Massachusetts 02115, USA;
- Department of Pathology, Harvard Medical School, Boston, Massachusetts 02115, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts 02142, USA
| |
Collapse
|
26
|
Lukšíková K, Pavlica T, Altmanová M, Štundlová J, Pelikánová Š, Simanovsky SA, Krysanov EY, Jankásek M, Hiřman M, Reichard M, Ráb P, Sember A. Conserved satellite DNA motif and lack of interstitial telomeric sites in highly rearranged African Nothobranchius killifish karyotypes. JOURNAL OF FISH BIOLOGY 2023; 103:1501-1514. [PMID: 37661806 DOI: 10.1111/jfb.15550] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 08/27/2023] [Accepted: 08/29/2023] [Indexed: 09/05/2023]
Abstract
Using African annual killifishes of the genus Nothobranchius from temporary savannah pools with rapid karyotype and sex chromosome evolution, we analysed the chromosomal distribution of telomeric (TTAGGG)n repeat and Nfu-SatC satellite DNA (satDNA; isolated from Nothobranchius furzeri) in 15 species across the Nothobranchius killifish phylogeny, and with Fundulosoma thierryi as an out-group. Our fluorescence in situ hybridization experiments revealed that all analysed taxa share the presence of Nfu-SatC repeat but with diverse organization and distribution on chromosomes. Nfu-SatC landscape was similar in conspecific populations of Nothobranchius guentheri and Nothobranchius melanospilus but slightly-to-moderately differed between populations of Nothobranchius pienaari, and between closely related Nothobranchius kuhntae and Nothobranchius orthonotus. Inter-individual variability in Nfu-SatC patterns was found in N. orthonotus and Nothobranchius krysanovi. We revealed mostly no sex-linked patterns of studied repetitive DNA distribution. Only in Nothobranchius brieni, possessing multiple sex chromosomes, Nfu-SatC repeat occupied a substantial portion of the neo-Y chromosome, similarly as formerly found in the XY sex chromosome system of turquoise killifish N. furzeri and its sister species Nothobranchius kadleci-representatives not closely related to N. brieni. All studied species further shared patterns of expected telomeric repeats at the ends of all chromosomes and no additional interstitial telomeric sites. In summary, we revealed (i) the presence of conserved satDNA class in Nothobranchius clades (a rare pattern among ray-finned fishes); (ii) independent trajectories of Nothobranchius sex chromosome differentiation, with recurrent and convergent accumulation of Nfu-SatC on the Y chromosome in some species; and (iii) genus-wide shared tendency to loss of telomeric repeats during interchromosomal rearrangements. Collectively, our findings advance our understanding of genome structure, mechanisms of karyotype reshuffling, and sex chromosome differentiation in Nothobranchius killifishes from the genus-wide perspective.
Collapse
Affiliation(s)
- Karolína Lukšíková
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov, Czech Republic
- Department of Genetics and Microbiology, Faculty of Science, Charles University, Prague, Czech Republic
| | - Tomáš Pavlica
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov, Czech Republic
- Department of Zoology, Faculty of Science, Charles University, Prague, Czech Republic
| | - Marie Altmanová
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov, Czech Republic
- Department of Ecology, Faculty of Science, Charles University, Prague, Czech Republic
| | - Jana Štundlová
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov, Czech Republic
- University of South Bohemia, Faculty of Science, České Budějovice, Czech Republic
| | - Šárka Pelikánová
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov, Czech Republic
| | - Sergey A Simanovsky
- Severtsov Institute of Ecology and Evolution, Russian Academy of Sciences, Moscow, Russia
| | - Eugene Yu Krysanov
- Severtsov Institute of Ecology and Evolution, Russian Academy of Sciences, Moscow, Russia
| | - Marek Jankásek
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov, Czech Republic
- Department of Zoology, Faculty of Science, Charles University, Prague, Czech Republic
| | - Matyáš Hiřman
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov, Czech Republic
- Department of Zoology, Faculty of Science, Charles University, Prague, Czech Republic
| | - Martin Reichard
- Institute of Vertebrate Biology, Czech Academy of Sciences, Czech Republic
- Department of Ecology and Vertebrate Zoology, University of Łódź, Łódź, Poland
- Department of Botany and Zoology, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Petr Ráb
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov, Czech Republic
| | - Alexandr Sember
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov, Czech Republic
| |
Collapse
|
27
|
Jiang S, Luo Z, Wu J, Yu K, Zhao S, Cai Z, Yu W, Wang H, Cheng L, Liang Z, Gao H, Monti M, Schindler D, Huang L, Zeng C, Zhang W, Zhou C, Tang Y, Li T, Ma Y, Cai Y, Boeke JD, Zhao Q, Dai J. Building a eukaryotic chromosome arm by de novo design and synthesis. Nat Commun 2023; 14:7886. [PMID: 38036514 PMCID: PMC10689750 DOI: 10.1038/s41467-023-43531-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 11/13/2023] [Indexed: 12/02/2023] Open
Abstract
The genome of an organism is inherited from its ancestor and continues to evolve over time, however, the extent to which the current version could be altered remains unknown. To probe the genome plasticity of Saccharomyces cerevisiae, here we replace the native left arm of chromosome XII (chrXIIL) with a linear artificial chromosome harboring small sets of reconstructed genes. We find that as few as 12 genes are sufficient for cell viability, whereas 25 genes are required to recover the partial fitness defects observed in the 12-gene strain. Next, we demonstrate that these genes can be reconstructed individually using synthetic regulatory sequences and recoded open-reading frames with a "one-amino-acid-one-codon" strategy to remain functional. Finally, a synthetic neochromsome with the reconstructed genes is assembled which could substitute chrXIIL for viability. Together, our work not only highlights the high plasticity of yeast genome, but also illustrates the possibility of making functional eukaryotic chromosomes from entirely artificial sequences.
Collapse
Grants
- National Natural Science Foundation of China (31725002), Shenzhen Science and Technology Program (KQTD20180413181837372), Guangdong Provincial Key Laboratory of Synthetic Genomics (2019B030301006),Bureau of International Cooperation,Chinese Academy of Sciences (172644KYSB20180022) and Shenzhen Outstanding Talents Training Fund.
- National Key Research and Development Program of China (2018YFA0900100),National Natural Science Foundation of China (31800069),Guangdong Basic and Applied Basic Research Foundation (2023A1515030285)
- National Key Research and Development Program of China (2018YFA0900100), National Natural Science Foundation of China (31800082 and 32122050),Guangdong Natural Science Funds for Distinguished Young Scholar (2021B1515020060)
- UK Biotechnology and Biological Sciences Research Council (BBSRC) grants BB/M005690/1, BB/P02114X/1 and BB/W014483/1, and a Volkswagen Foundation “Life? Initiative” Grant (Ref. 94 771)
- US NSF grants MCB-1026068, MCB-1443299, MCB-1616111 and MCB-1921641
Collapse
Affiliation(s)
- Shuangying Jiang
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Zhouqing Luo
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- State Key Laboratory of Cellular Stress Biology, Innovation Center for Cell Signaling Network, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, Xiamen, Fujian, 361102, China
| | - Jie Wu
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Kang Yu
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Shijun Zhao
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Zelin Cai
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Wenfei Yu
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Hui Wang
- State Key Laboratory of Cellular Stress Biology, Innovation Center for Cell Signaling Network, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, Xiamen, Fujian, 361102, China
| | - Li Cheng
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Zhenzhen Liang
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Hui Gao
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- Institute of Molecular Physiology, Shenzhen Bay Laboratory, Shenzhen, China
| | - Marco Monti
- Manchester Institute of Biotechnology, University of Manchester, Manchester, M1 7DN, UK
| | - Daniel Schindler
- Manchester Institute of Biotechnology, University of Manchester, Manchester, M1 7DN, UK
| | - Linsen Huang
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Cheng Zeng
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Weimin Zhang
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, NY, 10016, USA
| | - Chun Zhou
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yuanwei Tang
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Tianyi Li
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Yingxin Ma
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Yizhi Cai
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- Manchester Institute of Biotechnology, University of Manchester, Manchester, M1 7DN, UK
| | - Jef D Boeke
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, NY, 10016, USA
- Department of Biomedical Engineering, NYU Tandon School of Engineering, Brooklyn, NY, 11201, USA
| | - Qiao Zhao
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.
| | - Junbiao Dai
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.
- University of Chinese Academy of Sciences, Beijing, China.
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China.
| |
Collapse
|
28
|
Liao X, Zhu W, Zhou J, Li H, Xu X, Zhang B, Gao X. Repetitive DNA sequence detection and its role in the human genome. Commun Biol 2023; 6:954. [PMID: 37726397 PMCID: PMC10509279 DOI: 10.1038/s42003-023-05322-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 09/04/2023] [Indexed: 09/21/2023] Open
Abstract
Repetitive DNA sequences playing critical roles in driving evolution, inducing variation, and regulating gene expression. In this review, we summarized the definition, arrangement, and structural characteristics of repeats. Besides, we introduced diverse biological functions of repeats and reviewed existing methods for automatic repeat detection, classification, and masking. Finally, we analyzed the type, structure, and regulation of repeats in the human genome and their role in the induction of complex diseases. We believe that this review will facilitate a comprehensive understanding of repeats and provide guidance for repeat annotation and in-depth exploration of its association with human diseases.
Collapse
Affiliation(s)
- Xingyu Liao
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Wufei Zhu
- Department of Endocrinology, Yichang Central People's Hospital, The First College of Clinical Medical Science, China Three Gorges University, 443000, Yichang, P.R. China
| | - Juexiao Zhou
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Haoyang Li
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Xiaopeng Xu
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Bin Zhang
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Xin Gao
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia.
| |
Collapse
|
29
|
Duhamel M, Hood ME, Rodríguez de la Vega RC, Giraud T. Dynamics of transposable element accumulation in the non-recombining regions of mating-type chromosomes in anther-smut fungi. Nat Commun 2023; 14:5692. [PMID: 37709766 PMCID: PMC10502011 DOI: 10.1038/s41467-023-41413-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Accepted: 08/30/2023] [Indexed: 09/16/2023] Open
Abstract
In the absence of recombination, the number of transposable elements (TEs) increases due to less efficient selection, but the dynamics of such TE accumulations are not well characterized. Leveraging a dataset of 21 independent events of recombination cessation of different ages in mating-type chromosomes of Microbotryum fungi, we show that TEs rapidly accumulated in regions lacking recombination, but that TE content reached a plateau at ca. 50% of occupied base pairs by 1.5 million years following recombination suppression. The same TE superfamilies have expanded in independently evolved non-recombining regions, in particular rolling-circle replication elements (Helitrons). Long-terminal repeat (LTR) retrotransposons of the Copia and Ty3 superfamilies also expanded, through transposition bursts (distinguished from gene conversion based on LTR divergence), with both non-recombining regions and autosomes affected, suggesting that non-recombining regions constitute TE reservoirs. This study improves our knowledge of genome evolution by showing that TEs can accumulate through bursts, following non-linear decelerating dynamics.
Collapse
Affiliation(s)
- Marine Duhamel
- Ecologie Systématique Evolution, IDEEV, CNRS, Université Paris-Saclay, AgroParisTech, Bâtiment 680, 12 route RD128, 91190, Gif-sur-Yvette, France.
- Evolution der Pflanzen und Pilze, Ruhr-Universität Bochum, Universitätsstraße 150, 44780, Bochum, Germany.
| | - Michael E Hood
- Department of Biology, Amherst College, 01002-5000, Amherst, MA, USA
| | - Ricardo C Rodríguez de la Vega
- Ecologie Systématique Evolution, IDEEV, CNRS, Université Paris-Saclay, AgroParisTech, Bâtiment 680, 12 route RD128, 91190, Gif-sur-Yvette, France
| | - Tatiana Giraud
- Ecologie Systématique Evolution, IDEEV, CNRS, Université Paris-Saclay, AgroParisTech, Bâtiment 680, 12 route RD128, 91190, Gif-sur-Yvette, France
| |
Collapse
|
30
|
Soto DC, Uribe-Salazar JM, Shew CJ, Sekar A, McGinty S, Dennis MY. Genomic structural variation: A complex but important driver of human evolution. AMERICAN JOURNAL OF BIOLOGICAL ANTHROPOLOGY 2023; 181 Suppl 76:118-144. [PMID: 36794631 PMCID: PMC10329998 DOI: 10.1002/ajpa.24713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2022] [Revised: 01/21/2023] [Accepted: 02/05/2023] [Indexed: 02/17/2023]
Abstract
Structural variants (SVs)-including duplications, deletions, and inversions of DNA-can have significant genomic and functional impacts but are technically difficult to identify and assay compared with single-nucleotide variants. With the aid of new genomic technologies, it has become clear that SVs account for significant differences across and within species. This phenomenon is particularly well-documented for humans and other primates due to the wealth of sequence data available. In great apes, SVs affect a larger number of nucleotides than single-nucleotide variants, with many identified SVs exhibiting population and species specificity. In this review, we highlight the importance of SVs in human evolution by (1) how they have shaped great ape genomes resulting in sensitized regions associated with traits and diseases, (2) their impact on gene functions and regulation, which subsequently has played a role in natural selection, and (3) the role of gene duplications in human brain evolution. We further discuss how to incorporate SVs in research, including the strengths and limitations of various genomic approaches. Finally, we propose future considerations in integrating existing data and biospecimens with the ever-expanding SV compendium propelled by biotechnology advancements.
Collapse
Affiliation(s)
- Daniela C. Soto
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, USA
- Integrative Genetics and Genomics Graduate Group, University of California, Davis, CA, USA
| | - José M. Uribe-Salazar
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, USA
- Integrative Genetics and Genomics Graduate Group, University of California, Davis, CA, USA
| | - Colin J. Shew
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, USA
- Integrative Genetics and Genomics Graduate Group, University of California, Davis, CA, USA
| | - Aarthi Sekar
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, USA
- Integrative Genetics and Genomics Graduate Group, University of California, Davis, CA, USA
| | - Sean McGinty
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, USA
- Integrative Genetics and Genomics Graduate Group, University of California, Davis, CA, USA
| | - Megan Y. Dennis
- Genome Center, MIND Institute, and Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, USA
- Integrative Genetics and Genomics Graduate Group, University of California, Davis, CA, USA
| |
Collapse
|
31
|
Reitz D, Djeghmoum Y, Watson RA, Rajput P, Argueso JL, Heyer WD, Piazza A. Delineation of two multi-invasion-induced rearrangement pathways that differently affect genome stability. Genes Dev 2023; 37:621-639. [PMID: 37541760 PMCID: PMC10499017 DOI: 10.1101/gad.350618.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 07/14/2023] [Indexed: 08/06/2023]
Abstract
Punctuated bursts of structural genomic variations (SVs) have been described in various organisms, but their etiology remains incompletely understood. Homologous recombination (HR) is a template-guided mechanism of repair of DNA double-strand breaks and stalled or collapsed replication forks. We recently identified a DNA break amplification and genome rearrangement pathway originating from the endonucleolytic processing of a multi-invasion (MI) DNA joint molecule formed during HR. Genome-wide approaches confirmed that multi-invasion-induced rearrangement (MIR) frequently leads to several repeat-mediated SVs and aneuploidies. Using molecular and genetic analysis and a novel, highly sensitive proximity ligation-based assay for chromosomal rearrangement quantification, we further delineate two MIR subpathways. MIR1 is a universal pathway occurring in any sequence context, which generates secondary breaks and frequently leads to additional SVs. MIR2 occurs only if recombining donors exhibit substantial homology and results in sequence insertion without additional breaks or SVs. The most detrimental MIR1 pathway occurs late on a subset of persisting DNA joint molecules in a PCNA/Polδ-independent manner, unlike recombinational DNA synthesis. This work provides a refined mechanistic understanding of these HR-based SV formation pathways and shows that complex repeat-mediated SVs can occur without displacement DNA synthesis. Sequence signatures for inferring MIR1 from long-read data are proposed.
Collapse
Affiliation(s)
- Diedre Reitz
- Department of Microbiology and Molecular Genetics, University of California, Davis, Davis, California 95616, USA
| | - Yasmina Djeghmoum
- Laboratory of Biology and Modelling of the Cell (UMR5239), Ecole Normale Supérieure de Lyon, 69007 Lyon, France
| | - Ruth A Watson
- Department of Environmental and Radiological Health Sciences, Colorado State University, Fort Collins, Colorado 80523, USA
| | - Pallavi Rajput
- Department of Microbiology and Molecular Genetics, University of California, Davis, Davis, California 95616, USA
| | - Juan Lucas Argueso
- Department of Environmental and Radiological Health Sciences, Colorado State University, Fort Collins, Colorado 80523, USA
| | - Wolf-Dietrich Heyer
- Department of Microbiology and Molecular Genetics, University of California, Davis, Davis, California 95616, USA;
- Department of Molecular and Cellular Biology, University of California, Davis, Davis, California 95616, USA
| | - Aurèle Piazza
- Laboratory of Biology and Modelling of the Cell (UMR5239), Ecole Normale Supérieure de Lyon, 69007 Lyon, France;
| |
Collapse
|
32
|
Audano PA, Beck CR. Small allelic variants are a source of ancestral bias in structural variant breakpoint placement. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.25.546295. [PMID: 37425850 PMCID: PMC10327140 DOI: 10.1101/2023.06.25.546295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
High-quality genome assemblies and sophisticated algorithms have increased sensitivity for a wide range of variant types, and breakpoint accuracy for structural variants (SVs, ≥ 50 bp) has improved to near basepair precision. Despite these advances, many SVs in unique regions of the genome are subject to systematic bias that affects breakpoint location. This ambiguity leads to less accurate variant comparisons across samples, and it obscures true breakpoint features needed for mechanistic inferences. To understand why SVs are not consistently placed, we reanalyzed 64 phased haplotypes constructed from long-read assemblies released by the Human Genome Structural Variation Consortium (HGSVC). We identified variable breakpoints for 882 SV insertions and 180 SV deletions not anchored in tandem repeats (TRs) or segmental duplications (SDs). While this is unexpectedly high for genome assemblies in unique loci, we find read-based callsets from the same sequencing data yielded 1,566 insertions and 986 deletions with inconsistent breakpoints also not anchored in TRs or SDs. When we investigated causes for breakpoint inaccuracy, we found sequence and assembly errors had minimal impact, but we observed a strong effect of ancestry. We confirmed that polymorphic mismatches and small indels are enriched at shifted breakpoints and that these polymorphisms are generally lost when breakpoints shift. Long tracts of homology, such as SVs mediated by transposable elements, increase the likelihood of imprecise SV calls and the distance they are shifted. Tandem Duplication (TD) breakpoints are the most heavily affected SV class with 14% of TDs placed at different locations across haplotypes. While graph genome methods normalize SV calls across many samples, the resulting breakpoints are sometimes incorrect, highlighting a need to tune graph methods for breakpoint accuracy. The breakpoint inconsistencies we characterize collectively affect ~5% of the SVs called in a human genome and underscore a need for algorithm development to improve SV databases, mitigate the impact of ancestry on breakpoint placement, and increase the value of callsets for investigating mutational processes.
Collapse
Affiliation(s)
- Peter A Audano
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Christine R Beck
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, CT, USA
| |
Collapse
|
33
|
Girardini KN, Olthof AM, Kanadia RN. Introns: the "dark matter" of the eukaryotic genome. Front Genet 2023; 14:1150212. [PMID: 37260773 PMCID: PMC10228655 DOI: 10.3389/fgene.2023.1150212] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Accepted: 04/28/2023] [Indexed: 06/02/2023] Open
Abstract
The emergence of introns was a significant evolutionary leap that is a major distinguishing feature between prokaryotic and eukaryotic genomes. While historically introns were regarded merely as the sequences that are removed to produce spliced transcripts encoding functional products, increasingly data suggests that introns play important roles in the regulation of gene expression. Here, we use an intron-centric lens to review the role of introns in eukaryotic gene expression. First, we focus on intron architecture and how it may influence mechanisms of splicing. Second, we focus on the implications of spliceosomal snRNAs and their variants on intron splicing. Finally, we discuss how the presence of introns and the need to splice them influences transcription regulation. Despite the abundance of introns in the eukaryotic genome and their emerging role regulating gene expression, a lot remains unexplored. Therefore, here we refer to introns as the "dark matter" of the eukaryotic genome and discuss some of the outstanding questions in the field.
Collapse
Affiliation(s)
- Kaitlin N. Girardini
- Physiology and Neurobiology Department, University of Connecticut, Storrs, CT, United States
| | - Anouk M. Olthof
- Physiology and Neurobiology Department, University of Connecticut, Storrs, CT, United States
- Department of Cellular and Molecular Medicine, University of Copenhagen, Copenhagen, Denmark
| | - Rahul N. Kanadia
- Physiology and Neurobiology Department, University of Connecticut, Storrs, CT, United States
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, United States
| |
Collapse
|
34
|
Reitz D, Djeghmoum Y, Watson RA, Rajput P, Argueso JL, Heyer WD, Piazza A. Delineation of two multi-invasion-induced rearrangement pathways that differently affect genome stability. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.15.532751. [PMID: 36993162 PMCID: PMC10055120 DOI: 10.1101/2023.03.15.532751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Punctuated bursts of structural genomic variations (SVs) have been described in various organisms, but their etiology remains incompletely understood. Homologous recombination (HR) is a template-guided mechanism of repair of DNA double-strand breaks and stalled or collapsed replication forks. We recently identified a DNA break amplification and genome rearrangement pathway originating from the endonucleolytic processing of a multi-invasion (MI) DNA joint molecule formed during HR. Genome-wide sequencing approaches confirmed that multi-invasion-induced rearrangement (MIR) frequently leads to several repeat-mediated SVs and aneuploidies. Using molecular and genetic analysis, and a novel, highly sensitive proximity ligation-based assay for chromosomal rearrangement quantification, we further delineate two MIR sub-pathways. MIR1 is a universal pathway occurring in any sequence context, which generates secondary breaks and frequently leads to additional SVs. MIR2 occurs only if recombining donors exhibit substantial homology, and results in sequence insertion without additional break or SV. The most detrimental MIR1 pathway occurs late on a subset of persisting DNA joint molecules in a PCNA/Polδ-independent manner, unlike recombinational DNA synthesis. This work provides a refined mechanistic understanding of these HR-based SV formation pathways and shows that complex repeat-mediated SVs can occur without displacement DNA synthesis. Sequence signatures for inferring MIR1 from long-read data are proposed.
Collapse
Affiliation(s)
- Diedre Reitz
- Department of Microbiology and Molecular Genetics, One Shields Ave, University of California, Davis, CA 95616, USA
| | - Yasmina Djeghmoum
- Univ Lyon, ENS, UCBL, CNRS, INSERM, Laboratory of Biology and Modelling of the Cell, UMR5239, U 1210, F-69364, Lyon, France
| | - Ruth A. Watson
- Department of Environmental and Radiological Health Sciences, Colorado State University, Fort Collins, CO 80523
| | - Pallavi Rajput
- Department of Microbiology and Molecular Genetics, One Shields Ave, University of California, Davis, CA 95616, USA
| | - Juan Lucas Argueso
- Department of Environmental and Radiological Health Sciences, Colorado State University, Fort Collins, CO 80523
| | - Wolf-Dietrich Heyer
- Department of Microbiology and Molecular Genetics, One Shields Ave, University of California, Davis, CA 95616, USA
- Department of Molecular and Cellular Biology, One Shields Ave, University of California, Davis, CA 95616, USA
| | - Aurèle Piazza
- Univ Lyon, ENS, UCBL, CNRS, INSERM, Laboratory of Biology and Modelling of the Cell, UMR5239, U 1210, F-69364, Lyon, France
| |
Collapse
|