1
|
Zhang T, Tan S, Tang N, Li Y, Zhang C, Sun J, Guo Y, Gao H, Cai Y, Sun W, Wang C, Fu L, Ma H, Wu Y, Hu X, Zhang X, Gee P, Yan W, Zhao Y, Chen Q, Guo B, Wang H, Zhang YE. Heterologous survey of 130 DNA transposons in human cells highlights their functional divergence and expands the genome engineering toolbox. Cell 2024; 187:3741-3760.e30. [PMID: 38843831 DOI: 10.1016/j.cell.2024.05.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Revised: 03/11/2024] [Accepted: 05/02/2024] [Indexed: 07/14/2024]
Abstract
Experimental studies on DNA transposable elements (TEs) have been limited in scale, leading to a lack of understanding of the factors influencing transposition activity, evolutionary dynamics, and application potential as genome engineering tools. We predicted 130 active DNA TEs from 102 metazoan genomes and evaluated their activity in human cells. We identified 40 active (integration-competent) TEs, surpassing the cumulative number (20) of TEs found previously. With this unified comparative data, we found that the Tc1/mariner superfamily exhibits elevated activity, potentially explaining their pervasive horizontal transfers. Further functional characterization of TEs revealed additional divergence in features such as insertion bias. Remarkably, in CAR-T therapy for hematological and solid tumors, Mariner2_AG (MAG), the most active DNA TE identified, largely outperformed two widely used vectors, the lentiviral vector and the TE-based vector SB100X. Overall, this study highlights the varied transposition features and evolutionary dynamics of DNA TEs and increases the TE toolbox diversity.
Collapse
Affiliation(s)
- Tongtong Zhang
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China; Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing 100101, China
| | - Shengjun Tan
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Na Tang
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing 100101, China; Beijing Institute for Stem Cell and Regenerative Medicine, Beijing 100101, China
| | - Yuanqing Li
- University of Chinese Academy of Sciences, Beijing 100049, China; Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Chenze Zhang
- National Key Laboratory of Efficacy and Mechanism on Chinese Medicine for Metabolic Diseases, Beijing University of Chinese Medicine, Beijing 100029, China
| | - Jing Sun
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China; Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing 100101, China
| | - Yanyan Guo
- University of Chinese Academy of Sciences, Beijing 100049, China; Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Hui Gao
- Rengene Biotechnology Co., Ltd., Beijing 100036, China
| | - Yujia Cai
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China; Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing 100101, China
| | - Wen Sun
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing 100101, China; Beijing Institute for Stem Cell and Regenerative Medicine, Beijing 100101, China
| | - Chenxin Wang
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing 100101, China; Beijing Institute for Stem Cell and Regenerative Medicine, Beijing 100101, China
| | - Liangzheng Fu
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China; Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing 100101, China
| | - Huijing Ma
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Yachao Wu
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China; Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing 100101, China
| | - Xiaoxuan Hu
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China; Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing 100101, China
| | - Xuechun Zhang
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China; Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing 100101, China
| | - Peter Gee
- MaxCyte Inc., Rockville, MD 20850, USA
| | - Weihua Yan
- Cold Spring Biotech Corp., Beijing 100031, China
| | - Yahui Zhao
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Qiang Chen
- Department of Biotherapy, Cancer Center and State Key Laboratory of Biotherapy, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Baocheng Guo
- University of Chinese Academy of Sciences, Beijing 100049, China; Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; Academy of Plateau Science and Sustainability, Qinghai Normal University, Xining 810008, China
| | - Haoyi Wang
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China; Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing 100101, China; Beijing Institute for Stem Cell and Regenerative Medicine, Beijing 100101, China.
| | - Yong E Zhang
- University of Chinese Academy of Sciences, Beijing 100049, China; Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China.
| |
Collapse
|
2
|
Abraham LN, Oggenfuss U, Croll D. Population-level transposable element expression dynamics influence trait evolution in a fungal crop pathogen. mBio 2024; 15:e0284023. [PMID: 38349152 PMCID: PMC10936205 DOI: 10.1128/mbio.02840-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 01/22/2024] [Indexed: 03/14/2024] Open
Abstract
The rapid adaptive evolution of microbes is driven by strong selection pressure acting on genetic variation. How adaptive genetic variation is generated within species and how such variation influences phenotypic trait expression is often not well understood though. We focused on the recent activity of transposable elements (TEs) using deep population genomics and transcriptomics analyses of a fungal plant pathogen with a highly active content of TEs in the genome. Zymoseptoria tritici causes one of the most damaging diseases on wheat, with recent adaptation to the host and environment being facilitated by TE-associated mutations. We obtained genomic and RNA-sequencing data from 146 isolates collected from a single wheat field. We established a genome-wide map of TE insertion polymorphisms in the population by analyzing recent TE insertions among individuals. We quantified the locus-specific transcription of individual TE copies and found considerable population variation at individual TE loci in the population. About 20% of all TE copies show transcription in the genome suggesting that genomic defenses such as repressive epigenetic marks and repeat-induced polymorphisms are at least partially ineffective at preventing the proliferation of TEs in the genome. A quarter of recent TE insertions are associated with expression variation of neighboring genes providing broad potential to influence trait expression. We indeed found that TE insertions are likely responsible for variation in virulence on the host and potentially diverse components of secondary metabolite production. Our large-scale transcriptomics study emphasizes how TE-derived polymorphisms segregate even in individual microbial populations and can broadly underpin trait variation in pathogens.IMPORTANCEPathogens can rapidly adapt to new hosts, antimicrobials, or changes in the environment. Adaptation arises often from mutations in the genome; however, how such variation is generated remains poorly understood. We investigated the most dynamic regions of the genome of Zymoseptoria tritici, a major fungal pathogen of wheat. We focused on the transcription of transposable elements. A large proportion of the transposable elements not only show signatures of potential activity but are also variable within a single population of the pathogen. We find that this variation in activity is likely influencing many important traits of the pathogen. Hence, our work provides insights into how a microbial species can adapt over the shortest time periods based on the activity of transposable elements.
Collapse
Affiliation(s)
- Leen Nanchira Abraham
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Ursula Oggenfuss
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Daniel Croll
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| |
Collapse
|
3
|
Pianezza R, Scarpa A, Narayanan P, Signor S, Kofler R. Spoink, a LTR retrotransposon, invaded D. melanogaster populations in the 1990s. PLoS Genet 2024; 20:e1011201. [PMID: 38530818 DOI: 10.1371/journal.pgen.1011201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 02/27/2024] [Indexed: 03/28/2024] Open
Abstract
During the last few centuries D. melanogaster populations were invaded by several transposable elements, the most recent of which was thought to be the P-element between 1950 and 1980. Here we describe a novel TE, which we named Spoink, that has invaded D. melanogaster. It is a 5216nt LTR retrotransposon of the Ty3/gypsy superfamily. Relying on strains sampled at different times during the last century we show that Spoink invaded worldwide D. melanogaster populations after the P-element between 1983 and 1993. This invasion was likely triggered by a horizontal transfer from the D. willistoni group, much as the P-element. Spoink is probably silenced by the piRNA pathway in natural populations and about 1/3 of the examined strains have an insertion into a canonical piRNA cluster such as 42AB. Given the degree of genetic investigation of D. melanogaster it is perhaps surprising that Spoink was able to invade unnoticed.
Collapse
Affiliation(s)
- Riccardo Pianezza
- Institut für Populationsgenetik, Vetmeduni Vienna, Vienna, Austria
- Vienna Graduate School of Population Genetics, Vetmeduni Vienna, Vienna, Austria
| | - Almorò Scarpa
- Institut für Populationsgenetik, Vetmeduni Vienna, Vienna, Austria
- Vienna Graduate School of Population Genetics, Vetmeduni Vienna, Vienna, Austria
| | - Prakash Narayanan
- Biological Sciences, North Dakota State University, Fargo, North Dakota, United States of America
| | - Sarah Signor
- Biological Sciences, North Dakota State University, Fargo, North Dakota, United States of America
| | - Robert Kofler
- Institut für Populationsgenetik, Vetmeduni Vienna, Vienna, Austria
| |
Collapse
|
4
|
Kim ES, Rajan A, Chang K, Govindarajan S, Gulick C, English E, Rodriguez B, Bloomfield O, Nakada S, Beard C, O’Connor S, Mastroianni S, Downey E, Feigenbaum M, Tolentino C, Pace A, Khan M, Moon S, DiPrima J, Syed A, Lin F, Abukhadra Y, Bacon I, Beckerle J, Cho S, Donkor NE, Garberg L, Harrington A, Hoang M, Lawani N, Noori A, Park E, Parsons E, Oravitan P, Chen M, Molina C, Richmond C, Reddi A, Huang J, Shugrue C, Coviello R, Unver S, Indelicarto M, Islamovic E, McIlroy R, Yang A, Hamad M, Griffin E, Ahmed Z, Alla A, Fitzgerald P, Choi A, Das T, Cheng Y, Yu J, Roderiques T, Lee E, Liu L, Harper J, Wang J, Suhr C, Tan M, Luque J, Tam AR, Chen E, Triff M, Zimmermann L, Zhang E, Wood J, Clark K, Kpodonu N, Dey A, Ecker A, Chuang M, López RKS, Sun H, Wei Z, Stone H, Chi CYJ, Silvestri A, Orloff P, Nedumaran N, Zou A, Ünver L, Page O, Kim M, Chan TYT, Tulloch A, Hernandez A, Pillai A, Chen C, Chowdhury N, Huang L, Mudide A, Paik G, Wingate A, Quinn L, Conybere C, Baumgardt LL, Buckley R, Kolberg Z, Pattison R, Shazli AA, Ganske P, Sfragara L, Strub A, Collier B, Tamana H, Ravindran D, Howden J, Stewart M, Shimizu S, Braniff J, Fong M, Gutman L, Irvine D, Malholtra S, Medina J, Park J, Yin A, Abromavage H, Barrett B, Chen J, Cho R, Dilatush M, Gaw G, Gu C, Huang J, Kilby H, Markel E, McClure K, Phillips W, Polaski B, Roselli A, Saint-Cyr S, Shin E, Tatum K, Tumpunyawat T, Wetherill L, Ptaszynska S, Zeleznik M, Pesendorfer A, Nolan A, Tao J, Sammeta D, Nicholson L, Dinh GV, Foltz M, Vo A, Ross M, Tokarski A, Hariharan S, Wang E, Baziuk M, Tay A, Wong YHM, Floyd J, Cui A, Pierre K, Coppisetti N, Kutam M, Khurjekar D, Gadzi A, Gubbay B, Pedretti S, Belovich S, Yeung T, Fey M, Shaffer L, Li A, Beritela G, Huyghue K, Foster G, Durso-Finley G, Thierfelder Q, Kiernan H, Lenkowsky A, Thomas T, Cheng N, Chao O, L’Etoile-Goga P, King A, McKinley P, Read N, Milberg D, Lin L, Wong M, Gilman I, Brown S, Chen L, Kosai J, Verbinsky M, Belshaw-Hood A, Lee H, Zhou C, Lobo M, Tse A, Tran K, Lewis K, Sonawane P, Ngo J, Zuzga S, Chow L, Huynh V, Yang W, Lim S, Stites B, Chang S, Cruz-Balleza R, Pelta M, Kujawski S, Yuan C, Standen-Bloom E, Witt O, Anders K, Duane A, Huynh N, Lester B, Fung-Lee S, Fung M, Situ M, Canigiula P, Dijkgraaf M, Romero W, Baula SK, Wong K, Xu I, Martinez B, Nuygen R, Norris L, Nijensohn N, Altman N, Maajid E, Burkhardt O, Chanda J, Doscher C, Gopal A, Good A, Good J, Herrera N, Lanting L, Liem S, Marks A, McLaughlin E, Lee A, Mohr C, Patton E, Pyarali N, Oczon C, Richards D, Good N, Goss S, Khan A, Madonia R, Mitchell V, Sun N, Vranka T, Garcia D, Arroyo F, Morales E, Camey S, Cano G, Bernabe A, Arroyo J, Lopez Y, Gonzalez E, Zumba B, Garcia J, Vargas E, Trinidad A, Candelaria N, Valdez V, Campuzano F, Pereznegron E, Medrano J, Gutierrez J, Gutierrez E, Abrego ET, Gutierrez D, Ortiz C, Barnes A, Arms E, Mitchell L, Balanzá C, Bradford J, Detroy H, Ferguson D, Guillermo E, Manapragada A, Nanula D, Serna B, Singh K, Sramaty E, Wells B, Wiggins M, Dowling M, Schmadeke G, Cafferky S, Good S, Reese M, Fleig M, Gannett A, Cain C, Lee M, Oberto P, Rinehart J, Pan E, Mathis SA, Joiner J, Barr L, Evans CJ, Baena-Lopez A, Beatty A, Collette J, Smullen R, Suttie J, Chisholm T, Rotondo C, Lewis G, Turner V, Stark L, Fox E, Amirapu A, Park S, Lantz N, Rankin AE, Kim SK, Kockel L. Generation of LexA enhancer-trap lines in Drosophila by an international scholastic network. G3 (BETHESDA, MD.) 2023; 13:jkad124. [PMID: 37279923 PMCID: PMC10468311 DOI: 10.1093/g3journal/jkad124] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2023] [Revised: 05/05/2023] [Accepted: 05/10/2023] [Indexed: 06/08/2023]
Abstract
Conditional gene regulation in Drosophila through binary expression systems like the LexA-LexAop system provides a superb tool for investigating gene and tissue function. To increase the availability of defined LexA enhancer trap insertions, we present molecular, genetic, and tissue expression studies of 301 novel Stan-X LexA enhancer traps derived from mobilization of the index SX4 line. This includes insertions into distinct loci on the X, II, and III chromosomes that were not previously associated with enhancer traps or targeted LexA constructs, an insertion into ptc, and seventeen insertions into natural transposons. A subset of enhancer traps was expressed in CNS neurons known to produce and secrete insulin, an essential regulator of growth, development, and metabolism. Fly lines described here were generated and characterized through studies by students and teachers in an international network of genetics classes at public, independent high schools, and universities serving a diversity of students, including those underrepresented in science. Thus, a unique partnership between secondary schools and university-based programs has produced and characterized novel resources in Drosophila, establishing instructional paradigms devoted to unscripted experimental science.
Collapse
Affiliation(s)
- Ella S Kim
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | - Arjun Rajan
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Kathleen Chang
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | | | | | - Eva English
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | | | | | | | | | - Sarah O’Connor
- Commack High School, 1 Scholar Ln, Commack, NY 11725, USA
| | | | - Emma Downey
- Commack High School, 1 Scholar Ln, Commack, NY 11725, USA
| | | | | | - Abigail Pace
- Commack High School, 1 Scholar Ln, Commack, NY 11725, USA
| | - Marina Khan
- Commack High School, 1 Scholar Ln, Commack, NY 11725, USA
| | - Soyoun Moon
- Commack High School, 1 Scholar Ln, Commack, NY 11725, USA
| | - Jordan DiPrima
- Commack High School, 1 Scholar Ln, Commack, NY 11725, USA
| | - Amber Syed
- Commack High School, 1 Scholar Ln, Commack, NY 11725, USA
| | - Flora Lin
- Commack High School, 1 Scholar Ln, Commack, NY 11725, USA
| | | | | | | | - Sophia Cho
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | | | | | | | - Mai Hoang
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | - Nosa Lawani
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | - Ayush Noori
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | - Euwie Park
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | | | | | | | | | | | - Adith Reddi
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | - Jason Huang
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | | | | | - Selma Unver
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | | | | | | | - Alana Yang
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | - Mahdi Hamad
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | | | - Zara Ahmed
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | - Asha Alla
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | | | - Audrey Choi
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | - Tanya Das
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | | | - Joshua Yu
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | | | - Ethan Lee
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | | | | | - Jason Wang
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | - Chris Suhr
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | - Max Tan
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | | | | | - Emma Chen
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | - Max Triff
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | | | - Eric Zhang
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | - Jackie Wood
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | | | - Nat Kpodonu
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | - Antar Dey
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | | | | | | | - Harry Sun
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | - Zijing Wei
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | - Henry Stone
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | | | | | | | | | | | - Leyla Ünver
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | - Oscair Page
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | - Minseo Kim
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | | | | | | | | | | | | | - Lina Huang
- Phillips Exeter Academy, Exeter, NH 03833, USA
| | | | | | | | - Lily Quinn
- Haileybury School, Hertford SG13 7NU, UK
| | | | | | | | | | | | | | - Pia Ganske
- Haileybury School, Hertford SG13 7NU, UK
| | | | | | | | | | | | | | | | | | - Julia Braniff
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Melanie Fong
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Lucy Gutman
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Danny Irvine
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Sahil Malholtra
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Jillian Medina
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - John Park
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Alicia Yin
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | | | - Breanna Barrett
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Jacqueline Chen
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Rachelle Cho
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Mac Dilatush
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Gabriel Gaw
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Caitlin Gu
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Jupiter Huang
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Houston Kilby
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Ethan Markel
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Katie McClure
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - William Phillips
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Benjamin Polaski
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Amelia Roselli
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Soleil Saint-Cyr
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Ellie Shin
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Kylan Tatum
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Tai Tumpunyawat
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Lucia Wetherill
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Sara Ptaszynska
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Maddie Zeleznik
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | | | - Anna Nolan
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Jeffrey Tao
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Divya Sammeta
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Laney Nicholson
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Giao Vu Dinh
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Merrin Foltz
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - An Vo
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Maggie Ross
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Andrew Tokarski
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Samika Hariharan
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Elaine Wang
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Martha Baziuk
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Ashley Tay
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | | | - Jax Floyd
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Aileen Cui
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Kieran Pierre
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Nikita Coppisetti
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Matthew Kutam
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Dhruv Khurjekar
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Anthony Gadzi
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Ben Gubbay
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Sophia Pedretti
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Sofiya Belovich
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Tiffany Yeung
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Mercy Fey
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Layla Shaffer
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Arthur Li
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | | | - Kyle Huyghue
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Greg Foster
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | | | - Quinn Thierfelder
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Holly Kiernan
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Andrew Lenkowsky
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Tesia Thomas
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Nicole Cheng
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Olivia Chao
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Pia L’Etoile-Goga
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Alexa King
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Paris McKinley
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Nicole Read
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - David Milberg
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Leila Lin
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Melinda Wong
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Io Gilman
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Samantha Brown
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Lila Chen
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Jordyn Kosai
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Mark Verbinsky
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | | | - Honon Lee
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Cathy Zhou
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Maya Lobo
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Asia Tse
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Kyle Tran
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Kira Lewis
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Pratmesh Sonawane
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Jonathan Ngo
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Sophia Zuzga
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Lillian Chow
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Vianne Huynh
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Wenyi Yang
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Samantha Lim
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Brandon Stites
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Shannon Chang
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | | | - Michaela Pelta
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Stella Kujawski
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Christopher Yuan
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | | | - Oliver Witt
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Karina Anders
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Audrey Duane
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Nancy Huynh
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Benjamin Lester
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Samantha Fung-Lee
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Melanie Fung
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Mandy Situ
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Paolo Canigiula
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Matijs Dijkgraaf
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Wilbert Romero
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | | | - Kimberly Wong
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Ivana Xu
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | | | - Reena Nuygen
- Latin School of Chicago, 59 W North Blvd, Chicago, IL 60610, USA
| | - Lucy Norris
- Latin School of Chicago, 59 W North Blvd, Chicago, IL 60610, USA
| | - Noah Nijensohn
- Latin School of Chicago, 59 W North Blvd, Chicago, IL 60610, USA
| | - Naomi Altman
- Latin School of Chicago, 59 W North Blvd, Chicago, IL 60610, USA
| | - Elise Maajid
- Latin School of Chicago, 59 W North Blvd, Chicago, IL 60610, USA
| | | | | | | | - Alex Gopal
- Albuquerque Academy, Albuquerque, NM 87109, USA
| | - Aaron Good
- Albuquerque Academy, Albuquerque, NM 87109, USA
| | - Jonah Good
- Albuquerque Academy, Albuquerque, NM 87109, USA
| | | | | | - Sophia Liem
- Albuquerque Academy, Albuquerque, NM 87109, USA
| | - Anila Marks
- Albuquerque Academy, Albuquerque, NM 87109, USA
| | | | - Audrey Lee
- Albuquerque Academy, Albuquerque, NM 87109, USA
| | - Collin Mohr
- Albuquerque Academy, Albuquerque, NM 87109, USA
| | - Emma Patton
- Albuquerque Academy, Albuquerque, NM 87109, USA
| | | | | | | | - Nathan Good
- Albuquerque Academy, Albuquerque, NM 87109, USA
| | | | - Adeeb Khan
- Albuquerque Academy, Albuquerque, NM 87109, USA
| | | | | | - Natasha Sun
- Albuquerque Academy, Albuquerque, NM 87109, USA
| | | | | | | | | | | | | | | | | | | | | | - Bryan Zumba
- Pritzker College Prep, Chicago, IL 60639, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Jake Bradford
- Loyola Marymount University, Los Angeles, CA 90045, USA
| | | | | | | | | | | | | | - Khushi Singh
- Loyola Marymount University, Los Angeles, CA 90045, USA
| | - Emily Sramaty
- Loyola Marymount University, Los Angeles, CA 90045, USA
| | - Brian Wells
- Loyola Marymount University, Los Angeles, CA 90045, USA
| | | | - Melissa Dowling
- Latin School of Chicago, 59 W North Blvd, Chicago, IL 60610, USA
| | | | | | | | | | | | | | - Cory Cain
- Pritzker College Prep, Chicago, IL 60639, USA
| | - Melody Lee
- Harvard-Westlake School, Los Angeles, CA 90077, USA
| | | | | | | | | | | | - Leslie Barr
- Westtown School, West Chester, PA 19382, USA
| | - Cory J Evans
- Loyola Marymount University, Los Angeles, CA 90045, USA
| | | | - Andrea Beatty
- Commack High School, 1 Scholar Ln, Commack, NY 11725, USA
| | | | - Robert Smullen
- Commack High School, 1 Scholar Ln, Commack, NY 11725, USA
| | - Jeanne Suttie
- Commack High School, 1 Scholar Ln, Commack, NY 11725, USA
| | | | | | | | | | | | - Elizabeth Fox
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | - Anjana Amirapu
- Lowell High School, 1101 Eucalyptus Dr, San Francisco, CA 94132, USA
| | - Sangbin Park
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Nicole Lantz
- The Lawrenceville School, 2500 Main St, Lawrenceville, NJ 08648, USA
| | | | - Seung K Kim
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Lutz Kockel
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
| |
Collapse
|
5
|
Chen J, Basting PJ, Han S, Garfinkel DJ, Bergman CM. Reproducible evaluation of transposable element detectors with McClintock 2 guides accurate inference of Ty insertion patterns in yeast. Mob DNA 2023; 14:8. [PMID: 37452430 PMCID: PMC10347736 DOI: 10.1186/s13100-023-00296-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 06/09/2023] [Indexed: 07/18/2023] Open
Abstract
BACKGROUND Many computational methods have been developed to detect non-reference transposable element (TE) insertions using short-read whole genome sequencing data. The diversity and complexity of such methods often present challenges to new users seeking to reproducibly install, execute, or evaluate multiple TE insertion detectors. RESULTS We previously developed the McClintock meta-pipeline to facilitate the installation, execution, and evaluation of six first-generation short-read TE detectors. Here, we report a completely re-implemented version of McClintock written in Python using Snakemake and Conda that improves its installation, error handling, speed, stability, and extensibility. McClintock 2 now includes 12 short-read TE detectors, auxiliary pre-processing and analysis modules, interactive HTML reports, and a simulation framework to reproducibly evaluate the accuracy of component TE detectors. When applied to the model microbial eukaryote Saccharomyces cerevisiae, we find substantial variation in the ability of McClintock 2 components to identify the precise locations of non-reference TE insertions, with RelocaTE2 showing the highest recall and precision in simulated data. We find that RelocaTE2, TEMP, TEMP2 and TEBreak provide consistent estimates of [Formula: see text]50 non-reference TE insertions per strain and that Ty2 has the highest number of non-reference TE insertions in a species-wide panel of [Formula: see text]1000 yeast genomes. Finally, we show that best-in-class predictors for yeast applied to resequencing data have sufficient resolution to reveal a dyad pattern of integration in nucleosome-bound regions upstream of yeast tRNA genes for Ty1, Ty2, and Ty4, allowing us to extend knowledge about fine-scale target preferences revealed previously for experimentally-induced Ty1 insertions to spontaneous insertions for other copia-superfamily retrotransposons in yeast. CONCLUSION McClintock ( https://github.com/bergmanlab/mcclintock/ ) provides a user-friendly pipeline for the identification of TEs in short-read WGS data using multiple TE detectors, which should benefit researchers studying TE insertion variation in a wide range of different organisms. Application of the improved McClintock system to simulated and empirical yeast genome data reveals best-in-class methods and novel biological insights for one of the most widely-studied model eukaryotes and provides a paradigm for evaluating and selecting non-reference TE detectors in other species.
Collapse
Affiliation(s)
- Jingxuan Chen
- Institute of Bioinformatics, University of Georgia, Athens, GA USA
| | | | - Shunhua Han
- Institute of Bioinformatics, University of Georgia, Athens, GA USA
| | - David J. Garfinkel
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA USA
| | - Casey M. Bergman
- Institute of Bioinformatics, University of Georgia, Athens, GA USA
- Department of Genetics, University of Georgia, Athens, GA USA
| |
Collapse
|
6
|
Feurtey A, Lorrain C, McDonald MC, Milgate A, Solomon PS, Warren R, Puccetti G, Scalliet G, Torriani SFF, Gout L, Marcel TC, Suffert F, Alassimone J, Lipzen A, Yoshinaga Y, Daum C, Barry K, Grigoriev IV, Goodwin SB, Genissel A, Seidl MF, Stukenbrock EH, Lebrun MH, Kema GHJ, McDonald BA, Croll D. A thousand-genome panel retraces the global spread and adaptation of a major fungal crop pathogen. Nat Commun 2023; 14:1059. [PMID: 36828814 PMCID: PMC9958100 DOI: 10.1038/s41467-023-36674-y] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Accepted: 02/10/2023] [Indexed: 02/26/2023] Open
Abstract
Human activity impacts the evolutionary trajectories of many species worldwide. Global trade of agricultural goods contributes to the dispersal of pathogens reshaping their genetic makeup and providing opportunities for virulence gains. Understanding how pathogens surmount control strategies and cope with new climates is crucial to predicting the future impact of crop pathogens. Here, we address this by assembling a global thousand-genome panel of Zymoseptoria tritici, a major fungal pathogen of wheat reported in all production areas worldwide. We identify the global invasion routes and ongoing genetic exchange of the pathogen among wheat-growing regions. We find that the global expansion was accompanied by increased activity of transposable elements and weakened genomic defenses. Finally, we find significant standing variation for adaptation to new climates encountered during the global spread. Our work shows how large population genomic panels enable deep insights into the evolutionary trajectory of a major crop pathogen.
Collapse
Affiliation(s)
- Alice Feurtey
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, CH-2000, Neuchâtel, Switzerland
- Plant Pathology, D-USYS, ETH Zurich, CH-8092, Zurich, Switzerland
- Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Cécile Lorrain
- Plant Pathology, D-USYS, ETH Zurich, CH-8092, Zurich, Switzerland
| | - Megan C McDonald
- Division of Plant Science, Research School of Biology, The Australian National University, Canberra, ACT, Australia
- School of Biosciences, Institute of Microbiology and Infection, University of Birmingham, Birmingham, UK
| | - Andrew Milgate
- NSW Department of Primary Industries, Wagga Wagga Agricultural Institute, Pine Gully Road, Wagga Wagga, NSW, 2650, Australia
| | - Peter S Solomon
- Division of Plant Science, Research School of Biology, The Australian National University, Canberra, ACT, Australia
| | - Rachael Warren
- The New Zealand Institute for Plant and Food Research Limited, Lincoln, New Zealand
| | - Guido Puccetti
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, CH-2000, Neuchâtel, Switzerland
- Syngenta Crop Protection AG, CH-4332, Stein, Switzerland
| | | | | | - Lilian Gout
- Université Paris Saclay, INRAE, UR BIOGER, 91120, Palaiseau, France
| | - Thierry C Marcel
- Université Paris Saclay, INRAE, UR BIOGER, 91120, Palaiseau, France
| | - Frédéric Suffert
- Université Paris Saclay, INRAE, UR BIOGER, 91120, Palaiseau, France
| | | | - Anna Lipzen
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Yuko Yoshinaga
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Christopher Daum
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Kerrie Barry
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Igor V Grigoriev
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA, 9472, USA
| | | | - Anne Genissel
- Université Paris Saclay, INRAE, UR BIOGER, 91120, Palaiseau, France
| | - Michael F Seidl
- Wageningen University and Research, Laboratory of Phytopathology, Wageningen, The Netherlands
- Utrecht University, Theoretical Biology and Bioinformatics, Utrecht, The Netherlands
| | - Eva H Stukenbrock
- Max Planck Institute for Evolutionary Biology, Plön, Germany
- Environmental Genomics, Christian-Albrechts University of Kiel, Kiel, Germany
| | | | - Gert H J Kema
- Wageningen University and Research, Laboratory of Phytopathology, Wageningen, The Netherlands
| | - Bruce A McDonald
- Plant Pathology, D-USYS, ETH Zurich, CH-8092, Zurich, Switzerland
| | - Daniel Croll
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, CH-2000, Neuchâtel, Switzerland.
| |
Collapse
|
7
|
Chen J, Basting PJ, Han S, Garfinkel DJ, Bergman CM. Reproducible evaluation of short-read transposable element detectors and species-wide data mining of insertion patterns in yeast. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.13.528343. [PMID: 36824955 PMCID: PMC9948991 DOI: 10.1101/2023.02.13.528343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]
Abstract
Background Many computational methods have been developed to detect non-reference transposable element (TE) insertions using short-read whole genome sequencing data. The diversity and complexity of such methods often present challenges to new users seeking to reproducibly install, execute or evaluate multiple TE insertion detectors. Results We previously developed the McClintock meta-pipeline to facilitate the installation, execution, and evaluation of six first-generation short-read TE detectors. Here, we report a completely re-implemented version of McClintock written in Python using Snakemake and Conda that improves its installation, error handling, speed, stability, and extensibility. McClintock 2 now includes 12 short-read TE detectors, auxiliary pre-processing and analysis modules, interactive HTML reports, and a simulation framework to reproducibly evaluate the accuracy of component TE detectors. When applied to the model microbial eukaryote Saccharomyces cerevisiae , we find substantial variation in the ability of McClintock 2 components to identify the precise locations of non-reference TE insertions, with RelocaTE2 showing the highest recall and precision in simulated data. We find that RelocaTE2, TEMP, TEMP2 and TEBreak provide a consistent and biologically meaningful view of non-reference TE insertions in a species-wide panel of ∼ 1000 yeast genomes, as evaluated by coverage-based abundance estimates and expected patterns of tRNA promoter targeting. Finally, we show that best-in-class predictors for yeast have sufficient resolution to reveal a dyad pattern of integration in nucleosome-bound regions upstream of yeast tRNA genes for Ty1, Ty2, and Ty4, allowing us to extend knowledge aboutfine-scale target preferences first revealed experimentally for Ty1 to natural insertions and related copia -superfamily retrotransposons in yeast. Conclusion McClintock ( https://github.com/bergmanlab/mcclintock/ ) provides a user-friendly pipeline for the identification of TEs in short-read WGS data using multiple TE detectors, which should benefit researchers studying TE insertion variation in a wide range of different organisms. Application of the improved McClintock system to simulated and empirical yeast genome data reveals best-in-class methods and novel biological insights for one of the most widely-studied model eukaryotes and provides a paradigm for evaluating and selecting non-reference TE detectors for other species.
Collapse
|
8
|
Gozashti L, Roy SW, Thornlow B, Kramer A, Ares M, Corbett-Detig R. Transposable elements drive intron gain in diverse eukaryotes. Proc Natl Acad Sci U S A 2022; 119:e2209766119. [PMID: 36417430 PMCID: PMC9860276 DOI: 10.1073/pnas.2209766119] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 10/19/2022] [Indexed: 11/24/2022] Open
Abstract
There is massive variation in intron numbers across eukaryotic genomes, yet the major drivers of intron content during evolution remain elusive. Rapid intron loss and gain in some lineages contrast with long-term evolutionary stasis in others. Episodic intron gain could be explained by recently discovered specialized transposons called Introners, but so far Introners are only known from a handful of species. Here, we performed a systematic search across 3,325 eukaryotic genomes and identified 27,563 Introner-derived introns in 175 genomes (5.2%). Species with Introners span remarkable phylogenetic diversity, from animals to basal protists, representing lineages whose last common ancestor dates to over 1.7 billion years ago. Aquatic organisms were 6.5 times more likely to contain Introners than terrestrial organisms. Introners exhibit mechanistic diversity but most are consistent with DNA transposition, indicating that Introners have evolved convergently hundreds of times from nonautonomous transposable elements. Transposable elements and aquatic taxa are associated with high rates of horizontal gene transfer, suggesting that this combination of factors may explain the punctuated and biased diversity of species containing Introners. More generally, our data suggest that Introners may explain the episodic nature of intron gain across the eukaryotic tree of life. These results illuminate the major source of ongoing intron creation in eukaryotic genomes.
Collapse
Affiliation(s)
- Landen Gozashti
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA95064
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA95064
| | - Scott W. Roy
- Department of Biology, San Francisco State University, San Francisco, CA94117
| | - Bryan Thornlow
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA95064
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA95064
| | - Alexander Kramer
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA95064
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA95064
| | - Manuel Ares
- Department of Molecular, Cell, and Developmental Biology, University of California Santa Cruz, Santa Cruz, CA95064
| | - Russell Corbett-Detig
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA95064
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA95064
| |
Collapse
|
9
|
Ecovoiu AA, Bologa AM, Chifiriuc DIM, Ciuca AM, Constantin ND, Ghionoiu IC, Ghita IC, Ratiu AC. Genome ARTIST_v2-An Autonomous Bioinformatics Tool for Annotation of Natural Transposons in Sequenced Genomes. Int J Mol Sci 2022; 23:ijms232012686. [PMID: 36293549 PMCID: PMC9604107 DOI: 10.3390/ijms232012686] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Revised: 10/15/2022] [Accepted: 10/18/2022] [Indexed: 11/05/2022] Open
Abstract
The annotation of transposable elements (transposons) is a very dynamic field of genomics and various tools assigned to support this bioinformatics endeavor have been developed and described. Genome ARTIST v1.19 (GA_v1.19) software was conceived for mapping artificial transposons mobilized during insertional mutagenesis projects, but the new functions of GA_v2 qualify it as a tool for the mapping and annotation of natural transposons (NTs) in long reads, contigs and assembled genomes. The tabular export of mapping and annotation data for high-throughput data analysis, the generation of a list of flanking sequences around the coordinates of insertion or around the target site duplications and the computing of a consensus sequence for the flanking sequences are all key assets of GA_v2. Additionally, we developed a set of scripts that enable the user to annotate NTs, to harness annotations offered by FlyBase for Drosophila melanogaster genome, to convert sequence files from .fasta to .raw, and to extract junction query sequences essential for NTs mapping. Herein, we present the applicability of GA_v2 for a preliminary annotation of P-element and hobo class II NTs and copia retrotransposon in the genome of D. melanogaster strain Horezu_LaPeri (Horezu), Romania, which was sequenced with Nanopore technology in our laboratory. We used contigs assembled with Flye tool and a Q10 quality filter of the reads. Our results suggest that GA_v2 is a reliable autonomous tool able to perform mapping and annotation of NTs in genomes sequenced by long sequencing technology. GA_v2 is open-source software compatible with Linux, Mac OS and Windows and is available at GitHub repository and dedicated website.
Collapse
Affiliation(s)
- Alexandru Al. Ecovoiu
- Department of Genetics, Faculty of Biology, University of Bucharest, 060101 Bucharest, Romania
- Correspondence: (A.A.E.); (A.C.R.); Tel.: +40-725564416 (A.A.E.); +40-722250366 (A.C.R.)
| | - Alexandru Marian Bologa
- Department of Genetics, Faculty of Biology, University of Bucharest, 060101 Bucharest, Romania
| | | | | | | | | | - Iulian Cristian Ghita
- Department of Genetics, Faculty of Biology, University of Bucharest, 060101 Bucharest, Romania
| | - Attila Cristian Ratiu
- Department of Genetics, Faculty of Biology, University of Bucharest, 060101 Bucharest, Romania
- Correspondence: (A.A.E.); (A.C.R.); Tel.: +40-725564416 (A.A.E.); +40-722250366 (A.C.R.)
| |
Collapse
|
10
|
Han S, Dias GB, Basting PJ, Viswanatha R, Perrimon N, Bergman C. Local assembly of long reads enables phylogenomics of transposable elements in a polyploid cell line. Nucleic Acids Res 2022; 50:e124. [PMID: 36156149 PMCID: PMC9757076 DOI: 10.1093/nar/gkac794] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 07/21/2022] [Accepted: 09/16/2022] [Indexed: 12/24/2022] Open
Abstract
Animal cell lines often undergo extreme genome restructuring events, including polyploidy and segmental aneuploidy that can impede de novo whole-genome assembly (WGA). In some species like Drosophila, cell lines also exhibit massive proliferation of transposable elements (TEs). To better understand the role of transposition during animal cell culture, we sequenced the genome of the tetraploid Drosophila S2R+ cell line using long-read and linked-read technologies. WGAs for S2R+ were highly fragmented and generated variable estimates of TE content across sequencing and assembly technologies. We therefore developed a novel WGA-independent bioinformatics method called TELR that identifies, locally assembles, and estimates allele frequency of TEs from long-read sequence data (https://github.com/bergmanlab/telr). Application of TELR to a ∼130x PacBio dataset for S2R+ revealed many haplotype-specific TE insertions that arose by transposition after initial cell line establishment and subsequent tetraploidization. Local assemblies from TELR also allowed phylogenetic analysis of paralogous TEs, which revealed that proliferation of TE families in vitro can be driven by single or multiple source lineages. Our work provides a model for the analysis of TEs in complex heterozygous or polyploid genomes that are recalcitrant to WGA and yields new insights into the mechanisms of genome evolution in animal cell culture.
Collapse
Affiliation(s)
| | | | - Preston J Basting
- Institute of Bioinformatics, University of Georgia, 120 E. Green St., Athens, GA, USA
| | - Raghuvir Viswanatha
- Department of Genetics, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, USA
| | - Norbert Perrimon
- Department of Genetics, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, USA,Howard Hughes Medical Institute, Boston, MA, USA
| | - Casey M Bergman
- To whom correspondence should be addressed. Tel: +1 706 542 1764; Fax: +1 706 542 3910;
| |
Collapse
|
11
|
Yan H, Haak DC, Li S, Huang L, Bombarely A. Exploring transposable element-based markers to identify allelic variations underlying agronomic traits in rice. PLANT COMMUNICATIONS 2022; 3:100270. [PMID: 35576152 PMCID: PMC9251385 DOI: 10.1016/j.xplc.2021.100270] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Revised: 10/29/2021] [Accepted: 12/16/2021] [Indexed: 06/10/2023]
Abstract
Transposable elements (TEs) are a major force in the production of new alleles during domestication; nevertheless, their use in association studies has been limited because of their complexity. We have developed a TE genotyping pipeline (TEmarker) and applied it to whole-genome genome-wide association study (GWAS) data from 176 Oryza sativa subsp. japonica accessions to identify genetic elements associated with specific agronomic traits. TE markers recovered a large proportion (69%) of single-nucleotide polymorphism (SNP)-based GWAS peaks, and these TE peaks retained ca. 25% of the SNPs. The use of TEs in GWASs may reduce false positives associated with linkage disequilibrium (LD) among SNP markers. A genome scan revealed positive selection on TEs associated with agronomic traits. We found several cases of insertion and deletion variants that potentially resulted from the direct action of TEs, including an allele of LOC_Os11g08410 associated with plant height and panicle length traits. Together, these findings reveal the utility of TE markers for connecting genotype to phenotype and suggest a potential role for TEs in influencing phenotypic variations in rice that impact agronomic traits.
Collapse
Affiliation(s)
- Haidong Yan
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, VA 24061, USA
| | - David C Haak
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, VA 24061, USA; Graduate Program in Genetics, Bioinformatics and Computational Biology (GBCB), Virginia Tech, Blacksburg, VA 24061, USA
| | - Song Li
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, VA 24061, USA; Graduate Program in Genetics, Bioinformatics and Computational Biology (GBCB), Virginia Tech, Blacksburg, VA 24061, USA
| | - Linkai Huang
- Department of Grassland Science, Animal Science and Technology College, Sichuan Agricultural University, Chengdu 611130, China
| | - Aureliano Bombarely
- Department of Bioscience, Universita degli Studi di Milano (UNIMI), 20133 Milano, Italy; Instituto de Biologıa Molecular y Celular de Plantas (IBMCP), UPV-CSIC, 46022 Valencia, Spain.
| |
Collapse
|
12
|
Wei KHC, Mai D, Chatla K, Bachtrog D. Dynamics and Impacts of Transposable Element Proliferation in the Drosophila nasuta Species Group Radiation. Mol Biol Evol 2022; 39:msac080. [PMID: 35485457 PMCID: PMC9075770 DOI: 10.1093/molbev/msac080] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Transposable element (TE) mobilization is a constant threat to genome integrity. Eukaryotic organisms have evolved robust defensive mechanisms to suppress their activity, yet TEs can escape suppression and proliferate, creating strong selective pressure for host defense to adapt. This genomic conflict fuels a never-ending arms race that drives the rapid evolution of TEs and recurrent positive selection of genes involved in host defense; the latter has been shown to contribute to postzygotic hybrid incompatibility. However, how TE proliferation impacts genome and regulatory divergence remains poorly understood. Here, we report the highly complete and contiguous (N50 = 33.8-38.0 Mb) genome assemblies of seven closely related Drosophila species that belong to the nasuta species group-a poorly studied group of flies that radiated in the last 2 My. We constructed a high-quality de novo TE library and gathered germline RNA-seq data, which allowed us to comprehensively annotate and compare TE insertion patterns between the species, and infer the evolutionary forces controlling their spread. We find a strong negative association between TE insertion frequency and expression of genes nearby; this likely reflects survivor bias from reduced fitness impact of TEs inserting near lowly expressed, nonessential genes, with limited TE-induced epigenetic silencing. Phylogenetic analyses of insertions of 147 TE families reveal that 53% of them show recent amplification in at least one species. The most highly amplified TE is a nonautonomous DNA element (Drosophila INterspersed Element; DINE) which has gone through multiple bouts of expansions with thousands of full-length copies littered throughout each genome. Across all TEs, we find that TEs expansions are significantly associated with high expression in the expanded species consistent with suppression escape. Thus, whereas horizontal transfer followed by the invasion of a naïve genome has been highlighted to explain the long-term survival of TEs, our analysis suggests that evasion of host suppression of resident TEs is a major strategy to persist over evolutionary times. Altogether, our results shed light on the heterogenous and context-dependent nature in which TEs affect gene regulation and the dynamics of rampant TE proliferation amidst a recently radiated species group.
Collapse
Affiliation(s)
- Kevin H.-C. Wei
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720, USA
| | - Dat Mai
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720, USA
| | - Kamalakar Chatla
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720, USA
| | - Doris Bachtrog
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720, USA
| |
Collapse
|
13
|
Navarro-Dominguez B, Chang CH, Brand CL, Muirhead CA, Presgraves DC, Larracuente AM. Epistatic selection on a selfish Segregation Distorter supergene - drive, recombination, and genetic load. eLife 2022; 11:e78981. [PMID: 35486424 PMCID: PMC9122502 DOI: 10.7554/elife.78981] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Accepted: 04/20/2022] [Indexed: 11/13/2022] Open
Abstract
Meiotic drive supergenes are complexes of alleles at linked loci that together subvert Mendelian segregation resulting in preferential transmission. In males, the most common mechanism of drive involves the disruption of sperm bearing one of a pair of alternative alleles. While at least two loci are important for male drive-the driver and the target-linked modifiers can enhance drive, creating selection pressure to suppress recombination. In this work, we investigate the evolution and genomic consequences of an autosomal, multilocus, male meiotic drive system, Segregation Distorter (SD) in the fruit fly, Drosophila melanogaster. In African populations, the predominant SD chromosome variant, SD-Mal, is characterized by two overlapping, paracentric inversions on chromosome arm 2R and nearly perfect (~100%) transmission. We study the SD-Mal system in detail, exploring its components, chromosomal structure, and evolutionary history. Our findings reveal a recent chromosome-scale selective sweep mediated by strong epistatic selection for haplotypes carrying Sd, the main driving allele, and one or more factors within the double inversion. While most SD-Mal chromosomes are homozygous lethal, SD-Mal haplotypes can recombine with other, complementing haplotypes via crossing over, and with wildtype chromosomes via gene conversion. SD-Mal chromosomes have nevertheless accumulated lethal mutations, excess non-synonymous mutations, and excess transposable element insertions. Therefore, SD-Mal haplotypes evolve as a small, semi-isolated subpopulation with a history of strong selection. These results may explain the evolutionary turnover of SD haplotypes in different populations around the world and have implications for supergene evolution broadly.
Collapse
Affiliation(s)
| | - Ching-Ho Chang
- Department of Biology, University of RochesterRochesterUnited States
| | - Cara L Brand
- Department of Biology, University of RochesterRochesterUnited States
| | - Christina A Muirhead
- Department of Biology, University of RochesterRochesterUnited States
- Ronin InstituteMontclairUnited States
| | | | | |
Collapse
|
14
|
Rech GE, Radío S, Guirao-Rico S, Aguilera L, Horvath V, Green L, Lindstadt H, Jamilloux V, Quesneville H, González J. Population-scale long-read sequencing uncovers transposable elements associated with gene expression variation and adaptive signatures in Drosophila. Nat Commun 2022; 13:1948. [PMID: 35413957 PMCID: PMC9005704 DOI: 10.1038/s41467-022-29518-8] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Accepted: 03/15/2022] [Indexed: 12/16/2022] Open
Abstract
High quality reference genomes are crucial to understanding genome function, structure and evolution. The availability of reference genomes has allowed us to start inferring the role of genetic variation in biology, disease, and biodiversity conservation. However, analyses across organisms demonstrate that a single reference genome is not enough to capture the global genetic diversity present in populations. In this work, we generate 32 high-quality reference genomes for the well-known model species D. melanogaster and focus on the identification and analysis of transposable element variation as they are the most common type of structural variant. We show that integrating the genetic variation across natural populations from five climatic regions increases the number of detected insertions by 58%. Moreover, 26% to 57% of the insertions identified using long-reads were missed by short-reads methods. We also identify hundreds of transposable elements associated with gene expression variation and new TE variants likely to contribute to adaptive evolution in this species. Our results highlight the importance of incorporating the genetic variation present in natural populations to genomic studies, which is essential if we are to understand how genomes function and evolve. Even in well-studied species, there is still substantial natural genetic variation that has not been characterized. Here, the authors use long read sequencing to discover transposable elements in the Drosophila genome not detected by short read sequencing, and link them to gene expression.
Collapse
Affiliation(s)
- Gabriel E Rech
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Santiago Radío
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Sara Guirao-Rico
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Laura Aguilera
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Vivien Horvath
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Llewellyn Green
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Hannah Lindstadt
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | | | | | - Josefa González
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain.
| |
Collapse
|
15
|
Finding and Characterizing Repeats in Plant Genomes. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2443:327-385. [PMID: 35037215 DOI: 10.1007/978-1-0716-2067-0_18] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Plant genomes contain a particularly high proportion of repeated structures of various types. This chapter proposes a guided tour of the available software that can help biologists to scan automatically for these repeats in sequence data or check hypothetical models intended to characterize their structures. Since transposable elements (TEs) are a major source of repeats in plants, many methods have been used or developed for this broad class of sequences. They are representative of the range of tools available for other classes of repeats and we have provided two sections on this topic (for the analysis of genomes or directly of sequenced reads), as well as a selection of the main existing software. It may be hard to keep up with the profusion of proposals in this dynamic field and the rest of the chapter is devoted to the foundations of an efficient search for repeats and more complex patterns. We first introduce the key concepts of the art of indexing and mapping or querying sequences. We end the chapter with the more prospective issue of building models of repeat families. We present the Machine Learning approach first, seeking to build predictors automatically for some families of ET, from a set of sequences known to belong to this family. A second approach, the linguistic (or syntactic) approach, allows biologists to describe themselves and check the validity of models of their favorite repeat family.
Collapse
|
16
|
Han S, Basting PJ, Dias GB, Luhur A, Zelhof AC, Bergman CM. Transposable element profiles reveal cell line identity and loss of heterozygosity in Drosophila cell culture. Genetics 2021; 219:6321957. [PMID: 34849875 PMCID: PMC8633141 DOI: 10.1093/genetics/iyab113] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Accepted: 07/01/2021] [Indexed: 11/28/2022] Open
Abstract
Cell culture systems allow key insights into biological mechanisms yet suffer from irreproducible outcomes in part because of cross-contamination or mislabeling of cell lines. Cell line misidentification can be mitigated by the use of genotyping protocols, which have been developed for human cell lines but are lacking for many important model species. Here, we leverage the classical observation that transposable elements (TEs) proliferate in cultured Drosophila cells to demonstrate that genome-wide TE insertion profiles can reveal the identity and provenance of Drosophila cell lines. We identify multiple cases where TE profiles clarify the origin of Drosophila cell lines (Sg4, mbn2, and OSS_E) relative to published reports, and also provide evidence that insertions from only a subset of long-terminal repeat retrotransposon families are necessary to mark Drosophila cell line identity. We also develop a new bioinformatics approach to detect TE insertions and estimate intra-sample allele frequencies in legacy whole-genome sequencing data (called ngs_te_mapper2), which revealed loss of heterozygosity as a mechanism shaping the unique TE profiles that identify Drosophila cell lines. Our work contributes to the general understanding of the forces impacting metazoan genomes as they evolve in cell culture and paves the way for high-throughput protocols that use TE insertions to authenticate cell lines in Drosophila and other organisms.
Collapse
Affiliation(s)
- Shunhua Han
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Preston J Basting
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Guilherme B Dias
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA.,Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | - Arthur Luhur
- Drosophila Genomics Resource Center, Indiana University, Bloomington, IN 47405, USA.,Department of Biology, Indiana University, Bloomington, IN 47405, USA
| | - Andrew C Zelhof
- Drosophila Genomics Resource Center, Indiana University, Bloomington, IN 47405, USA.,Department of Biology, Indiana University, Bloomington, IN 47405, USA
| | - Casey M Bergman
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA.,Department of Genetics, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
17
|
Oggenfuss U, Badet T, Wicker T, Hartmann FE, Singh NK, Abraham L, Karisto P, Vonlanthen T, Mundt C, McDonald BA, Croll D. A population-level invasion by transposable elements triggers genome expansion in a fungal pathogen. eLife 2021; 10:e69249. [PMID: 34528512 PMCID: PMC8445621 DOI: 10.7554/elife.69249] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Accepted: 08/28/2021] [Indexed: 12/16/2022] Open
Abstract
Genome evolution is driven by the activity of transposable elements (TEs). The spread of TEs can have deleterious effects including the destabilization of genome integrity and expansions. However, the precise triggers of genome expansions remain poorly understood because genome size evolution is typically investigated only among deeply divergent lineages. Here, we use a large population genomics dataset of 284 individuals from populations across the globe of Zymoseptoria tritici, a major fungal wheat pathogen. We built a robust map of genome-wide TE insertions and deletions to track a total of 2456 polymorphic loci within the species. We show that purifying selection substantially depressed TE frequencies in most populations, but some rare TEs have recently risen in frequency and likely confer benefits. We found that specific TE families have undergone a substantial genome-wide expansion from the pathogen's center of origin to more recently founded populations. The most dramatic increase in TE insertions occurred between a pair of North American populations collected in the same field at an interval of 25 years. We find that both genome-wide counts of TE insertions and genome size have increased with colonization bottlenecks. Hence, the demographic history likely played a major role in shaping genome evolution within the species. We show that both the activation of specific TEs and relaxed purifying selection underpin this incipient expansion of the genome. Our study establishes a model to recapitulate TE-driven genome evolution over deeper evolutionary timescales.
Collapse
Affiliation(s)
- Ursula Oggenfuss
- Laboratory of Evolutionary Genetics, Institute of Biology, University of NeuchâtelNeuchatelSwitzerland
| | - Thomas Badet
- Laboratory of Evolutionary Genetics, Institute of Biology, University of NeuchâtelNeuchatelSwitzerland
| | - Thomas Wicker
- Institute for Plant and Microbial Biology, University of ZurichZurichSwitzerland
| | - Fanny E Hartmann
- Ecologie Systématique Evolution, Bâtiment 360, Univ. Paris-Sud, AgroParisTech, CNRS, Université Paris-SaclayOrsayFrance
- Plant Pathology, Institute of Integrative Biology, ETH ZurichZurichSwitzerland
| | - Nikhil Kumar Singh
- Laboratory of Evolutionary Genetics, Institute of Biology, University of NeuchâtelNeuchatelSwitzerland
| | - Leen Abraham
- Laboratory of Evolutionary Genetics, Institute of Biology, University of NeuchâtelNeuchatelSwitzerland
| | - Petteri Karisto
- Plant Pathology, Institute of Integrative Biology, ETH ZurichZurichSwitzerland
| | - Tiziana Vonlanthen
- Plant Pathology, Institute of Integrative Biology, ETH ZurichZurichSwitzerland
| | - Christopher Mundt
- Department of Botany and Plant Pathology, Oregon State UniversityCorvallisUnited States
| | - Bruce A McDonald
- Plant Pathology, Institute of Integrative Biology, ETH ZurichZurichSwitzerland
| | - Daniel Croll
- Laboratory of Evolutionary Genetics, Institute of Biology, University of NeuchâtelNeuchatelSwitzerland
| |
Collapse
|
18
|
Pereira D, Oggenfuss U, McDonald BA, Croll D. Population genomics of transposable element activation in the highly repressive genome of an agricultural pathogen. Microb Genom 2021; 7:000540. [PMID: 34424154 PMCID: PMC8549362 DOI: 10.1099/mgen.0.000540] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Accepted: 02/03/2021] [Indexed: 12/13/2022] Open
Abstract
The activity of transposable elements (TEs) can be an important driver of genetic diversity with TE-mediated mutations having a wide range of fitness consequences. To avoid deleterious effects of TE activity, some fungi have evolved highly sophisticated genomic defences to reduce TE proliferation across the genome. Repeat-induced point mutation (RIP) is a fungal-specific TE defence mechanism efficiently targeting duplicated sequences. The rapid accumulation of RIPs is expected to deactivate TEs over the course of a few generations. The evolutionary dynamics of TEs at the population level in a species with highly repressive genome defences is poorly understood. Here, we analyse 366 whole-genome sequences of Parastagonospora nodorum, a fungal pathogen of wheat with efficient RIP. A global population genomics analysis revealed high levels of genetic diversity and signs of frequent sexual recombination. Contrary to expectations for a species with RIP, we identified recent TE activity in multiple populations. The TE composition and copy numbers showed little divergence among global populations regardless of the demographic history. Miniature inverted-repeat transposable elements (MITEs) and terminal repeat retrotransposons in miniature (TRIMs) were largely underlying recent intra-species TE expansions. We inferred RIP footprints in individual TE families and found that recently active, high-copy TEs have possibly evaded genomic defences. We find no evidence that recent positive selection acted on TE-mediated mutations rather that purifying selection maintained new TE insertions at low insertion frequencies in populations. Our findings highlight the complex evolutionary equilibria established by the joint action of TE activity, selection and genomic repression.
Collapse
Affiliation(s)
- Danilo Pereira
- Plant Pathology, Institute of Integrative Biology, ETH Zürich, Zürich, Switzerland
- Present address: Max Planck Institute for Evolutionary Biology, August-Thienemann-Straße 2, D-24306 Plön, Germany
| | - Ursula Oggenfuss
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Bruce A. McDonald
- Plant Pathology, Institute of Integrative Biology, ETH Zürich, Zürich, Switzerland
| | - Daniel Croll
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| |
Collapse
|
19
|
Siudeja K, van den Beek M, Riddiford N, Boumard B, Wurmser A, Stefanutti M, Lameiras S, Bardin AJ. Unraveling the features of somatic transposition in the Drosophila intestine. EMBO J 2021; 40:e106388. [PMID: 33634906 PMCID: PMC8090852 DOI: 10.15252/embj.2020106388] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Revised: 01/20/2021] [Accepted: 01/27/2021] [Indexed: 12/22/2022] Open
Abstract
Transposable elements (TEs) play a significant role in evolution, contributing to genetic variation. However, TE mobilization in somatic cells is not well understood. Here, we address the prevalence of transposition in a somatic tissue, exploiting the Drosophila midgut as a model. Using whole-genome sequencing of in vivo clonally expanded gut tissue, we have mapped hundreds of high-confidence somatic TE integration sites genome-wide. We show that somatic retrotransposon insertions are associated with inactivation of the tumor suppressor Notch, likely contributing to neoplasia formation. Moreover, applying Oxford Nanopore long-read sequencing technology we provide evidence for tissue-specific differences in retrotransposition. Comparing somatic TE insertional activity with transcriptomic and small RNA sequencing data, we demonstrate that transposon mobility cannot be simply predicted by whole tissue TE expression levels or by small RNA pathway activity. Finally, we reveal that somatic TE insertions in the adult fly intestine are enriched in genic regions and in transcriptionally active chromatin. Together, our findings provide clear evidence of ongoing somatic transposition in Drosophila and delineate previously unknown features underlying somatic TE mobility in vivo.
Collapse
Affiliation(s)
- Katarzyna Siudeja
- Institut CurieCNRSUMR 3215INSERM U934Stem Cells and Tissue Homeostasis GroupPSL Research UniversityParisFrance
- Sorbonne UniversitésUPMC Univ Paris 6ParisFrance
| | - Marius van den Beek
- Institut CurieCNRSUMR 3215INSERM U934Stem Cells and Tissue Homeostasis GroupPSL Research UniversityParisFrance
- Sorbonne UniversitésUPMC Univ Paris 6ParisFrance
| | - Nick Riddiford
- Institut CurieCNRSUMR 3215INSERM U934Stem Cells and Tissue Homeostasis GroupPSL Research UniversityParisFrance
- Sorbonne UniversitésUPMC Univ Paris 6ParisFrance
| | - Benjamin Boumard
- Institut CurieCNRSUMR 3215INSERM U934Stem Cells and Tissue Homeostasis GroupPSL Research UniversityParisFrance
- Sorbonne UniversitésUPMC Univ Paris 6ParisFrance
| | - Annabelle Wurmser
- Institut CurieCNRSUMR 3215INSERM U934Stem Cells and Tissue Homeostasis GroupPSL Research UniversityParisFrance
- Sorbonne UniversitésUPMC Univ Paris 6ParisFrance
| | - Marine Stefanutti
- Institut CurieCNRSUMR 3215INSERM U934Stem Cells and Tissue Homeostasis GroupPSL Research UniversityParisFrance
- Sorbonne UniversitésUPMC Univ Paris 6ParisFrance
| | - Sonia Lameiras
- ICGex Next‐Generation Sequencing PlatformInstitut CuriePSL Research UniversityParisFrance
| | - Allison J Bardin
- Institut CurieCNRSUMR 3215INSERM U934Stem Cells and Tissue Homeostasis GroupPSL Research UniversityParisFrance
- Sorbonne UniversitésUPMC Univ Paris 6ParisFrance
| |
Collapse
|
20
|
Vogan AA, Ament-Velásquez SL, Bastiaans E, Wallerman O, Saupe SJ, Suh A, Johannesson H. The Enterprise, a massive transposon carrying Spok meiotic drive genes. Genome Res 2021; 31:789-798. [PMID: 33875482 PMCID: PMC8092012 DOI: 10.1101/gr.267609.120] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2020] [Accepted: 03/05/2021] [Indexed: 12/19/2022]
Abstract
The genomes of eukaryotes are full of parasitic sequences known as transposable elements (TEs). Here, we report the discovery of a putative giant tyrosine-recombinase-mobilized DNA transposon, Enterprise, from the model fungus Podospora anserina Previously, we described a large genomic feature called the Spok block which is notable due to the presence of meiotic drive genes of the Spok gene family. The Spok block ranges from 110 kb to 247 kb and can be present in at least four different genomic locations within P. anserina, despite what is an otherwise highly conserved genome structure. We propose that the reason for its varying positions is that the Spok block is not only capable of meiotic drive but is also capable of transposition. More precisely, the Spok block represents a unique case where the Enterprise has captured the Spoks, thereby parasitizing a resident genomic parasite to become a genomic hyperparasite. Furthermore, we demonstrate that Enterprise (without the Spoks) is found in other fungal lineages, where it can be as large as 70 kb. Lastly, we provide experimental evidence that the Spok block is deleterious, with detrimental effects on spore production in strains which carry it. This union of meiotic drivers and a transposon has created a selfish element of impressive size in Podospora, challenging our perception of how TEs influence genome evolution and broadening the horizons in terms of what the upper limit of transposition may be.
Collapse
Affiliation(s)
- Aaron A Vogan
- Systematic Biology, Department of Organismal Biology, Uppsala University, 752 36 Uppsala, Sweden
| | - S Lorena Ament-Velásquez
- Systematic Biology, Department of Organismal Biology, Uppsala University, 752 36 Uppsala, Sweden
| | - Eric Bastiaans
- Systematic Biology, Department of Organismal Biology, Uppsala University, 752 36 Uppsala, Sweden
- Laboratory of Genetics, Wageningen University, 6703 BD, Wageningen, The Netherlands
| | - Ola Wallerman
- Department of Medical Biochemistry and Microbiology, Comparative Genetics and Functional Genomics; Uppsala University, 752 37 Uppsala, Sweden
| | - Sven J Saupe
- IBGC, UMR 5095, CNRS Université de Bordeaux, 33077 Bordeaux Cedex, France
| | - Alexander Suh
- Systematic Biology, Department of Organismal Biology, Uppsala University, 752 36 Uppsala, Sweden
| | - Hanna Johannesson
- Systematic Biology, Department of Organismal Biology, Uppsala University, 752 36 Uppsala, Sweden
| |
Collapse
|
21
|
Gilbert C, Peccoud J, Cordaux R. Transposable Elements and the Evolution of Insects. ANNUAL REVIEW OF ENTOMOLOGY 2021; 66:355-372. [PMID: 32931312 DOI: 10.1146/annurev-ento-070720-074650] [Citation(s) in RCA: 55] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Insects are major contributors to our understanding of the interaction between transposable elements (TEs) and their hosts, owing to seminal discoveries, as well as to the growing number of sequenced insect genomes and population genomics and functional studies. Insect TE landscapes are highly variable both within and across insect orders, although phylogenetic relatedness appears to correlate with similarity in insect TE content. This correlation is unlikely to be solely due to inheritance of TEs from shared ancestors and may partly reflect preferential horizontal transfer of TEs between closely related species. The influence of insect traits on TE landscapes, however, remains unclear. Recent findings indicate that, in addition to being involved in insect adaptations and aging, TEs are seemingly at the cornerstone of insect antiviral immunity. Thus, TEs are emerging as essential insect symbionts that may have deleterious or beneficial consequences on their hosts, depending on context.
Collapse
Affiliation(s)
- Clément Gilbert
- Université Paris-Saclay, CNRS, IRD, UMR Évolution, Génomes, Comportement et Écologie, 91198 Gif-sur-Yvette, France;
| | - Jean Peccoud
- Laboratoire Ecologie et Biologie des Interactions, Equipe Ecologie Evolution Symbiose, Unité Mixte de Recherche 7267 Centre National de la Recherche Scientifique, Université de Poitiers, 86073 Poitiers CEDEX 9, France
| | - Richard Cordaux
- Laboratoire Ecologie et Biologie des Interactions, Equipe Ecologie Evolution Symbiose, Unité Mixte de Recherche 7267 Centre National de la Recherche Scientifique, Université de Poitiers, 86073 Poitiers CEDEX 9, France
| |
Collapse
|
22
|
Mair WJ, Thomas GJ, Dodhia K, Hills AL, Jayasena KW, Ellwood SR, Oliver RP, Lopez-Ruiz FJ. Parallel evolution of multiple mechanisms for demethylase inhibitor fungicide resistance in the barley pathogen Pyrenophora teres f. sp. maculata. Fungal Genet Biol 2020; 145:103475. [DOI: 10.1016/j.fgb.2020.103475] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2020] [Revised: 09/11/2020] [Accepted: 09/25/2020] [Indexed: 10/23/2022]
|
23
|
R2 and Non-Site-Specific R2-Like Retrotransposons of the German Cockroach, Blattella germanica. Genes (Basel) 2020; 11:genes11101202. [PMID: 33076367 PMCID: PMC7650587 DOI: 10.3390/genes11101202] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Revised: 10/10/2020] [Accepted: 10/12/2020] [Indexed: 11/17/2022] Open
Abstract
The structural and functional organization of the ribosomal RNA gene cluster and the full-length R2 non-LTR retrotransposon (integrated into a specific site of 28S ribosomal RNA genes) of the German cockroach, Blattella germanica, is described. A partial sequence of the R2 retrotransposon of the cockroach Rhyparobia maderae is also analyzed. The analysis of previously published next-generation sequencing data from the B. germanica genome reveals a new type of retrotransposon closely related to R2 retrotransposons but with a random distribution in the genome. Phylogenetic analysis reveals that these newly described retrotransposons form a separate clade. It is shown that proteins corresponding to the open reading frames of newly described retrotransposons exhibit unequal structural domains. Within these retrotransposons, a recombination event is described. New mechanism of transposition activity is discussed. The essential structural features of R2 retrotransposons are conserved in cockroaches and are typical of previously described R2 retrotransposons. However, the investigation of the number and frequency of 5′-truncated R2 retrotransposon insertion variants in eight B. germanica populations suggests recent mobile element activity. It is shown that the pattern of 5′-truncated R2 retrotransposon copies can be an informative molecular genetic marker for revealing genetic distances between insect populations.
Collapse
|
24
|
Son W, Choi KW. The Classic Lobe Eye Phenotype of Drosophila Is Caused by Transposon Insertion-Induced Misexpression of a Zinc-Finger Transcription Factor. Genetics 2020; 216:117-134. [PMID: 32641295 PMCID: PMC7463288 DOI: 10.1534/genetics.120.303486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2019] [Accepted: 07/06/2020] [Indexed: 11/18/2022] Open
Abstract
Drosophila Lobe (L) alleles were first discovered ∼100 years ago as spontaneous dominant mutants with characteristic developmental eye defects. However, the molecular basis for L dominant eye phenotypes has not been clearly understood. A previous work reported identification of CG10109/PRAS40 as the L gene, but subsequent analyses suggested that PRAS40 may not be related to L Here, we revisited the L gene to clarify this discrepancy and understand the basis for the dominance of L mutations. Genetic analysis localized the L gene to Oaz, which encodes a homolog of the vertebrate zinc finger protein 423 (Zfp423) family transcriptional regulators. We demonstrate that RNAi knockdown of Oaz almost completely restores all L dominant alleles tested. Lrev6-3 , a revertant allele of the L2 dominant eye phenotype, has an inframe deletion in the Oaz coding sequence. Molecular analysis of L dominant mutants identified allele-specific insertions of natural transposons (roo[ ]L1 , hopper[ ]L5 , and roo[ ]Lr ) or alterations of a preexisting transposon (L2 -specific mutations in roo[ ]Mohr) in the Oaz region. In addition, we generated additional L2 -reversion alleles by CRISPR targeting at Oaz These new loss-of-function Oaz mutations suppress the dominant L eye phenotype. Oaz protein is not expressed in wild-type eye disc but is expressed ectopically in L2/+ mutant eye disc. We induced male recombination between Oaz-GAL4 insertions and the L2 mutation through homologous recombination. By using the L2 -recombined GAL4 reporters, we show that Oaz-GAL4 is expressed ectopically in L2 eye imaginal disc. Taken together, our data suggest that neomorphic L eye phenotypes are likely due to misregulation of Oaz by spontaneous transposon insertions.
Collapse
Affiliation(s)
- Wonseok Son
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon 34141, Korea
| | - Kwang-Wook Choi
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon 34141, Korea
| |
Collapse
|
25
|
Mohamed M, Dang NTM, Ogyama Y, Burlet N, Mugat B, Boulesteix M, Mérel V, Veber P, Salces-Ortiz J, Severac D, Pélisson A, Vieira C, Sabot F, Fablet M, Chambeyron S. A Transposon Story: From TE Content to TE Dynamic Invasion of Drosophila Genomes Using the Single-Molecule Sequencing Technology from Oxford Nanopore. Cells 2020; 9:E1776. [PMID: 32722451 PMCID: PMC7465170 DOI: 10.3390/cells9081776] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2020] [Revised: 07/17/2020] [Accepted: 07/23/2020] [Indexed: 11/17/2022] Open
Abstract
Transposable elements (TEs) are the main components of genomes. However, due to their repetitive nature, they are very difficult to study using data obtained with short-read sequencing technologies. Here, we describe an efficient pipeline to accurately recover TE insertion (TEI) sites and sequences from long reads obtained by Oxford Nanopore Technology (ONT) sequencing. With this pipeline, we could precisely describe the landscapes of the most recent TEIs in wild-type strains of Drosophila melanogaster and Drosophila simulans. Their comparison suggests that this subset of TE sequences is more similar than previously thought in these two species. The chromosome assemblies obtained using this pipeline also allowed recovering piRNA cluster sequences, which was impossible using short-read sequencing. Finally, we used our pipeline to analyze ONT sequencing data from a D. melanogaster unstable line in which LTR transposition was derepressed for 73 successive generations. We could rely on single reads to identify new insertions with intact target site duplications. Moreover, the detailed analysis of TEIs in the wild-type strains and the unstable line did not support the trap model claiming that piRNA clusters are hotspots of TE insertions.
Collapse
Affiliation(s)
- Mourdas Mohamed
- Institute of Human Genetics, UMR9002, CNRS and Montpellier University, 34396 Montpellier, France; (M.M.); (Y.O.); (B.M.); (A.P.)
| | - Nguyet Thi-Minh Dang
- IRD/UM UMR DIADE, 911 avenue Agropolis BP64501, 34394 Montpellier, France; (N.T.-M.D.); (F.S.)
| | - Yuki Ogyama
- Institute of Human Genetics, UMR9002, CNRS and Montpellier University, 34396 Montpellier, France; (M.M.); (Y.O.); (B.M.); (A.P.)
| | - Nelly Burlet
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France; (N.B.); (M.B.); (V.M.); (P.V.); (J.S.-O.); (C.V.)
| | - Bruno Mugat
- Institute of Human Genetics, UMR9002, CNRS and Montpellier University, 34396 Montpellier, France; (M.M.); (Y.O.); (B.M.); (A.P.)
| | - Matthieu Boulesteix
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France; (N.B.); (M.B.); (V.M.); (P.V.); (J.S.-O.); (C.V.)
| | - Vincent Mérel
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France; (N.B.); (M.B.); (V.M.); (P.V.); (J.S.-O.); (C.V.)
| | - Philippe Veber
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France; (N.B.); (M.B.); (V.M.); (P.V.); (J.S.-O.); (C.V.)
| | - Judit Salces-Ortiz
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France; (N.B.); (M.B.); (V.M.); (P.V.); (J.S.-O.); (C.V.)
- Institute of Evolutionary Biology (IBE), CSIC-Universitat Pompeu Fabra, 08003 Barcelona, Spain
| | - Dany Severac
- MGX-Montpellier GenomiX, c/o Institut de Génomique Fonctionnelle, CNRS, INSERM, Université de Montpellier, 34094 Montpellier, France;
| | - Alain Pélisson
- Institute of Human Genetics, UMR9002, CNRS and Montpellier University, 34396 Montpellier, France; (M.M.); (Y.O.); (B.M.); (A.P.)
| | - Cristina Vieira
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France; (N.B.); (M.B.); (V.M.); (P.V.); (J.S.-O.); (C.V.)
| | - François Sabot
- IRD/UM UMR DIADE, 911 avenue Agropolis BP64501, 34394 Montpellier, France; (N.T.-M.D.); (F.S.)
| | - Marie Fablet
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France; (N.B.); (M.B.); (V.M.); (P.V.); (J.S.-O.); (C.V.)
| | - Séverine Chambeyron
- Institute of Human Genetics, UMR9002, CNRS and Montpellier University, 34396 Montpellier, France; (M.M.); (Y.O.); (B.M.); (A.P.)
| |
Collapse
|
26
|
Kofler R. piRNA Clusters Need a Minimum Size to Control Transposable Element Invasions. Genome Biol Evol 2020; 12:736-749. [PMID: 32219390 PMCID: PMC7259680 DOI: 10.1093/gbe/evaa064] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/23/2020] [Indexed: 12/21/2022] Open
Abstract
piRNA clusters are thought to repress transposable element (TE) activity in mammals and invertebrates. Here, we show that a simple population genetics model reveals a constraint on the size of piRNA clusters: The total size of the piRNA clusters of an organism must exceed 0.2% of a genome to repress TE invasions. Moreover, larger piRNA clusters accounting for up to 3% of the genome may be necessary when populations are small, transposition rates are high, and TE insertions are recessive. If piRNA clusters are too small, the load of deleterious TE insertions that accumulate during a TE invasion may drive populations extinct before an effective piRNA-based defense against the TE can be established. Our findings are solely based on three well-supported assumptions: 1) TEs multiply within genomes, 2) TEs are mostly deleterious, and 3) piRNA clusters act as transposon traps, where a single insertion in a cluster silences all TE copies in trans. Interestingly, the piRNA clusters of some species meet our observed minimum size requirements, whereas the clusters of other species do not. Species with small piRNA clusters, such as humans and mice, may experience severe fitness reductions during invasions of novel TEs, which is possibly even threatening the persistence of some populations. This work also raises the important question of how piRNA clusters evolve. We propose that the size of piRNA clusters may be at an equilibrium between evolutionary forces that act to expand and contract piRNA clusters.
Collapse
Affiliation(s)
- Robert Kofler
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria
| |
Collapse
|
27
|
Zhang S, Pointer B, Kelleher ES. Rapid evolution of piRNA-mediated silencing of an invading transposable element was driven by abundant de novo mutations. Genome Res 2020; 30:566-575. [PMID: 32238416 PMCID: PMC7197473 DOI: 10.1101/gr.251546.119] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2019] [Accepted: 03/24/2020] [Indexed: 11/24/2022]
Abstract
The regulation of transposable element (TE) activity by small RNAs is a ubiquitous feature of germlines. However, despite the obvious benefits to the host in terms of ensuring the production of viable gametes and maintaining the integrity of the genomes they carry, it remains controversial whether TE regulation evolves adaptively. We examined the emergence and evolutionary dynamics of repressor alleles after P-elements invaded the Drosophila melanogaster genome in the mid-twentieth century. In many animals including Drosophila, repressor alleles are produced by transpositional insertions into piRNA clusters, genomic regions encoding the Piwi-interacting RNAs (piRNAs) that regulate TEs. We discovered that ∼94% of recently collected isofemale lines in the Drosophila melanogaster Genetic Reference Panel (DGRP) contain at least one P-element insertion in a piRNA cluster, indicating that repressor alleles are produced by de novo insertion at an exceptional rate. Furthermore, in our sample of approximately 200 genomes, we uncovered no fewer than 80 unique P-element insertion alleles in at least 15 different piRNA clusters. Finally, we observe no footprint of positive selection on P-element insertions in piRNA clusters, suggesting that the rapid evolution of piRNA-mediated repression in D. melanogaster was driven primarily by mutation. Our results reveal for the first time how the unique genetic architecture of piRNA production, in which numerous piRNA clusters can encode regulatory small RNAs upon transpositional insertion, facilitates the nonadaptive rapid evolution of repression.
Collapse
Affiliation(s)
- Shuo Zhang
- Department of Biology and Biochemistry, University of Houston, Houston, Texas 77204, USA
| | - Beverly Pointer
- Department of Biology and Biochemistry, University of Houston, Houston, Texas 77204, USA
| | - Erin S Kelleher
- Department of Biology and Biochemistry, University of Houston, Houston, Texas 77204, USA
| |
Collapse
|
28
|
Abstract
Transposons are major genome constituents that can mobilize and trigger mutations, DNA breaks and chromosome rearrangements. Transposon silencing is particularly important in the germline, which is dedicated to transmission of the inherited genome. Piwi-interacting RNAs (piRNAs) guide a host defence system that transcriptionally and post-transcriptionally silences transposons during germline development. While germline control of transposons by the piRNA pathway is conserved, many piRNA pathway genes are evolving rapidly under positive selection, and the piRNA biogenesis machinery shows remarkable phylogenetic diversity. Conservation of core function combined with rapid gene evolution is characteristic of a host–pathogen arms race, suggesting that transposons and the piRNA pathway are engaged in an evolutionary tug of war that is driving divergence of the biogenesis machinery. Recent studies suggest that this process may produce biochemical incompatibilities that contribute to reproductive isolation and species divergence.
Collapse
Affiliation(s)
- Swapnil S Parhad
- Program in Molecular Medicine, University of Massachusetts Medical School , 373 Plantation Street, Worcester, MA 01605 , USA
| | - William E Theurkauf
- Program in Molecular Medicine, University of Massachusetts Medical School , 373 Plantation Street, Worcester, MA 01605 , USA
| |
Collapse
|
29
|
Cerbin S, Wai CM, VanBuren R, Jiang N. GingerRoot: A Novel DNA Transposon Encoding Integrase-Related Transposase in Plants and Animals. Genome Biol Evol 2020; 11:3181-3193. [PMID: 31633753 PMCID: PMC6839031 DOI: 10.1093/gbe/evz230] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/20/2019] [Indexed: 02/06/2023] Open
Abstract
Transposable elements represent the largest components of many eukaryotic genomes and different genomes harbor different combinations of elements. Here, we discovered a novel DNA transposon in the genome of the clubmoss Selaginella lepidophylla. Further searching for related sequences to the conserved DDE region uncovered the presence of this superfamily of elements in fish, coral, sea anemone, and other animal species. However, this element appears restricted to Bryophytes and Lycophytes in plants. This transposon, named GingerRoot, is associated with a 6 bp (base pair) target site duplication, and 100-150 bp terminal inverted repeats. Analysis of transposase sequences identified the DDE motif, a catalytic domain, which shows similarity to the integrase of Gypsy-like long terminal repeat retrotransposons, the most abundant component in plant genomes. A total of 77 intact and several hundred truncated copies of GingerRoot elements were identified in S. lepidophylla. Like Gypsy retrotransposons, GingerRoots show a lack of insertion preference near genes, which contrasts to the compact genome size of about 100 Mb. Nevertheless, a considerable portion of GingerRoot elements was found to carry gene fragments, suggesting the capacity of duplicating gene sequences is unlikely attributed to the proximity to genes. Elements carrying gene fragments appear to be less methylated, more diverged, and more distal to genes than those without gene fragments, indicating they are preferentially retained in gene-poor regions. This study has identified a broadly dispersed, novel DNA transposon, and the first plant DNA transposon with an integrase-related transposase, suggesting the possibility of de novo formation of Gypsy-like elements in plants.
Collapse
Affiliation(s)
- Stefan Cerbin
- Department of Horticulture, Michigan State University, East Lansing, MI 48824
| | - Ching Man Wai
- Department of Horticulture, Michigan State University, East Lansing, MI 48824
| | - Robert VanBuren
- Department of Horticulture, Michigan State University, East Lansing, MI 48824
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI 48824
| |
Collapse
|
30
|
Ellison CE, Cao W. Nanopore sequencing and Hi-C scaffolding provide insight into the evolutionary dynamics of transposable elements and piRNA production in wild strains of Drosophila melanogaster. Nucleic Acids Res 2020; 48:290-303. [PMID: 31754714 PMCID: PMC6943127 DOI: 10.1093/nar/gkz1080] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2019] [Revised: 10/29/2019] [Accepted: 11/01/2019] [Indexed: 01/29/2023] Open
Abstract
Illumina sequencing has allowed for population-level surveys of transposable element (TE) polymorphism via split alignment approaches, which has provided important insight into the population dynamics of TEs. However, such approaches are not able to identify insertions of uncharacterized TEs, nor can they assemble the full sequence of inserted elements. Here, we use nanopore sequencing and Hi-C scaffolding to produce de novo genome assemblies for two wild strains of Drosophila melanogaster from the Drosophila Genetic Reference Panel (DGRP). Ovarian piRNA populations and Illumina split-read TE insertion profiles have been previously produced for both strains. We find that nanopore sequencing with Hi-C scaffolding produces highly contiguous, chromosome-length scaffolds, and we identify hundreds of TE insertions that were missed by Illumina-based methods, including a novel micropia-like element that has recently invaded the DGRP population. We also find hundreds of piRNA-producing loci that are specific to each strain. Some of these loci are created by strain-specific TE insertions, while others appear to be epigenetically controlled. Our results suggest that Illumina approaches reveal only a portion of the repetitive sequence landscape of eukaryotic genomes and that population-level resequencing using long reads is likely to provide novel insight into the evolutionary dynamics of repetitive elements.
Collapse
Affiliation(s)
- Christopher E Ellison
- Department of Genetics, Human Genetics Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Weihuan Cao
- Department of Genetics, Human Genetics Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| |
Collapse
|
31
|
Miniature inverted-repeat transposable elements (MITEs), derived insertional polymorphism as a tool of marker systems for molecular plant breeding. Mol Biol Rep 2020; 47:3155-3167. [PMID: 32162128 DOI: 10.1007/s11033-020-05365-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Accepted: 02/29/2020] [Indexed: 12/20/2022]
Abstract
Plant molecular breeding is expected to give significant gains in cultivar development through development and utilization of suitable molecular marker systems for genetic diversity analysis, rapid DNA fingerprinting, identification of true hybrids, trait mapping and marker-assisted selection. Transposable elements (TEs) are the most abundant component in a genome and being used as genetic markers in the plant molecular breeding. Here, we review on the high copious transposable element belonging to class-II DNA TEs called "miniature inverted-repeat transposable elements" (MITEs). MITEs are ubiquitous, short and non-autonomous DNA transposable elements which have a tendency to insert into genes and genic regions have paved a way for the development of functional DNA marker systems in plant genomes. This review summarises the characteristics of MITEs, principles and methodologies for development of MITEs based DNA markers, bioinformatics tools and resources for plant MITE discovery and their utilization in crop improvement.
Collapse
|
32
|
Vendrell-Mir P, Barteri F, Merenciano M, González J, Casacuberta JM, Castanera R. A benchmark of transposon insertion detection tools using real data. Mob DNA 2019; 10:53. [PMID: 31892957 PMCID: PMC6937713 DOI: 10.1186/s13100-019-0197-9] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2019] [Accepted: 12/17/2019] [Indexed: 02/01/2023] Open
Abstract
Background Transposable elements (TEs) are an important source of genomic variability in eukaryotic genomes. Their activity impacts genome architecture and gene expression and can lead to drastic phenotypic changes. Therefore, identifying TE polymorphisms is key to better understand the link between genotype and phenotype. However, most genotype-to-phenotype analyses have concentrated on single nucleotide polymorphisms as they are easier to reliable detect using short-read data. Many bioinformatic tools have been developed to identify transposon insertions from resequencing data using short reads. Nevertheless, the performance of most of these tools has been tested using simulated insertions, which do not accurately reproduce the complexity of natural insertions. Results We have overcome this limitation by building a dataset of insertions from the comparison of two high-quality rice genomes, followed by extensive manual curation. This dataset contains validated insertions of two very different types of TEs, LTR-retrotransposons and MITEs. Using this dataset, we have benchmarked the sensitivity and precision of 12 commonly used tools, and our results suggest that in general their sensitivity was previously overestimated when using simulated data. Our results also show that, increasing coverage leads to a better sensitivity but with a cost in precision. Moreover, we found important differences in tool performance, with some tools performing better on a specific type of TEs. We have also used two sets of experimentally validated insertions in Drosophila and humans and show that this trend is maintained in genomes of different size and complexity. Conclusions We discuss the possible choice of tools depending on the goals of the study and show that the appropriate combination of tools could be an option for most approaches, increasing the sensitivity while maintaining a good precision.
Collapse
Affiliation(s)
- Pol Vendrell-Mir
- 1Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, Campus UAB, Edifici CRAG, Bellaterra, 08193 Barcelona, Spain
| | - Fabio Barteri
- 1Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, Campus UAB, Edifici CRAG, Bellaterra, 08193 Barcelona, Spain
| | - Miriam Merenciano
- 2Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Passeig Maritim Barceloneta 37-49, 08003 Barcelona, Spain
| | - Josefa González
- 2Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Passeig Maritim Barceloneta 37-49, 08003 Barcelona, Spain
| | - Josep M Casacuberta
- 1Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, Campus UAB, Edifici CRAG, Bellaterra, 08193 Barcelona, Spain
| | - Raúl Castanera
- 1Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, Campus UAB, Edifici CRAG, Bellaterra, 08193 Barcelona, Spain
| |
Collapse
|
33
|
Hickman AB, Voth AR, Ewis H, Li X, Craig NL, Dyda F. Structural insights into the mechanism of double strand break formation by Hermes, a hAT family eukaryotic DNA transposase. Nucleic Acids Res 2019; 46:10286-10301. [PMID: 30239795 PMCID: PMC6212770 DOI: 10.1093/nar/gky838] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2018] [Accepted: 09/07/2018] [Indexed: 12/12/2022] Open
Abstract
Some DNA transposons relocate from one genomic location to another using a mechanism that involves generating double-strand breaks at their transposon ends by forming hairpins on flanking DNA. The same double-strand break mode is employed by the V(D)J recombinase at signal-end/coding-end junctions during the generation of antibody diversity. How flanking hairpins are formed during DNA transposition has remained elusive. Here, we describe several co-crystal structures of the Hermes transposase bound to DNA that mimics the reaction step immediately prior to hairpin formation. Our results reveal a large DNA conformational change between the initial cleavage step and subsequent hairpin formation that changes which strand is acted upon by a single active site. We observed that two factors affect the conformational change: the complement of divalent metal ions bound by the catalytically essential DDE residues, and the identity of the –2 flanking base pair. Our data also provides a mechanistic link between the efficiency of hairpin formation (an A:T basepair is favored at the –2 position) and Hermes' strong target site preference. Furthermore, we have established that the histidine residue within a conserved C/DxxH motif present in many transposase families interacts directly with the scissile phosphate, suggesting a crucial role in catalysis.
Collapse
Affiliation(s)
- Alison B Hickman
- Laboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Andrea Regier Voth
- Laboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Hosam Ewis
- Howard Hughes Medical Institute, Department of Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Xianghong Li
- Howard Hughes Medical Institute, Department of Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Nancy L Craig
- Howard Hughes Medical Institute, Department of Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Fred Dyda
- Laboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| |
Collapse
|
34
|
Merenciano M, Iacometti C, González J. A unique cluster of roo insertions in the promoter region of a stress response gene in Drosophila melanogaster. Mob DNA 2019; 10:10. [PMID: 30911338 PMCID: PMC6415491 DOI: 10.1186/s13100-019-0152-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2018] [Accepted: 02/28/2019] [Indexed: 01/19/2023] Open
Abstract
Transposable elements (TEs) are not randomly distributed in the genome. A genome-wide analysis of the D. melanogaster genome found that differences in TE density across 50 kb genomic regions was due both to transposition and duplication. At smaller genomic scales, promoter regions of hsp genes and the promoter region of CG18446 have been shown to accumulate TE insertions. In this work, we have further analyzed the promoter region of CG18446. We screened 218 strains collected in 15 natural populations, and we found that the CG18446 promoter region contains 20 independent roo insertions. Based on phylogenetic analysis, we suggest that the presence of multiple roo insertions in this region is likely to be the result of several bursts of transposition. Moreover, we found that the roo insertional cluster in the CG18446 promoter region is unique: no other promoter region in the genome contains a similar number of roo insertions. We found that, similar to hsp gene promoters, chromatin accessibility could be one of the factors explaining the recurrent insertions of roo elements in CG18446 promoter region.
Collapse
Affiliation(s)
- Miriam Merenciano
- 1Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Passeig Maritim de la Barceloneta 37,49, 08003 Barcelona, Spain
| | - Camillo Iacometti
- 1Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Passeig Maritim de la Barceloneta 37,49, 08003 Barcelona, Spain.,2Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy
| | - Josefa González
- 1Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Passeig Maritim de la Barceloneta 37,49, 08003 Barcelona, Spain
| |
Collapse
|
35
|
Lerat E, Goubert C, Guirao‐Rico S, Merenciano M, Dufour A, Vieira C, González J. Population-specific dynamics and selection patterns of transposable element insertions in European natural populations. Mol Ecol 2019; 28:1506-1522. [PMID: 30506554 PMCID: PMC6849870 DOI: 10.1111/mec.14963] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2018] [Revised: 10/30/2018] [Accepted: 11/05/2018] [Indexed: 01/02/2023]
Abstract
Transposable elements (TEs) are ubiquitous sequences in genomes of virtually all species. While TEs have been investigated for several decades, only recently we have the opportunity to study their genome-wide population dynamics. Most of the studies so far have been restricted either to the analysis of the insertions annotated in the reference genome or to the analysis of a limited number of populations. Taking advantage of the European Drosophila population genomics consortium (DrosEU) sequencing data set, we have identified and measured the dynamics of TEs in a large sample of European Drosophila melanogaster natural populations. We showed that the mobilome landscape is population-specific and highly diverse depending on the TE family. In contrast with previous studies based on SNP variants, no geographical structure was observed for TE abundance or TE divergence in European populations. We further identified de novo individual insertions using two available programs and, as expected, most of the insertions were present at low frequencies. Nevertheless, we identified a subset of TEs present at high frequencies and located in genomic regions with a high recombination rate. These TEs are candidates for being the target of positive selection, although neutral processes should be discarded before reaching any conclusion on the type of selection acting on them. Finally, parallel patterns of association between the frequency of TE insertions and several geographical and temporal variables were found between European and North American populations, suggesting that TEs can be potentially implicated in the adaptation of populations across continents.
Collapse
Affiliation(s)
- Emmanuelle Lerat
- Laboratoire de Biométrie et Biologie EvolutiveUMR 5558Université de Lyon, Université Lyon 1, CNRSVilleurbanneFrance
| | - Clément Goubert
- Molecular Biology and GeneticsCornell UniversityIthacaNew York
| | - Sara Guirao‐Rico
- Institute of Evolutionary Biology (CSIC‐Universitat Pompeu Fabra)BarcelonaSpain
| | - Miriam Merenciano
- Institute of Evolutionary Biology (CSIC‐Universitat Pompeu Fabra)BarcelonaSpain
| | - Anne‐Béatrice Dufour
- Laboratoire de Biométrie et Biologie EvolutiveUMR 5558Université de Lyon, Université Lyon 1, CNRSVilleurbanneFrance
| | - Cristina Vieira
- Laboratoire de Biométrie et Biologie EvolutiveUMR 5558Université de Lyon, Université Lyon 1, CNRSVilleurbanneFrance
| | - Josefa González
- Institute of Evolutionary Biology (CSIC‐Universitat Pompeu Fabra)BarcelonaSpain
| |
Collapse
|
36
|
Bae J, Lee KW, Islam MN, Yim HS, Park H, Rho M. iMGEins: detecting novel mobile genetic elements inserted in individual genomes. BMC Genomics 2018; 19:944. [PMID: 30563451 PMCID: PMC6299635 DOI: 10.1186/s12864-018-5290-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2018] [Accepted: 11/20/2018] [Indexed: 11/10/2022] Open
Abstract
Background Recent advances in sequencing technology have allowed us to investigate personal genomes to find structural variations, which have been studied extensively to identify their association with the physiology of diseases such as cancer. In particular, mobile genetic elements (MGEs) are one of the major constituents of the human genomes, and cause genome instability by insertion, mutation, and rearrangement. Result We have developed a new program, iMGEins, to identify such novel MGEs by using sequencing reads of individual genomes, and to explore the breakpoints with the supporting reads and MGEs detected. iMGEins is the first MGE detection program that integrates three algorithmic components: discordant read-pair mapping, split-read mapping, and insertion sequence assembly. Our evaluation results showed its outstanding performance in detecting novel MGEs from simulated genomes, as well as real personal genomes. In detail, the average recall and precision rates of iMGEins are 96.67 and 100%, respectively, which are the highest among the programs compared. In the testing with real human genomes of the NA12878 sample, iMGEins shows the highest accuracy in detecting MGEs within 20 bp proximity of the breakpoints annotated. Conclusion In order to study the dynamics of MGEs in individual genomes, iMGEins was developed to accurately detect breakpoints and report inserted MGEs. Compared with other programs, iMGEins has valuable features of identifying novel MGEs and assembling the MGEs inserted. Electronic supplementary material The online version of this article (10.1186/s12864-018-5290-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Junwoo Bae
- Department of Electronics and Computer Engineering, Hanyang University, Seoul, Korea
| | - Kyeong Won Lee
- Marine Biotechnology Research Center, Korea Institute of Ocean Science and Technology, Ansan, Korea
| | - Mohammad Nazrul Islam
- Marine Biotechnology Research Center, Korea Institute of Ocean Science and Technology, Ansan, Korea.,Department of Marine Biotechnology, Korea University of Science and Technology, Daejeon, Korea.,Department of Biotechnology, Sher-e-Bangla Agricultural University, Dhaka, 1207, Bangladesh
| | - Hyung-Soon Yim
- Marine Biotechnology Research Center, Korea Institute of Ocean Science and Technology, Ansan, Korea.,Department of Marine Biotechnology, Korea University of Science and Technology, Daejeon, Korea
| | - Heejin Park
- Department of Computer Science and Engineering, Hanyang University, Seoul, Korea. .,Department of Biomedical Informatics, Hanyang University, Seoul, Korea.
| | - Mina Rho
- Department of Computer Science and Engineering, Hanyang University, Seoul, Korea. .,Department of Biomedical Informatics, Hanyang University, Seoul, Korea.
| |
Collapse
|
37
|
Lee CC, Wang J. Rapid Expansion of a Highly Germline-Expressed Mariner Element Acquired by Horizontal Transfer in the Fire Ant Genome. Genome Biol Evol 2018; 10:3262-3278. [PMID: 30304394 PMCID: PMC6307670 DOI: 10.1093/gbe/evy220] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/08/2018] [Indexed: 12/25/2022] Open
Abstract
Transposable elements (TEs) are present in almost all organisms and affect the host in various ways. TE activity can increase genomic variation and thereby affect host evolution. Currently active TEs are particularly interesting because they are likely generating new genomic diversity. These active TEs have been poorly studied outside of model organisms. In this study, we aimed to identify currently active TEs of a notorious invasive species, the red imported fire ant Solenopsis invicta. Using RNA profiling of male and female germline tissues, we found that the majority of TE-containing transcripts in the fire ant germline belong to the IS630-Tc1-Mariner superfamily. Subsequent genomic characterization of fire ant mariner content, molecular evolution analysis, and population comparisons revealed a highly expressed and highly polymorphic mariner element that is rapidly expanding in the fire ant genome. Additionally, using comparative genomics of multiple insect species we showed that this mariner has undergone several recent horizontal transfer events (<5.1 My). Our results document a rare case of a currently active TE originating from horizontal transfer.
Collapse
Affiliation(s)
- Chih-Chi Lee
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
- Laboratory of Insect Ecology, Division of Applied Biosciences, Graduate School of Agriculture, Kyoto University, Japan
- Research Institute for Sustainable Humanosphere, Kyoto University, Kyoto, Japan
| | - John Wang
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| |
Collapse
|
38
|
Manee MM, Jackson J, Bergman CM. Conserved Noncoding Elements Influence the Transposable Element Landscape in Drosophila. Genome Biol Evol 2018; 10:1533-1545. [PMID: 29850787 PMCID: PMC6007792 DOI: 10.1093/gbe/evy104] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/22/2018] [Indexed: 12/15/2022] Open
Abstract
Highly conserved noncoding elements (CNEs) constitute a significant proportion of the genomes of multicellular eukaryotes. The function of most CNEs remains elusive, but growing evidence indicates they are under some form of purifying selection. Noncoding regions in many species also harbor large numbers of transposable element (TE) insertions, which are typically lineage specific and depleted in exons because of their deleterious effects on gene function or expression. However, it is currently unknown whether the landscape of TE insertions in noncoding regions is random or influenced by purifying selection on CNEs. Here, we combine comparative and population genomic data in Drosophila melanogaster to show that the abundance of TE insertions in intronic and intergenic CNEs is reduced relative to random expectation, supporting the idea that selective constraints on CNEs eliminate a proportion of TE insertions in noncoding regions. However, we find no evidence for differences in the allele frequency spectra for polymorphic TE insertions in CNEs versus those in unconstrained spacer regions, suggesting that the distribution of fitness effects acting on observable TE insertions is similar across different functional compartments in noncoding DNA. Our results provide evidence that selective constraints on CNEs contribute to shaping the landscape of TE insertion in eukaryotic genomes, and provide further evidence that CNEs are indeed functionally constrained and not simply mutational cold spots.
Collapse
Affiliation(s)
- Manee M Manee
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom.,National Center for Biotechnology, King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia.,Center of Excellence for Genomics (CEG), King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - John Jackson
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom.,Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| | - Casey M Bergman
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom.,Department of Genetics, University of Georgia, Athens, GA.,Institute of Bioinformatics, University of Georgia, Athens, GA
| |
Collapse
|
39
|
|
40
|
Banuelos M, Sindi S. Modeling transposable element dynamics with fragmentation equations. Math Biosci 2018; 302:46-66. [DOI: 10.1016/j.mbs.2018.05.009] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2017] [Revised: 04/02/2018] [Accepted: 05/11/2018] [Indexed: 12/16/2022]
|
41
|
Serrato-Capuchina A, Matute DR. The Role of Transposable Elements in Speciation. Genes (Basel) 2018; 9:E254. [PMID: 29762547 PMCID: PMC5977194 DOI: 10.3390/genes9050254] [Citation(s) in RCA: 95] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2018] [Revised: 04/26/2018] [Accepted: 04/26/2018] [Indexed: 01/20/2023] Open
Abstract
Understanding the phenotypic and molecular mechanisms that contribute to genetic diversity between and within species is fundamental in studying the evolution of species. In particular, identifying the interspecific differences that lead to the reduction or even cessation of gene flow between nascent species is one of the main goals of speciation genetic research. Transposable elements (TEs) are DNA sequences with the ability to move within genomes. TEs are ubiquitous throughout eukaryotic genomes and have been shown to alter regulatory networks, gene expression, and to rearrange genomes as a result of their transposition. However, no systematic effort has evaluated the role of TEs in speciation. We compiled the evidence for TEs as potential causes of reproductive isolation across a diversity of taxa. We find that TEs are often associated with hybrid defects that might preclude the fusion between species, but that the involvement of TEs in other barriers to gene flow different from postzygotic isolation is still relatively unknown. Finally, we list a series of guides and research avenues to disentangle the effects of TEs on the origin of new species.
Collapse
Affiliation(s)
- Antonio Serrato-Capuchina
- Biology Department, Genome Sciences Building, University of North Carolina, Chapel Hill, NC 27514, USA.
| | - Daniel R Matute
- Biology Department, Genome Sciences Building, University of North Carolina, Chapel Hill, NC 27514, USA.
| |
Collapse
|
42
|
Bergman CM, Han S, Nelson MG, Bondarenko V, Kozeretska I. Genomic analysis of P elements in natural populations of Drosophila melanogaster. PeerJ 2017; 5:e3824. [PMID: 28929030 PMCID: PMC5602686 DOI: 10.7717/peerj.3824] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2017] [Accepted: 08/29/2017] [Indexed: 11/20/2022] Open
Abstract
The Drosophila melanogaster P transposable element provides one of the best cases of horizontal transfer of a mobile DNA sequence in eukaryotes. Invasion of natural populations by the P element has led to a syndrome of phenotypes known as P-M hybrid dysgenesis that emerges when strains differing in their P element composition mate and produce offspring. Despite extensive research on many aspects of P element biology, many questions remain about the genomic basis of variation in P-M dysgenesis phenotypes across populations. Here we compare estimates of genomic P element content with gonadal dysgenesis phenotypes for isofemale strains obtained from three worldwide populations of D. melanogaster to illuminate the molecular basis of natural variation in cytotype status. We show that P element abundance estimated from genome sequences of isofemale strains is highly correlated across different bioinformatics approaches, but that abundance estimates are sensitive to method and filtering strategies as well as incomplete inbreeding of isofemale strains. We find that P element content varies significantly across populations, with strains from a North American population having fewer P elements but a higher proportion of full-length elements than strains from populations sampled in Europe or Africa. Despite these geographic differences in P element abundance and structure, neither the number of P elements nor the ratio of full-length to internally-truncated copies is strongly correlated with the degree of gonadal dysgenesis exhibited by an isofemale strain. Thus, variation in P element abundance and structure across different populations does not necessarily lead to corresponding geographic differences in gonadal dysgenesis phenotypes. Finally, we confirm that population differences in the abundance and structure of P elements that are observed from isofemale lines can also be observed in pool-seq samples from the same populations. Our work supports the view that genomic P element content alone is not sufficient to explain variation in gonadal dysgenesis across strains of D. melanogaster, and informs future efforts to decode the genomic basis of geographic and temporal differences in P element induced phenotypes.
Collapse
Affiliation(s)
- Casey M Bergman
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom.,Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA, United States of America
| | - Shunhua Han
- Institute of Bioinformatics, University of Georgia, Athens, GA, United States of America
| | - Michael G Nelson
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom
| | - Vladyslav Bondarenko
- Department of General and Molecular Genetics, Taras Shevchenko University of Kyiv, Kyiv, Ukraine
| | - Iryna Kozeretska
- Department of General and Molecular Genetics, Taras Shevchenko University of Kyiv, Kyiv, Ukraine
| |
Collapse
|
43
|
McClintock: An Integrated Pipeline for Detecting Transposable Element Insertions in Whole-Genome Shotgun Sequencing Data. G3-GENES GENOMES GENETICS 2017. [PMID: 28637810 PMCID: PMC5555480 DOI: 10.1534/g3.117.043893] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Transposable element (TE) insertions are among the most challenging types of variants to detect in genomic data because of their repetitive nature and complex mechanisms of replication . Nevertheless, the recent availability of large resequencing data sets has spurred the development of many new methods to detect TE insertions in whole-genome shotgun sequences. Here we report an integrated bioinformatics pipeline for the detection of TE insertions in whole-genome shotgun data, called McClintock (https://github.com/bergmanlab/mcclintock), which automatically runs and standardizes output for multiple TE detection methods. We demonstrate the utility of McClintock by evaluating six TE detection methods using simulated and real genome data from the model microbial eukaryote, Saccharomyces cerevisiae We find substantial variation among McClintock component methods in their ability to detect nonreference TEs in the yeast genome, but show that nonreference TEs at nearly all biologically realistic locations can be detected in simulated data by combining multiple methods that use split-read and read-pair evidence. In general, our results reveal that split-read methods detect fewer nonreference TE insertions than read-pair methods, but generally have much higher positional accuracy. Analysis of a large sample of real yeast genomes reveals that most McClintock component methods can recover known aspects of TE biology in yeast such as the transpositional activity status of families, target preferences, and target site duplication structure, albeit with varying levels of accuracy. Our work provides a general framework for integrating and analyzing results from multiple TE detection methods, as well as useful guidance for researchers studying TEs in yeast resequencing data.
Collapse
|
44
|
Cis- and Trans-regulatory Effects on Gene Expression in a Natural Population of Drosophila melanogaster. Genetics 2017; 206:2139-2148. [PMID: 28615283 DOI: 10.1534/genetics.117.201459] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Accepted: 06/06/2017] [Indexed: 12/30/2022] Open
Abstract
Cis- and trans-regulatory mutations are important contributors to transcriptome evolution. Quantifying their relative contributions to intraspecific variation in gene expression is essential for understanding the population genetic processes that underlie evolutionary changes in gene expression. Here, we have examined this issue by quantifying genome-wide, allele-specific expression (ASE) variation using a crossing scheme that produces F1 hybrids between 18 different Drosophila melanogaster strains sampled from the Drosophila Genetic Reference Panel and a reference strain from another population. Head and body samples from F1 adult females were subjected to RNA sequencing and the subsequent ASE quantification. Cis- and trans-regulatory effects on expression variation were estimated from these data. A higher proportion of genes showed significant cis-regulatory variation (∼28%) than those that showed significant trans-regulatory variation (∼9%). The sizes of cis-regulatory effects on expression variation were 1.98 and 1.88 times larger than trans-regulatory effects in heads and bodies, respectively. A generalized linear model analysis revealed that both cis- and trans-regulated expression variation was strongly associated with nonsynonymous nucleotide diversity and tissue specificity. Interestingly, trans-regulated variation showed a negative correlation with local recombination rate. Also, our analysis on proximal transposable element (TE) insertions suggested that they affect transcription levels of ovary-expressed genes more pronouncedly than genes not expressed in the ovary, possibly due to defense mechanisms against TE mobility in the germline. Collectively, our detailed quantification of ASE variations from a natural population has revealed a number of new relationships between genomic factors and the effects of cis- and trans-regulatory factors on expression variation.
Collapse
|
45
|
Mechanisms of LTR-Retroelement Transposition: Lessons from Drosophila melanogaster. Viruses 2017; 9:v9040081. [PMID: 28420154 PMCID: PMC5408687 DOI: 10.3390/v9040081] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2017] [Revised: 04/06/2017] [Accepted: 04/10/2017] [Indexed: 01/02/2023] Open
Abstract
Long terminal repeat (LTR) retrotransposons occupy a special place among all mobile genetic element families. The structure of LTR retrotransposons that have three open reading frames is identical to DNA forms of retroviruses that are integrated into the host genome. Several lines of evidence suggest that LTR retrotransposons share a common ancestry with retroviruses and thus are highly relevant to understanding mechanisms of transposition. Drosophila melanogaster is an exceptionally convenient model for studying the mechanisms of retrotransposon movement because many such elements in its genome are transpositionally active. Moreover, two LTRretrotransposons of D. melanogaster, gypsy and ZAM, have been found to have infectious properties and have been classified as errantiviruses. Despite numerous studies focusing on retroviral integration process, there is still no clear understanding of integration specificity in a target site. Most LTR retrotransposons non-specifically integrate into a target site. Site-specificity of integration at vertebrate retroviruses is rather relative. At the same time, sequence-specific integration is the exclusive property of errantiviruses and their derivatives with two open reading frames. The possible basis for the errantivirus integration specificity is discussed in the present review.
Collapse
|
46
|
Disdero E, Filée J. LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences. Mob DNA 2017; 8:5. [PMID: 28405230 PMCID: PMC5385071 DOI: 10.1186/s13100-017-0088-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2017] [Accepted: 03/17/2017] [Indexed: 02/03/2023] Open
Abstract
Background Population genomic analysis of transposable elements has greatly benefited from recent advances of sequencing technologies. However, the short size of the reads and the propensity of transposable elements to nest in highly repeated regions of genomes limits the efficiency of bioinformatic tools when Illumina or 454 technologies are used. Fortunately, long read sequencing technologies generating read length that may span the entire length of full transposons are now available. However, existing TE population genomic softwares were not designed to handle long reads and the development of new dedicated tools is needed. Results LoRTE is the first tool able to use PacBio long read sequences to identify transposon deletions and insertions between a reference genome and genomes of different strains or populations. Tested against simulated and genuine Drosophila melanogaster PacBio datasets, LoRTE appears to be a reliable and broadly applicable tool to study the dynamic and evolutionary impact of transposable elements using low coverage, long read sequences. Conclusions LoRTE is an efficient and accurate tool to identify structural genomic variants caused by TE insertion or deletion. LoRTE is available for download at http://www.egce.cnrs-gif.fr/?p=6422
Collapse
Affiliation(s)
- Eric Disdero
- Laboratoire Evolution, Génomes, Comportement, Ecologie; CNRS, IRD, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Jonathan Filée
- Laboratoire Evolution, Génomes, Comportement, Ecologie; CNRS, IRD, Université Paris-Saclay, Gif-sur-Yvette, France
| |
Collapse
|
47
|
Integration site selection by retroviruses and transposable elements in eukaryotes. Nat Rev Genet 2017; 18:292-308. [PMID: 28286338 DOI: 10.1038/nrg.2017.7] [Citation(s) in RCA: 145] [Impact Index Per Article: 20.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Transposable elements and retroviruses are found in most genomes, can be pathogenic and are widely used as gene-delivery and functional genomics tools. Exploring whether these genetic elements target specific genomic sites for integration and how this preference is achieved is crucial to our understanding of genome evolution, somatic genome plasticity in cancer and ageing, host-parasite interactions and genome engineering applications. High-throughput profiling of integration sites by next-generation sequencing, combined with large-scale genomic data mining and cellular or biochemical approaches, has revealed that the insertions are usually non-random. The DNA sequence, chromatin and nuclear context, and cellular proteins cooperate in guiding integration in eukaryotic genomes, leading to a remarkable diversity of insertion site distribution and evolutionary strategies.
Collapse
|
48
|
Tan S, Cardoso-Moreira M, Shi W, Zhang D, Huang J, Mao Y, Jia H, Zhang Y, Chen C, Shao Y, Leng L, Liu Z, Huang X, Long M, Zhang YE. LTR-mediated retroposition as a mechanism of RNA-based duplication in metazoans. Genome Res 2016; 26:1663-1675. [PMID: 27934698 PMCID: PMC5131818 DOI: 10.1101/gr.204925.116] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2016] [Accepted: 10/18/2016] [Indexed: 01/09/2023]
Abstract
In a broad range of taxa, genes can duplicate through an RNA intermediate in a process mediated by retrotransposons (retroposition). In mammals, L1 retrotransposons drive retroposition, but the elements responsible for retroposition in other animals have yet to be identified. Here, we examined young retrocopies from various animals that still retain the sequence features indicative of the underlying retroposition mechanism. In Drosophila melanogaster, we identified and de novo assembled 15 polymorphic retrocopies and found that all retroposed loci are chimeras of internal retrocopies flanked by discontinuous LTR retrotransposons. At the fusion points between the mRNAs and the LTR retrotransposons, we identified shared short similar sequences that suggest the involvement of microsimilarity-dependent template switches. By expanding our approach to mosquito, zebrafish, chicken, and mammals, we identified in all these species recently originated retrocopies with a similar chimeric structure and shared microsimilarities at the fusion points. We also identified several retrocopies that combine the sequences of two or more parental genes, demonstrating LTR-retroposition as a novel mechanism of exon shuffling. Finally, we found that LTR-mediated retrocopies are immediately cotranscribed with their flanking LTR retrotransposons. Transcriptional profiling coupled with sequence analyses revealed that the sense-strand transcription of the retrocopies often lead to the origination of in-frame proteins relative to the parental genes. Overall, our data show that LTR-mediated retroposition is highly conserved across a wide range of animal taxa; combined with previous work from plants and yeast, it represents an ancient and ongoing mechanism continuously shaping gene content evolution in eukaryotes.
Collapse
Affiliation(s)
- Shengjun Tan
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | | | - Wenwen Shi
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Dan Zhang
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jiawei Huang
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yanan Mao
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Hangxing Jia
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yaqiong Zhang
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Chunyan Chen
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yi Shao
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Liang Leng
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Zhonghua Liu
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Xun Huang
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Manyuan Long
- Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois 60637, USA
| | - Yong E Zhang
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
49
|
Merenciano M, Ullastres A, de Cara MAR, Barrón MG, González J. Multiple Independent Retroelement Insertions in the Promoter of a Stress Response Gene Have Variable Molecular and Functional Effects in Drosophila. PLoS Genet 2016; 12:e1006249. [PMID: 27517860 PMCID: PMC4982627 DOI: 10.1371/journal.pgen.1006249] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2015] [Accepted: 07/18/2016] [Indexed: 12/20/2022] Open
Abstract
Promoters are structurally and functionally diverse gene regulatory regions. The presence or absence of sequence motifs and the spacing between the motifs defines the properties of promoters. Recent alternative promoter usage analyses in Drosophila melanogaster revealed that transposable elements significantly contribute to promote diversity. In this work, we analyzed in detail one of the transposable element insertions, named FBti0019985, that has been co-opted to drive expression of CG18446, a candidate stress response gene. We analyzed strains from different natural populations and we found that besides FBti0019985, there are another eight independent transposable elements inserted in the proximal promoter region of CG18446. All nine insertions are solo-LTRs that belong to the roo family. We analyzed the sequence of the nine roo insertions and we investigated whether the different insertions were functionally equivalent by performing 5'-RACE, gene expression, and cold-stress survival experiments. We found that different insertions have different molecular and functional consequences. The exact position where the transposable elements are inserted matters, as they all showed highly conserved sequences but only two of the analyzed insertions provided alternative transcription start sites, and only the FBti0019985 insertion consistently affects CG18446 expression. The phenotypic consequences of the different insertions also vary: only FBti0019985 was associated with cold-stress tolerance. Interestingly, the only previous report of transposable elements inserting repeatedly and independently in a promoter region in D. melanogaster, were also located upstream of a stress response gene. Our results suggest that functional validation of individual structural variants is needed to resolve the complexity of insertion clusters.
Collapse
Affiliation(s)
- Miriam Merenciano
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona. Spain
| | - Anna Ullastres
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona. Spain
| | - M. A. R. de Cara
- Laboratoire d’Eco-anthropologie et Ethnobiologie, UMR 7206, CNRS/MNHN/Universite Paris 7, Museum National d’Histoire Naturelle, F-75116 Paris, France
| | - Maite G. Barrón
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona. Spain
| | - Josefa González
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona. Spain
- * E-mail:
| |
Collapse
|
50
|
Glukhov IA, Kotnova AP, Stefanov YE, Ilyin YV. The first complete Mag family retrotransposons discovered in Drosophila. DOKL BIOCHEM BIOPHYS 2016; 466:1-4. [PMID: 27025475 DOI: 10.1134/s1607672916010014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2015] [Indexed: 11/23/2022]
Abstract
A retrotransposon of the Mag family was found in the Drosophila simulans genome for the first time. We also identified novel transposable elements representing the Mag family in seven Drosophila species. The high similarity between the 3' and 5' long terminal repeats in the found copies of transposable elements indicates that their retrotransposition has occurred relatively recently. Thus, the Mag family of retrotransposons is quite common for the genus Drosophila.
Collapse
Affiliation(s)
- I A Glukhov
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, ul. Vavilova 32, Moscow, 119991, Russia.
| | - A P Kotnova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, ul. Vavilova 32, Moscow, 119991, Russia
| | - Y E Stefanov
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, ul. Vavilova 32, Moscow, 119991, Russia
| | - Y V Ilyin
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, ul. Vavilova 32, Moscow, 119991, Russia.,Institute of Gene Biology, Russian Academy of Sciences, ul. Vavilova 34/5, Moscow, 119334, Russia
| |
Collapse
|