1
|
Koga A, Ahmad SF, Panthum T, Singchat W, Srikulnath K. Characterization of the marsupial endogenous retrovirus walb with a focus on satellite DNA formation. Virology 2023; 588:109911. [PMID: 37918186 DOI: 10.1016/j.virol.2023.109911] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 10/06/2023] [Accepted: 10/10/2023] [Indexed: 11/04/2023]
Abstract
The walbRep megasatellite DNA found in the red-necked wallaby was formed from the walb endogenous retrovirus. Our previous PCR experiments suggested the presence of walb and absence of walbRep in the genome of the tammar wallaby, which diverged from the red-necked wallaby 2-3 Mya. The results failed to exclude the possibility that certain walbRep sequences might have remained undetected owing to variation in the primer-annealing regions; therefore, the aforementioned suggestion was not confirmed. To obtain conclusive evidence, we analyzed the structure of walb sequences drawn from the tammar wallaby genome database recently updated to a chromosome-level assembly. All walb copies existed as separate DNA segments, not constituting tandem repeats. We concluded that walbRep was formed in the red-necked wallaby lineage after its divergence from the tammar wallaby. We also confirm the presence of a walb copy with an anomalistic, complex structure and propose a simple model for its generation mechanism.
Collapse
Affiliation(s)
- Akihiko Koga
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
| | - Syed Farhan Ahmad
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
| | - Thitipong Panthum
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
| | - Worapong Singchat
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand; Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand
| | - Kornsorn Srikulnath
- Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand; Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, 50 Ngamwongwan, Chatuchak, Bangkok, 10900, Thailand.
| |
Collapse
|
2
|
Larivière D, Abueg L, Brajuka N, Gallardo-Alba C, Grüning B, Ko BJ, Ostrovsky A, Palmada-Flores M, Pickett BD, Rabbani K, Balacco JR, Chaisson M, Cheng H, Collins J, Denisova A, Fedrigo O, Gallo GR, Giani AM, Gooder GM, Jain N, Johnson C, Kim H, Lee C, Marques-Bonet T, O'Toole B, Rhie A, Secomandi S, Sozzoni M, Tilley T, Uliano-Silva M, van den Beek M, Waterhouse RM, Phillippy AM, Jarvis ED, Schatz MC, Nekrutenko A, Formenti G. Scalable, accessible, and reproducible reference genome assembly and evaluation in Galaxy. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.28.546576. [PMID: 37425881 PMCID: PMC10327048 DOI: 10.1101/2023.06.28.546576] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
Improvements in genome sequencing and assembly are enabling high-quality reference genomes for all species. However, the assembly process is still laborious, computationally and technically demanding, lacks standards for reproducibility, and is not readily scalable. Here we present the latest Vertebrate Genomes Project assembly pipeline and demonstrate that it delivers high-quality reference genomes at scale across a set of vertebrate species arising over the last ~500 million years. The pipeline is versatile and combines PacBio HiFi long-reads and Hi-C-based haplotype phasing in a new graph-based paradigm. Standardized quality control is performed automatically to troubleshoot assembly issues and assess biological complexities. We make the pipeline freely accessible through Galaxy, accommodating researchers even without local computational resources and enhanced reproducibility by democratizing the training and assembly process. We demonstrate the flexibility and reliability of the pipeline by assembling reference genomes for 51 vertebrate species from major taxonomic groups (fish, amphibians, reptiles, birds, and mammals).
Collapse
Affiliation(s)
- Delphine Larivière
- Dept. of Biochemistry and Molecular Biology, Pennsylvania State University, USA
| | - Linelle Abueg
- Vertebrate Genome Laboratory, The Rockefeller University, USA
| | | | - Cristóbal Gallardo-Alba
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Freiburg, Germany
| | - Bjorn Grüning
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Freiburg, Germany
| | - Byung June Ko
- Department of Agricultural Biotechnology and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea
| | - Alex Ostrovsky
- Departments of Biology and Computer Science, Johns Hopkins University, USA
| | - Marc Palmada-Flores
- Department of Medicine and Life Sciences (MELIS), Institut de Biologia Evolutiva, Universitat Pompeu Fabra-CSIC, Barcelona 08003, Spain
| | - Brandon D Pickett
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Keon Rabbani
- Department of Quantitative and Computational Biology, University of Southern California
| | | | - Mark Chaisson
- Department of Quantitative and Computational Biology, University of Southern California
| | - Haoyu Cheng
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Joanna Collins
- Wellcome Sanger Institute, Cambridge CB10 1SA, United Kingdom
| | - Alexandra Denisova
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Russia
| | - Olivier Fedrigo
- Vertebrate Genome Laboratory, The Rockefeller University, USA
| | | | | | | | - Nivesh Jain
- Vertebrate Genome Laboratory, The Rockefeller University, USA
| | - Cassidy Johnson
- Vertebrate Genome Laboratory, The Rockefeller University, USA
| | - Heebal Kim
- Department of Agricultural Biotechnology and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea
- eGnome, Inc, Seoul, Republic of Korea
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
| | - Chul Lee
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York City, NY, 10065, USA
| | - Tomas Marques-Bonet
- Department of Medicine and Life Sciences (MELIS), Institut de Biologia Evolutiva, Universitat Pompeu Fabra-CSIC, Barcelona 08003, Spain
- Catalan Institution of Research and Advanced Studies (ICREA), Barcelona 08010, Spain
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona 08028, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Cerdanyola del Vallès 08193, Spain
| | - Brian O'Toole
- Vertebrate Genome Laboratory, The Rockefeller University, USA
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Simona Secomandi
- Department of Biological Sciences, University of Cyprus, Nicosia, Cyprus
| | - Marcella Sozzoni
- University of Florence, Department of Biology, Via Madonna del Piano 6, Sesto Fiorentino (FI)
| | - Tatiana Tilley
- Vertebrate Genome Laboratory, The Rockefeller University, USA
| | | | - Marius van den Beek
- Dept. of Biochemistry and Molecular Biology, Pennsylvania State University, USA
| | - Robert M Waterhouse
- Department of Ecology & Evolution and Swiss Institute of Bioinformatics, University of Lausanne, Lausanne, Switzerland
| | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Erich D Jarvis
- Vertebrate Genome Laboratory, The Rockefeller University, USA
| | - Michael C Schatz
- Departments of Biology and Computer Science, Johns Hopkins University, USA
| | - Anton Nekrutenko
- Dept. of Biochemistry and Molecular Biology, Pennsylvania State University, USA
| | - Giulio Formenti
- Vertebrate Genome Laboratory, The Rockefeller University, USA
| |
Collapse
|
3
|
Koga A, Hashimoto K, Honda Y, Nishihara H. Marsupial genome analysis suggests that satellite DNA formation from walb endogenous retrovirus is an event specific to the red-necked wallaby. Genes Cells 2023; 28:149-155. [PMID: 36527312 DOI: 10.1111/gtc.12999] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Revised: 11/29/2022] [Accepted: 12/11/2022] [Indexed: 12/23/2022]
Abstract
We recently identified walbRep, a satellite DNA residing in the genome of the red-necked wallaby Notamacropus rufogriseus. It originates from the walb endogenous retrovirus and is organized in a manner in which the provirus structure is retained. The walbRep repeat units feature an average pairwise nucleotide identity as high as 99.5%, raising the possibility of a recent origin. The tammar wallaby N. eugenii is a species estimated to have diverged from the red-necked wallaby 2-3 million years ago. In PCR analyses of these two and other related species, walbRep-specific fragment amplification was observed only in the red-necked wallaby. Sequence database searches for the tammar wallaby resulted in sequence alignment lists that were sufficiently powerful to exclude the possibility of walbRep existence. These results suggested that the walbRep formation occurred in the red-necked wallaby lineage after its divergence from the tammar wallaby lineage, thus in a time span of maximum 3 million years.
Collapse
Affiliation(s)
- Akihiko Koga
- Center for Evolutionary Origins of Human Behavior, Kyoto University, Inuyama, Japan
| | | | - Yusuke Honda
- Noichi Zoological Park of Kochi Prefecture, Konan, Japan
| | - Hidenori Nishihara
- School of Life Science and Technology, Tokyo Institute of Technology, Yokohama, Japan
| |
Collapse
|
4
|
Hayashi S, Honda Y, Kanesaki E, Koga A. Marsupial satellite DNA as faithful reflections of long terminal repeat (LTR) retroelement structure. Genome 2022; 65:469-478. [PMID: 35930809 DOI: 10.1139/gen-2022-0039] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Long terminal repeat (LTR) retroelements, including endogenous retroviruses, are one of the origins of satellite DNAs. However, the vast majority of satellite DNAs originating from LTR retroelements consist of parts of the element. In addition, they frequently contain sequences unrelated to that element. Here we report a novel marsupial satellite DNA (named walbRep) that contains, and consists solely of, the entire sequence of an LTR retroelement (the walb element). As is common with LTR retroelements, walb copies exhibit length variation. We focused on the abundance of copies of a specific length (2.7 kb) in the genome of the red-necked wallaby. Cloning and analyses of long genomic DNA fragments revealed a satellite DNA in which the LTR sequence (0.4 kb) and the sequence of the internal region of a nonautonomous walb copy (2.3 kb) were repeated alternately. The junctions between these two components exhibited the same end-to-end arrangements as those in the walb element. This satellite organization could be accounted for by a simple formation model that includes slippage during chromosome pairing followed by homologous recombination but does not invoke any other types of rearrangements. We discuss the possible reasons why satellite DNAs having such structures are rarely found in mammals.
Collapse
Affiliation(s)
| | - Yusuke Honda
- Noichi Zoological Park of Kochi Prefecture, Konan, Japan;
| | | | | |
Collapse
|
5
|
Feigin CY, Newton AH, Pask AJ. Widespread cis-regulatory convergence between the extinct Tasmanian tiger and gray wolf. Genome Res 2019; 29:1648-1658. [PMID: 31533979 PMCID: PMC6771401 DOI: 10.1101/gr.244251.118] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2019] [Accepted: 08/19/2019] [Indexed: 12/18/2022]
Abstract
The extinct marsupial Tasmanian tiger, or thylacine, and the eutherian gray wolf are among the most widely recognized examples of convergent evolution in mammals. Despite being distantly related, these large predators independently evolved extremely similar craniofacial morphologies, and evidence suggests that they filled similar ecological niches. Previous analyses revealed little evidence of adaptive convergence between their protein-coding genes. Thus, the genetic basis of their convergence is still unclear. Here, we identified candidate craniofacial cis-regulatory elements across vertebrates and compared their evolutionary rates in the thylacine and wolf, revealing abundant signatures of convergent positive selection. Craniofacial thylacine-wolf accelerated regions were enriched near genes involved in TGF beta (TGFB) and BMP signaling, both of which are key morphological signaling pathways with critical roles in establishing the identities and boundaries between craniofacial tissues. Similarly, enhancers of genes involved in craniofacial nerve development showed convergent selection and involvement in these pathways. Taken together, these results suggest that adaptation in cis-regulators of TGF beta and BMP signaling may provide a mechanism to explain the coevolution of developmentally and functionally integrated craniofacial structures in these species. We also found that despite major structural differences in marsupial and eutherian brains, accelerated regions in both species were common near genes with roles in brain development. Our findings support the hypothesis that, relative to protein-coding genes, positive selection on cis-regulatory elements is likely to be an essential driver of adaptive convergent evolution and may underpin thylacine-wolf phenotypic similarities.
Collapse
Affiliation(s)
- Charles Y Feigin
- School of BioSciences, The University of Melbourne, Parkville, Victoria 3010, Australia.,Department of Molecular Biology, Princeton University, Princeton, New Jersey 08544, USA
| | - Axel H Newton
- School of BioSciences, The University of Melbourne, Parkville, Victoria 3010, Australia.,Museums Victoria, Melbourne, Victoria 3053, Australia
| | - Andrew J Pask
- School of BioSciences, The University of Melbourne, Parkville, Victoria 3010, Australia.,Museums Victoria, Melbourne, Victoria 3053, Australia
| |
Collapse
|