1
|
Luis Molina-Quirós J, Hernández-Muñoz S, Antonio Baeza J. The complete mitochondrial genome of the roosterfish Nematistius pectoralis Gill 1862: purifying selection in protein coding genes, organization of the control region, and insights into family-level phylogenomic relationships in the recently erected order Carangiformes. Gene 2022; 845:146847. [PMID: 36058495 DOI: 10.1016/j.gene.2022.146847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 06/26/2022] [Accepted: 08/25/2022] [Indexed: 11/24/2022]
Abstract
The roosterfish Nematistius pectoralis is considered as one of the most magnificent sportfishes worldwide. This study developed the first genomic resource for this trophy-fish that is heavily targeted by the fly-fishing industry. The complete mitochondrial genome of N. pectoralis was assembled using short read sequences and analyzed in detail. The mitochondrial genome of N. pectoralis is 16,537 bp in length and comprises 13 protein-coding genes (PCGs), 2 ribosomal RNA genes (12S and 16S), and 22 transfer RNA genes. A long intergenic space 770 bp in length was assumed to be the D-loop or Control Region (CR). Most of the PCGs and tRNA genes are encoded in the L-strand. All PCGs are under purifying selection and atp8 and nad6 experienced the least selective pressure. All tRNAs exhibit a cloverleaf secondary structure except tRNA-Serine 1 that lacked the D-arm loop. The D-loop of N. pectoralis exhibits three domains commonly described in other fishes; extended terminal associated sequences (ETAS), central, and conserved sequence block (CSB). A ML phylogenetic reconstruction of the newly recognized order Carangiformes based on all 13 mitochondrial PCGs did not support the monophyly of this clade but recognized several families as monophyletic, including Bothidae, Carangidae, Istiophoridae, Latidae, Paralichthyidae, Polynemidae, and Rhombosoleidae. Nematistius pectoralis was sister to a clade composed of Toxotes chatareus (fam. Toxotidae) + Lactarius lactarius (fam. Lactariidae). This genomic resource developed for N. pectoralis will aid in improving our understanding of the population genomics of and strengthen conservation and management strategies in this remarkable trophy-fish.
Collapse
Affiliation(s)
- José Luis Molina-Quirós
- Biomolecular Laboratory, Center for International Programs, Universidad Veritas, San José, Costa Rica.
| | - Sebastián Hernández-Muñoz
- Biomolecular Laboratory, Center for International Programs, Universidad Veritas, San José, Costa Rica; Sala de Colecciones, Facultad de Ciencias del Mar, Universidad Católica del Norte, Coquimbo, Chile
| | - J Antonio Baeza
- Department of Biological Sciences, Clemson University, Clemson, SC, USA; Departamento de Biología Marina, Universidad Catolica del Norte, Coquimbo, IV Región, Chile; Smithsonian Marine Station at Fort Pierce, Smithsonian Institution, Fort Pierce, FL, USA
| |
Collapse
|
2
|
De Vivo M, Lee HH, Huang YS, Dreyer N, Fong CL, de Mattos FMG, Jain D, Wen YHV, Mwihaki JK, Wang TY, Machida RJ, Wang J, Chan BKK, Tsai IJ. Utilisation of Oxford Nanopore sequencing to generate six complete gastropod mitochondrial genomes as part of a biodiversity curriculum. Sci Rep 2022; 12:9973. [PMID: 35705661 PMCID: PMC9200733 DOI: 10.1038/s41598-022-14121-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2022] [Accepted: 06/01/2022] [Indexed: 11/13/2022] Open
Abstract
High-throughput sequencing has enabled genome skimming approaches to produce complete mitochondrial genomes (mitogenomes) for species identification and phylogenomics purposes. In particular, the portable sequencing device from Oxford Nanopore Technologies (ONT) has the potential to facilitate hands-on training from sampling to sequencing and interpretation of mitogenomes. In this study, we present the results from sampling and sequencing of six gastropod mitogenomes (Aplysia argus, Cellana orientalis, Cellana toreuma, Conus ebraeus, Conus miles and Tylothais aculeata) from a graduate level biodiversity course. The students were able to produce mitogenomes from sampling to annotation using existing protocols and programs. Approximately 4 Gb of sequence was produced from 16 Flongle and one MinION flow cells, averaging 235 Mb and N50 = 4.4 kb per flow cell. Five of the six 14.1-18 kb mitogenomes were circlised containing all 13 core protein coding genes. Additional Illumina sequencing revealed that the ONT assemblies spanned over highly AT rich sequences in the control region that were otherwise missing in Illumina-assembled mitogenomes, but still contained a base error of one every 70.8-346.7 bp under the fast mode basecalling with the majority occurring at homopolymer regions. Our findings suggest that the portable MinION device can be used to rapidly produce low-cost mitogenomes onsite and tailored to genomics-based training in biodiversity research.
Collapse
Affiliation(s)
- Mattia De Vivo
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
- Department of Life Science, National Taiwan Normal University, Taipei, Taiwan
- Biodiversity Program, Taiwan International Graduate Program, Academia Sinica and National Taiwan Normal University, Taipei, Taiwan
| | - Hsin-Han Lee
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
- Bioinformatics Program, Taiwan International Graduate Program, National Taiwan University, Taipei, Taiwan
- Bioinformatics Program, Institute of Information Science, Taiwan International Graduate Program, Academia Sinica, Taipei, Taiwan
| | - Yu-Sin Huang
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
- Department of Life Science, National Taiwan Normal University, Taipei, Taiwan
- Biodiversity Program, Taiwan International Graduate Program, Academia Sinica and National Taiwan Normal University, Taipei, Taiwan
| | - Niklas Dreyer
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
- Department of Life Science, National Taiwan Normal University, Taipei, Taiwan
- Biodiversity Program, Taiwan International Graduate Program, Academia Sinica and National Taiwan Normal University, Taipei, Taiwan
- Natural History Museum of Denmark, University of Copenhagen, Faculty of Science, Copenhagen, Denmark
| | - Chia-Ling Fong
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
- Department of Life Science, National Taiwan Normal University, Taipei, Taiwan
- Biodiversity Program, Taiwan International Graduate Program, Academia Sinica and National Taiwan Normal University, Taipei, Taiwan
| | - Felipe Monteiro Gomes de Mattos
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
- Department of Life Science, National Taiwan Normal University, Taipei, Taiwan
- Biodiversity Program, Taiwan International Graduate Program, Academia Sinica and National Taiwan Normal University, Taipei, Taiwan
| | - Dharmesh Jain
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan
- Graduate Institute of Biotechnology, National Chung-Hsing University, Taichung, Taiwan
- Molecular and Biological Agricultural Sciences, Taiwan International Graduate Program, Academia Sinica and National Chung Hsing University, Taipei, Taiwan
| | - Yung-Hui Victoria Wen
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan
- Ph.D. Program in Microbial Genomics, National Chung Hsing University and Academia Sinica, Taipei, Taiwan
| | - John Karichu Mwihaki
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
- Department of Life Science, National Taiwan Normal University, Taipei, Taiwan
- Biodiversity Program, Taiwan International Graduate Program, Academia Sinica and National Taiwan Normal University, Taipei, Taiwan
| | - Tzi-Yuan Wang
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Ryuji J Machida
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - John Wang
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Benny K K Chan
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | | |
Collapse
|
3
|
Baeza JA, García-De León FJ. Are we there yet? Benchmarking low-coverage nanopore long-read sequencing for the assembling of mitochondrial genomes using the vulnerable silky shark Carcharhinus falciformis. BMC Genomics 2022; 23:320. [PMID: 35459089 PMCID: PMC9027416 DOI: 10.1186/s12864-022-08482-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Accepted: 03/18/2022] [Indexed: 12/28/2022] Open
Abstract
Background Whole mitochondrial genomes are quickly becoming markers of choice for the exploration of within-species genealogical and among-species phylogenetic relationships. Most often, ‘primer walking’ or ‘long PCR’ strategies plus Sanger sequencing or low-pass whole genome sequencing using Illumina short reads are used for the assembling of mitochondrial chromosomes. In this study, we first confirmed that mitochondrial genomes can be sequenced from long reads using nanopore sequencing data exclusively. Next, we examined the accuracy of the long-reads assembled mitochondrial chromosomes when comparing them to a ‘gold’ standard reference mitochondrial chromosome assembled using Illumina short-reads sequencing. Results Using a specialized bioinformatics tool, we first produced a short-reads mitochondrial genome assembly for the silky shark C. falciformis with an average base coverage of 9.8x. The complete mitochondrial genome of C. falciformis was 16,705 bp in length and 934 bp shorter than a previously assembled genome (17,639 bp in length) that used bioinformatics tools not specialized for the assembly of mitochondrial chromosomes. Next, low-pass whole genome sequencing using a MinION ONT pocket-sized platform plus customized de-novo and reference-based workflows assembled and circularized a highly accurate mitochondrial genome in the silky shark Carcharhinus falciformis. Indels at the flanks of homopolymer regions explained most of the dissimilarities observed between the ‘gold’ standard reference mitochondrial genome (assembled using Illumina short reads) and each of the long-reads mitochondrial genome assemblies. Although not completely accurate, mitophylogenomics and barcoding analyses (using entire mitogenomes and the D-Loop/Control Region, respectively) suggest that long-reads assembled mitochondrial genomes are reliable for identifying a sequenced individual, such as C. falciformis, and separating the same individual from others belonging to closely related congeneric species. Conclusions This study confirms that mitochondrial genomes can be sequenced from long-reads nanopore sequencing data exclusively. With further development, nanopore technology can be used to quickly test in situ mislabeling in the shark fin fishing industry and thus, improve surveillance protocols, law enforcement, and the regulation of this fishery. This study will also assist with the transferring of high-throughput sequencing technology to middle- and low-income countries so that international scientists can explore population genomics in sharks using inclusive research strategies. Lastly, we recommend assembling mitochondrial genomes using specialized assemblers instead of other assemblers developed for bacterial and/or nuclear genomes.
Collapse
Affiliation(s)
- J Antonio Baeza
- Department of Biological Sciences, 132 Long Hall, Clemson University, Clemson, SC, 29634, USA. .,Smithsonian Marine Station at Fort Pierce, 701 Seaway Drive, Fort Pierce, Florida, 34949, USA. .,Departamento de Biología Marina, Facultad de Ciencias del Mar, Universidad Católica del Norte, Larrondo, 1281, Coquimbo, Chile.
| | - F J García-De León
- Laboratorio de Genética para la Conservación, Centro de Investigaciones Biológicas del Noroeste, S.C., La Paz, Baja California Sur, Mexico
| |
Collapse
|
4
|
Wanner N, Larsen PA, McLain A, Faulk C. The mitochondrial genome and Epigenome of the Golden lion Tamarin from fecal DNA using Nanopore adaptive sequencing. BMC Genomics 2021; 22:726. [PMID: 34620074 PMCID: PMC8499546 DOI: 10.1186/s12864-021-08046-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Accepted: 09/29/2021] [Indexed: 02/01/2023] Open
Abstract
BACKGROUND The golden lion tamarin (Leontopithecus rosalia) is an endangered Platyrrhine primate endemic to the Atlantic coastal forests of Brazil. Despite ongoing conservation efforts, genetic data on this species remains scarce. Complicating factors include limitations on sample collection and a lack of high-quality reference sequences. Here, we used nanopore adaptive sampling to resequence the L. rosalia mitogenome from feces, a sample which can be collected non-invasively. RESULTS Adaptive sampling doubled the fraction of both host-derived and mitochondrial sequences compared to sequencing without enrichment. 258x coverage of the L. rosalia mitogenome was achieved in a single flow cell by targeting the unfinished genome of the distantly related emperor tamarin (Saguinus imperator) and the mitogenome of the closely related black lion tamarin (Leontopithecus chrysopygus). The L. rosalia mitogenome has a length of 16,597 bp, sharing 99.68% sequence identity with the L. chrysopygus mitogenome. A total of 38 SNPs between them were identified, with the majority being found in the non-coding D-loop region. DNA methylation and hydroxymethylation were directly detected using a neural network model applied to the raw signal from the MinION sequencer. In contrast to prior reports, DNA methylation was negligible in mitochondria in both CpG and non-CpG contexts. Surprisingly, a quarter of the 642 CpG sites exhibited DNA hydroxymethylation greater than 1% and 44 sites were above 5%, with concentration in the 3' side of several coding regions. CONCLUSIONS Overall, we report a robust new mitogenome assembly for L. rosalia and direct detection of cytosine base modifications in all contexts.
Collapse
Affiliation(s)
- Nicole Wanner
- Department of Animal Sciences, University of Minnesota, College of Food, Agricultural, and Natural Resource Sciences, 1988 Fitch Ave., Saint Paul, MN 55108 USA
| | - Peter A. Larsen
- Department of Veterinary and Biomedical Sciences, College of Veterinary Medicine, University of Minnesota, Saint Paul, MN USA
| | - Adam McLain
- Department of Biology and Chemistry, College of Arts and Sciences, SUNY Polytechnic Institute, Utica, NY USA
| | - Christopher Faulk
- Department of Animal Sciences, University of Minnesota, College of Food, Agricultural, and Natural Resource Sciences, 1988 Fitch Ave., Saint Paul, MN 55108 USA
| |
Collapse
|
5
|
Yang Z, Slone J, Wang X, Zhan J, Huang Y, Namjou B, Kaufman KM, Pauciulo M, Harley JB, Muglia LJ, Chepelev I, Huang T. Validation of low-coverage whole-genome sequencing for mitochondrial DNA variants suggests mitochondrial DNA as a genetic cause of preterm birth. Hum Mutat 2021; 42:1602-1614. [PMID: 34467602 PMCID: PMC9290920 DOI: 10.1002/humu.24279] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Revised: 07/17/2021] [Accepted: 08/29/2021] [Indexed: 01/06/2023]
Abstract
Preterm birth (PTB), or birth that occurs earlier than 37 weeks of gestational age, is a major contributor to infant mortality and neonatal hospitalization. Mutations in the mitochondrial genome (mtDNA) have been linked to various rare mitochondrial disorders and may be a contributing factor in PTB given that maternal genetic factors have been strongly linked to PTB. However, to date, no study has found a conclusive connection between a particular mtDNA variant and PTB. Given the high mtDNA copy number per cell, an automated pipeline was developed for detecting mtDNA variants using low‐coverage whole‐genome sequencing (lcWGS) data. The pipeline was first validated against samples of known heteroplasmy, and then applied to 929 samples from a PTB cohort from diverse ethnic backgrounds with an average gestational age of 27.18 weeks (range: 21–30). Our new pipeline successfully identified haplogroups and a large number of mtDNA variants in this large PTB cohort, including 8 samples carrying known pathogenic variants and 47 samples carrying rare mtDNA variants. These results confirm that lcWGS can be utilized to reliably identify mtDNA variants. These mtDNA variants may make a contribution toward preterm birth in a small proportion of live births.
Collapse
Affiliation(s)
- Zeyu Yang
- Division of Human Genetics, Cincinnati Children's Hospital Medical Center and University of Cincinnati College of Medicine, Cincinnati, Ohio, USA
| | - Jesse Slone
- Division of Human Genetics, Cincinnati Children's Hospital Medical Center and University of Cincinnati College of Medicine, Cincinnati, Ohio, USA
| | - Xinjian Wang
- Division of Human Genetics, Cincinnati Children's Hospital Medical Center and University of Cincinnati College of Medicine, Cincinnati, Ohio, USA
| | - Jack Zhan
- Division of Human Genetics, Cincinnati Children's Hospital Medical Center and University of Cincinnati College of Medicine, Cincinnati, Ohio, USA
| | - Yongbo Huang
- Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children's Hospital Medical Center and University of Cincinnati College of Medicine, Cincinnati, Ohio, USA
| | - Bahram Namjou
- Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children's Hospital Medical Center and University of Cincinnati College of Medicine, Cincinnati, Ohio, USA
| | - Kenneth M Kaufman
- Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children's Hospital Medical Center and University of Cincinnati College of Medicine, Cincinnati, Ohio, USA.,US Department of Veterans Affairs Medical Center, Cincinnati, Ohio, USA
| | - Michael Pauciulo
- Division of Human Genetics, Cincinnati Children's Hospital Medical Center and University of Cincinnati College of Medicine, Cincinnati, Ohio, USA
| | - John B Harley
- Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children's Hospital Medical Center and University of Cincinnati College of Medicine, Cincinnati, Ohio, USA.,US Department of Veterans Affairs Medical Center, Cincinnati, Ohio, USA
| | - Louis J Muglia
- Division of Human Genetics, Cincinnati Children's Hospital Medical Center and University of Cincinnati College of Medicine, Cincinnati, Ohio, USA.,Burroughs Wellcome Fund, Research Triangle Park, North Carolina, USA
| | - Iouri Chepelev
- Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children's Hospital Medical Center and University of Cincinnati College of Medicine, Cincinnati, Ohio, USA
| | - Taosheng Huang
- Division of Human Genetics, Cincinnati Children's Hospital Medical Center and University of Cincinnati College of Medicine, Cincinnati, Ohio, USA
| |
Collapse
|
6
|
Malukiewicz J, Cartwright RA, Dergam JA, Igayara CS, Nicola PA, Pereira LMC, Ruiz-Miranda CR, Stone AC, Silva DL, Silva FDFRD, Varsani A, Walter L, Wilson MA, Zinner D, Roos C. Genomic skimming and nanopore sequencing uncover cryptic hybridization in one of world's most threatened primates. Sci Rep 2021; 11:17279. [PMID: 34446741 PMCID: PMC8390465 DOI: 10.1038/s41598-021-96404-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Accepted: 08/10/2021] [Indexed: 12/28/2022] Open
Abstract
The Brazilian buffy-tufted-ear marmoset (Callithrix aurita), one of the world's most endangered primates, is threatened by anthropogenic hybridization with exotic, invasive marmoset species. As there are few genetic data available for C. aurita, we developed a PCR-free protocol with minimal technical requirements to rapidly generate genomic data with genomic skimming and portable nanopore sequencing. With this direct DNA sequencing approach, we successfully determined the complete mitogenome of a marmoset that we initially identified as C. aurita. The obtained nanopore-assembled sequence was highly concordant with a Sanger sequenced version of the same mitogenome. Phylogenetic analyses unexpectedly revealed that our specimen was a cryptic hybrid, with a C. aurita phenotype and C. penicillata mitogenome lineage. We also used publicly available mitogenome data to determine diversity estimates for C. aurita and three other marmoset species. Mitogenomics holds great potential to address deficiencies in genomic data for endangered, non-model species such as C. aurita. However, we discuss why mitogenomic approaches should be used in conjunction with other data for marmoset species identification. Finally, we discuss the utility and implications of our results and genomic skimming/nanopore approach for conservation and evolutionary studies of C. aurita and other marmosets.
Collapse
Affiliation(s)
- Joanna Malukiewicz
- German Primate Center, Leibniz Institute for Primate Research, Primate Genetics Laboratory, Göttingen, 37077, Germany.
- Universidade de São Paulo, Faculdade de Medicina, São Paulo, São Paulo, 05403-000, Brazil.
| | - Reed A Cartwright
- School of Life Sciences and The Biodesign Institute, Arizona State University, Tempe, AZ, 85281, USA
| | - Jorge A Dergam
- Departamento de Biologia Animal, Universidade Federal de Viçosa, Viçosa, MG, 36570-900, Brazil
| | | | - Patricia A Nicola
- Centro de Conservação e Manejo de Fauna da Caatinga, Universidade Federal do Vale do São Francisco, Petrolina, PE, 56300-000, Brazil
| | - Luiz M C Pereira
- Centro de Conservação e Manejo de Fauna da Caatinga, Universidade Federal do Vale do São Francisco, Petrolina, PE, 56300-000, Brazil
| | - Carlos R Ruiz-Miranda
- Laboratório das Ciências Ambientais, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense, Campos dos Goytacazes, RJ, 28013-602, Brazil
| | - Anne C Stone
- School of Human Evolution and Social Change, Arizona State University, Tempe, AZ, 85281, USA
- Arizona State University, Institute of Human Origins, Tempe, AZ, 85281, USA
- Center for Evolution and Medicine, Arizona State University, Tempe, AZ, 85287, USA
| | - Daniel L Silva
- Núcleo de Pesquisas em Ciências Biológicas - NUPEB, Federal University of Ouro Preto, Ouro Preto, MG, 35400-000, Brazil
| | | | - Arvind Varsani
- The Biodesign Center for Fundamental and Applied Microbiomics, Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, 85287, USA
- Structural Biology Research Unit, Department of Integrative Biomedical Sciences, University of Cape Town, Rondebosch, Cape Town, 7700, South Africa
| | - Lutz Walter
- German Primate Center, Leibniz Institute for Primate Research, Primate Genetics Laboratory, Göttingen, 37077, Germany
| | - Melissa A Wilson
- Center for Evolution and Medicine, Arizona State University, Tempe, AZ, 85287, USA
- School of Life Sciences, Arizona State University, Tempe, AZ, 85287, USA
| | - Dietmar Zinner
- German Primate Center, Leibniz Institute for Primate Research, Cognitive Ethology Laboratory, Göttingen, 37077, Germany
- Leibniz ScienceCampus Primate Cognition, Göttingen, 37077, Germany
- Department of Primate Cognition, Georg-August-University, Göttingen, 37077, Germany
| | - Christian Roos
- German Primate Center, Leibniz Institute for Primate Research, Primate Genetics Laboratory, Göttingen, 37077, Germany
- German Primate Center, Leibniz Institute for Primate Research, Gene Bank of Primates, Göttingen, 37077, Germany
| |
Collapse
|