1
|
Inoue J, Shinzato C, Hirai J, Itoh S, Minegishi Y, Ito SI, Hyodo S. phyloBARCODER: A Web Tool for Phylogenetic Classification of Eukaryote Metabarcodes Using Custom Reference Databases. Mol Biol Evol 2024; 41:msae111. [PMID: 38850168 PMCID: PMC11297486 DOI: 10.1093/molbev/msae111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Revised: 06/02/2024] [Accepted: 06/04/2024] [Indexed: 06/10/2024] Open
Abstract
We developed phyloBARCODER (https://github.com/jun-inoue/phyloBARCODER), a new web tool that can identify short DNA sequences to the species level using metabarcoding. phyloBARCODER estimates phylogenetic trees based on the uploaded anonymous DNA sequences and reference sequences from databases. Without such phylogenetic contexts, alternative, similarity-based methods independently identify species names and anonymous sequences of the same group by pairwise comparisons between queries and database sequences, with the caveat that they must match exactly or very closely. By putting metabarcoding sequences into a phylogenetic context, phyloBARCODER accurately identifies (i) species or classification of query sequences and (ii) anonymous sequences associated with the same species or even with populations of query sequences, with clear and accurate explanations. Version 1 of phyloBARCODER stores a database comprising all eukaryotic mitochondrial gene sequences. Moreover, by uploading their own databases, phyloBARCODER users can conduct species identification specialized for sequences obtained from a local geographic region or those of nonmitochondrial genes, e.g. ITS or rbcL.
Collapse
Affiliation(s)
- Jun Inoue
- Atmosphere and Ocean Research Institute, The University of Tokyo, Kashiwa, Japan
| | - Chuya Shinzato
- Atmosphere and Ocean Research Institute, The University of Tokyo, Kashiwa, Japan
| | - Junya Hirai
- Atmosphere and Ocean Research Institute, The University of Tokyo, Kashiwa, Japan
| | - Sachihiko Itoh
- Atmosphere and Ocean Research Institute, The University of Tokyo, Kashiwa, Japan
| | - Yuki Minegishi
- Atmosphere and Ocean Research Institute, The University of Tokyo, Kashiwa, Japan
| | - Shin-ichi Ito
- Atmosphere and Ocean Research Institute, The University of Tokyo, Kashiwa, Japan
| | - Susumu Hyodo
- Atmosphere and Ocean Research Institute, The University of Tokyo, Kashiwa, Japan
| |
Collapse
|
2
|
Gwiazdowski R. Principles for Constructing DNA Barcode Reference Libraries. Methods Mol Biol 2024; 2744:491-502. [PMID: 38683337 DOI: 10.1007/978-1-0716-3581-0_29] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/01/2024]
Abstract
All DNA barcode methods rely on reference sequences linked to well-curated voucher specimens. Definitions for and locations of DNA barcode reference libraries are not standardized, and vary throughout the literature. Standardizing, and centralizing reference specimens would provide an unambiguous source, analogous to reference genomes, to reproduce identifications and improve a library. This chapter proposes a working definition of a DNA barcode reference library, consistent with DNA barcode data standards, along with principles and methods to consider when producing or using such a library. These methods allow explicit traceback to sequence-sources which elevate the value of voucher specimens, and create a potential for community curation.
Collapse
Affiliation(s)
- Rodger Gwiazdowski
- Department of Environmental Conservation, University of Massachusetts, Amherst, MA, USA.
| |
Collapse
|
3
|
Melis R, Di Crescenzo S, Cariani A, Ferrari A, Crobe V, Bellodi A, Mulas A, Carugati L, Coluccia E, Follesa MC, Cannas R. I Like This New Me: Unravelling Population Structure of Mediterranean Electric Rays and Taxonomic Uncertainties within Torpediniformes. Animals (Basel) 2023; 13:2899. [PMID: 37760300 PMCID: PMC10525375 DOI: 10.3390/ani13182899] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 09/04/2023] [Accepted: 09/07/2023] [Indexed: 09/29/2023] Open
Abstract
The present study focused on the three species of electric rays known to occur in the Mediterranean Sea: Torpedo torpedo, Torpedo marmorata and Tetronarce nobiliana. Correct identification of specimens is needed to properly assess the impact of fisheries on populations and species. Unfortunately, torpedoes share high morphological similarities, boosting episodes of field misidentification. In this context, genetic data was used (1) to identify specimens caught during fishing operations, (2) to measure the diversity among and within these species, and (3) to shed light on the possible occurrence of additional hidden species in the investigated area. New and already published sequences of COI and NADH2 mitochondrial genes were analyzed, both at a small scale along the Sardinian coasts (Western Mediterranean) and at a large scale in the whole Mediterranean Sea. High levels of genetic diversity were found in Sardinian populations, being significantly different from other areas of the Eastern Mediterranean Sea due to the biotic and abiotic factors here discussed. Sardinian torpedoes can hence be indicated as priority populations/areas to be protected within the Mediterranean Sea. Moreover, sequence data confirmed that only the three species occur in the investigated area. The application of several 'species-delimitation' methods found evidence of cryptic species in the three species outside the Mediterranean Sea, as well as in other genera/families, suggesting the urgent need for future studies and a comprehensive revision of the order Torpediniformes for its effective conservation.
Collapse
Affiliation(s)
- Riccardo Melis
- Department of Life and Environmental Sciences, University of Cagliari, 09126 Cagliari, Italy; (R.M.); (S.D.C.); (A.B.); (A.M.); (L.C.); (E.C.); (M.C.F.)
| | - Simone Di Crescenzo
- Department of Life and Environmental Sciences, University of Cagliari, 09126 Cagliari, Italy; (R.M.); (S.D.C.); (A.B.); (A.M.); (L.C.); (E.C.); (M.C.F.)
| | - Alessia Cariani
- Department of Biological, Geological and Environmental Sciences, University of Bologna, 40126 Bologna, Italy; (A.C.); (A.F.); (V.C.)
| | - Alice Ferrari
- Department of Biological, Geological and Environmental Sciences, University of Bologna, 40126 Bologna, Italy; (A.C.); (A.F.); (V.C.)
| | - Valentina Crobe
- Department of Biological, Geological and Environmental Sciences, University of Bologna, 40126 Bologna, Italy; (A.C.); (A.F.); (V.C.)
| | - Andrea Bellodi
- Department of Life and Environmental Sciences, University of Cagliari, 09126 Cagliari, Italy; (R.M.); (S.D.C.); (A.B.); (A.M.); (L.C.); (E.C.); (M.C.F.)
| | - Antonello Mulas
- Department of Life and Environmental Sciences, University of Cagliari, 09126 Cagliari, Italy; (R.M.); (S.D.C.); (A.B.); (A.M.); (L.C.); (E.C.); (M.C.F.)
| | - Laura Carugati
- Department of Life and Environmental Sciences, University of Cagliari, 09126 Cagliari, Italy; (R.M.); (S.D.C.); (A.B.); (A.M.); (L.C.); (E.C.); (M.C.F.)
| | - Elisabetta Coluccia
- Department of Life and Environmental Sciences, University of Cagliari, 09126 Cagliari, Italy; (R.M.); (S.D.C.); (A.B.); (A.M.); (L.C.); (E.C.); (M.C.F.)
| | - Maria Cristina Follesa
- Department of Life and Environmental Sciences, University of Cagliari, 09126 Cagliari, Italy; (R.M.); (S.D.C.); (A.B.); (A.M.); (L.C.); (E.C.); (M.C.F.)
| | - Rita Cannas
- Department of Life and Environmental Sciences, University of Cagliari, 09126 Cagliari, Italy; (R.M.); (S.D.C.); (A.B.); (A.M.); (L.C.); (E.C.); (M.C.F.)
| |
Collapse
|
4
|
Gold Z, Koch MQ, Schooler NK, Emery KA, Dugan JE, Miller RJ, Page HM, Schroeder DM, Hubbard DM, Madden JR, Whitaker SG, Barber PH. A comparison of biomonitoring methodologies for surf zone fish communities. PLoS One 2023; 18:e0260903. [PMID: 37314989 DOI: 10.1371/journal.pone.0260903] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2021] [Accepted: 04/25/2023] [Indexed: 06/16/2023] Open
Abstract
Surf zones are highly dynamic marine ecosystems that are subject to increasing anthropogenic and climatic pressures, posing multiple challenges for biomonitoring. Traditional methods such as seines and hook and line surveys are often labor intensive, taxonomically biased, and can be physically hazardous. Emerging techniques, such as baited remote underwater video (BRUV) and environmental DNA (eDNA) are promising nondestructive tools for assessing marine biodiversity in surf zones of sandy beaches. Here we compare the relative performance of beach seines, BRUV, and eDNA in characterizing community composition of bony (teleost) and cartilaginous (elasmobranch) fishes of surf zones at 18 open coast sandy beaches in southern California. Seine and BRUV surveys captured overlapping, but distinct fish communities with 50% (18/36) of detected species shared. BRUV surveys more frequently detected larger species (e.g. sharks and rays) while seines more frequently detected one of the most abundant species, barred surfperch (Amphistichus argenteus). In contrast, eDNA metabarcoding captured 88.9% (32/36) of all fishes observed in seine and BRUV surveys plus 57 additional species, including 15 that frequent surf zone habitats. On average, eDNA detected over 5 times more species than BRUVs and 8 times more species than seine surveys at a given site. eDNA approaches also showed significantly higher sensitivity than seine and BRUV methods and more consistently detected 31 of the 32 (96.9%) jointly observed species across beaches. The four species detected by BRUV/seines, but not eDNA were only resolved at higher taxonomic ranks (e.g. Embiotocidae surfperches and Sygnathidae pipefishes). In frequent co-detection of species between methods limited comparisons of richness and abundance estimates, highlighting the challenge of comparing biomonitoring approaches. Despite potential for improvement, results overall demonstrate that eDNA can provide a cost-effective tool for long-term surf zone monitoring that complements data from seine and BRUV surveys, allowing more comprehensive surveys of vertebrate diversity in surf zone habitats.
Collapse
Affiliation(s)
- Zachary Gold
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, United States of America
| | - McKenzie Q Koch
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, United States of America
| | - Nicholas K Schooler
- Marine Science Institute, University of California, Santa Barbara, Santa Barbara, CA, United States of America
| | - Kyle A Emery
- Marine Science Institute, University of California, Santa Barbara, Santa Barbara, CA, United States of America
| | - Jenifer E Dugan
- Marine Science Institute, University of California, Santa Barbara, Santa Barbara, CA, United States of America
| | - Robert J Miller
- Marine Science Institute, University of California, Santa Barbara, Santa Barbara, CA, United States of America
| | - Henry M Page
- Marine Science Institute, University of California, Santa Barbara, Santa Barbara, CA, United States of America
| | - Donna M Schroeder
- Bureau of Ocean Energy Management, Camarillo, CA, United States of America
| | - David M Hubbard
- Marine Science Institute, University of California, Santa Barbara, Santa Barbara, CA, United States of America
| | - Jessica R Madden
- Marine Science Institute, University of California, Santa Barbara, Santa Barbara, CA, United States of America
| | - Stephen G Whitaker
- Marine Science Institute, University of California, Santa Barbara, Santa Barbara, CA, United States of America
- Channel Islands National Park, Ventura, CA, United States of America
| | - Paul H Barber
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, United States of America
| |
Collapse
|
5
|
Jeunen GJ, Dowle E, Edgecombe J, von Ammon U, Gemmell NJ, Cross H. crabs-A software program to generate curated reference databases for metabarcoding sequencing data. Mol Ecol Resour 2023; 23:725-738. [PMID: 36437603 DOI: 10.1111/1755-0998.13741] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 10/30/2022] [Accepted: 11/15/2022] [Indexed: 11/29/2022]
Abstract
The measurement of biodiversity is an integral aspect of life science research. With the establishment of second- and third-generation sequencing technologies, an increasing amount of metabarcoding data is being generated as we seek to describe the extent and patterns of biodiversity in multiple contexts. The reliability and accuracy of taxonomically assigning metabarcoding sequencing data have been shown to be critically influenced by the quality and completeness of reference databases. Custom, curated, eukaryotic reference databases, however, are scarce, as are the software programs for generating them. Here, we present crabs (Creating Reference databases for Amplicon-Based Sequencing), a software package to create custom reference databases for metabarcoding studies. crabs includes tools to download sequences from multiple online repositories (i.e., NCBI, BOLD, EMBL, MitoFish), retrieve amplicon regions through in silico PCR analysis and pairwise global alignments, curate the database through multiple filtering parameters (e.g., dereplication, sequence length, sequence quality, unresolved taxonomy, inclusion/exclusion filter), export the reference database in multiple formats for immediate use in taxonomy assignment software, and investigate the reference database through implemented visualizations for diversity, primer efficiency, reference sequence length, database completeness and taxonomic resolution. crabs is a versatile tool for generating curated reference databases of user-specified genetic markers to aid taxonomy assignment from metabarcoding sequencing data. crabs can be installed via docker and is available for download as a conda package and via GitHub (https://github.com/gjeunen/reference_database_creator).
Collapse
Affiliation(s)
- Gert-Jan Jeunen
- Department of Anatomy, School of Biomedical Sciences, University of Otago, Dunedin, New Zealand
| | - Eddy Dowle
- Department of Anatomy, School of Biomedical Sciences, University of Otago, Dunedin, New Zealand
| | - Jonika Edgecombe
- Department of Anatomy, School of Biomedical Sciences, University of Otago, Dunedin, New Zealand
| | - Ulla von Ammon
- Coastal and Freshwater Group, Cawthron Institute, Nelson, New Zealand
| | - Neil J Gemmell
- Department of Anatomy, School of Biomedical Sciences, University of Otago, Dunedin, New Zealand
| | - Hugh Cross
- Department of Anatomy, School of Biomedical Sciences, University of Otago, Dunedin, New Zealand.,National Ecological Observatory Network, Boulder, Colorado, USA
| |
Collapse
|
6
|
Bourret A, Nozères C, Parent E, Parent GJ. Maximizing the reliability and the number of species assignments in metabarcoding studies using a curated regional library and a public repository. METABARCODING AND METAGENOMICS 2023. [DOI: 10.3897/mbmg.7.98539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/25/2023] Open
Abstract
Biodiversity assessments relying on DNA have increased rapidly over the last decade. However, the reliability of taxonomic assignments in metabarcoding studies is variable and affected by the reference databases and the assignment methods used. Species level assignments are usually considered as reliable using regional libraries but unreliable using public repositories. In this study, we aimed to test this assumption for metazoan species detected in the Gulf of St. Lawrence in the Northwest Atlantic. We first created a regional library (GSL-rl) by data mining COI barcode sequences from BOLD, and included a reliability ranking system for species assignments. We then estimated 1) the accuracy and precision of the public repository NCBI-nt for species assignments using sequences from the regional library and 2) compared the detection and reliability of species assignments of a metabarcoding dataset using either NCBI-nt or the regional library and popular assignment methods. With NCBI-nt and sequences from the regional library, the BLAST-LCA (least common ancestor) method was the most precise method for species assignments, but the accuracy was higher with the BLAST-TopHit method (>80% over all taxa, between 70% and 90% amongst taxonomic groups). With the metabarcoding dataset, the reliability of species assignments was greater using GSL-rl compared to NCBI-nt. However, we also observed that the total number of reliable species assignments could be maximized using both GSL-rl and NCBI-nt with different optimized assignment methods. The use of a two-step approach for species assignments, i.e., using a regional library and a public repository, could improve the reliability and the number of detected species in metabarcoding studies.
Collapse
|
7
|
Mugnai F, Costantini F, Chenuil A, Leduc M, Gutiérrez Ortega JM, Meglécz E. Be positive: customized reference databases and new, local barcodes balance false taxonomic assignments in metabarcoding studies. PeerJ 2023; 11:e14616. [PMID: 36643652 PMCID: PMC9835706 DOI: 10.7717/peerj.14616] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Accepted: 12/01/2022] [Indexed: 01/11/2023] Open
Abstract
Background In metabarcoding analyses, the taxonomic assignment is crucial to place sequencing data in biological and ecological contexts. This fundamental step depends on a reference database, which should have a good taxonomic coverage to avoid unassigned sequences. However, this goal is rarely achieved in many geographic regions and for several taxonomic groups. On the other hand, more is not necessarily better, as sequences in reference databases belonging to taxonomic groups out of the studied region/environment context might lead to false assignments. Methods We investigated the effect of using several subsets of a cytochrome c oxidase subunit I (COI) reference database on taxonomic assignment. Published metabarcoding sequences from the Mediterranean Sea were assigned to taxa using COInr, which is a comprehensive, non-redundant and recent database of COI sequences obtained both from BOLD and NCBI, and two of its subsets: (i) all sequences except insects (COInr-WO-Insecta), which represent the overwhelming majority of COInr database, but are irrelevant for marine samples, and (ii) all sequences from taxonomic families present in the Mediterranean Sea (COInr-Med). Four different algorithms for taxonomic assignment were employed in parallel to evaluate differences in their output and data consistency. Results The reduction of the database to more specific custom subsets increased the number of unassigned sequences. Nevertheless, since most of them were incorrectly assigned by the less specific databases, this is a positive outcome. Moreover, the taxonomic resolution (the lowest taxonomic level to which a sequence is attributed) of several sequences tended to increase when using customized databases. These findings clearly indicated the need for customized databases adapted to each study. However, the very high proportion of unassigned sequences points to the need to enrich the local database with new barcodes specifically obtained from the studied region and/or taxonomic group. Including novel local barcodes to the COI database proved to be very profitable: by adding only 116 new barcodes sequenced in our laboratory, thus increasing the reference database by only 0.04%, we were able to improve the resolution for ca. 0.6-1% of the Amplicon Sequence Variants (ASVs).
Collapse
Affiliation(s)
- Francesco Mugnai
- Department of Biological, Geological and Environmental Sciences (BiGeA), University of Bologna, Ravenna, Italy
| | - Federica Costantini
- Department of Biological, Geological and Environmental Sciences (BiGeA), University of Bologna, Ravenna, Italy,Consorzio Nazionale Interuniversitario per le Scienze del Mare (CoNISMa), Roma, Italy
| | - Anne Chenuil
- Aix Marseille Univ, Avignon Université, CNRS, IRD, IMBE, Marseille, France
| | | | | | - Emese Meglécz
- Aix Marseille Univ, Avignon Université, CNRS, IRD, IMBE, Marseille, France
| |
Collapse
|
8
|
Gold Z, Shelton AO, Casendino HR, Duprey J, Gallego R, Van Cise A, Fisher M, Jensen AJ, D'Agnese E, Andruszkiewicz Allan E, Ramón-Laca A, Garber-Yonts M, Labare M, Parsons KM, Kelly RP. Signal and noise in metabarcoding data. PLoS One 2023; 18:e0285674. [PMID: 37167310 PMCID: PMC10174484 DOI: 10.1371/journal.pone.0285674] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 04/27/2023] [Indexed: 05/13/2023] Open
Abstract
Metabarcoding is a powerful molecular tool for simultaneously surveying hundreds to thousands of species from a single sample, underpinning microbiome and environmental DNA (eDNA) methods. Deriving quantitative estimates of underlying biological communities from metabarcoding is critical for enhancing the utility of such approaches for health and conservation. Recent work has demonstrated that correcting for amplification biases in genetic metabarcoding data can yield quantitative estimates of template DNA concentrations. However, a major source of uncertainty in metabarcoding data stems from non-detections across technical PCR replicates where one replicate fails to detect a species observed in other replicates. Such non-detections are a special case of variability among technical replicates in metabarcoding data. While many sampling and amplification processes underlie observed variation in metabarcoding data, understanding the causes of non-detections is an important step in distinguishing signal from noise in metabarcoding studies. Here, we use both simulated and empirical data to 1) suggest how non-detections may arise in metabarcoding data, 2) outline steps to recognize uninformative data in practice, and 3) identify the conditions under which amplicon sequence data can reliably detect underlying biological signals. We show with both simulations and empirical data that, for a given species, the rate of non-detections among technical replicates is a function of both the template DNA concentration and species-specific amplification efficiency. Consequently, we conclude metabarcoding datasets are strongly affected by (1) deterministic amplification biases during PCR and (2) stochastic sampling of amplicons during sequencing-both of which we can model-but also by (3) stochastic sampling of rare molecules prior to PCR, which remains a frontier for quantitative metabarcoding. Our results highlight the importance of estimating species-specific amplification efficiencies and critically evaluating patterns of non-detection in metabarcoding datasets to better distinguish environmental signal from the noise inherent in molecular detections of rare targets.
Collapse
Affiliation(s)
- Zachary Gold
- Cooperative Institute for Climate, Ocean, & Ecosystem Studies, UW, Seattle, Washington, United States of America
- Northwest Fisheries Science Center, NMFS/NOAA, Seattle, Washington, United States of America
| | - Andrew Olaf Shelton
- Northwest Fisheries Science Center, NMFS/NOAA, Seattle, Washington, United States of America
| | - Helen R Casendino
- School of Marine and Environmental Affairs, UW, Seattle, Washington, United States of America
| | - Joe Duprey
- School of Marine and Environmental Affairs, UW, Seattle, Washington, United States of America
| | - Ramón Gallego
- Northwest Fisheries Science Center, NMFS/NOAA, Seattle, Washington, United States of America
| | - Amy Van Cise
- Northwest Fisheries Science Center, NMFS/NOAA, Seattle, Washington, United States of America
| | - Mary Fisher
- School of Aquatic Fisheries Science, UW, Seattle, Washington, United States of America
| | - Alexander J Jensen
- Northwest Fisheries Science Center, NMFS/NOAA, Seattle, Washington, United States of America
| | - Erin D'Agnese
- School of Marine and Environmental Affairs, UW, Seattle, Washington, United States of America
| | | | - Ana Ramón-Laca
- Northwest Fisheries Science Center, NMFS/NOAA, Seattle, Washington, United States of America
| | - Maya Garber-Yonts
- School of Marine and Environmental Affairs, UW, Seattle, Washington, United States of America
| | - Michaela Labare
- Scripps Institution of Oceanography, UCSD, La Jolla, California, United States of America
| | - Kim M Parsons
- Northwest Fisheries Science Center, NMFS/NOAA, Seattle, Washington, United States of America
| | - Ryan P Kelly
- School of Marine and Environmental Affairs, UW, Seattle, Washington, United States of America
| |
Collapse
|
9
|
Yao M, Zhang S, Lu Q, Chen X, Zhang SY, Kong Y, Zhao J. Fishing for fish environmental DNA: Ecological applications, methodological considerations, surveying designs, and ways forward. Mol Ecol 2022; 31:5132-5164. [PMID: 35972241 DOI: 10.1111/mec.16659] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Revised: 07/20/2022] [Accepted: 07/27/2022] [Indexed: 12/15/2022]
Abstract
Vast global declines of freshwater and marine fish diversity and population abundance pose serious threats to both ecosystem sustainability and human livelihoods. Environmental DNA (eDNA)-based biomonitoring provides robust, efficient, and cost-effective assessment of species occurrences and population trends in diverse aquatic environments. Thus, it holds great potential for improving conventional surveillance frameworks to facilitate fish conservation and fisheries management. However, the many technical considerations and rapid developments underway in the eDNA arena can overwhelm researchers and practitioners new to the field. Here, we systematically analysed 416 fish eDNA studies to summarize research trends in terms of investigated targets, research aims, and study systems, and reviewed the applications, rationales, methodological considerations, and limitations of eDNA methods with an emphasis on fish and fisheries research. We highlighted how eDNA technology may advance our knowledge of fish behaviour, species distributions, population genetics, community structures, and ecological interactions. We also synthesized the current knowledge of several important methodological concerns, including the qualitative and quantitative power eDNA has to recover fish biodiversity and abundance, and the spatial and temporal representations of eDNA with respect to its sources. To facilitate ecological applications implementing fish eDNA techniques, recent literature was summarized to generate guidelines for effective sampling in lentic, lotic, and marine habitats. Finally, we identified current gaps and limitations, and pointed out newly emerging research avenues for fish eDNA. As methodological optimization and standardization improve, eDNA technology should revolutionize fish monitoring and promote biodiversity conservation and fisheries management that transcends geographic and temporal boundaries.
Collapse
Affiliation(s)
- Meng Yao
- Institute of Ecology, College of Urban and Environmental Sciences, Peking University, Beijing, China.,School of Life Sciences, Peking University, Beijing, China
| | - Shan Zhang
- Institute of Ecology, College of Urban and Environmental Sciences, Peking University, Beijing, China.,School of Life Sciences, Peking University, Beijing, China
| | - Qi Lu
- Institute of Ecology, College of Urban and Environmental Sciences, Peking University, Beijing, China.,School of Life Sciences, Peking University, Beijing, China
| | - Xiaoyu Chen
- Institute of Ecology, College of Urban and Environmental Sciences, Peking University, Beijing, China.,School of Life Sciences, Peking University, Beijing, China
| | - Si-Yu Zhang
- Institute of Ecology, College of Urban and Environmental Sciences, Peking University, Beijing, China.,School of Life Sciences, Peking University, Beijing, China
| | - Yueqiao Kong
- Institute of Ecology, College of Urban and Environmental Sciences, Peking University, Beijing, China.,School of Life Sciences, Peking University, Beijing, China
| | - Jindong Zhao
- Institute of Ecology, College of Urban and Environmental Sciences, Peking University, Beijing, China.,School of Life Sciences, Peking University, Beijing, China
| |
Collapse
|
10
|
Czech L, Stamatakis A, Dunthorn M, Barbera P. Metagenomic Analysis Using Phylogenetic Placement-A Review of the First Decade. FRONTIERS IN BIOINFORMATICS 2022; 2:871393. [PMID: 36304302 PMCID: PMC9580882 DOI: 10.3389/fbinf.2022.871393] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Accepted: 04/11/2022] [Indexed: 12/20/2022] Open
Abstract
Phylogenetic placement refers to a family of tools and methods to analyze, visualize, and interpret the tsunami of metagenomic sequencing data generated by high-throughput sequencing. Compared to alternative (e. g., similarity-based) methods, it puts metabarcoding sequences into a phylogenetic context using a set of known reference sequences and taking evolutionary history into account. Thereby, one can increase the accuracy of metagenomic surveys and eliminate the requirement for having exact or close matches with existing sequence databases. Phylogenetic placement constitutes a valuable analysis tool per se, but also entails a plethora of downstream tools to interpret its results. A common use case is to analyze species communities obtained from metagenomic sequencing, for example via taxonomic assignment, diversity quantification, sample comparison, and identification of correlations with environmental variables. In this review, we provide an overview over the methods developed during the first 10 years. In particular, the goals of this review are 1) to motivate the usage of phylogenetic placement and illustrate some of its use cases, 2) to outline the full workflow, from raw sequences to publishable figures, including best practices, 3) to introduce the most common tools and methods and their capabilities, 4) to point out common placement pitfalls and misconceptions, 5) to showcase typical placement-based analyses, and how they can help to analyze, visualize, and interpret phylogenetic placement data.
Collapse
Affiliation(s)
- Lucas Czech
- Department of Plant Biology, Carnegie Institution for Science, Stanford, CA, United States
| | - Alexandros Stamatakis
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
- Institute for Theoretical Informatics, Karlsruhe Institute of Technology, Karlsruhe, Germany
| | - Micah Dunthorn
- Natural History Museum, University of Oslo, Oslo, Norway
| | | |
Collapse
|