1
|
Ramesh S, Rapp S, Tapias Gomez J, Levine B, Tapias-Gomez D, Chung D, Truong Z. Reference Sequence Browser: An R application with a user-friendly GUI to rapidly query sequence databases. PLoS One 2024; 19:e0309707. [PMID: 39480818 PMCID: PMC11527216 DOI: 10.1371/journal.pone.0309707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Accepted: 08/16/2024] [Indexed: 11/02/2024] Open
Abstract
Land managers, researchers, and regulators increasingly utilize environmental DNA (eDNA) techniques to monitor species richness, presence, and absence. In order to properly develop a biological assay for eDNA metabarcoding or quantitative PCR, scientists must be able to find not only reference sequences (previously identified sequences in a genomics database) that match their target taxa but also reference sequences that match non-target taxa. Determining which taxa have publicly available sequences in a time-efficient and accurate manner currently requires computational skills to search, manipulate, and parse multiple unconnected DNA sequence databases. Our team iteratively designed a Graphic User Interface (GUI) Shiny application called the Reference Sequence Browser (RSB) that provides users efficient and intuitive access to multiple genetic databases regardless of computer programming expertise. The application returns the number of publicly accessible barcode markers per organism in the NCBI Nucleotide, BOLD, or CALeDNA CRUX Metabarcoding Reference Databases. Depending on the database, we offer various search filters such as min and max sequence length or country of origin. Users can then download the FASTA/GenBank files from the RSB web tool, view statistics about the data, and explore results to determine details about the availability or absence of reference sequences.
Collapse
Affiliation(s)
- Sriram Ramesh
- Department of Computer Science and Engineering, University of California Santa Cruz, Santa Cruz, CA, United States of America
| | - Samuel Rapp
- Department of Ecology and Evolutionary Biology, University of California Santa Cruz, Santa Cruz, CA, United States of America
| | - Jorge Tapias Gomez
- Department of Computing and Information Science, Cornell University, Ithaca, NY, United States of America
| | - Benjamin Levine
- Department Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, United States of America
| | - Daniel Tapias-Gomez
- Department of Cell and Molecular Biology, University of Texas Southwestern, Dallas, TX, United States of America
| | - Dickson Chung
- Department of Computer Science and Engineering, University of California Santa Cruz, Santa Cruz, CA, United States of America
| | - Zia Truong
- Department Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, United States of America
| |
Collapse
|
2
|
Pipes L, Nielsen R. A rapid phylogeny-based method for accurate community profiling of large-scale metabarcoding datasets. eLife 2024; 13:e85794. [PMID: 39145536 PMCID: PMC11377034 DOI: 10.7554/elife.85794] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Accepted: 08/14/2024] [Indexed: 08/16/2024] Open
Abstract
Environmental DNA (eDNA) is becoming an increasingly important tool in diverse scientific fields from ecological biomonitoring to wastewater surveillance of viruses. The fundamental challenge in eDNA analyses has been the bioinformatical assignment of reads to taxonomic groups. It has long been known that full probabilistic methods for phylogenetic assignment are preferable, but unfortunately, such methods are computationally intensive and are typically inapplicable to modern next-generation sequencing data. We present a fast approximate likelihood method for phylogenetic assignment of DNA sequences. Applying the new method to several mock communities and simulated datasets, we show that it identifies more reads at both high and low taxonomic levels more accurately than other leading methods. The advantage of the method is particularly apparent in the presence of polymorphisms and/or sequencing errors and when the true species is not represented in the reference database.
Collapse
Affiliation(s)
- Lenore Pipes
- Department of Integrative Biology, University of California, Berkeley, Berkeley, United States
| | - Rasmus Nielsen
- Department of Integrative Biology, University of California, Berkeley, Berkeley, United States
- GLOBE Institute, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
3
|
Saenz‐Agudelo P, Ramirez P, Beldade R, Campoy AN, Garmendia V, Search FV, Fernández M, Wieters EA, Navarrete SA, Landaeta MF, Pérez‐Matus A. Environmental DNA reveals temporal variation in mesophotic reefs of the Humboldt upwelling ecosystems of central Chile: Toward a baseline for biodiversity monitoring of unexplored marine habitats. Ecol Evol 2024; 14:e10999. [PMID: 38390005 PMCID: PMC10881902 DOI: 10.1002/ece3.10999] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 01/12/2024] [Accepted: 01/26/2024] [Indexed: 02/24/2024] Open
Abstract
Temperate mesophotic reef ecosystems (TMREs) are among the least known marine habitats. Information on their diversity and ecology is geographically and temporally scarce, especially in highly productive large upwelling ecosystems. Lack of information remains an obstacle to understanding the importance of TMREs as habitats, biodiversity reservoirs and their connections with better-studied shallow reefs. Here, we use environmental DNA (eDNA) from water samples to characterize the community composition of TMREs on the central Chilean coast, generating the first baseline for monitoring the biodiversity of these habitats. We analyzed samples from two depths (30 and 60 m) over four seasons (spring, summer, autumn, and winter) and at two locations approximately 16 km apart. We used a panel of three metabarcodes, two that target all eukaryotes (18S rRNA and mitochondrial COI) and one specifically targeting fishes (16S rRNA). All panels combined encompassed eDNA assigned to 42 phyla, 90 classes, 237 orders, and 402 families. The highest family richness was found for the phyla Arthropoda, Bacillariophyta, and Chordata. Overall, family richness was similar between depths but decreased during summer, a pattern consistent at both locations. Our results indicate that the structure (composition) of the mesophotic communities varied predominantly with seasons. We analyzed further the better-resolved fish assemblage and compared eDNA with other visual methods at the same locations and depths. We recovered eDNA from 19 genera of fish, six of these have also been observed on towed underwater videos, while 13 were unique to eDNA. We discuss the potential drivers of seasonal differences in community composition and richness. Our results suggest that eDNA can provide valuable insights for monitoring TMRE communities but highlight the necessity of completing reference DNA databases available for this region.
Collapse
Affiliation(s)
- Pablo Saenz‐Agudelo
- Instituto de Ciencias Ambientales y Evolutivas, Universidad Austral de ChileValdiviaChile
- Millennium Nucleus for Ecology and Conservation of Temperate Marine Ecosystems, NUTMELas CrucesChile
| | - Paula Ramirez
- Instituto de Ciencias Ambientales y Evolutivas, Universidad Austral de ChileValdiviaChile
- Millennium Nucleus for Ecology and Conservation of Temperate Marine Ecosystems, NUTMELas CrucesChile
| | - Ricardo Beldade
- Millennium Nucleus for Ecology and Conservation of Temperate Marine Ecosystems, NUTMELas CrucesChile
- Estación Costera de Investigaciones MarinasPontificia Universidad CatólicaLas CrucesChile
| | - Ana N. Campoy
- Center of Marine Sciences (CCMAR‐CIMAR)University of the AlgarveFaroPortugal
| | - Vladimir Garmendia
- Millennium Nucleus for Ecology and Conservation of Temperate Marine Ecosystems, NUTMELas CrucesChile
- Estación Costera de Investigaciones MarinasPontificia Universidad CatólicaLas CrucesChile
| | - Francesca V. Search
- Millennium Nucleus for Ecology and Conservation of Temperate Marine Ecosystems, NUTMELas CrucesChile
| | - Miriam Fernández
- Millennium Nucleus for Ecology and Conservation of Temperate Marine Ecosystems, NUTMELas CrucesChile
- Estación Costera de Investigaciones MarinasPontificia Universidad CatólicaLas CrucesChile
| | - Evie A. Wieters
- Millennium Nucleus for Ecology and Conservation of Temperate Marine Ecosystems, NUTMELas CrucesChile
- Estación Costera de Investigaciones MarinasPontificia Universidad CatólicaLas CrucesChile
| | - Sergio A. Navarrete
- Millennium Nucleus for Ecology and Conservation of Temperate Marine Ecosystems, NUTMELas CrucesChile
- Estación Costera de Investigaciones MarinasPontificia Universidad CatólicaLas CrucesChile
- Center for Applied Ecology and Sustainability (CAPES) and Coastal Socio‐Ecological Millennium Institute (SECOS)Pontificia Universidad Católica de ChileSantiagoChile
- Center for Oceanographic Research COASTAL‐COASTALUniversidad de ConcepciónConcepciónChile
| | - Mauricio F. Landaeta
- Millennium Nucleus for Ecology and Conservation of Temperate Marine Ecosystems, NUTMELas CrucesChile
- Laboratorio de Ictiología e Interacciones Biofísicas (LABITI)Instituto de Biología, Facultad de Ciencias, Universidad de ValparaísoValparaísoChile
| | - Alejandro Pérez‐Matus
- Millennium Nucleus for Ecology and Conservation of Temperate Marine Ecosystems, NUTMELas CrucesChile
- Estación Costera de Investigaciones MarinasPontificia Universidad CatólicaLas CrucesChile
| |
Collapse
|
4
|
Quaresma A, Ankenbrand MJ, Garcia CAY, Rufino J, Honrado M, Amaral J, Brodschneider R, Brusbardis V, Gratzer K, Hatjina F, Kilpinen O, Pietropaoli M, Roessink I, van der Steen J, Vejsnæs F, Pinto MA, Keller A. Semi-automated sequence curation for reliable reference datasets in ITS2 vascular plant DNA (meta-)barcoding. Sci Data 2024; 11:129. [PMID: 38272945 PMCID: PMC10810873 DOI: 10.1038/s41597-024-02962-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 01/12/2024] [Indexed: 01/27/2024] Open
Abstract
One of the most critical steps for accurate taxonomic identification in DNA (meta)-barcoding is to have an accurate DNA reference sequence dataset for the marker of choice. Therefore, developing such a dataset has been a long-term ambition, especially in the Viridiplantae kingdom. Typically, reference datasets are constructed with sequences downloaded from general public databases, which can carry taxonomic and other relevant errors. Herein, we constructed a curated (i) global dataset, (ii) European crop dataset, and (iii) 27 datasets for the EU countries for the ITS2 barcoding marker of vascular plants. To that end, we first developed a pipeline script that entails (i) an automated curation stage comprising five filters, (ii) manual taxonomic correction for misclassified taxa, and (iii) manual addition of newly sequenced species. The pipeline allows easy updating of the curated datasets. With this approach, 13% of the sequences, corresponding to 7% of species originally imported from GenBank, were discarded. Further, 259 sequences were manually added to the curated global dataset, which now comprises 307,977 sequences of 111,382 plant species.
Collapse
Affiliation(s)
- Andreia Quaresma
- Centro de Investigação de Montanha (CIMO), Instituto Politécnico de Bragança, Campus de Santa Apolónia, 5300-253, Bragança, Portugal
- Laboratório Associado para a Sustentabilidade e Tecnologia em Regiões de Montanha (SusTEC), Instituto Politécnico de Bragança, Campus de Santa Apolónia, 5300-253, Bragança, Portugal
- Departamento de Biologia, Faculdade de Ciências da Universidade do Porto, Rua do Campo Alegre, S/N, Edifício FC4, 4169-007, Porto, Portugal
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Campus de Vairão, Universidade do Porto, 4485-661, Vairão, Vila do Conde, Portugal
- BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairão, 4485-661, Vairão, Vila do Conde, Portugal
| | - Markus J Ankenbrand
- Center for Computational and Theoretical Biology, Faculty of Biology, Julius-Maximilians-Universität Würzburg, Klara-Oppenheimer-Weg 32, 97074, Würzburg, Germany
| | - Carlos Ariel Yadró Garcia
- Centro de Investigação de Montanha (CIMO), Instituto Politécnico de Bragança, Campus de Santa Apolónia, 5300-253, Bragança, Portugal
- Laboratório Associado para a Sustentabilidade e Tecnologia em Regiões de Montanha (SusTEC), Instituto Politécnico de Bragança, Campus de Santa Apolónia, 5300-253, Bragança, Portugal
| | - José Rufino
- Laboratório Associado para a Sustentabilidade e Tecnologia em Regiões de Montanha (SusTEC), Instituto Politécnico de Bragança, Campus de Santa Apolónia, 5300-253, Bragança, Portugal
- Research Centre in Digitalization and Intelligent Robotics (CeDRI), Instituto Politécnico de Bragança, Bragança, Portugal
| | - Mónica Honrado
- Centro de Investigação de Montanha (CIMO), Instituto Politécnico de Bragança, Campus de Santa Apolónia, 5300-253, Bragança, Portugal
- Laboratório Associado para a Sustentabilidade e Tecnologia em Regiões de Montanha (SusTEC), Instituto Politécnico de Bragança, Campus de Santa Apolónia, 5300-253, Bragança, Portugal
| | - Joana Amaral
- Centro de Investigação de Montanha (CIMO), Instituto Politécnico de Bragança, Campus de Santa Apolónia, 5300-253, Bragança, Portugal
- Laboratório Associado para a Sustentabilidade e Tecnologia em Regiões de Montanha (SusTEC), Instituto Politécnico de Bragança, Campus de Santa Apolónia, 5300-253, Bragança, Portugal
| | - Robert Brodschneider
- Institute of Biology, University of Graz, Universitätsplatz 2, 8010, Graz, Austria
| | - Valters Brusbardis
- Latvian Beekeepers' Association (LBA), Rigas iela 22, LV-3004, Jelgava, Latvia
| | - Kristina Gratzer
- Institute of Biology, University of Graz, Universitätsplatz 2, 8010, Graz, Austria
| | - Fani Hatjina
- Ellinikos Georgikos Organismos DIMITRA (ELGO- DIMITRA), Kourtidou 56-58, GR-11145, Athina, Greece
| | - Ole Kilpinen
- Danish Beekeepers Association (DBF), Fulbyvej 15, DK-4180, Sorø, Denmark
| | - Marco Pietropaoli
- Istituto Zooprofilattico Sperimentale del Lazio e della Toscana "M. Aleandri" (IZSLT), Via Appia Nuova 1411, IT-00178, Roma, Italy
| | - Ivo Roessink
- Wageningen Environmental Research, WageningenUniversity&Research, Droevendaalsesteeg 3, 6700 AA, Wageningen, Netherlands
| | | | - Flemming Vejsnæs
- Danish Beekeepers Association (DBF), Fulbyvej 15, DK-4180, Sorø, Denmark
| | - M Alice Pinto
- Centro de Investigação de Montanha (CIMO), Instituto Politécnico de Bragança, Campus de Santa Apolónia, 5300-253, Bragança, Portugal
- Laboratório Associado para a Sustentabilidade e Tecnologia em Regiões de Montanha (SusTEC), Instituto Politécnico de Bragança, Campus de Santa Apolónia, 5300-253, Bragança, Portugal
| | - Alexander Keller
- Cellular and Organismic Interactions, Biocenter, Faculty of Biology, Ludwig-Maximilians-Universität München, Großhaderner Str. 2-4, 82152, Planegg-Martinsried, Germany.
| |
Collapse
|
5
|
Curd EE, Gal L, Gallego R, Silliman K, Nielsen S, Gold Z. rCRUX: A Rapid and Versatile Tool for Generating Metabarcoding Reference libraries in R. ENVIRONMENTAL DNA (HOBOKEN, N.J.) 2024; 6:e489. [PMID: 38370872 PMCID: PMC10871694 DOI: 10.1002/edn3.489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Accepted: 10/19/2023] [Indexed: 02/20/2024]
Abstract
The sequencing revolution requires accurate taxonomic classification of DNA sequences. Key to making accurate taxonomic assignments are curated, comprehensive reference barcode databases. However, the generation and curation of such databases has remained challenging given the large and continuously growing volumes of both DNA sequence data and novel reference barcode targets. Monitoring and research applications require a greater diversity of specialized gene regions and targeted taxa then are currently curated by professional staff. Thus there is a growing need for an easy to implement computational tool that can generate comprehensive metabarcoding reference libraries for any bespoke locus. We address this need by reimagining CRUX from the Anacapa Toolkit and present the rCRUX package in R which, like it's predecessor, relies on sequence homology and PCR primer compatibility instead of keyword-searches to avoid limitations of user-defined metadata. The typical workflow involves searching for plausible seed amplicons (get_seeds_local() or get_seeds_remote()) by simulating in silico PCR to acquire a set of sequences analogous to PCR products containing a user-defined set of primer sequences. Next, these seeds are used to iteratively blast search seed sequences against a local copy of the National Center for Biotechnology Information (NCBI) formatted nt database using a taxonomic-rank based stratified random sampling approach ( blast_seeds() ). This results in a comprehensive set of sequence matches. This database is dereplicated and cleaned (derep_and_clean_db()) by identifying identical reference sequences and collapsing the taxonomic path to the lowest taxonomic agreement across all matching reads. This results in a curated, comprehensive database of primer-specific reference barcode sequences from NCBI. Databases can then be compared (compare_db()) to determine read and taxonomic overlap. We demonstrate that rCRUX provides more comprehensive reference databases for the MiFish Universal Teleost 12S, Taberlet trnl, fungal ITS, and Leray CO1 loci than CRABS, MetaCurator, RESCRIPt, and ecoPCR reference databases. We then further demonstrate the utility of rCRUX by generating 24 reference databases for 20 metabarcoding loci, many of which lack dedicated reference database curation efforts. The rCRUX package provides a simple to use tool for the generation of curated, comprehensive reference databases for user-defined loci, facilitating accurate and effective taxonomic classification of metabarcoding and DNA sequence efforts broadly.
Collapse
Affiliation(s)
- Emily E. Curd
- Vermont Biomedical Research Network, University of Vermont, VT, USA
| | - Luna Gal
- Landmark College, VT, USA
- California Cooperative Oceanic Fisheries Investigations (CalCOFI), Scripps Institution of Oceanography, University of California San Diego (UCSD), La Jolla, CA, USA
| | - Ramon Gallego
- Departamento de Biología, Universidad Autónoma de Madrid, Cantoblanco, Madrid, Spain
| | - Katherine Silliman
- Northern Gulf Institute, Mississippi State University, Starkville, MS, USA
- NOAA Atlantic Oceanographic and Meteorological Laboratory, Miami, FL, USA
| | | | - Zachary Gold
- California Cooperative Oceanic Fisheries Investigations (CalCOFI), Scripps Institution of Oceanography, University of California San Diego (UCSD), La Jolla, CA, USA
- NOAA Pacific Marine Environmental Laboratory, Seattle, WA, USA
| |
Collapse
|
6
|
Gwiazdowski R. Principles for Constructing DNA Barcode Reference Libraries. Methods Mol Biol 2024; 2744:491-502. [PMID: 38683337 DOI: 10.1007/978-1-0716-3581-0_29] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/01/2024]
Abstract
All DNA barcode methods rely on reference sequences linked to well-curated voucher specimens. Definitions for and locations of DNA barcode reference libraries are not standardized, and vary throughout the literature. Standardizing, and centralizing reference specimens would provide an unambiguous source, analogous to reference genomes, to reproduce identifications and improve a library. This chapter proposes a working definition of a DNA barcode reference library, consistent with DNA barcode data standards, along with principles and methods to consider when producing or using such a library. These methods allow explicit traceback to sequence-sources which elevate the value of voucher specimens, and create a potential for community curation.
Collapse
Affiliation(s)
- Rodger Gwiazdowski
- Department of Environmental Conservation, University of Massachusetts, Amherst, MA, USA.
| |
Collapse
|
7
|
Duprey J, Gallego R, Klinger T, Kelly RP. Environmental DNA reveals patterns of biological invasion in an inland sea. PLoS One 2023; 18:e0281525. [PMID: 38150426 PMCID: PMC10752502 DOI: 10.1371/journal.pone.0281525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Accepted: 11/28/2023] [Indexed: 12/29/2023] Open
Abstract
Non-native species have the potential to cause ecological and economic harm to coastal and estuarine ecosystems. Understanding which habitat types are most vulnerable to biological invasions, where invasions originate, and the vectors by which they arrive can help direct limited resources to prevent or mitigate ecological and socio-economic harm. Information about the occurrence of non-native species can help guide interventions at all stages of invasion, from first introduction, to naturalization and invasion. However, monitoring at relevant scales requires considerable investment of time, resources, and taxonomic expertise. Environmental DNA (eDNA) metabarcoding methods sample coastal ecosystems at broad spatial and temporal scales to augment established monitoring methods. We use COI mtDNA eDNA sampling to survey a diverse assemblage of species across distinct habitats in the Salish Sea in Washington State, USA, and classify each as non-native, native, or indeterminate in origin. The non-native species detected include both well-documented invaders and species not previously reported within the Salish Sea. We find a non-native assemblage dominated by shellfish and algae with native ranges in the temperate western Pacific, and find more-retentive estuarine habitats to be invaded at far higher levels than better-flushed rocky shores. Furthermore, we find an increase in invasion level with higher water temperatures in spring and summer across habitat types. This analysis contributes to a growing understanding of the biotic and abiotic factors that influence invasion level, and underscores the utility of eDNA surveys to monitor biological invasions and to better understand the factors that drive these invasions.
Collapse
Affiliation(s)
- Joe Duprey
- School of Marine and Environmental Affairs, University of Washington, Seattle, WA, United States of America
| | - Ramón Gallego
- School of Marine and Environmental Affairs, University of Washington, Seattle, WA, United States of America
- Universidad Autónoma de Madrid—Unidad de Genética, Madrid, Spain
| | - Terrie Klinger
- School of Marine and Environmental Affairs, University of Washington, Seattle, WA, United States of America
| | - Ryan P. Kelly
- School of Marine and Environmental Affairs, University of Washington, Seattle, WA, United States of America
| |
Collapse
|
8
|
Harning DJ, Sacco S, Anamthawat-Jónsson K, Ardenghi N, Thordarson T, Raberg JH, Sepúlveda J, Geirsdóttir Á, Shapiro B, Miller GH. Delayed postglacial colonization of Betula in Iceland and the circum North Atlantic. eLife 2023; 12:RP87749. [PMID: 37955570 PMCID: PMC10642962 DOI: 10.7554/elife.87749] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2023] Open
Abstract
As the Arctic continues to warm, woody shrubs are expected to expand northward. This process, known as 'shrubification,' has important implications for regional biodiversity, food web structure, and high-latitude temperature amplification. While the future rate of shrubification remains poorly constrained, past records of plant immigration to newly deglaciated landscapes in the Arctic may serve as useful analogs. We provide one new postglacial Holocene sedimentary ancient DNA (sedaDNA) record of vascular plants from Iceland and place a second Iceland postglacial sedaDNA record on an improved geochronology; both show Salicaceae present shortly after deglaciation, whereas Betulaceae first appears more than 1000 y later. We find a similar pattern of delayed Betulaceae colonization in eight previously published postglacial sedaDNA records from across the glaciated circum North Atlantic. In nearly all cases, we find that Salicaceae colonizes earlier than Betulaceae and that Betulaceae colonization is increasingly delayed for locations farther from glacial-age woody plant refugia. These trends in Salicaceae and Betulaceae colonization are consistent with the plant families' environmental tolerances, species diversity, reproductive strategies, seed sizes, and soil preferences. As these reconstructions capture the efficiency of postglacial vascular plant migration during a past period of high-latitude warming, a similarly slow response of some woody shrubs to current warming in glaciated regions, and possibly non-glaciated tundra, may delay Arctic shrubification and future changes in the structure of tundra ecosystems and temperature amplification.
Collapse
Affiliation(s)
- David J Harning
- Institute of Arctic and Alpine Research, University of Colorado BoulderBoulderUnited States
| | - Samuel Sacco
- Department of Ecology and Evolutionary Biology, University of California Santa CruzSanta CruzUnited States
| | | | - Nicolò Ardenghi
- Institute of Arctic and Alpine Research, University of Colorado BoulderBoulderUnited States
| | - Thor Thordarson
- Faculty of Earth Sciences, University of IcelandReykjavikIceland
| | - Jonathan H Raberg
- Institute of Arctic and Alpine Research, University of Colorado BoulderBoulderUnited States
| | - Julio Sepúlveda
- Institute of Arctic and Alpine Research, University of Colorado BoulderBoulderUnited States
- Department of Geological Sciences, University of Colorado BoulderBoulderUnited States
| | | | - Beth Shapiro
- Department of Ecology and Evolutionary Biology, University of California Santa CruzSanta CruzUnited States
| | - Gifford H Miller
- Institute of Arctic and Alpine Research, University of Colorado BoulderBoulderUnited States
- Department of Geological Sciences, University of Colorado BoulderBoulderUnited States
| |
Collapse
|
9
|
Waters T, Gold Z, Obaza A, Ambrose RF, Eagle RA. Environmental DNA metabarcoding reveals distinct fish assemblages supported by seagrass (Zostera marina and Zostera pacifica) beds in different geographic settings in Southern California. PLoS One 2023; 18:e0286228. [PMID: 37796915 PMCID: PMC10553302 DOI: 10.1371/journal.pone.0286228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Accepted: 09/18/2023] [Indexed: 10/07/2023] Open
Abstract
Seagrass beds are disappearing at a record pace despite their known value to our oceans and coastal communities. Simultaneously, our coastlines are under the constant pressure of climate change which is impacting their chemical, physical and biological characteristics. It is thus pertinent to evaluate and record habitat use so we can understand how these different environments contribute to local biodiversity. This study evaluates the assemblages of fish found at five Zostera beds in Southern California using environmental DNA (eDNA) metabarcoding. eDNA is a powerful biodiversity monitoring tool that offers key advantages to conventional monitoring. Results from our eDNA study found 78 species of fish that inhabit these five beds around Southern California representing embayment, open coastal mainland and open coastal island settings. While each bed had the same average number of species found throughout the year, the composition of these fish assemblages was strongly site dependent. There were 35 fish that were found at both open coast and embayment seagrass beds, while embayment seagrass sites had 20 unique fish and open coast sites had 23 unique fish. These results demonstrate that seagrass fish assemblages are heterogenous based on their geographic positioning and that marine managers must take this into account for holistic conservation and restoration efforts.
Collapse
Affiliation(s)
- Tanner Waters
- Institute of the Environment and Sustainability, University of California, Los Angeles, CA, United States of America
- Center for Diverse Leadership in Science, University of California, Los Angeles, CA, United States of America
| | - Zachary Gold
- NOAA Pacific Marine Environmental Laboratory, Seattle, WA, United States of America
| | - Adam Obaza
- Paua Marine Research Group, Long Beach, CA, United States of America
| | - Richard F. Ambrose
- Institute of the Environment and Sustainability, University of California, Los Angeles, CA, United States of America
- Department of Environmental Health Sciences, Jonathan and Karen Fielding School of Public Health, University of California, Los Angeles, CA, United States of America
| | - Robert A. Eagle
- Institute of the Environment and Sustainability, University of California, Los Angeles, CA, United States of America
- Center for Diverse Leadership in Science, University of California, Los Angeles, CA, United States of America
- Atmospheric and Oceanic Sciences Department, University of California, Los Angeles, CA, United States of America
| |
Collapse
|
10
|
Curd EE, Gal L, Gallego R, Nielsen S, Gold Z. rCRUX: A Rapid and Versatile Tool for Generating Metabarcoding Reference libraries in R. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.31.543005. [PMID: 37397980 PMCID: PMC10312559 DOI: 10.1101/2023.05.31.543005] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Key to making accurate taxonomic assignments are curated, comprehensive reference barcode databases. However, the generation and curation of such databases has remained challenging given the large and continuously growing volumes of DNA sequence data and novel reference barcode targets. Monitoring and research applications require a greater diversity of specialized gene regions and targeted taxa to meet taxonomic classification goals then are currently curated by professional staff. Thus, there is a growing need for an easy to implement tool that can generate comprehensive metabarcoding reference libraries for any bespoke locus. We address this need by reimagining CRUX from the Anacapa Toolkit and present the rCRUX package in R. The typical workflow involves searching for plausible seed amplicons (get_seeds_local() or get_seeds_remote()) by simulating in silico PCR to acquire seed sequences containing a user-defined primer set. Next these seeds are used to iteratively blast search seed sequences against a local NCBI formatted database using a taxonomic rank based stratified random sampling approach (blast_seeds()) that results in a comprehensive set of sequence matches. This database is dereplicated and cleaned (derep_and_clean_db()) by identifying identical reference sequences and collapsing the taxonomic path to the lowest taxonomic agreement across all matching reads. This results in a curated, comprehensive database of primer specific reference barcode sequences from NCBI. We demonstrate that rCRUX provides more comprehensive reference databases for the MiFish Universal Teleost 12S, Taberlet trnl, and fungal ITS locus than CRABS, METACURATOR, RESCRIPt, and ECOPCR reference databases. We then further demonstrate the utility of rCRUX by generating 16 reference databases for metabarcoding loci that lack dedicated reference database curation efforts. The rCRUX package provides a simple to use tool for the generation of curated, comprehensive reference databases for user-defined loci, facilitating accurate and effective taxonomic classification of metabarcoding and DNA sequence efforts broadly.
Collapse
Affiliation(s)
- Emily E. Curd
- Vermont Biomedical Research Network, University of Vermont, VT, USA
| | - Luna Gal
- Landmark College, VT, USA
- California Cooperative Oceanic Fisheries Investigations (CalCOFI), Scripps Institution of Oceanography, University of California San Diego (UCSD), La Jolla, CA, USA
| | - Ramon Gallego
- Universidad Autónoma de Madrid - Unidad de Genética, Spain
| | | | - Zachary Gold
- California Cooperative Oceanic Fisheries Investigations (CalCOFI), Scripps Institution of Oceanography, University of California San Diego (UCSD), La Jolla, CA, USA
- NOAA Pacific Marine Environmental Laboratory, Seattle, WA, USA
| |
Collapse
|
11
|
Wilson Rankin EE, Knowlton JL, Shmerling AJ, Hoey-Chamberlain R. Diets of two non-native praying mantids (Tenodera sinensis and Mantis religiosa) show consumption of arthropods across all ecological roles. FOOD WEBS 2023. [DOI: 10.1016/j.fooweb.2023.e00280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/03/2023]
|
12
|
Mejia MP, Rojas CA, Curd E, Renshaw MA, Edalati K, Shih B, Vincent N, Lin M, Nguyen PH, Wayne R, Jessup K, Parker SS. Soil Microbial Community Composition and Tolerance to Contaminants in an Urban Brownfield Site. MICROBIAL ECOLOGY 2023; 85:998-1012. [PMID: 35802172 PMCID: PMC10156844 DOI: 10.1007/s00248-022-02061-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Accepted: 06/21/2022] [Indexed: 05/04/2023]
Abstract
Brownfields are unused sites that contain hazardous substances due to previous commercial or industrial use. The sites are inhospitable for many organisms, but some fungi and microbes can tolerate and thrive in the nutrient-depleted and contaminated soils. However, few studies have characterized the impacts of long-term contamination on soil microbiome composition and diversity at brownfields. This study focuses on an urban brownfield-a former rail yard in Los Angeles that is contaminated with heavy metals, volatile organic compounds, and petroleum-derived pollutants. We anticipate that heavy metals and organic pollutants will shape soil microbiome diversity and that several candidate fungi and bacteria will be tolerant to the contaminants. We sequence three gene markers (16S ribosomal RNA, 18S ribosomal RNA, and the fungal internal transcribed spacer (FITS)) in 55 soil samples collected at five depths to (1) profile the composition of the soil microbiome across depths; (2) determine the extent to which hazardous chemicals predict microbiome variation; and (3) identify microbial taxonomic groups that may metabolize these contaminants. Detected contaminants in the samples included heavy metals, petroleum hydrocarbons, polycyclic aromatic hydrocarbons, and volatile organic compounds. Bacterial, eukaryotic, and fungal communities all varied with depth and with concentrations of arsenic, chromium, cobalt, and lead. 18S rRNA microbiome richness and fungal richness were positively correlated with lead and cobalt levels, respectively. Furthermore, bacterial Paenibacillus and Iamia, eukaryotic Actinochloris, and fungal Alternaria were enriched in contaminated soils compared to uncontaminated soils and represent taxa of interest for future bioremediation research. Based on our results, we recommend incorporating DNA-based multi-marker microbial community profiling at multiple sites and depths in brownfield site assessment standard methods and restoration.
Collapse
Affiliation(s)
- Maura Palacios Mejia
- Ecology & Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, USA.
| | - Connie A Rojas
- Ecology, Evolution, and Behavior Program, Michigan State University, Lansing, MI, USA
| | - Emily Curd
- Natural Science, Landmark College, Putney, VT, USA
| | - Mark A Renshaw
- Cherokee Federal, USGS Wetland and Aquatic Research Center, Gainesville, FL, USA
| | - Kiumars Edalati
- Ecology & Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, USA
| | - Beverly Shih
- Ecology & Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, USA
| | - Nitin Vincent
- Ecology & Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, USA
| | - Meixi Lin
- Ecology & Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, USA
| | - Peggy H Nguyen
- Institute of the Environment and Sustainability, University of California, Los Angeles, Los Angeles, CA, USA
| | - Robert Wayne
- Ecology & Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, USA
| | | | | |
Collapse
|
13
|
Gold Z, Shelton AO, Casendino HR, Duprey J, Gallego R, Van Cise A, Fisher M, Jensen AJ, D'Agnese E, Andruszkiewicz Allan E, Ramón-Laca A, Garber-Yonts M, Labare M, Parsons KM, Kelly RP. Signal and noise in metabarcoding data. PLoS One 2023; 18:e0285674. [PMID: 37167310 PMCID: PMC10174484 DOI: 10.1371/journal.pone.0285674] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 04/27/2023] [Indexed: 05/13/2023] Open
Abstract
Metabarcoding is a powerful molecular tool for simultaneously surveying hundreds to thousands of species from a single sample, underpinning microbiome and environmental DNA (eDNA) methods. Deriving quantitative estimates of underlying biological communities from metabarcoding is critical for enhancing the utility of such approaches for health and conservation. Recent work has demonstrated that correcting for amplification biases in genetic metabarcoding data can yield quantitative estimates of template DNA concentrations. However, a major source of uncertainty in metabarcoding data stems from non-detections across technical PCR replicates where one replicate fails to detect a species observed in other replicates. Such non-detections are a special case of variability among technical replicates in metabarcoding data. While many sampling and amplification processes underlie observed variation in metabarcoding data, understanding the causes of non-detections is an important step in distinguishing signal from noise in metabarcoding studies. Here, we use both simulated and empirical data to 1) suggest how non-detections may arise in metabarcoding data, 2) outline steps to recognize uninformative data in practice, and 3) identify the conditions under which amplicon sequence data can reliably detect underlying biological signals. We show with both simulations and empirical data that, for a given species, the rate of non-detections among technical replicates is a function of both the template DNA concentration and species-specific amplification efficiency. Consequently, we conclude metabarcoding datasets are strongly affected by (1) deterministic amplification biases during PCR and (2) stochastic sampling of amplicons during sequencing-both of which we can model-but also by (3) stochastic sampling of rare molecules prior to PCR, which remains a frontier for quantitative metabarcoding. Our results highlight the importance of estimating species-specific amplification efficiencies and critically evaluating patterns of non-detection in metabarcoding datasets to better distinguish environmental signal from the noise inherent in molecular detections of rare targets.
Collapse
Affiliation(s)
- Zachary Gold
- Cooperative Institute for Climate, Ocean, & Ecosystem Studies, UW, Seattle, Washington, United States of America
- Northwest Fisheries Science Center, NMFS/NOAA, Seattle, Washington, United States of America
| | - Andrew Olaf Shelton
- Northwest Fisheries Science Center, NMFS/NOAA, Seattle, Washington, United States of America
| | - Helen R Casendino
- School of Marine and Environmental Affairs, UW, Seattle, Washington, United States of America
| | - Joe Duprey
- School of Marine and Environmental Affairs, UW, Seattle, Washington, United States of America
| | - Ramón Gallego
- Northwest Fisheries Science Center, NMFS/NOAA, Seattle, Washington, United States of America
| | - Amy Van Cise
- Northwest Fisheries Science Center, NMFS/NOAA, Seattle, Washington, United States of America
| | - Mary Fisher
- School of Aquatic Fisheries Science, UW, Seattle, Washington, United States of America
| | - Alexander J Jensen
- Northwest Fisheries Science Center, NMFS/NOAA, Seattle, Washington, United States of America
| | - Erin D'Agnese
- School of Marine and Environmental Affairs, UW, Seattle, Washington, United States of America
| | | | - Ana Ramón-Laca
- Northwest Fisheries Science Center, NMFS/NOAA, Seattle, Washington, United States of America
| | - Maya Garber-Yonts
- School of Marine and Environmental Affairs, UW, Seattle, Washington, United States of America
| | - Michaela Labare
- Scripps Institution of Oceanography, UCSD, La Jolla, California, United States of America
| | - Kim M Parsons
- Northwest Fisheries Science Center, NMFS/NOAA, Seattle, Washington, United States of America
| | - Ryan P Kelly
- School of Marine and Environmental Affairs, UW, Seattle, Washington, United States of America
| |
Collapse
|
14
|
Dubois B, Debode F, Hautier L, Hulin J, Martin GS, Delvaux A, Janssen E, Mingeot D. A detailed workflow to develop QIIME2-formatted reference databases for taxonomic analysis of DNA metabarcoding data. BMC Genom Data 2022; 23:53. [PMID: 35804326 PMCID: PMC9264521 DOI: 10.1186/s12863-022-01067-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Accepted: 07/01/2022] [Indexed: 11/10/2022] Open
Abstract
Background The DNA metabarcoding approach has become one of the most used techniques to study the taxa composition of various sample types. To deal with the high amount of data generated by the high-throughput sequencing process, a bioinformatics workflow is required and the QIIME2 platform has emerged as one of the most reliable and commonly used. However, only some pre-formatted reference databases dedicated to a few barcode sequences are available to assign taxonomy. If users want to develop a new custom reference database, several bottlenecks still need to be addressed and a detailed procedure explaining how to develop and format such a database is currently missing. In consequence, this work is aimed at presenting a detailed workflow explaining from start to finish how to develop such a curated reference database for any barcode sequence. Results We developed DB4Q2, a detailed workflow that allowed development of plant reference databases dedicated to ITS2 and rbcL, two commonly used barcode sequences in plant metabarcoding studies. This workflow addresses several of the main bottlenecks connected with the development of a curated reference database. The detailed and commented structure of DB4Q2 offers the possibility of developing reference databases even without extensive bioinformatics skills, and avoids ‘black box’ systems that are sometimes encountered. Some filtering steps have been included to discard presumably fungal and misidentified sequences. The flexible character of DB4Q2 allows several key sequence processing steps to be included or not, and downloading issues can be avoided. Benchmarking the databases developed using DB4Q2 revealed that they performed well compared to previously published reference datasets. Conclusion This study presents DB4Q2, a detailed procedure to develop custom reference databases in order to carry out taxonomic analyses with QIIME2, but also with other bioinformatics platforms if desired. This work also provides ready-to-use plant ITS2 and rbcL databases for which the prediction accuracy has been assessed and compared to that of other published databases. Supplementary Information The online version contains supplementary material available at 10.1186/s12863-022-01067-5.
Collapse
|
15
|
Gold Z, Wall AR, Schweizer TM, Pentcheff ND, Curd EE, Barber PH, Meyer RS, Wayne R, Stolzenbach K, Prickett K, Luedy J, Wetzer R. A manager's guide to using eDNA metabarcoding in marine ecosystems. PeerJ 2022; 10:e14071. [PMID: 36405018 PMCID: PMC9673773 DOI: 10.7717/peerj.14071] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Accepted: 08/27/2022] [Indexed: 11/16/2022] Open
Abstract
Environmental DNA (eDNA) metabarcoding is a powerful tool that can enhance marine ecosystem/biodiversity monitoring programs. Here we outline five important steps managers and researchers should consider when developing eDNA monitoring program: (1) select genes and primers to target taxa; (2) assemble or develop comprehensive barcode reference databases; (3) apply rigorous site occupancy based decontamination pipelines; (4) conduct pilot studies to define spatial and temporal variance of eDNA; and (5) archive samples, extracts, and raw sequence data. We demonstrate the importance of each of these considerations using a case study of eDNA metabarcoding in the Ports of Los Angeles and Long Beach. eDNA metabarcoding approaches detected 94.1% (16/17) of species observed in paired trawl surveys while identifying an additional 55 native fishes, providing more comprehensive biodiversity inventories. Rigorous benchmarking of eDNA metabarcoding results improved ecological interpretation and confidence in species detections while providing archived genetic resources for future analyses. Well designed and validated eDNA metabarcoding approaches are ideally suited for biomonitoring applications that rely on the detection of species, including mapping invasive species fronts and endangered species habitats as well as tracking range shifts in response to climate change. Incorporating these considerations will enhance the utility and efficacy of eDNA metabarcoding for routine biomonitoring applications.
Collapse
Affiliation(s)
- Zachary Gold
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, United States of America
| | - Adam R. Wall
- Diversity Initiative for the Southern California Ocean (DISCO), Natural History Museum of Los Angeles County, Los Angeles, CA, United States of America
| | - Teia M. Schweizer
- Department of Fish and Wildlife Conservation Biology, Colorado State University, Fort Collins, CO, United States of America
| | - N. Dean Pentcheff
- Diversity Initiative for the Southern California Ocean (DISCO), Natural History Museum of Los Angeles County, Los Angeles, CA, United States of America
| | - Emily E. Curd
- Department of Natural Sciences, Landmark College, Putney, VT, United States of America
| | - Paul H. Barber
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, United States of America
| | - Rachel S. Meyer
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, United States of America,Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, Santa Cruz, CA, United States of America
| | - Robert Wayne
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, United States of America
| | - Kevin Stolzenbach
- Wood Environment and Infrastructure, Inc., San Diego, CA, United States of America
| | - Kat Prickett
- Port of Los Angeles, Los Angeles, CA, United States of America
| | - Justin Luedy
- Port of Long Beach, Long Beach, CA, United States of America
| | - Regina Wetzer
- Diversity Initiative for the Southern California Ocean (DISCO), Natural History Museum of Los Angeles County, Los Angeles, CA, United States of America
| |
Collapse
|
16
|
Senn S, Bhattacharyya S, Presley G, Taylor AE, Nash B, Enke RA, Barnard-Kubow KB, Ford J, Jasinski B, Badalova Y. The Functional Biogeography of eDNA Metacommunities in the Post-Fire Landscape of the Angeles National Forest. Microorganisms 2022; 10:microorganisms10061218. [PMID: 35744735 PMCID: PMC9229275 DOI: 10.3390/microorganisms10061218] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Revised: 05/29/2022] [Accepted: 05/30/2022] [Indexed: 02/01/2023] Open
Abstract
Wildfires have continued to increase in frequency and severity in Southern California due in part to climate change. To gain a further understanding of microbial soil communities’ response to fire and functions that may enhance post-wildfire resilience, soil fungal and bacterial microbiomes were studied from different wildfire areas in the Gold Creek Preserve within the Angeles National Forest using 16S, FITS, 18S, 12S, PITS, and COI amplicon sequencing. Sequencing datasets from December 2020 and June 2021 samplings were analyzed using QIIME2, ranacapa, stats, vcd, EZBioCloud, and mixomics. Significant differences were found among bacterial and fungal taxa associated with different fire areas in the Gold Creek Preserve. There was evidence of seasonal shifts in the alpha diversity of the bacterial communities. In the sparse partial least squares analysis, there were strong associations (r > 0.8) between longitude, elevation, and a defined cluster of Amplicon Sequence Variants (ASVs). The Chi-square test revealed differences in fungi−bacteria (F:B) proportions between different trails (p = 2 × 10−16). sPLS results focused on a cluster of Green Trail samples with high elevation and longitude. Analysis revealed the cluster included the post-fire pioneer fungi Pyronema and Tremella. Chlorellales algae and possibly pathogenic Fusarium sequences were elevated. Bacterivorous Corallococcus, which secretes antimicrobials, and bacterivorous flagellate Spumella were associated with the cluster. There was functional redundancy in clusters that were differently composed but shared similar ecological functions. These results implied a set of traits for post-fire resiliency. These included photo-autotrophy, mineralization of pyrolyzed organic matter and aromatic/oily compounds, potential pathogenicity and parasitism, antimicrobials, and N-metabolism.
Collapse
Affiliation(s)
- Savanah Senn
- Department of Agriculture Sciences, Los Angeles Pierce College, 6201 Winnetka Avenue, PMB 553, Woodland Hills, CA 91304, USA; (J.F.); (B.J.); (Y.B.)
- Environmental Sciences Graduate Program, Oregon State University, Corvallis, OR 97331, USA; (S.B.); (G.P.); (A.E.T.)
- Correspondence:
| | - Sharmodeep Bhattacharyya
- Environmental Sciences Graduate Program, Oregon State University, Corvallis, OR 97331, USA; (S.B.); (G.P.); (A.E.T.)
- Department of Statistics, Oregon State University, Corvallis, OR 97331, USA
| | - Gerald Presley
- Environmental Sciences Graduate Program, Oregon State University, Corvallis, OR 97331, USA; (S.B.); (G.P.); (A.E.T.)
- Department of Wood Science & Engineering, Oregon State University, Corvallis, OR 97331, USA
| | - Anne E. Taylor
- Environmental Sciences Graduate Program, Oregon State University, Corvallis, OR 97331, USA; (S.B.); (G.P.); (A.E.T.)
- Department of Crop and Soil Sciences, Oregon State University, Corvallis, OR 97331, USA
| | - Bruce Nash
- DNA Learning Center, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA;
| | - Ray A. Enke
- Department of Biology, Center for Genome & Metagenome Studies, James Madison University, Harrisonburg, VA 22807, USA; (R.A.E.); (K.B.B.-K.)
| | - Karen B. Barnard-Kubow
- Department of Biology, Center for Genome & Metagenome Studies, James Madison University, Harrisonburg, VA 22807, USA; (R.A.E.); (K.B.B.-K.)
| | - Jillian Ford
- Department of Agriculture Sciences, Los Angeles Pierce College, 6201 Winnetka Avenue, PMB 553, Woodland Hills, CA 91304, USA; (J.F.); (B.J.); (Y.B.)
| | - Brandon Jasinski
- Department of Agriculture Sciences, Los Angeles Pierce College, 6201 Winnetka Avenue, PMB 553, Woodland Hills, CA 91304, USA; (J.F.); (B.J.); (Y.B.)
| | - Yekaterina Badalova
- Department of Agriculture Sciences, Los Angeles Pierce College, 6201 Winnetka Avenue, PMB 553, Woodland Hills, CA 91304, USA; (J.F.); (B.J.); (Y.B.)
| |
Collapse
|
17
|
Pipes L, Nielsen R. AncestralClust: clustering of divergent nucleotide sequences by ancestral sequence reconstruction using phylogenetic trees. Bioinformatics 2022; 38:663-670. [PMID: 34668516 PMCID: PMC8756197 DOI: 10.1093/bioinformatics/btab723] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Revised: 09/30/2021] [Accepted: 10/15/2021] [Indexed: 02/04/2023] Open
Abstract
MOTIVATION Clustering is a fundamental task in the analysis of nucleotide sequences. Despite the exponential increase in the size of sequence databases of homologous genes, few methods exist to cluster divergent sequences. Traditional clustering methods have mostly focused on optimizing high speed clustering of highly similar sequences. We develop a phylogenetic clustering method which infers ancestral sequences for a set of initial clusters and then uses a greedy algorithm to cluster sequences. RESULTS We describe a clustering program AncestralClust, which is developed for clustering divergent sequences. We compare this method with other state-of-the-art clustering methods using datasets of homologous sequences from different species. We show that, in divergent datasets, AncestralClust has higher accuracy and more even cluster sizes than current popular methods. AVAILABILITY AND IMPLEMENTATION AncestralClust is an Open Source program available at https://github.com/lpipes/ancestralclust. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Lenore Pipes
- Department of Integrative Biology, University of California-Berkeley, Berkeley, CA 94707, USA
| | - Rasmus Nielsen
- Department of Integrative Biology, University of California-Berkeley, Berkeley, CA 94707, USA
- Department of Statistics, University of California-Berkeley, Berkeley, CA 94707, USA
- Globe Institute, University of Copenhagen, 1350 København K, Copenhagen, Denmark
| |
Collapse
|
18
|
Shirazi S, Meyer RS, Shapiro B. Revisiting the effect of PCR replication and sequencing depth on biodiversity metrics in environmental DNA metabarcoding. Ecol Evol 2021; 11:15766-15779. [PMID: 34824788 PMCID: PMC8601883 DOI: 10.1002/ece3.8239] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Revised: 09/22/2021] [Accepted: 09/27/2021] [Indexed: 11/30/2022] Open
Abstract
Environmental DNA (eDNA) metabarcoding is an increasingly popular tool for measuring and cataloguing biodiversity. Because the environments and substrates in which DNA is preserved differ considerably, eDNA research often requires bespoke approaches to generating eDNA data. Here, we explore how two experimental choices in eDNA study design-the number of PCR replicates and the depth of sequencing of PCR replicates-influence the composition and consistency of taxa recovered from eDNA extracts. We perform 24 PCR replicates from each of six soil samples using two of the most common metabarcodes for Fungi and Viridiplantae (ITS1 and ITS2), and sequence each replicate to an average depth of ~84,000 reads. We find that PCR replicates are broadly consistent in composition and relative abundance of dominant taxa, but that low abundance taxa are often unique to one or a few PCR replicates. Taxa observed in one out of 24 PCR replicates make up 21-29% of the total taxa detected. We also observe that sequencing depth or rarefaction influences alpha diversity and beta diversity estimates. Read sampling depth influences local contribution to beta diversity, placement in ordinations, and beta dispersion in ordinations. Our results suggest that, because common taxa drive some alpha diversity estimates, few PCR replicates and low read sampling depths may be sufficient for many biological applications of eDNA metabarcoding. However, because rare taxa are recovered stochastically, eDNA metabarcoding may never fully recover the true amplifiable alpha diversity in an eDNA extract. Rare taxa drive PCR replicate outliers of alpha and beta diversity and lead to dispersion differences at different read sampling depths. We conclude that researchers should consider the complexity and unevenness of a community when choosing analytical approaches, read sampling depths, and filtering thresholds to arrive at stable estimates.
Collapse
Affiliation(s)
- Sabrina Shirazi
- Department of Ecology and Evolutionary BiologyUniversity of California Santa CruzSanta CruzCaliforniaUSA
| | - Rachel S. Meyer
- Department of Ecology and Evolutionary BiologyUniversity of California Santa CruzSanta CruzCaliforniaUSA
| | - Beth Shapiro
- Department of Ecology and Evolutionary BiologyUniversity of California Santa CruzSanta CruzCaliforniaUSA
- Howard Hughes Medical InstituteUniversity of California Santa CruzSanta CruzCaliforniaUSA
| |
Collapse
|
19
|
Collins RA, Trauzzi G, Maltby KM, Gibson TI, Ratcliffe FC, Hallam J, Rainbird S, Maclaine J, Henderson PA, Sims DW, Mariani S, Genner MJ. Meta-Fish-Lib: A generalised, dynamic DNA reference library pipeline for metabarcoding of fishes. JOURNAL OF FISH BIOLOGY 2021; 99:1446-1454. [PMID: 34269417 DOI: 10.1111/jfb.14852] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 07/06/2021] [Accepted: 07/12/2021] [Indexed: 06/13/2023]
Abstract
The accuracy and reliability of DNA metabarcoding analyses depend on the breadth and quality of the reference libraries that underpin them. However, there are limited options available to obtain and curate the huge volumes of sequence data that are available on public repositories such as NCBI and BOLD. Here, we provide a pipeline to download, clean and annotate mitochondrial DNA sequence data for a given list of fish species. Features of this pipeline include (a) support for multiple metabarcode markers; (b) searches on species synonyms and taxonomic name validation; (c) phylogeny assisted quality control for identification and removal of misannotated sequences; (d) automatically generated coverage reports for each new GenBank release update; and (e) citable, versioned DOIs. As an example we provide a ready-to-use curated reference library for the marine and freshwater fishes of the U.K. To augment this reference library for environmental DNA metabarcoding specifically, we generated 241 new MiFish-12S sequences for 88 U.K. marine species, and make available new primer sets useful for sequencing these. This brings the coverage of common U.K. species for the MiFish-12S fragment to 93%, opening new avenues for scaling up fish metabarcoding across wide spatial gradients. The Meta-Fish-Lib reference library and pipeline is hosted at https://github.com/genner-lab/meta-fish-lib.
Collapse
Affiliation(s)
- Rupert A Collins
- School of Biological Sciences, University of Bristol, Bristol, UK
| | - Giulia Trauzzi
- School of Biological Sciences, University of Bristol, Bristol, UK
- School of Biological Sciences, Victoria University of Wellington, Wellington, New Zealand
| | - Katherine M Maltby
- Centre for Environment, Fisheries and Aquaculture Science, Lowestoft, UK
| | - Thomas I Gibson
- Molecular Ecology and Fisheries Genetics Laboratory, Bangor University School of Natural Sciences, Environment Centre Wales, Bangor, UK
| | | | - Jane Hallam
- School of Biological and Chemical Sciences, Queen Mary University of London, London, UK
| | - Sophie Rainbird
- Marine Biological Association of the United Kingdom, Plymouth, UK
| | - James Maclaine
- Department of Life Sciences, The Natural History Museum, London, UK
| | | | - David W Sims
- Marine Biological Association of the United Kingdom, Plymouth, UK
- Ocean and Earth Science, University of Southampton, National Oceanography Centre Southampton, Southampton, UK
| | - Stefano Mariani
- Ecosystems & Environment Research Centre, School of Environment & Life Sciences, University of Salford, Salford, UK
- School of Biological & Environmental Sciences, Liverpool John Moores University, Liverpool, UK
| | - Martin J Genner
- School of Biological Sciences, University of Bristol, Bristol, UK
| |
Collapse
|
20
|
Lin M, Simons AL, Harrigan RJ, Curd EE, Schneider FD, Ruiz-Ramos DV, Gold Z, Osborne MG, Shirazi S, Schweizer TM, Moore TN, Fox EA, Turba R, Garcia-Vedrenne AE, Helman SK, Rutledge K, Mejia MP, Marwayana O, Munguia Ramos MN, Wetzer R, Pentcheff ND, McTavish EJ, Dawson MN, Shapiro B, Wayne RK, Meyer RS. Landscape analyses using eDNA metabarcoding and Earth observation predict community biodiversity in California. ECOLOGICAL APPLICATIONS : A PUBLICATION OF THE ECOLOGICAL SOCIETY OF AMERICA 2021; 31:e02379. [PMID: 34013632 PMCID: PMC9297316 DOI: 10.1002/eap.2379] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/02/2020] [Revised: 12/23/2020] [Accepted: 02/04/2021] [Indexed: 05/15/2023]
Abstract
Ecosystems globally are under threat from ongoing anthropogenic environmental change. Effective conservation management requires more thorough biodiversity surveys that can reveal system-level patterns and that can be applied rapidly across space and time. Using modern ecological models and community science, we integrate environmental DNA and Earth observations to produce a time snapshot of regional biodiversity patterns and provide multi-scalar community-level characterization. We collected 278 samples in spring 2017 from coastal, shrub, and lowland forest sites in California, a complex ecosystem and biodiversity hotspot. We recovered 16,118 taxonomic entries from eDNA analyses and compiled associated traditional observations and environmental data to assess how well they predicted alpha, beta, and zeta diversity. We found that local habitat classification was diagnostic of community composition and distinct communities and organisms in different kingdoms are predicted by different environmental variables. Nonetheless, gradient forest models of 915 families recovered by eDNA analysis and using BIOCLIM variables, Sentinel-2 satellite data, human impact, and topographical features as predictors, explained 35% of the variance in community turnover. Elevation, sand percentage, and photosynthetic activities (NDVI32) were the top predictors. In addition to this signal of environmental filtering, we found a positive relationship between environmentally predicted families and their numbers of biotic interactions, suggesting environmental change could have a disproportionate effect on community networks. Together, these analyses show that coupling eDNA with environmental predictors including remote sensing data has capacity to test proposed Essential Biodiversity Variables and create new landscape biodiversity baselines that span the tree of life.
Collapse
Affiliation(s)
- Meixi Lin
- Department of Ecology and Evolutionary Biology, University of California-Los Angeles, Los Angeles, California 90095 USA
| | - Ariel Levi Simons
- Department of Marine and Environmental Biology, University of Southern California, Los Angeles, California 90089 USA
- Institute of the Environment and Sustainability, University of California-Los Angeles, Los Angeles, California 90095 USA
| | - Ryan J. Harrigan
- Center for Tropical Research, Institute of the Environment and Sustainability, University of California-Los Angeles, Los Angeles, California 90095 USA
| | - Emily E. Curd
- Department of Ecology and Evolutionary Biology, University of California-Los Angeles, Los Angeles, California 90095 USA
| | - Fabian D. Schneider
- Jet Propulsion Laboratory, California Institute of Technology, 4800 Oak Grove Drive, Pasadena, California 91009 USA
| | - Dannise V. Ruiz-Ramos
- Columbia Environmental Research Center, U.S. Geological Survey, Columbia, Missouri 65201 USA
- Department of Life & Environmental Sciences, University of California-Merced, Merced, California 95343 USA
| | - Zack Gold
- Department of Ecology and Evolutionary Biology, University of California-Los Angeles, Los Angeles, California 90095 USA
| | - Melisa G. Osborne
- Department of Molecular and Computational Biology, University of Southern California, Los Angeles, California 90089 USA
| | - Sabrina Shirazi
- Department of Ecology and Evolutionary Biology, University of California-Santa Cruz, Santa Cruz, California 95064 USA
| | - Teia M. Schweizer
- Department of Ecology and Evolutionary Biology, University of California-Los Angeles, Los Angeles, California 90095 USA
- Department of Biology, Colorado State University, Fort Collins, Colorado 80523 USA
| | - Tiara N. Moore
- Department of Ecology and Evolutionary Biology, University of California-Los Angeles, Los Angeles, California 90095 USA
- School of Environmental and Forestry Sciences, University of Washington, Seattle, Washington 98195 USA
| | - Emma A. Fox
- Department of Ecology and Evolutionary Biology, University of California-Los Angeles, Los Angeles, California 90095 USA
| | - Rachel Turba
- Department of Ecology and Evolutionary Biology, University of California-Los Angeles, Los Angeles, California 90095 USA
| | - Ana E. Garcia-Vedrenne
- Department of Ecology and Evolutionary Biology, University of California-Los Angeles, Los Angeles, California 90095 USA
| | - Sarah K. Helman
- Department of Ecology and Evolutionary Biology, University of California-Los Angeles, Los Angeles, California 90095 USA
| | - Kelsi Rutledge
- Department of Ecology and Evolutionary Biology, University of California-Los Angeles, Los Angeles, California 90095 USA
| | - Maura Palacios Mejia
- Department of Ecology and Evolutionary Biology, University of California-Los Angeles, Los Angeles, California 90095 USA
| | - Onny Marwayana
- Department of Ecology and Evolutionary Biology, University of California-Los Angeles, Los Angeles, California 90095 USA
- Museum Zoologicum Bogoriense, Research Center for Biology, Indonesian Institute of Sciences (LIPI), Cibinong, Bogor 16911 Indonesia
| | - Miroslava N. Munguia Ramos
- Department of Ecology and Evolutionary Biology, University of California-Los Angeles, Los Angeles, California 90095 USA
| | - Regina Wetzer
- Research and Collections, Natural History Museum of Los Angeles County, Los Angeles, California 90007 USA
- Biological Sciences, University of Southern California, Los Angeles, California 90089 USA
| | - N. Dean Pentcheff
- Research and Collections, Natural History Museum of Los Angeles County, Los Angeles, California 90007 USA
| | - Emily Jane McTavish
- Department of Life & Environmental Sciences, University of California-Merced, Merced, California 95343 USA
| | - Michael N. Dawson
- Department of Life & Environmental Sciences, University of California-Merced, Merced, California 95343 USA
| | - Beth Shapiro
- Department of Ecology and Evolutionary Biology, University of California-Santa Cruz, Santa Cruz, California 95064 USA
- Howard Hughes Medical Institute, University of California-Santa Cruz, Santa Cruz, California 95064 USA
| | - Robert K. Wayne
- Department of Ecology and Evolutionary Biology, University of California-Los Angeles, Los Angeles, California 90095 USA
| | - Rachel S. Meyer
- Department of Ecology and Evolutionary Biology, University of California-Los Angeles, Los Angeles, California 90095 USA
- Department of Ecology and Evolutionary Biology, University of California-Santa Cruz, Santa Cruz, California 95064 USA
| |
Collapse
|
21
|
Gold Z, Curd EE, Goodwin KD, Choi ES, Frable BW, Thompson AR, Walker HJ, Burton RS, Kacev D, Martz LD, Barber PH. Improving metabarcoding taxonomic assignment: A case study of fishes in a large marine ecosystem. Mol Ecol Resour 2021; 21:2546-2564. [PMID: 34235858 DOI: 10.1111/1755-0998.13450] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Revised: 05/25/2021] [Accepted: 06/03/2021] [Indexed: 01/08/2023]
Abstract
DNA metabarcoding is an important tool for molecular ecology. However, its effectiveness hinges on the quality of reference sequence databases and classification parameters employed. Here we evaluate the performance of MiFish 12S taxonomic assignments using a case study of California Current Large Marine Ecosystem fishes to determine best practices for metabarcoding. Specifically, we use a taxonomy cross-validation by identity framework to compare classification performance between a global database comprised of all available sequences and a curated database that only includes sequences of fishes from the California Current Large Marine Ecosystem. We demonstrate that the regional database provides higher assignment accuracy than the comprehensive global database. We also document a tradeoff between accuracy and misclassification across a range of taxonomic cutoff scores, highlighting the importance of parameter selection for taxonomic classification. Furthermore, we compared assignment accuracy with and without the inclusion of additionally generated reference sequences. To this end, we sequenced tissue from 597 species using the MiFish 12S primers, adding 252 species to GenBank's existing 550 California Current Large Marine Ecosystem fish sequences. We then compared species and reads identified from seawater environmental DNA samples using global databases with and without our generated references, and the regional database. The addition of new references allowed for the identification of 16 additional native taxa representing 17.0% of total reads from eDNA samples, including species with vast ecological and economic value. Together these results demonstrate the importance of comprehensive and curated reference databases for effective metabarcoding and the need for locus-specific validation efforts.
Collapse
Affiliation(s)
- Zachary Gold
- Department of Ecology and Evolutionary Biology, University of California Los Angeles, Los Angeles, California, USA
| | - Emily E Curd
- Department of Ecology and Evolutionary Biology, University of California Los Angeles, Los Angeles, California, USA
| | - Kelly D Goodwin
- Ocean Chemistry and Ecosystems Division, Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration, Stationed at Southwest Fisheries Science Center, La Jolla, California, USA
| | - Emma S Choi
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, California, USA
| | - Benjamin W Frable
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, California, USA
| | - Andrew R Thompson
- Southwest Fisheries Science Center, National Oceanic and Atmospheric Administration, La Jolla, California, USA
| | - Harold J Walker
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, California, USA
| | - Ronald S Burton
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, California, USA
| | - Dovi Kacev
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, California, USA
| | - Lucas D Martz
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, California, USA
| | - Paul H Barber
- Department of Ecology and Evolutionary Biology, University of California Los Angeles, Los Angeles, California, USA
| |
Collapse
|
22
|
|
23
|
Characterizing Industrial and Artisanal Fishing Vessel Catch Composition Using Environmental DNA and Satellite-Based Tracking Data. Foods 2021; 10:foods10061425. [PMID: 34205462 PMCID: PMC8235475 DOI: 10.3390/foods10061425] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Revised: 06/13/2021] [Accepted: 06/15/2021] [Indexed: 01/10/2023] Open
Abstract
The decline in wild-caught fisheries paired with increasing global seafood demand is pushing the need for seafood sustainability to the forefront of national and regional priorities. Validation of species identity is a crucial early step, yet conventional monitoring and surveillance tools are limited in their effectiveness because they are extremely time-consuming and require expertise in fish identification. DNA barcoding methods are a versatile tool for the genetic monitoring of wildlife products; however, they are also limited by requiring individual tissue samples from target specimens which may not always be possible given the speed and scale of seafood operations. To circumvent the need to individually sample organisms, we pilot an approach that uses forensic environmental DNA (eDNA) metabarcoding to profile fish species composition from the meltwater in fish holds on industrial and artisanal fishing vessels in Ecuador. Fish identified genetically as present were compared to target species reported by each vessel’s crew. Additionally, we contrasted the geographic range of identified species against the satellite-based fishing route data of industrial vessels to determine if identified species could be reasonably expected in the catch.
Collapse
|
24
|
Ely T, Barber PH, Man L, Gold Z. Short-lived detection of an introduced vertebrate eDNA signal in a nearshore rocky reef environment. PLoS One 2021; 16:e0245314. [PMID: 34086697 PMCID: PMC8177635 DOI: 10.1371/journal.pone.0245314] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Accepted: 05/21/2021] [Indexed: 11/18/2022] Open
Abstract
Environmental DNA (eDNA) is increasingly used to measure biodiversity of marine ecosystems, yet key aspects of the temporal dynamics of eDNA remain unknown. Of particular interest is in situ persistence of eDNA signals in dynamic marine environments, as eDNA degradation rates have predominantly been quantified through mesocosm studies. To determine in situ eDNA residence times, we introduced an eDNA signal from a non-native fish into a protected bay of a Southern California rocky reef ecosystem, and then measured changes in both introduced and background eDNA signals across a fixed transect over 96 hours. Foreign eDNA signal was no longer detected only 7.5 hours after introduction, a time substantially shorter than the multi-day persistence times in laboratory studies. Moreover, the foreign eDNA signal spread along the entire 38 m transect within 1.5 hours after introduction, indicating that transport and diffusion play a role in eDNA detectability even in protected low energy marine environments. Similarly, native vertebrate eDNA signals varied greatly over the 96 hours of observation as well as within two additional nearby fixed transects sampled over 120 hours. While community structure did significantly change across time of day and tidal direction, neither accounted for the majority of observed variation. Combined, results show that both foreign and native eDNA signatures can exhibit substantial temporal heterogeneity, even on hourly time scales. Further work exploring eDNA decay from lagrangian perspective and quantifying effects of sample and technical replication are needed to better understand temporal variation of eDNA signatures in nearshore marine environments.
Collapse
Affiliation(s)
- Taylor Ely
- Department of Ecology and Evolutionary Biology, University of California Los Angeles, Los Angeles, California, United States of America
| | - Paul H. Barber
- Department of Ecology and Evolutionary Biology, University of California Los Angeles, Los Angeles, California, United States of America
| | - Lauren Man
- Department of Ecology and Evolutionary Biology, University of California Los Angeles, Los Angeles, California, United States of America
| | - Zachary Gold
- Department of Ecology and Evolutionary Biology, University of California Los Angeles, Los Angeles, California, United States of America
| |
Collapse
|
25
|
Mathon L, Valentini A, Guérin PE, Normandeau E, Noel C, Lionnet C, Boulanger E, Thuiller W, Bernatchez L, Mouillot D, Dejean T, Manel S. Benchmarking bioinformatic tools for fast and accurate eDNA metabarcoding species identification. Mol Ecol Resour 2021; 21:2565-2579. [PMID: 34002951 DOI: 10.1111/1755-0998.13430] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Revised: 05/10/2021] [Accepted: 05/11/2021] [Indexed: 12/01/2022]
Abstract
Bioinformatic analysis of eDNA metabarcoding data is a crucial step toward rigorously assessing biodiversity. Many programs are now available for each step of the required analyses, but their relative abilities at providing fast and accurate species lists have seldom been evaluated. We used simulated mock communities and real fish eDNA metabarcoding data to evaluate the performance of 13 bioinformatic programs and pipelines to retrieve fish occurrence and read abundance using the 12S mt rRNA gene marker. We used four indices to compare the outputs of each program with the simulated samples: sensitivity, F-measure, root-mean-square error (RMSE) on read relative abundances, and execution time. We found marked differences among programs only for the taxonomic assignment step, both in terms of sensitivity, F-measure and RMSE. Running time was highly different between programs for each step. The fastest programs with best indices for each step were assembled into a pipeline. We compared this pipeline to pipelines constructed from existing toolboxes (OBITools, Barque, and QIIME 2). Our pipeline and Barque obtained the best performance for all indices and appear to be better alternatives to highly used pipelines for analysing fish eDNA metabarcoding data when a complete reference database is available. Analysis on real eDNA metabarcoding data also indicated differences for taxonomic assignment and execution time only. This study reveals major differences between programs during the taxonomic assignment step. The choice of algorithm for the taxonomic assignment can have a significant impact on diversity estimates and should be made according to the objectives of the study.
Collapse
Affiliation(s)
- Laetitia Mathon
- CEFE, Univ. Montpellier, CNRS, EPHE-PSL University, IRD, Montpellier, France.,SPYGEN, Savoie Technolac, Le Bourget du Lac, France
| | | | | | - Eric Normandeau
- Université Laval, IBIS (Institut de Biologie Intégrative et des Systèmes), Québec, QC, Canada
| | - Cyril Noel
- IFREMER - IRSI - Service de Bioinformatique (SeBiMER), Plouzané, France
| | - Clément Lionnet
- Univ. Grenoble Alpes, Univ. Savoie Mont Blanc, CNRS, LECA, Laboratoire d'Ecologie Alpine, Grenoble, France
| | - Emilie Boulanger
- CEFE, Univ. Montpellier, CNRS, EPHE-PSL University, IRD, Montpellier, France.,MARBEC, Univ. Montpellier, CNRS, IRD, Ifremer, Montpellier, France
| | - Wilfried Thuiller
- Univ. Grenoble Alpes, Univ. Savoie Mont Blanc, CNRS, LECA, Laboratoire d'Ecologie Alpine, Grenoble, France
| | - Louis Bernatchez
- Université Laval, IBIS (Institut de Biologie Intégrative et des Systèmes), Québec, QC, Canada
| | - David Mouillot
- MARBEC, Univ. Montpellier, CNRS, IRD, Ifremer, Montpellier, France.,Institut Universitaire de France, IUF, Paris, France
| | - Tony Dejean
- SPYGEN, Savoie Technolac, Le Bourget du Lac, France
| | - Stéphanie Manel
- CEFE, Univ. Montpellier, CNRS, EPHE-PSL University, IRD, Montpellier, France
| |
Collapse
|
26
|
Jacobs-Palmer E, Gallego R, Cribari K, Keller AG, Kelly RP. Environmental DNA Metabarcoding for Simultaneous Monitoring and Ecological Assessment of Many Harmful Algae. Front Ecol Evol 2021. [DOI: 10.3389/fevo.2021.612107] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Harmful algae can have profound economic, environmental, and social consequences. As the timing, frequency, and severity of harmful algal blooms (HABs) change alongside global climate, efficient tools to monitor and understand the current ecological context of these taxa are increasingly important. Here we employ environmental DNA metabarcoding to identify patterns in a wide variety of potentially harmful algae and associated ecological communities in the Hood Canal of Puget Sound in Washington State, USA. Tracking trends of occurrence in a series of water samples over a period of 19 months, we find algal sequences from genera with harmful members in a majority of samples, suggesting that these groups are routinely present in local waters. We report patterns in variants of the economically important genus Pseudo-nitzschia (of which some members produce domoic acid; family Bacillariaceae), as well as multiple potentially harmful algal taxa previously unknown or poorly documented in the region, including a cold-water variant from the genus Alexandrium (of which some members produce saxitoxin; family Gonyaulacaceae), two variants from the genus Karlodinium (of which some members produce karlotoxins; family Kareniaceae), and one variant from the parasitic genus Hematodinium (family Syndiniaceae). We then use data on environmental variables and the biological community surrounding each algal taxon to illustrate the ecological context in which they are commonly found. Environmental DNA metabarcoding thus simultaneously (1) alerts us to potential new or cryptic occurrences of algae from harmful genera, (2) expands our knowledge of the co-occurring conditions and species associated with the growth of these organisms in changing marine environments, and (3) suggests a pathway for multispecies monitoring and management moving forward.
Collapse
|
27
|
Mousavi-Derazmahalleh M, Stott A, Lines R, Peverley G, Nester G, Simpson T, Zawierta M, De La Pierre M, Bunce M, Christophersen CT. eDNAFlow, an automated, reproducible and scalable workflow for analysis of environmental DNA sequences exploiting Nextflow and Singularity. Mol Ecol Resour 2021; 21:1697-1704. [PMID: 33580619 DOI: 10.1111/1755-0998.13356] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Revised: 01/29/2021] [Accepted: 02/08/2021] [Indexed: 12/18/2022]
Abstract
Metabarcoding of environmental DNA (eDNA) when coupled with high throughput sequencing is revolutionising the way biodiversity can be monitored across a wide range of applications. However, the large number of tools deployed in downstream bioinformatic analyses often places a challenge in configuration and maintenance of a workflow, and consequently limits the research reproducibility. Furthermore, scalability needs to be considered to handle the growing amount of data due to increase in sequence output and the scale of project. Here, we describe eDNAFlow, a fully automated workflow that employs a number of state-of-the-art applications to process eDNA data from raw sequences (single-end or paired-end) to generation of curated and noncurated zero-radius operational taxonomic units (ZOTUs) and their abundance tables. This pipeline is based on Nextflow and Singularity which enable a scalable, portable and reproducible workflow using software containers on a local computer, clouds and high-performance computing (HPC) clusters. Finally, we present an in-house Python script to assign taxonomy to ZOTUs based on user specified thresholds for assigning lowest common ancestor (LCA). We demonstrate the utility and efficiency of the pipeline using an example of a published coral diversity biomonitoring study. Our results were congruent with the aforementioned study. The scalability of the pipeline is also demonstrated through analysis of a large data set containing 154 samples. To our knowledge, this is the first automated bioinformatic pipeline for eDNA analysis using two powerful tools: Nextflow and Singularity. This pipeline addresses two major challenges in the analysis of eDNA data; scalability and reproducibility.
Collapse
Affiliation(s)
- Mahsa Mousavi-Derazmahalleh
- Trace and Environmental DNA (TrEnD) Laboratory, School of Molecular and Life Sciences, Curtin University, Bentley, WA, Australia.,WA Human Microbiome Collaboration Centre (WAHMCC), School of Molecular and Life Sciences, Curtin University, Bentley, WA, Australia.,eDNA Frontiers, School of Molecular and Life Sciences, Curtin University, Bentley, WA, Australia
| | - Audrey Stott
- Pawsey Supercomputing Centre, Kensington, WA, Australia
| | - Rose Lines
- Trace and Environmental DNA (TrEnD) Laboratory, School of Molecular and Life Sciences, Curtin University, Bentley, WA, Australia.,WA Human Microbiome Collaboration Centre (WAHMCC), School of Molecular and Life Sciences, Curtin University, Bentley, WA, Australia.,eDNA Frontiers, School of Molecular and Life Sciences, Curtin University, Bentley, WA, Australia
| | - Georgia Peverley
- eDNA Frontiers, School of Molecular and Life Sciences, Curtin University, Bentley, WA, Australia
| | - Georgia Nester
- Trace and Environmental DNA (TrEnD) Laboratory, School of Molecular and Life Sciences, Curtin University, Bentley, WA, Australia
| | - Tiffany Simpson
- Trace and Environmental DNA (TrEnD) Laboratory, School of Molecular and Life Sciences, Curtin University, Bentley, WA, Australia.,eDNA Frontiers, School of Molecular and Life Sciences, Curtin University, Bentley, WA, Australia
| | - Michal Zawierta
- Department of Electrical, Electronic and Computer Engineering, The University of Western Australia, Nedlands, WA, Australia
| | | | - Michael Bunce
- Trace and Environmental DNA (TrEnD) Laboratory, School of Molecular and Life Sciences, Curtin University, Bentley, WA, Australia.,Environmental Protection Authority, Wellington, New Zealand
| | - Claus T Christophersen
- Trace and Environmental DNA (TrEnD) Laboratory, School of Molecular and Life Sciences, Curtin University, Bentley, WA, Australia.,WA Human Microbiome Collaboration Centre (WAHMCC), School of Molecular and Life Sciences, Curtin University, Bentley, WA, Australia.,School of Medical & Health Sciences, Edith Cowan University, Joondalup, WA, Australia
| |
Collapse
|
28
|
Hazlehurst J, Rankin D, Clark C, McFrederick Q, Wilson-Rankin E. Macroecological patterns of resource use in resident and migratory hummingbirds. Basic Appl Ecol 2021. [DOI: 10.1016/j.baae.2021.01.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
29
|
Gold Z, Sprague J, Kushner DJ, Zerecero Marin E, Barber PH. eDNA metabarcoding as a biomonitoring tool for marine protected areas. PLoS One 2021; 16:e0238557. [PMID: 33626067 PMCID: PMC7904164 DOI: 10.1371/journal.pone.0238557] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Accepted: 01/04/2021] [Indexed: 01/04/2023] Open
Abstract
Monitoring of marine protected areas (MPAs) is critical for marine ecosystem management, yet current protocols rely on SCUBA-based visual surveys that are costly and time consuming, limiting their scope and effectiveness. Environmental DNA (eDNA) metabarcoding is a promising alternative for marine ecosystem monitoring, but more direct comparisons to visual surveys are needed to understand the strengths and limitations of each approach. This study compares fish communities inside and outside the Scorpion State Marine Reserve off Santa Cruz Island, CA using eDNA metabarcoding and underwater visual census surveys. Results from eDNA captured 76% (19/25) of fish species and 95% (19/20) of fish genera observed during pairwise underwater visual census. Species missed by eDNA were due to the inability of MiFish 12S barcodes to differentiate species of rockfishes (Sebastes, n = 4) or low site occupancy rates of crevice-dwelling Lythrypnus gobies. However, eDNA detected an additional 23 fish species not recorded in paired visual surveys, but previously reported from prior visual surveys, highlighting the sensitivity of eDNA. Significant variation in eDNA signatures by location (50 m) and site (~1000 m) demonstrates the sensitivity of eDNA to address key questions such as community composition inside and outside MPAs. Results demonstrate the utility of eDNA metabarcoding for monitoring marine ecosystems, providing an important complementary tool to visual methods.
Collapse
Affiliation(s)
- Zachary Gold
- Department of Ecology and Evolutionary Biology, University of California–Los Angeles, Los Angeles, California, United States of America
| | - Joshua Sprague
- Channel Islands National Park Service, Ventura, California, United States of America
| | - David J. Kushner
- Channel Islands National Park Service, Ventura, California, United States of America
| | - Erick Zerecero Marin
- Department of Ecology and Evolutionary Biology, University of California–Los Angeles, Los Angeles, California, United States of America
| | - Paul H. Barber
- Department of Ecology and Evolutionary Biology, University of California–Los Angeles, Los Angeles, California, United States of America
| |
Collapse
|
30
|
Lake Sedimentary DNA Research on Past Terrestrial and Aquatic Biodiversity: Overview and Recommendations. QUATERNARY 2021. [DOI: 10.3390/quat4010006] [Citation(s) in RCA: 53] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
The use of lake sedimentary DNA to track the long-term changes in both terrestrial and aquatic biota is a rapidly advancing field in paleoecological research. Although largely applied nowadays, knowledge gaps remain in this field and there is therefore still research to be conducted to ensure the reliability of the sedimentary DNA signal. Building on the most recent literature and seven original case studies, we synthesize the state-of-the-art analytical procedures for effective sampling, extraction, amplification, quantification and/or generation of DNA inventories from sedimentary ancient DNA (sedaDNA) via high-throughput sequencing technologies. We provide recommendations based on current knowledge and best practises.
Collapse
|
31
|
Lafferty KD, Garcia-Vedrenne AE, McLaughlin JP, Childress JN, Morse MF, Jerde CL. At Palmyra Atoll, the fish-community environmental DNA signal changes across habitats but not with tides. JOURNAL OF FISH BIOLOGY 2021; 98:415-425. [PMID: 32441343 PMCID: PMC9300262 DOI: 10.1111/jfb.14403] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/25/2019] [Revised: 04/24/2020] [Accepted: 05/20/2020] [Indexed: 05/24/2023]
Abstract
At Palmyra Atoll, the environmental DNA (eDNA) signal on tidal sand flats was associated with fish biomass density and captured 98%-100% of the expected species diversity there. Although eDNA spilled over across habitats, species associated with reef habitat contributed more eDNA to reef sites than to sand-flat sites, and species associated with sand-flat habitat contributed more eDNA to sand-flat sites than to reef sites. Tides did not disrupt the sand-flat habitat signal. At least 25 samples give a coverage >97.5% at this diverse, tropical, marine system.
Collapse
Affiliation(s)
- Kevin D. Lafferty
- Western Ecological Research Center, U.S. Geological Survey, Reston, Virginia
- Marine Science Institute, University of California, Santa Barbara, Santa Barbara, California
| | - Ana E. Garcia-Vedrenne
- Ecology, Evolution and Marine Biology, University of California, Los Angeles, Los Angeles, California
| | - John P. McLaughlin
- Ecology, Evolution and Marine Biology, University of California, Santa Barbara, Santa Barbara, California
| | - Jasmine N. Childress
- Ecology, Evolution and Marine Biology, University of California, Santa Barbara, Santa Barbara, California
| | - Marisa F. Morse
- Ecology, Evolution and Marine Biology, University of California, Santa Barbara, Santa Barbara, California
| | - Christopher L. Jerde
- Marine Science Institute, University of California, Santa Barbara, Santa Barbara, California
| |
Collapse
|
32
|
Gallego R, Jacobs-Palmer E, Cribari K, Kelly RP. Environmental DNA metabarcoding reveals winners and losers of global change in coastal waters. Proc Biol Sci 2020; 287:20202424. [PMID: 33290686 DOI: 10.1098/rspb.2020.2424] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Studies of the ecological effects of global change often focus on one or a few species at a time. Consequently, we know relatively little about the changes underway at real-world scales of biological communities, which typically have hundreds or thousands of interacting species. Here, we use COI mtDNA amplicons from monthly samples of environmental DNA to survey 221 planktonic taxa along a gradient of temperature, salinity, dissolved oxygen and carbonate chemistry in nearshore marine habitat. The result is a high-resolution picture of changes in ecological communities using a technique replicable across a wide variety of ecosystems. We estimate community-level differences associated with time, space and environmental variables, and use these results to forecast near-term community changes due to warming and ocean acidification. We find distinct communities in warmer and more acidified conditions, with overall reduced richness in diatom assemblages and increased richness in dinoflagellates. Individual taxa finding more suitable habitat in near-future waters are more taxonomically varied and include the ubiquitous coccolithophore Emiliania huxleyi and the harmful dinoflagellate Alexandrium sp. These results suggest foundational changes for nearshore food webs under near-future conditions.
Collapse
Affiliation(s)
- Ramón Gallego
- School of Marine and Environmental Affairs, University of Washington, 3707 Brooklyn Avenue NE, Seattle, WA 98105, USA
| | - Emily Jacobs-Palmer
- School of Marine and Environmental Affairs, University of Washington, 3707 Brooklyn Avenue NE, Seattle, WA 98105, USA
| | - Kelly Cribari
- School of Marine and Environmental Affairs, University of Washington, 3707 Brooklyn Avenue NE, Seattle, WA 98105, USA
| | - Ryan P Kelly
- School of Marine and Environmental Affairs, University of Washington, 3707 Brooklyn Avenue NE, Seattle, WA 98105, USA
| |
Collapse
|
33
|
Mitchell K, Ronas J, Dao C, Freise AC, Mangul S, Shapiro C, Moberg Parker J. PUMAA: A Platform for Accessible Microbiome Analysis in the Undergraduate Classroom. Front Microbiol 2020; 11:584699. [PMID: 33123113 PMCID: PMC7573227 DOI: 10.3389/fmicb.2020.584699] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Accepted: 09/14/2020] [Indexed: 12/22/2022] Open
Abstract
Improvements in high-throughput sequencing makes targeted amplicon analysis an ideal method for the study of human and environmental microbiomes by undergraduates. Multiple bioinformatics programs are available to process and interpret raw microbial diversity datasets, and the choice of programs to use in curricula is largely determined by student learning goals. Many of the most commonly used microbiome bioinformatics platforms offer end-to-end data processing and data analysis using a command line interface (CLI), but the downside for novice microbiome researchers is the steep learning curve often required. Alternatively, some sequencing providers include processing of raw data and taxonomy assignments as part of their pipelines. This, when coupled with available web-based or graphical user interface (GUI) analysis and visualization tools, eliminates the need for students or instructors to have extensive CLI experience. However, lack of universal data formats can make integration of these tools challenging. For example, tools for upstream and downstream analyses frequently use multiple different data formats which then require writing custom scripts or hours of manual work to make the files compatible. Here, we describe a microbial ecology bioinformatics curriculum that focuses on data analysis, visualization, and statistical reasoning by taking advantage of existing web-based and GUI tools. We created the Program for Unifying Microbiome Analysis Applications (PUMAA), which solves the problem of inconsistent files by formatting the output files from several raw data processing programs to seamlessly transition to a suite of GUI programs for analysis and visualization of microbiome taxonomic and inferred functional profiles. Additionally, we created a series of tutorials to accompany each of the microbiome analysis curricular modules. From pre- and post-course surveys, students in this curriculum self-reported conceptual and confidence gains in bioinformatics and data analysis skills. Students also demonstrated gains in biologically relevant statistical reasoning based on rubric-guided evaluations of open-ended survey questions and the Statistical Reasoning in Biology Concept Inventory. The PUMAA program and associated analysis tutorials enable students and researchers with no computational experience to effectively analyze real microbiome datasets to investigate real-world research questions.
Collapse
Affiliation(s)
- Keith Mitchell
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, Los Angeles, CA, United States
| | - Jiem Ronas
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, Los Angeles, CA, United States
| | - Christopher Dao
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, Los Angeles, CA, United States
| | - Amanda C Freise
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, Los Angeles, CA, United States
| | - Serghei Mangul
- Department of Clinical Pharmacy, School of Pharmacy, University of Southern California, Los Angeles, CA, United States
| | - Casey Shapiro
- Center for Educational Assessment, Center for the Advancement of Teaching, University of California, Los Angeles, Los Angeles, CA, United States
| | - Jordan Moberg Parker
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, Los Angeles, CA, United States
| |
Collapse
|
34
|
Gold Z, Wall AR, Curd EE, Kelly RP, Pentcheff ND, Ripma L, Barber PH, Wetzer R. eDNA metabarcoding bioassessment of endangered fairy shrimp (Branchinecta spp.). CONSERV GENET RESOUR 2020. [DOI: 10.1007/s12686-020-01161-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
35
|
Arranz V, Pearman WS, Aguirre JD, Liggins L. MARES, a replicable pipeline and curated reference database for marine eukaryote metabarcoding. Sci Data 2020; 7:209. [PMID: 32620910 PMCID: PMC7334202 DOI: 10.1038/s41597-020-0549-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2019] [Accepted: 05/27/2020] [Indexed: 11/09/2022] Open
Abstract
The use of DNA metabarcoding to characterise the biodiversity of environmental and community samples has exploded in recent years. However, taxonomic inferences from these studies are contingent on the quality and completeness of the sequence reference database used to characterise sample species-composition. In response, studies often develop custom reference databases to improve species assignment. The disadvantage of this approach is that it limits the potential for database re-use, and the transferability of inferences across studies. Here, we present the MARine Eukaryote Species (MARES) reference database for use in marine metabarcoding studies, created using a transparent and reproducible pipeline. MARES includes all COI sequences available in GenBank and BOLD for marine taxa, unified into a single taxonomy. Our pipeline facilitates the curation of sequences, synonymization of taxonomic identifiers used by different repositories, and formatting these data for use in taxonomic assignment tools. Overall, MARES provides a benchmark COI reference database for marine eukaryotes, and a standardised pipeline for (re)producing reference databases enabling integration and fair comparison of marine DNA metabarcoding results. Measurement(s) | DNA | Technology Type(s) | bioinformatics analysis | Factor Type(s) | DNA sequence | Sample Characteristic - Organism | Eukaryota | Sample Characteristic - Environment | marine environment |
Machine-accessible metadata file describing the reported data: 10.6084/m9.figshare.12324122
Collapse
Affiliation(s)
- Vanessa Arranz
- School of Natural and Computational Sciences, Massey University Auckland, Albany, Auckland, 0745, New Zealand.
| | - William S Pearman
- School of Natural and Computational Sciences, Massey University Auckland, Albany, Auckland, 0745, New Zealand
| | - J David Aguirre
- School of Natural and Computational Sciences, Massey University Auckland, Albany, Auckland, 0745, New Zealand
| | - Libby Liggins
- School of Natural and Computational Sciences, Massey University Auckland, Albany, Auckland, 0745, New Zealand
| |
Collapse
|
36
|
Experimental DNA Demethylation Associates with Changes in Growth and Gene Expression of Oak Tree Seedlings. G3-GENES GENOMES GENETICS 2020; 10:1019-1028. [PMID: 31941723 PMCID: PMC7056980 DOI: 10.1534/g3.119.400770] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Epigenetic modifications such as DNA methylation, where methyl groups are added to cytosine base pairs, have the potential to impact phenotypic variation and gene expression, and could influence plant response to changing environments. One way to test this impact is through the application of chemical demethylation agents, such as 5-Azacytidine, which inhibit DNA methylation and lead to a partial reduction in DNA methylation across the genome. In this study, we treated 5-month-old seedlings of the tree, Quercus lobata, with foliar application of 5-Azacytidine to test whether a reduction in genome-wide methylation would cause differential gene expression and change phenotypic development. First, we demonstrate that demethylation treatment led to 3–6% absolute reductions and 6.7–43.2% relative reductions in genome-wide methylation across CG, CHG, and CHH sequence contexts, with CHH showing the strongest relative reduction. Seedlings treated with 5-Azacytidine showed a substantial reduction in new growth, which was less than half that of control seedlings. We tested whether this result could be due to impact of the treatment on the soil microbiome and found minimal differences in the soil microbiome between two groups, although with limited sample size. We found no significant differences in leaf fluctuating asymmetry (i.e., deviations from bilateral symmetry), which has been found in other studies. Nonetheless, treated seedlings showed differential expression of a total of 23 genes. Overall, this study provides initial evidence that DNA methylation is involved in gene expression and phenotypic variation in seedlings and suggests that removal of DNA methylation affects plant development.
Collapse
|
37
|
Richardson RT, Sponsler DB, McMinn‐Sauder H, Johnson RM. MetaCurator: A hidden Markov model‐based toolkit for extracting and curating sequences from taxonomically‐informative genetic markers. Methods Ecol Evol 2019. [DOI: 10.1111/2041-210x.13314] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
| | - Douglas B. Sponsler
- Department of Entomology Pennsylvania State University University Park PA USA
- Department of Botany The Academy of Natural Sciences of Drexel University Philadelphia PA USA
| | | | - Reed M. Johnson
- Department of Entomology The Ohio State University Columbus OH USA
| |
Collapse
|