1
|
Simović P, Milosavljević A, Stojanović K, Radenković M, Savić-Zdravković D, Predić B, Petrović A, Božanić M, Milošević D. Automated identification of aquatic insects: A case study using deep learning and computer vision techniques. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 935:172877. [PMID: 38740196 DOI: 10.1016/j.scitotenv.2024.172877] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Accepted: 04/27/2024] [Indexed: 05/16/2024]
Abstract
Deep learning techniques have recently found application in biodiversity research. Mayflies (Ephemeroptera), stoneflies (Plecoptera) and caddisflies (Trichoptera), often abbreviated as EPT, are frequently used for freshwater biomonitoring due to their large numbers and sensitivity to environmental changes. However, the morphological identification of EPT species is a challenging but fundamental task. Morphological identification of these freshwater insects is therefore not only extremely time-consuming and costly, but also often leads to misjudgments or generates datasets with low taxonomic resolution. Here, we investigated the application of deep learning to increase the efficiency and taxonomic resolution of biomonitoring programs. Our database contains 90 EPT taxa (genus or species level), with the number of images per category ranging from 21 to 300 (16,650 in total). Upon completion of training, a CNN (Convolutional Neural Network) model was created, capable of automatically classifying these taxa into their appropriate taxonomic categories with an accuracy of 98.7 %. Our model achieved a perfect classification rate of 100 % for 68 of the taxa in our dataset. We achieved noteworthy classification accuracy with morphologically closely related taxa within the training data (e.g., species of the genus Baetis, Hydropsyche, Perla). Gradient-weighted Class Activation Mapping (Grad-CAM) visualized the morphological features responsible for the classification of the treated species in the CNN models. Within Ephemeroptera, the head was the most important feature, while the thorax and abdomen were equally important for the classification of Plecoptera taxa. For the order Trichoptera, the head and thorax were almost equally important. Our database is recognized as the most extensive aquatic insect database, notably distinguished by its wealth of included categories (taxa). Our approach can help solve long-standing challenges in biodiversity research and address pressing issues in monitoring programs by saving time in sample identification.
Collapse
Affiliation(s)
- Predrag Simović
- Department of Biology and Ecology, Faculty of Science, University of Kragujevac, Radoja Domanovića 12, 34000 Kragujevac, Serbia.
| | - Aleksandar Milosavljević
- Faculty of Electronic Engineering, University of Niš, Aleksandra Medvedeva 14, 18000 Niš, Serbia.
| | - Katarina Stojanović
- Department of Zoology, Faculty of Biology, University of Belgrade, Studentski trg 16, Belgrade, Serbia.
| | - Milena Radenković
- Department of Biology and Ecology, Faculty of Science, University of Kragujevac, Radoja Domanovića 12, 34000 Kragujevac, Serbia.
| | - Dimitrija Savić-Zdravković
- Department of Biology and Ecology, Faculty of Sciences and Mathematics, University of Niš, Višegradska 33, 18000 Niš, Serbia.
| | - Bratislav Predić
- Faculty of Electronic Engineering, University of Niš, Aleksandra Medvedeva 14, 18000 Niš, Serbia.
| | - Ana Petrović
- Department of Biology and Ecology, Faculty of Science, University of Kragujevac, Radoja Domanovića 12, 34000 Kragujevac, Serbia.
| | - Milenka Božanić
- Department of Zoology, Faculty of Biology, University of Belgrade, Studentski trg 16, Belgrade, Serbia.
| | - Djuradj Milošević
- Department of Biology and Ecology, Faculty of Sciences and Mathematics, University of Niš, Višegradska 33, 18000 Niš, Serbia.
| |
Collapse
|
2
|
Kilian IC, Espeland M, Mey W, Wowor D, Hadiaty RK, von Rintelen T, Herder F. DNA barcoding unveils a high diversity of caddisflies (Trichoptera) in the Mount Halimun Salak National Park (West Java; Indonesia). PeerJ 2022; 10:e14182. [PMID: 36530410 PMCID: PMC9753737 DOI: 10.7717/peerj.14182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2021] [Accepted: 09/14/2022] [Indexed: 12/14/2022] Open
Abstract
Background Trichoptera are one of the most diverse groups of freshwater insects worldwide and one of the main bioindicators for freshwater quality. However, in many areas, caddisflies remain understudied due to lack of taxonomic expertise. Meanwhile, globally increasing anthropogenic stress on freshwater streams also threatens Trichoptera diversity. Methods To assess the Trichoptera diversity of the area within and around the Mount Halimun Salak National Park (MHSNP or Taman Nasional Gunung Halimun Salak) in West Java (Indonesia), we conducted a molecular-morphological study on Trichoptera diversity using larvae from a benthic survey and adults from hand-netting. In addition to morphological identification, we applied four different molecular taxon delimitation approaches (Generalized Mixed Yule Coalescent, Bayesian Poisson Tree Processes, Automatic Barcode Gap Discovery and Assemble Species by Automatic Partitioning) based on DNA barcoding of Cytochrome-C-Oxidase I (COI). Results The molecular delimitation detected 72 to 81 Operational Taxonomic Units (OTU). Only five OTUs could be identified to species level by comparing sequences against the BOLD database using BLAST, and four more to the genus level. Adults and larvae could be successfully associated in 18 cases across six families. The high diversity of Trichoptera in this area highlights their potential as bioindicators for water quality assessment. Conclusions This study provides an example of how molecular approaches can benefit the exploration of hidden diversity in unexplored areas and can be a valuable tool to link life stages. However, our study also highlights the need to improve DNA barcode reference libraries of Trichoptera for the Oriental region.
Collapse
Affiliation(s)
- Isabel C. Kilian
- Museum Koenig, Leibniz Institute for the Analysis of Biodiversity Change, Bonn, Germany
| | - Marianne Espeland
- Museum Koenig, Leibniz Institute for the Analysis of Biodiversity Change, Bonn, Germany
| | - Wolfram Mey
- Museum für Naturkunde, Leibniz Institute for Evolution and Biodiversity Science, Berlin, Germany
| | - Daisy Wowor
- Museum Zoologicum Bogoriense, Research Center for Biosystematics and Evolution, National Research and Innovation Agency, Cibinong, Indonesia
| | - Renny K. Hadiaty
- Museum Zoologicum Bogoriense, Research Center for Biosystematics and Evolution, National Research and Innovation Agency, Cibinong, Indonesia
| | - Thomas von Rintelen
- Museum für Naturkunde, Leibniz Institute for Evolution and Biodiversity Science, Berlin, Germany
| | - Fabian Herder
- Museum Koenig, Leibniz Institute for the Analysis of Biodiversity Change, Bonn, Germany
| |
Collapse
|
3
|
Weigand AM, Desquiotz N, Weigand H, Szucsich N. Application of propylene glycol in DNA-based studies of invertebrates. METABARCODING AND METAGENOMICS 2021. [DOI: 10.3897/mbmg.5.57278] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
High-throughput sequencing (HTS) studies on invertebrates commonly use ethanol as the main sample fixative (upon collection) and preservative (for storage and curation). However, alternative agents exists, which should not be automatically neglected when studies are newly designed. This review provides an overview of the application of propylene glycol (PG) in DNA-based studies of invertebrates, thus to stimulate an evidence-based discussion.
The use of PG in DNA-based studies of invertebrates is still limited (n = 79), but a steady increase has been visible since 2011. Most studies used PG as a fixative for passive trapping (73%) and performed Sanger sequencing (66%; e.g. DNA barcoding). More recently, HTS setups joined the field (11%). Terrestrial Coleoptera (30%) and Diptera (20%) were the most studied groups. Very often, information on the grade of PG used (75%) or storage conditions (duration, temperature) were lacking. This rendered direct comparisons of study results difficult, and highlight the need for further systematic studies on these subjects.
When compared to absolute ethanol, PG can be more widely and cheaply acquired (e.g. as an antifreeze, 13% of studies). It also enables longer trapping intervals, being especially relevant at remote or hard-to-reach places. Shipping of PG-conserved samples is regarded as risk-free and is authorised, pinpointing its potential for larger trapping programs or citizen science projects. Its property to retain flexibility of morphological characters as well as to lead to a reduced shrinkage effect was especially appraised by integrative study designs. Finally, the so far limited application of PG in the context of HTS showed promising results for short read amplicon sequencing and reduced representation methods. Knowledge of the influence of PG fixation and storage for long(er) read HTS setups is currently unavailable.
Given our review results and taking difficulties of direct methodological comparisons into account, future DNA-based studies of invertebrates should on a case-by-case basis critically scrutinise if the application of PG in their anticipated study design can be of benefit.
Collapse
|
4
|
A DNA Barcoding Survey of an Arctic Arthropod Community: Implications for Future Monitoring. INSECTS 2020; 11:insects11010046. [PMID: 31936447 PMCID: PMC7023425 DOI: 10.3390/insects11010046] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/02/2019] [Revised: 12/20/2019] [Accepted: 01/06/2020] [Indexed: 12/02/2022]
Abstract
Accurate and cost-effective methods for tracking changes in arthropod communities are needed to develop integrative environmental monitoring programs in the Arctic. To date, even baseline data on their species composition at established ecological monitoring sites are severely lacking. We present the results of a pilot assessment of non-marine arthropod diversity in a middle arctic tundra area near Ikaluktutiak (Cambridge Bay), Victoria Island, Nunavut, undertaken in 2018 using DNA barcodes. A total of 1264 Barcode Index Number (BIN) clusters, used as a proxy for species, were recorded. The efficacy of widely used sampling methods was assessed. Yellow pan traps captured 62% of the entire BIN diversity at the study sites. When complemented with soil and leaf litter sifting, the coverage rose up to 74.6%. Combining community-based data collection with high-throughput DNA barcoding has the potential to overcome many of the logistic, financial, and taxonomic obstacles for large-scale monitoring of the Arctic arthropod fauna.
Collapse
|
5
|
Huber DPW, Shrimpton CM, Erasmus DJ. Eight New Provincial Species Records of Mayflies (Ephemeroptera) from One Arctic Watershed River in British Columbia. WEST N AM NATURALIST 2019. [DOI: 10.3398/064.079.0101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Affiliation(s)
- Dezene P. W. Huber
- Ecosystem Science and Management Program, University of Northern British Columbia, 3333 University Way, Prince George, British Columbia, Canada V2N 4Z9
| | - Claire M. Shrimpton
- Ecosystem Science and Management Program, University of Northern British Columbia, 3333 University Way, Prince George, British Columbia, Canada V2N 4Z9
| | - Daniel J. Erasmus
- Biochemistry and Molecular Biology, University of Northern British Columbia, 3333 University Way, Prince George, British Columbia, Canada V2N 4Z9
| |
Collapse
|
6
|
Sheffield CS, deWaard JR, Morse JC, Rasmussen AK. Trichoptera of Canada. Zookeys 2019; 819:507-520. [PMID: 30713460 PMCID: PMC6355732 DOI: 10.3897/zookeys.819.31140] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2018] [Accepted: 12/09/2018] [Indexed: 11/12/2022] Open
Abstract
Trichoptera, or caddisflies, are common members of freshwater ecosystems as larvae and are important indicators of aquatic system health. As such, the species are relatively well studied, with keys available for larvae and adults of many of the taxa occurring in Canada. The number of species recorded from Canada since 1979 (Wiggins 1979) has increased from 546 to 636, an increase of 16.4%. Of those species newly recorded, 17 represent newly described taxa since 1979. Taking into consideration the species likely to be subsequently found in Canada based on records in adjacent parts of the United States and results from DNA barcoding, an estimated 129-181 species remain to be documented in Canada.
Collapse
Affiliation(s)
- Cory S. Sheffield
- Royal Saskatchewan Museum, 2340 Albert Street, Regina, Saskatchewan, S4P 2V7, CanadaRoyal Saskatchewan MuseumReginaCanada
| | - Jeremy R. deWaard
- Centre for Biodiversity Genomics, University of Guelph, Guelph, Ontario, N1G 2W1, CanadaUniversity of GuelphGuelphCanada
| | - John C. Morse
- Department of Plant & Environmental Sciences, Clemson University, E-143 Poole Agricultural Center, Clemson, South Carolina 29634-0310, USAClemson UniversityClemsonUnited States of America
| | - Andrew K. Rasmussen
- Center for Water Resources, College of Agriculture and Food Sciences, Florida A&M University, 113 South Perry-Paige Bldg., Tallahassee, Florida 32307-4100, USAFlorida A&M UniversityTallahasseeUnited States of America
| |
Collapse
|
7
|
Abstract
Thus far, 335 currently valid species in 82 genera and 21 families of mayflies (Ephemeroptera) have been documented from Canada, remarkably representing a little more than half of the combined species richness of Canada, Mexico and the USA. The current known species richness for Canada represents an increase of 11.3% as compared to that reported in 1979. Species richness is greatest in the families Heptageniidae (83), Baetidae (76) and Ephemerellidae (45). A total of 328 DNA Barcode Index Numbers (BINs) are available for Canadian mayfly species. The greatest net gains anticipated for future species tallies are for Baetidae (25), Heptageniidae (10) and Leptophlebiidae (10). A total of 66 more species overall is anticipated for Canada, with greatest gains potentially coming from lentic habitats across Canada and from far eastern and far western areas in general. However, even metropolitan areas should not be overlooked for the potential of discovery.
Collapse
Affiliation(s)
- Luke M. Jacobus
- Division of Science, Indiana University Purdue University Columbus, 4601 Central Avenue, Columbus, Indiana 47203, USAIndiana University Purdue University ColumbusColumbusUnited States of America
| |
Collapse
|
8
|
Ants in Australia’s Monsoonal Tropics: CO1 Barcoding Reveals Extensive Unrecognised Diversity. DIVERSITY-BASEL 2018. [DOI: 10.3390/d10020036] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
|
9
|
Erasmus DJ, Yurkowski EA, Huber DPW. DNA barcode-based survey of Trichoptera in the Crooked River reveals three new species records for British Columbia. PeerJ 2018; 6:e4221. [PMID: 29340236 PMCID: PMC5768167 DOI: 10.7717/peerj.4221] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2017] [Accepted: 12/12/2017] [Indexed: 01/27/2023] Open
Abstract
Anthropogenic pressures on aquatic systems have placed a renewed focus on biodiversity of aquatic macroinvertebrates. By combining classical taxonomy and DNA barcoding we identified 39 species of caddisflies from the Crooked River, a unique and sensitive system in the southernmost arctic watershed in British Columbia. Our records include three species never before recorded in British Columbia: Lepidostoma togatum (Lepidostomatidae), Ceraclea annulicornis (Leptoceridae), and possibly Cheumatopsyche harwoodi (Hydropsychidae). Three other specimens may represent new occurrence records and a number of other records seem to be substantial observed geographic range expansions within British Columbia.
Collapse
Affiliation(s)
- Daniel J Erasmus
- Department of Biochemistry and Molecular Biology, University of Northern British Columbia, Prince George, British Columbia, Canada
| | - Emily A Yurkowski
- Department of Biochemistry and Molecular Biology, University of Northern British Columbia, Prince George, British Columbia, Canada
| | - Dezene P W Huber
- Department of Ecosystem Science and Management Program, University of Northern British Columbia, Prince George, British Columbia, Canada
| |
Collapse
|