51
|
Penev L, Agosti D, Georgiev T, Catapano T, Miller J, Blagoderov V, Roberts D, Smith VS, Brake I, Ryrcroft S, Scott B, Johnson NF, Morris RA, Sautter G, Chavan V, Robertson T, Remsen D, Stoev P, Parr C, Knapp S, Kress WJ, Thompson CF, Erwin T. Semantic tagging of and semantic enhancements to systematics papers: ZooKeys working examples. Zookeys 2010:1-16. [PMID: 21594113 PMCID: PMC3088020 DOI: 10.3897/zookeys.50.538] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2010] [Accepted: 06/22/2010] [Indexed: 11/29/2022] Open
Abstract
The concept of semantic tagging and its potential for semantic enhancements to taxonomic papers is outlined and illustrated by four exemplar papers published in the present issue of ZooKeys. The four papers were created in different ways: (i) written in Microsoft Word and submitted as non-tagged manuscript (doi: 10.3897/zookeys.50.504); (ii) generated from Scratchpads and submitted as XML-tagged manuscripts (doi: 10.3897/zookeys.50.505 and doi: 10.3897/zookeys.50.506); (iii) generated from an author’s database (doi: 10.3897/zookeys.50.485) and submitted as XML-tagged manuscript. XML tagging and semantic enhancements were implemented during the editorial process of ZooKeys using the Pensoft Mark Up Tool (PMT), specially designed for this purpose. The XML schema used was TaxPub, an extension to the Document Type Definitions (DTD) of the US National Library of Medicine Journal Archiving and Interchange Tag Suite (NLM). The following innovative methods of tagging, layout, publishing and disseminating the content were tested and implemented within the ZooKeys editorial workflow: (1) highly automated, fine-grained XML tagging based on TaxPub; (2) final XML output of the paper validated against the NLM DTD for archiving in PubMedCentral; (3) bibliographic metadata embedded in the PDF through XMP (Extensible Metadata Platform); (4) PDF uploaded after publication to the Biodiversity Heritage Library (BHL); (5) taxon treatments supplied through XML to Plazi; (6) semantically enhanced HTML version of the paper encompassing numerous internal and external links and linkouts, such as: (i) vizualisation of main tag elements within the text (e.g., taxon names, taxon treatments, localities, etc.); (ii) internal cross-linking between paper sections, citations, references, tables, and figures; (iii) mapping of localities listed in the whole paper or within separate taxon treatments; (v) taxon names autotagged, dynamically mapped and linked through the Pensoft Taxon Profile (PTP) to large international database services and indexers such as Global Biodiversity Information Facility (GBIF), National Center for Biotechnology Information (NCBI), Barcode of Life (BOLD), Encyclopedia of Life (EOL), ZooBank, Wikipedia, Wikispecies, Wikimedia, and others; (vi) GenBank accession numbers autotagged and linked to NCBI; (vii) external links of taxon names to references in PubMed, Google Scholar, Biodiversity Heritage Library and other sources. With the launching of the working example, ZooKeys becomes the first taxonomic journal to provide a complete XML-based editorial, publication and dissemination workflow implemented as a routine and cost-efficient practice. It is anticipated that XML-based workflow will also soon be implemented in botany through PhytoKeys, a forthcoming partner journal of ZooKeys. The semantic markup and enhancements are expected to greatly extend and accelerate the way taxonomic information is published, disseminated and used.
Collapse
Affiliation(s)
- Lyubomir Penev
- Bulgarian Academy of Sciences & Pensoft Publishers, 13a Geo Milev Str., Sofia, Bulgaria
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
52
|
Pires AC, Marinoni L. DNA barcoding and traditional taxonomy unified through Integrative Taxonomy: a view that challenges the debate questioning both methodologies. BIOTA NEOTROPICA 2010. [DOI: 10.1590/s1676-06032010000200035] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The taxonomic crisis, emphasized in recent years, is marked by the lack of popularity (lack of interest in taxonomy) and financial incentives to study biodiversity. This situation, coupled with the issues involved with the necessity of knowing many yet undiscovered species, has meant that new technologies, including the use of DNA, have emerged to revitalize taxonomy. Part of the scientific community, however, has rejected the use of these innovative ideas. DNA barcoding has especially been the target of numerous criticisms regarding its application, as opposed to the use of morphology. This paper aims to highlight the inconsistency of the debate involving DNA versus morphology, since there is a proposal for the integration of traditional taxonomy and DNA barcoding - the integrative taxonomy. The positive and negative points of this proposal will be discussed, as well as its validity and application. From it, the importance of morphology is recognized and the revitalization of traditional taxonomy is achieved by the addition of technologies to overcome the taxonomic impediment.
Collapse
|
53
|
Virgilio M, Backeljau T, Nevado B, De Meyer M. Comparative performances of DNA barcoding across insect orders. BMC Bioinformatics 2010; 11:206. [PMID: 20420717 PMCID: PMC2885370 DOI: 10.1186/1471-2105-11-206] [Citation(s) in RCA: 143] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2009] [Accepted: 04/27/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Previous studies on insect DNA barcoding provide contradictory results and suggest not consistent performances across orders. This work aims at providing a general evaluation of insect DNA barcoding and "mini-barcoding" by performing simulations on a large database of 15,948 DNA barcodes. We compared the proportions of correctly identified queries across a) six insect orders (Coleoptera, Diptera, Hemiptera, Hymenoptera, Lepidoptera and Orthoptera), b) four identification criteria (Best Match: BM; Best Close Match: BCM; All Species Barcodes: ASB; tree-based identification: NJT), and c) reference databases with different taxon coverage (100, 500, 1,000, 1,500 and 1,995 insect species). RESULTS Analysis of variance revealed highly significant differences among ID criteria and insect orders. A posteriori comparisons of means showed that NJT had always a significantly lower identification success (NJT = 0.656, S.D. = 0.118) compared to both BM and BCM (BM = 0.948, S.D. = 0.026; BCM = 0.946, S.D. = 0.031). NJT showed significant variations among orders, with the highest proportion of correctly identified queries in Hymenoptera and Orthoptera and the lowest in Diptera. Conversely, the proportions of correct matches of BM and BCM were consistent across orders but a progressive increase in false identification was observed when larger reference databases were used. CONCLUSIONS Regardless the relatively low proportion of Type I errors (misidentification of queries which are represented in the reference database) of BM and BCM, the lack of reference DNA barcodes for 98% of the known insect species implies that insect DNA barcoding is heavily biased by Type II errors (misidentification of queries without conspecifics in the database). The detrimental effects of Type II errors could be circumvented if insect DNA barcoding is used to verify the lack of correspondence between a query and a list of properly referenced target species (e.g. insect pests). This "negative identification" would only be subjected to Type I errors and could be profitably adopted in insect quarantine procedures.
Collapse
Affiliation(s)
- Massimiliano Virgilio
- Royal Museum for Central Africa, Leuvensesteenweg 13, 3080, Tervuren, Belgium
- Royal Belgian Institute of Natural Sciences, Vautierstraat 29, 1000, Brussels, Belgium
| | - Thierry Backeljau
- Royal Belgian Institute of Natural Sciences, Vautierstraat 29, 1000, Brussels, Belgium
- Department of Biology, University of Antwerp, Groenenborgerlaan 171, B-2020 Antwerp, Belgium
| | - Bruno Nevado
- Royal Belgian Institute of Natural Sciences, Vautierstraat 29, 1000, Brussels, Belgium
| | - Marc De Meyer
- Royal Museum for Central Africa, Leuvensesteenweg 13, 3080, Tervuren, Belgium
| |
Collapse
|
54
|
Menke SB, Booth W, Dunn RR, Schal C, Vargo EL, Silverman J. Is it easy to be urban? Convergent success in urban habitats among lineages of a widespread native ant. PLoS One 2010; 5:e9194. [PMID: 20169204 PMCID: PMC2820551 DOI: 10.1371/journal.pone.0009194] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2009] [Accepted: 01/21/2010] [Indexed: 11/19/2022] Open
Abstract
The most rapidly expanding habitat globally is the urban habitat, yet the origin and life histories of the populations of native species that inhabit this habitat remain poorly understood. We use DNA barcoding of the COI gene in the widespread native pest ant Tapinoma sessile to test two hypotheses regarding the origin of urban populations and traits associated with their success. First, we determine if urban samples of T. sessile have a single origin from natural populations by looking at patterns of haplotype clustering from across their range. Second, we examine whether polygynous colony structure--a trait associated with invasion success--is correlated with urban environments, by studying the lineage dependence of colony structure. Our phylogenetic analysis of 49 samples identified four well supported geographic clades. Within clades, Kimura-2 parameter pairwise genetic distances revealed <2.3% variation; however, between clade genetic distances were 7.5-10.0%, suggesting the possibility of the presence of cryptic species. Our results indicate that T. sessile has successfully colonized urban environments multiple times. Additionally, polygynous colony structure is a highly plastic trait across habitat, clade, and haplotype. In short, T. sessile has colonized urban habitats repeatedly and appears to do so using life history strategies already present in more natural populations. Whether similar results hold for other species found in urban habitats has scarcely begun to be considered.
Collapse
Affiliation(s)
- Sean B Menke
- Department of Entomology, North Carolina State University, Raleigh, North Carolina, United States of America.
| | | | | | | | | | | |
Collapse
|
55
|
Riedel A, Daawia D, Balke M. Deepcox1divergence and hyperdiversity ofTrigonopterusweevils in a New Guinea mountain range (Coleoptera, Curculionidae). ZOOL SCR 2010. [DOI: 10.1111/j.1463-6409.2009.00404.x] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
56
|
Schlick-Steiner BC, Steiner FM, Seifert B, Stauffer C, Christian E, Crozier RH. Integrative taxonomy: a multisource approach to exploring biodiversity. ANNUAL REVIEW OF ENTOMOLOGY 2010; 55:421-38. [PMID: 19737081 DOI: 10.1146/annurev-ento-112408-085432] [Citation(s) in RCA: 490] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Good alpha taxonomy is central to biology. On the basis of a survey of arthropod studies that used multiple disciplines for species delimitation, we evaluated the performance of single disciplines. All included disciplines had a considerable failure rate. Rigor in species delimitation can thus be increased when several disciplines chosen for complementarity are used. We present a flexible procedure and stopping rule for integrative taxonomy that uses the information from different disciplines separately. Disagreement among disciplines over the number and demarcation of species is resolved by elucidating and invoking evolutionary explanations for disagreement. With the identification of further promising study organisms and of new questions for in-depth analysis, evolutionary biology should profit from integrative taxonomy. An important rationale is clarity in researcher bias in the decision-making process. The success of integrative taxonomy will further increase through methodological progress, taxonomic training of evolutionary biologists, and balanced resource allocation.
Collapse
|
57
|
Smith MA, Fisher BL. Invasions, DNA barcodes, and rapid biodiversity assessment using ants of Mauritius. Front Zool 2009; 6:31. [PMID: 20003263 PMCID: PMC2804717 DOI: 10.1186/1742-9994-6-31] [Citation(s) in RCA: 76] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2009] [Accepted: 12/10/2009] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND Using an understudied taxon (Hymenoptera, Formicidae) found on a tropical island (Mauritius) where native flora and fauna have been threatened by 400 years of habitat modification and introduced species, we tested whether estimated incidences of diversity and complementarity were similar when measured by standard morphological alpha-taxonomy or phylogenetic diversity (PD) based on a standardized mitochondrial barcode and corroborating nuclear marker. RESULTS We found that costs related to site loss (considered loss of evolutionary history measured as loss of barcode PD) were not significantly different from predictions made either a) using standard morphology-based taxonomy, or b) measured using a nuclear marker. Integrating morphology and barcode results permitted us to identify a case of initially morphologically-cryptic variation as a new and endemic candidate species. However, barcode estimates of the relative importance of each site or network of sites were dramatically affected when the species in question was known to be indigenous or introduced. CONCLUSION This study goes beyond a mere demonstration of the rapid gains possible for diversity assessment using a standardized DNA barcode. Contextualization of these gains with ecological and natural history information is necessary to calibrate this wealth of standardized information. Without such an integrative approach, critical opportunities to advance knowledge will be missed.
Collapse
Affiliation(s)
- M Alex Smith
- Biodiversity Institute of Ontario, Department of Integrative Biology, University of Guelph, Guelph Ontario, N1G 2W1, Canada
| | - Brian L Fisher
- Department of Entomology, California Academy of Sciences, San Francisco, USA
| |
Collapse
|
58
|
|
59
|
Vences M, Wollenberg KC, Vieites DR, Lees DC. Madagascar as a model region of species diversification. Trends Ecol Evol 2009; 24:456-65. [DOI: 10.1016/j.tree.2009.03.011] [Citation(s) in RCA: 244] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2008] [Revised: 03/09/2009] [Accepted: 03/11/2009] [Indexed: 12/18/2022]
|
60
|
Agosti D, Egloff W. Taxonomic information exchange and copyright: the Plazi approach. BMC Res Notes 2009; 2:53. [PMID: 19331688 PMCID: PMC2673227 DOI: 10.1186/1756-0500-2-53] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2009] [Accepted: 03/30/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND A large part of our knowledge on the world's species is recorded in the corpus of biodiversity literature with well over hundred million pages, and is represented in natural history collections estimated at 2 - 3 billion specimens. But this body of knowledge is almost entirely in paper-print form and is not directly accessible through the Internet. For the digitization of this literature, new territories have to be chartered in the fields of technical, legal and social issues that presently impede its advance. The taxonomic literature seems especially destined for such a transformation. DISCUSSION Plazi was founded as an association with the primary goal of transforming both the printed and, more recently, "born-digital" taxonomic literature into semantically enabled, enhanced documents. This includes the creation of a test body of literature, an XML schema modeling its logic content (TaxonX), the development of a mark-up editor (GoldenGATE) allowing also the enhancement of documents with links to external resources via Life Science Identifiers (LSID), a repository for publications and issuance of bibliographic identifiers, a dedicated server to serve the marked up content (the Plazi Search and Retrieval Server, SRS) and semantic tools to mine information. Plazi's workflow is designed to respect copyright protection and achieves extraction by observing exceptions and limitations existent in international copyright law. CONCLUSION The information found in Plazi's databases - taxonomic treatments as well as the metadata of the publications - are in the public domain and can therefore be used for further scientific research without any restriction, whether or not contained in copyrighted publications.
Collapse
Affiliation(s)
- Donat Agosti
- Plazi, Zinggstrasse 16, CH-3007, Bern, Switzerland
| | - Willi Egloff
- Advocomplex, Zinggstrasse 16, CH-3007, Bern, Switzerland
| |
Collapse
|
61
|
Extreme diversity of tropical parasitoid wasps exposed by iterative integration of natural history, DNA barcoding, morphology, and collections. Proc Natl Acad Sci U S A 2008; 105:12359-64. [PMID: 18716001 DOI: 10.1073/pnas.0805319105] [Citation(s) in RCA: 316] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We DNA barcoded 2,597 parasitoid wasps belonging to 6 microgastrine braconid genera reared from parapatric tropical dry forest, cloud forest, and rain forest in Area de Conservación Guanacaste (ACG) in northwestern Costa Rica and combined these data with records of caterpillar hosts and morphological analyses. We asked whether barcoding and morphology discover the same provisional species and whether the biological entities revealed by our analysis are congruent with wasp host specificity. Morphological analysis revealed 171 provisional species, but barcoding exposed an additional 142 provisional species; 95% of the total is likely to be undescribed. These 313 provisional species are extraordinarily host specific; more than 90% attack only 1 or 2 species of caterpillars out of more than 3,500 species sampled. The most extreme case of overlooked diversity is the morphospecies Apanteles leucostigmus. This minute black wasp with a distinctive white wing stigma was thought to parasitize 32 species of ACG hesperiid caterpillars, but barcoding revealed 36 provisional species, each attacking one or a very few closely related species of caterpillars. When host records and/or within-ACG distributions suggested that DNA barcoding had missed a species-pair, or when provisional species were separated only by slight differences in their barcodes, we examined nuclear sequences to test hypotheses of presumptive species boundaries and to further probe host specificity. Our iterative process of combining morphological analysis, ecology, and DNA barcoding and reiteratively using specimens maintained in permanent collections has resulted in a much more fine-scaled understanding of parasitoid diversity and host specificity than any one of these elements could have produced on its own.
Collapse
|