1
|
Sandall EL, Maureaud AA, Guralnick R, McGeoch MA, Sica YV, Rogan MS, Booher DB, Edwards R, Franz N, Ingenloff K, Lucas M, Marsh CJ, McGowan J, Pinkert S, Ranipeta A, Uetz P, Wieczorek J, Jetz W. A globally integrated structure of taxonomy to support biodiversity science and conservation. Trends Ecol Evol 2023; 38:1143-1153. [PMID: 37684131 DOI: 10.1016/j.tree.2023.08.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 07/30/2023] [Accepted: 08/04/2023] [Indexed: 09/10/2023]
Abstract
All aspects of biodiversity research, from taxonomy to conservation, rely on data associated with species names. Effective integration of names across multiple fields is paramount and depends on the coordination and organization of taxonomic data. We assess current efforts and find that even key applications for well-studied taxa still lack commonality in taxonomic information required for integration. We identify essential taxonomic elements from our interoperability assessment to support improved access and integration of taxonomic data. A stronger focus on these elements has the potential to involve taxonomic communities in biodiversity science and overcome broken linkages currently limiting research capacity. We encourage a community effort to democratize taxonomic expertise and language in order to facilitate maximum interoperability and integration.
Collapse
Affiliation(s)
- Emily L Sandall
- Department of Ecology & Evolutionary Biology, Yale University, New Haven, CT 06520, USA; Center for Biodiversity & Global Change, Yale University, New Haven, CT 06520, USA.
| | - Aurore A Maureaud
- Department of Ecology & Evolutionary Biology, Yale University, New Haven, CT 06520, USA; Center for Biodiversity & Global Change, Yale University, New Haven, CT 06520, USA; Department of Ecology, Evolution, and Natural Resources, Rutgers University, New Brunswick, NJ, USA.
| | - Robert Guralnick
- Department of Natural History, Florida Museum of Natural History, University of Florida, Gainesville, FL, USA
| | - Melodie A McGeoch
- Securing Antarctica's Environmental Future, Department of Environment and Genetics, LaTrobe University, Melbourne, Australia
| | - Yanina V Sica
- Department of Ecology & Evolutionary Biology, Yale University, New Haven, CT 06520, USA; Center for Biodiversity & Global Change, Yale University, New Haven, CT 06520, USA
| | - Matthew S Rogan
- Department of Ecology & Evolutionary Biology, Yale University, New Haven, CT 06520, USA; Center for Biodiversity & Global Change, Yale University, New Haven, CT 06520, USA
| | - Douglas B Booher
- Department of Ecology & Evolutionary Biology, Yale University, New Haven, CT 06520, USA; Center for Biodiversity & Global Change, Yale University, New Haven, CT 06520, USA
| | - Robert Edwards
- Department of Ecology & Evolutionary Biology, Yale University, New Haven, CT 06520, USA; Center for Biodiversity & Global Change, Yale University, New Haven, CT 06520, USA; Cleveland Museum of Natural History, Cleveland, OH, USA
| | - Nico Franz
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Kate Ingenloff
- Department of Ecology & Evolutionary Biology, Yale University, New Haven, CT 06520, USA; Center for Biodiversity & Global Change, Yale University, New Haven, CT 06520, USA
| | - Maisha Lucas
- Department of Ecology & Evolutionary Biology, Yale University, New Haven, CT 06520, USA; Center for Biodiversity & Global Change, Yale University, New Haven, CT 06520, USA
| | - Charles J Marsh
- Department of Ecology & Evolutionary Biology, Yale University, New Haven, CT 06520, USA; Center for Biodiversity & Global Change, Yale University, New Haven, CT 06520, USA
| | - Jennifer McGowan
- Department of Ecology & Evolutionary Biology, Yale University, New Haven, CT 06520, USA; Center for Biodiversity & Global Change, Yale University, New Haven, CT 06520, USA; The Nature Conservancy, Arlington, VA, USA
| | - Stefan Pinkert
- Department of Ecology & Evolutionary Biology, Yale University, New Haven, CT 06520, USA; Center for Biodiversity & Global Change, Yale University, New Haven, CT 06520, USA; Department of Conservation Ecology, University of Marburg, Marburg, Germany
| | - Ajay Ranipeta
- Department of Ecology & Evolutionary Biology, Yale University, New Haven, CT 06520, USA; Center for Biodiversity & Global Change, Yale University, New Haven, CT 06520, USA
| | - Peter Uetz
- Center for Biological Data Science, Virginia Commonwealth University, Richmond, VA, USA
| | - John Wieczorek
- Museum of Vertebrate Zoology, University of California, Berkeley, CA, USA
| | - Walter Jetz
- Department of Ecology & Evolutionary Biology, Yale University, New Haven, CT 06520, USA; Center for Biodiversity & Global Change, Yale University, New Haven, CT 06520, USA; E.O. Wilson Biodiversity Foundation, Durham, NC, USA
| |
Collapse
|
2
|
Sterner B, Elliott S, Gilbert EE, Franz NM. Unified and pluralistic ideals for data sharing and reuse in biodiversity. Database (Oxford) 2023; 2023:baad048. [PMID: 37465916 PMCID: PMC10354506 DOI: 10.1093/database/baad048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Revised: 05/30/2023] [Accepted: 06/27/2023] [Indexed: 07/20/2023]
Abstract
How should billions of species observations worldwide be shared and made reusable? Many biodiversity scientists assume the ideal solution is to standardize all datasets according to a single, universal classification and aggregate them into a centralized, global repository. This ideal has known practical and theoretical limitations, however, which justifies investigating alternatives. To support better community deliberation and normative evaluation, we develop a novel conceptual framework showing how different organizational models, regulative ideals and heuristic strategies are combined to form shared infrastructures supporting data reuse. The framework is anchored in a general definition of data pooling as an activity of making a taxonomically standardized body of information available for community reuse via digital infrastructure. We describe and illustrate unified and pluralistic ideals for biodiversity data pooling and show how communities may advance toward these ideals using different heuristic strategies. We present evidence for the strengths and limitations of the unification and pluralistic ideals based on systemic relationships of power, responsibility and benefit they establish among stakeholders, and we conclude the pluralistic ideal is better suited for biodiversity data.
Collapse
Affiliation(s)
- Beckett Sterner
- School of Life Sciences, Arizona State University, 427 E Tyler Mall, Tempe, AZ 85281, USA
| | - Steve Elliott
- School of Life Sciences, Arizona State University, 427 E Tyler Mall, Tempe, AZ 85281, USA
| | - Edward E Gilbert
- School of Life Sciences, Arizona State University, 427 E Tyler Mall, Tempe, AZ 85281, USA
| | - Nico M Franz
- School of Life Sciences, Arizona State University, 427 E Tyler Mall, Tempe, AZ 85281, USA
| |
Collapse
|
3
|
Sterner B, Upham N, Gupta P, Powell C, Franz N. Wanted: Standards for FAIR taxonomic concept representations and relationships. BIODIVERSITY INFORMATION SCIENCE AND STANDARDS 2021; 5. [PMID: 35462676 PMCID: PMC9028594 DOI: 10.3897/biss.5.75587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
Making the most of biodiversity data requires linking observations of biological species from multiple sources both efficiently and accurately (Bisby 2000, Franz et al. 2016). Aggregating occurrence records using taxonomic names and synonyms is computationally efficient but known to experience significant limitations on accuracy when the assumption of one-to-one relationships between names and biological entities breaks down (Remsen 2016, Franz and Sterner 2018). Taxonomic treatments and checklists provide authoritative information about the correct usage of names for species, including operational representations of the meanings of those names in the form of range maps, reference genetic sequences, or diagnostic traits. They increasingly provide taxonomic intelligence in the form of precise description of the semantic relationships between different published names in the literature. Making this authoritative information Findable, Accessible, Interoperable, and Reusable (FAIR; Wilkinson et al. 2016) would be a transformative advance for biodiversity data sharing and help drive adoption and novel extensions of existing standards such as the Taxonomic Concept Schema and the OpenBiodiv Ontology (Kennedy et al. 2006, Senderov et al. 2018). We call for the greater, global Biodiversity Information Standards (TDWG) and taxonomy community to commit to extending and expanding on how FAIR applies to biodiversity data and include practical targets and criteria for the publication and digitization of taxonomic concept representations and alignments in taxonomic treatments, checklists, and backbones.
Collapse
|
4
|
Sterner BW, Gilbert EE, Franz NM. Decentralized but Globally Coordinated Biodiversity Data. Front Big Data 2021; 3:519133. [PMID: 33693407 PMCID: PMC7931950 DOI: 10.3389/fdata.2020.519133] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Accepted: 08/31/2020] [Indexed: 11/22/2022] Open
Abstract
Centralized biodiversity data aggregation is too often failing societal needs due to pervasive and systemic data quality deficiencies. We argue for a novel approach that embodies the spirit of the Web (“small pieces loosely joined”) through the decentralized coordination of data across scientific languages and communities. The upfront cost of decentralization can be offset by the long-term benefit of achieving sustained expert engagement, higher-quality data products, and ultimately more societal impact for biodiversity data. Our decentralized approach encourages the emergence and evolution of multiple self-identifying communities of practice that are regionally, taxonomically, or institutionally localized. Each community is empowered to control the social and informational design and versioning of their local data infrastructures and signals. With no single aggregator to exert centralized control over biodiversity data, decentralization generates loosely connected networks of mid-level aggregators. Global coordination is nevertheless feasible through automatable data sharing agreements that enable efficient propagation and translation of biodiversity data across communities. The decentralized model also poses novel integration challenges, among which the explicit and continuous articulation of conflicting systematic classifications and phylogenies remain the most challenging. We discuss the development of available solutions, challenges, and outline next steps: the global effort of coordination should focus on developing shared languages for data signal translation, as opposed to homogenizing the data signal itself.
Collapse
Affiliation(s)
- Beckett W Sterner
- School of Life Sciences, Arizona State University, Tempe, AZ, United States
| | - Edward E Gilbert
- School of Life Sciences, Arizona State University, Tempe, AZ, United States
| | - Nico M Franz
- School of Life Sciences, Arizona State University, Tempe, AZ, United States
| |
Collapse
|
5
|
Sterner B, Elliott S, Upham N, Franz N. Bats, objectivity, and viral spillover risk. HISTORY AND PHILOSOPHY OF THE LIFE SCIENCES 2021; 43:7. [PMID: 33439354 PMCID: PMC7805256 DOI: 10.1007/s40656-021-00366-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Accepted: 01/07/2021] [Indexed: 06/12/2023]
Abstract
What should the best practices be for modeling zoonotic disease risks, e.g. to anticipate the next pandemic, when background assumptions are unsettled or evolving rapidly? This challenge runs deeper than one might expect, all the way into how we model the robustness of contemporary phylogenetic inference and taxonomic classifications. Different and legitimate taxonomic assumptions can destabilize the putative objectivity of zoonotic risk assessments, thus potentially supporting inconsistent and overconfident policy decisions.
Collapse
Affiliation(s)
- Beckett Sterner
- School of Life Sciences, Arizona State University, Tempe, USA.
| | - Steve Elliott
- Center for Gender Equity in Science and Technology, Arizona State University, Tempe, USA
| | - Nate Upham
- School of Life Sciences, Arizona State University, Tempe, USA
| | - Nico Franz
- School of Life Sciences, Arizona State University, Tempe, USA
| |
Collapse
|
6
|
Sterner B, Witteveen J, Franz N. Coordinating dissent as an alternative to consensus classification: insights from systematics for bio-ontologies. HISTORY AND PHILOSOPHY OF THE LIFE SCIENCES 2020; 42:8. [PMID: 32030540 DOI: 10.1007/s40656-020-0300-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Accepted: 01/17/2020] [Indexed: 06/10/2023]
Abstract
The collection and classification of data into meaningful categories is a key step in the process of knowledge making. In the life sciences, the design of data discovery and integration tools has relied on the premise that a formal classificatory system for expressing a body of data should be grounded in consensus definitions for classifications. On this approach, exemplified by the realist program of the Open Biomedical Ontologies Foundry, progress is maximized by grounding the representation and aggregation of data on settled knowledge. We argue that historical practices in systematic biology provide an important and overlooked alternative approach to classifying and disseminating data, based on a principle of coordinative rather than definitional consensus. Systematists have developed a robust system for referring to taxonomic entities that can deliver high quality data discovery and integration without invoking consensus about reality or "settled" science.
Collapse
Affiliation(s)
- Beckett Sterner
- School of Life Sciences, Arizona State University, Tempe, USA.
| | - Joeri Witteveen
- Department of Science Education, Section for History and Philosophy of Science, University of Copenhagen, Copenhagen, Denmark
| | - Nico Franz
- School of Life Sciences, Arizona State University, Tempe, USA
| |
Collapse
|
7
|
Franz NM, Musher LJ, Brown JW, Yu S, Ludäscher B. Verbalizing phylogenomic conflict: Representation of node congruence across competing reconstructions of the neoavian explosion. PLoS Comput Biol 2019; 15:e1006493. [PMID: 30768597 PMCID: PMC6395011 DOI: 10.1371/journal.pcbi.1006493] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2017] [Revised: 02/28/2019] [Accepted: 09/10/2018] [Indexed: 11/24/2022] Open
Abstract
Phylogenomic research is accelerating the publication of landmark studies that aim to resolve deep divergences of major organismal groups. Meanwhile, systems for identifying and integrating the products of phylogenomic inference-such as newly supported clade concepts-have not kept pace. However, the ability to verbalize node concept congruence and conflict across multiple, in effect simultaneously endorsed phylogenomic hypotheses, is a prerequisite for building synthetic data environments for biological systematics and other domains impacted by these conflicting inferences. Here we develop a novel solution to the conflict verbalization challenge, based on a logic representation and reasoning approach that utilizes the language of Region Connection Calculus (RCC-5) to produce consistent alignments of node concepts endorsed by incongruent phylogenomic studies. The approach employs clade concept labels to individuate concepts used by each source, even if these carry identical names. Indirect RCC-5 modeling of intensional (property-based) node concept definitions, facilitated by the local relaxation of coverage constraints, allows parent concepts to attain congruence in spite of their differentially sampled children. To demonstrate the feasibility of this approach, we align two recent phylogenomic reconstructions of higher-level avian groups that entail strong conflict in the "neoavian explosion" region. According to our representations, this conflict is constituted by 26 instances of input "whole concept" overlap. These instances are further resolvable in the output labeling schemes and visualizations as "split concepts", which provide the labels and relations needed to build truly synthetic phylogenomic data environments. Because the RCC-5 alignments fundamentally reflect the trained, logic-enabled judgments of systematic experts, future designs for such environments need to promote a culture where experts routinely assess the intensionalities of node concepts published by our peers-even and especially when we are not in agreement with each other.
Collapse
Affiliation(s)
- Nico M. Franz
- School of Life Sciences, Arizona State University, Tempe, Arizona, United States of America
| | - Lukas J. Musher
- Richard Gilder Graduate School and Department of Ornithology, American Museum of Natural History, New York, New York, United States of America
| | - Joseph W. Brown
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| | - Shizhuo Yu
- Department of Computer Science, University of California at Davis, Davis, California, United States of America
| | - Bertram Ludäscher
- School of Information Sciences, University of Illinois at Urbana-Champaign, Champaign, Illinois, United States of America
| |
Collapse
|
8
|
Jansen MA, Franz NM. Descriptions of four new species of Minyomerus Horn, 1876 sec. Jansen & Franz, 2018 (Coleoptera: Curculionidae), with notes on their distribution and phylogeny. PeerJ 2018; 6:e5633. [PMID: 30356935 PMCID: PMC6195114 DOI: 10.7717/peerj.5633] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2018] [Accepted: 08/17/2018] [Indexed: 11/20/2022] Open
Abstract
This contribution adopts the taxonomic concept approach, including the use of taxonomic concept labels (name sec. [according to] source) and region connection calculus-5 (RCC-5) articulations and alignments. Prior to this study, the broad-nosed weevil genus Minyomerus Horn, 1876 sec. Jansen & Franz, 2015 (Curculionidae [non-focal]: Entiminae [non-focal]: Tanymecini [non-focal]) contained 17 species distributed throughout the desert and plains regions of North America. In this review of Minyomerus sec. Jansen & Franz, 2018, we describe the following four species as new to science: Minyomerus ampullaceus sec. Jansen & Franz, 2018 (henceforth: [JF2018]), new species, Minyomerus franko [JF2018], new species, Minyomerus sculptilis [JF2018], new species, and Minyomerus tylotos [JF2018], new species. The four new species are added to, and integrated with, the preceding revision, and an updated key and phylogeny of Minyomerus [JF2018] are presented. A cladistic analysis using 52 morphological characters of 26 terminal taxa (5/21 outgroup/ingroup) yielded a single most-parsimonious cladogram (Length = 99 steps, consistency index = 60, retention index = 80). The analysis reaffirms the monophyly of Minyomerus [JF2018] with eight unreversed synapomorphies. The species-group placements, possible biogeographic origins, and natural history of the new species are discussed in detail.
Collapse
Affiliation(s)
- M. Andrew Jansen
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Nico M. Franz
- ASU Natural History Collections, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
9
|
Morphological and Molecular Perspectives on the Phylogeny, Evolution, and Classification of Weevils (Coleoptera: Curculionoidea): Proceedings from the 2016 International Weevil Meeting. DIVERSITY 2018. [DOI: 10.3390/d10030064] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The 2016 International Weevil Meeting was held immediately after the International Congress of Entomology (ICE). It built on the topics and content of the 2016 ICE weevil symposium Phylogeny and Evolution of Weevils (Coleoptera: Curculionoidea): A Symposium in Honor of Dr. Guillermo "Willy” Kuschel. Beyond catalyzing research and collaboration, the meeting was intended to serve as a forum for identifying priorities and goals for those who study weevils. The meeting consisted of 46 invited and contributed lectures, discussion sessions and introductory remarks presented by 23 speakers along with eight contributed research posters. These were organized into three convened sessions, each lasting one day: (1) weevil morphology; (2) weevil fossils, biogeography and host/habitat associations; and (3) molecular phylogenetics and classification of weevils. Some of the topics covered included the 1K Weevils Project, major morphological character systems of adult and larval weevils, weevil morphological terminology, prospects for future morphological character discovery, phylogenetic analysis of morphological character data, the current status of weevil molecular phylogenetics and evolution, resources available for phylogenetic and comparative genomic studies of weevils, the weevil fossil record, weevil biogeography and evolution, weevil host plants, evolutionary development of the weevil rostrum, resources available for weevil identification and the current status of and challenges in weevil classification.
Collapse
|
10
|
Franz NM, Zhang C, Lee J. A logic approach to modelling nomenclatural change. Cladistics 2018; 34:336-357. [PMID: 34645079 DOI: 10.1111/cla.12201] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/10/2017] [Indexed: 11/27/2022] Open
Abstract
We utilize an Answer Set Programming (ASP) approach to show that the principles of nomenclature are tractable in computational logic. To this end we design a hypothetical, 20 nomenclatural taxon use case, with starting conditions that embody several overarching principles of the International Code of Zoological Nomenclature, including Binomial Nomenclature, Priority, Coordination, Homonymy, Typification and the structural requirement of Gender Agreement. The use case ending conditions are triggered by the reinterpretation of the diagnostic features of one of 12 type specimens anchoring the corresponding species-level epithets. Permutations of this child-to-parent reassignment action lead to 36 alternative scenarios, where each scenario requires a set of 1-14 logically contingent nomenclatural emendations. We show that an ASP transition system approach can correctly infer the Code-mandated changes for each scenario, and visually output the ending conditions. The results provide a foundation for further developing logic-based nomenclatural change optimization and validation services, which could be applied in global nomenclatural registries. More generally, logic explorations of nomenclatural and taxonomic change scenarios provide a novel means of assessing design biases inherent in the principles of nomenclature, and can therefore inform the design of future, big data-compatible identifier systems that recognize and mitigate these constraints.
Collapse
Affiliation(s)
- Nico M Franz
- School of Life Sciences, Arizona State University, PO Box 874501, Tempe, AZ, 85287-4501, USA
| | - Chao Zhang
- School of Computing, Informatics, and Decision Systems Engineering, Arizona State University, PO Box 878809, Tempe, AZ, 85287-8809, USA
| | - Joohyung Lee
- School of Computing, Informatics, and Decision Systems Engineering, Arizona State University, PO Box 878809, Tempe, AZ, 85287-8809, USA
| |
Collapse
|
11
|
Vaidya G, Lepage D, Guralnick R. The tempo and mode of the taxonomic correction process: How taxonomists have corrected and recorrected North American bird species over the last 127 years. PLoS One 2018; 13:e0195736. [PMID: 29672539 PMCID: PMC5909608 DOI: 10.1371/journal.pone.0195736] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2017] [Accepted: 03/28/2018] [Indexed: 11/19/2022] Open
Abstract
While studies of taxonomy usually focus on species description, there is also a taxonomic correction process that retests and updates existing species circumscriptions on the basis of new evidence. These corrections may themselves be subsequently retested and recorrected. We studied this correction process by using the Check-List of North and Middle American Birds, a well-known taxonomic checklist that spans 130 years. We identified 142 lumps and 95 splits across sixty-three versions of the Check-List and found that while lumping rates have markedly decreased since the 1970s, splitting rates are accelerating. We found that 74% of North American bird species recognized today have never been corrected (i.e., lumped or split) over the period of the checklist, while 16% have been corrected exactly once and 10% have been corrected twice or more. Since North American bird species are known to have been extensively lumped in the first half of the 20th century with the advent of the biological species concept, we determined whether most splits seen today were the result of those lumps being recorrected. We found that 5% of lumps and 23% of splits fully reverted previous corrections, while a further 3% of lumps and 13% of splits are partial reversions. These results show a taxonomic correction process with moderate levels of recorrection, particularly of previous lumps. However, 81% of corrections do not revert any previous corrections, suggesting that the majority result in novel circumscriptions not previously recognized by the Check-List. We could find no order or family with a significantly higher rate of correction than any other, but twenty-two genera as currently recognized by the AOU do have significantly higher rates than others. Given the currently accelerating rate of splitting, prediction of the end-point of the taxonomic recorrection process is difficult, and many entirely new taxonomic concepts are still being, and likely will continue to be, proposed and further tested.
Collapse
Affiliation(s)
- Gaurav Vaidya
- Department of Ecology and Evolutionary Biology, University of Colorado Boulder, Boulder, Colorado, United States of America
- * E-mail:
| | - Denis Lepage
- Bird Studies Canada, Port Rowan, Ontario, Canada
| | - Robert Guralnick
- Department of Natural History and the Florida Museum of Natural History, University of Florida, Gainesville, Florida, United States of America
| |
Collapse
|
12
|
Senderov V, Simov K, Franz N, Stoev P, Catapano T, Agosti D, Sautter G, Morris RA, Penev L. OpenBiodiv-O: ontology of the OpenBiodiv knowledge management system. J Biomed Semantics 2018; 9:5. [PMID: 29347997 PMCID: PMC5774086 DOI: 10.1186/s13326-017-0174-5] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Accepted: 12/28/2017] [Indexed: 11/16/2022] Open
Abstract
BACKGROUND The biodiversity domain, and in particular biological taxonomy, is moving in the direction of semantization of its research outputs. The present work introduces OpenBiodiv-O, the ontology that serves as the basis of the OpenBiodiv Knowledge Management System. Our intent is to provide an ontology that fills the gaps between ontologies for biodiversity resources, such as DarwinCore-based ontologies, and semantic publishing ontologies, such as the SPAR Ontologies. We bridge this gap by providing an ontology focusing on biological taxonomy. RESULTS OpenBiodiv-O introduces classes, properties, and axioms in the domains of scholarly biodiversity publishing and biological taxonomy and aligns them with several important domain ontologies (FaBiO, DoCO, DwC, Darwin-SW, NOMEN, ENVO). By doing so, it bridges the ontological gap across scholarly biodiversity publishing and biological taxonomy and allows for the creation of a Linked Open Dataset (LOD) of biodiversity information (a biodiversity knowledge graph) and enables the creation of the OpenBiodiv Knowledge Management System. A key feature of the ontology is that it is an ontology of the scientific process of biological taxonomy and not of any particular state of knowledge. This feature allows it to express a multiplicity of scientific opinions. The resulting OpenBiodiv knowledge system may gain a high level of trust in the scientific community as it does not force a scientific opinion on its users (e.g. practicing taxonomists, library researchers, etc.), but rather provides the tools for experts to encode different views as science progresses. CONCLUSIONS OpenBiodiv-O provides a conceptual model of the structure of a biodiversity publication and the development of related taxonomic concepts. It also serves as the basis for the OpenBiodiv Knowledge Management System.
Collapse
Affiliation(s)
- Viktor Senderov
- Pensoft Publishers, Prof. Georgi Zlatarski 12, Sofia, 1700 Bulgaria
- Institute of Biodiversity and Ecosystems Research, Bulgarian Academy of Sciences, Sofia, Bulgaria
| | - Kiril Simov
- Institute of Information and Communication Technologies, Bulgarian Academy of Sciences, Sofia, Bulgaria
| | - Nico Franz
- Arizona State University, School of Life Sciences, Tempe Campus, Tempe, 4501 AZ USA
| | - Pavel Stoev
- Pensoft Publishers, Prof. Georgi Zlatarski 12, Sofia, 1700 Bulgaria
- National Museum of Natural History, 1 Tsar Osvoboditel Blvd., Sofia, 1000 Bulgaria
| | | | | | | | | | - Lyubomir Penev
- Pensoft Publishers, Prof. Georgi Zlatarski 12, Sofia, 1700 Bulgaria
- Institute of Biodiversity and Ecosystems Research, Bulgarian Academy of Sciences, Sofia, Bulgaria
| |
Collapse
|
13
|
Franz NM, Sterner BW. To increase trust, change the social design behind aggregated biodiversity data. Database (Oxford) 2018; 2018:4791171. [PMID: 29315357 PMCID: PMC7206650 DOI: 10.1093/database/bax100] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2017] [Revised: 12/05/2017] [Accepted: 12/06/2017] [Indexed: 01/07/2023]
Abstract
Growing concerns about the quality of aggregated biodiversity data are lowering trust in large-scale data networks. Aggregators frequently respond to quality concerns by recommending that biologists work with original data providers to correct errors 'at the source.' We show that this strategy falls systematically short of a full diagnosis of the underlying causes of distrust. In particular, trust in an aggregator is not just a feature of the data signal quality provided by the sources to the aggregator, but also a consequence of the social design of the aggregation process and the resulting power balance between individual data contributors and aggregators. The latter have created an accountability gap by downplaying the authorship and significance of the taxonomic hierarchies-frequently called 'backbones'-they generate, and which are in effect novel classification theories that operate at the core of data-structuring process. The Darwin Core standard for sharing occurrence records plays an under-appreciated role in maintaining the accountability gap, because this standard lacks the syntactic structure needed to preserve the taxonomic coherence of data packages submitted for aggregation, potentially leading to inferences that no individual source would support. Since high-quality data packages can mirror competing and conflicting classifications, i.e. unsettled systematic research, this plurality must be accommodated in the design of biodiversity data integration. Looking forward, a key directive is to develop new technical pathways and social incentives for experts to contribute directly to the validation of taxonomically coherent data packages as part of a greater, trustworthy aggregation process.
Collapse
Affiliation(s)
- Nico M Franz
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| | - Beckett W Sterner
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| |
Collapse
|
14
|
Rees JA, Cranston K. Automated assembly of a reference taxonomy for phylogenetic data synthesis. Biodivers Data J 2017:e12581. [PMID: 28765728 PMCID: PMC5515096 DOI: 10.3897/bdj.5.e12581] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2017] [Accepted: 05/12/2017] [Indexed: 12/24/2022] Open
|
15
|
Franz N, Zhang G. Three new species of entimine weevils in Early Miocene amber from the Dominican Republic (Coleoptera: Curculionidae). Biodivers Data J 2017; 5:e10469. [PMID: 28325975 PMCID: PMC5345054 DOI: 10.3897/bdj.5.e10469] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2016] [Accepted: 11/29/2016] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND Using syntactic and semantic conventions of the taxonomic concept approach (Franz et al. 2015), we describe three newly recognized fossil broad-nosed weevils (Coleoptera: Curculionidae: Entiminae) preserved in Early Miocene amber (ca. 20.4-16.0 mya) from the Dominican Republic: Scelianoma compactasp. n. sec. Franz & Zhang (2017) (henceforth abbreviated as [FZ2017]), Tropirhinus palpebratussp. n. [FZ2017], and Diaprepes anticussp. n. [FZ2017]. The taxonomic assignment of the amber inclusions is grounded in a preceding phylogenetic analysis by Franz (2012). As many as 88 of the 143 therein identified characters were coded for the fossils, whose traits are largely congruent with those present in extant congeners while also differing in ways that justify their new nomenclatural and taxonomic status. NEW INFORMATION We present detailed images, descriptions, and phylogenetically informed diagnoses for the three new species-level entities, along with logically consistent Region Connection Calculus (RCC-5) alignments of the amended genus-level classifications for Scelianoma Franz and Girón 2009 [FZ2017], Tropirhinus Schoenherr 1823 [FZ2017], and Diaprepes Schoenherr 1823 [FZ2017] - in relation to 2-4 preceding classifications published in 1982-2012. The description of Scelianoma compacta [FZ2017] from Hispaniola is indicative of a more widespread historical range of Scelianoma [FZ2017] than reflected in the extant, southwestern Puerto Rican Scelianoma elydimorpha Franz and Girón 2009 sec. Franz and Girón (2009). The presence of Diaprepes anticus [FZ2017] in Hispaniola during the Early Miocene suggests an eastward directed process of island colonization and likely speciation of members of Diaprepes [FZ2017], given that most extant relatives occur throughout the Lesser Antilles. The herein presented data will facilitate more reliable reconstructions of historical biographic processes thought to have played a prominent role in the diversification of the West Indian and Neotropical mainland broad-nosed weevil lineages.
Collapse
Affiliation(s)
- Nico Franz
- Arizona State University, Tempe, United States of America
| | - Guanyang Zhang
- Arizona State University, Tempe, United States of America
| |
Collapse
|
16
|
Franz N, Gilbert E, Ludäscher B, Weakley A. Controlling the taxonomic variable: Taxonomic concept resolution for a southeastern United States herbarium portal. RESEARCH IDEAS AND OUTCOMES 2016. [DOI: 10.3897/rio.2.e10610] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Overview. Taxonomic names are imperfect identifiers of specific and sometimes conflicting taxonomic perspectives in aggregated biodiversity data environments. The inherent ambiguities of names can be mitigated using syntactic and semantic conventions developed under the taxonomic concept approach. These include: (1) representation of taxonomic concept labels (TCLs: name sec. source) to precisely identify name usages and meanings, (2) use of parent/child relationships to assemble separate taxonomic perspectives, and (3) expert provision of Region Connection Calculus articulations (RCC–5: congruence, [inverse] inclusion, overlap, exclusion) that specify how data identified to different-sourced TCLs can be integrated. Application of these conventions greatly increases trust in biodiversity data networks, most of which promote unitary taxonomic 'syntheses' that obscure the actual diversity of expert-held views. Better design solutions allow users to control the taxonomic variable and thereby assess the robustness of their biological inferences under different perspectives. A unique constellation of prior efforts – including the powerful Symbiota collections software platform, the Euler/X multi-taxonomy alignment toolkit, and the "Weakley Flora" which entails 7,000 concepts and more than 75,000 RCC–5 articulations – provides the opportunity to build a first full-scale concept resolution service for SERNEC, the SouthEast Regional Network of Expertise and Collections, currently with 60 member herbaria and 2 million occurrence records.
Intellectual merit. We have developed a multi-dimensional, step-wise plan to transition SERNEC's data culture from name- to concept-based practices. (1) We will engage SERNEC experts through annual, regional workshops and follow-up interactions that will foster buy-in and ultimately the completion of 12 community-identified use cases. (2). We will leverage RCC–5 data from the Weakley Flora and further development of the Euler/X logic reasoning toolkit to provide comprehensive genus- to variety-level concept alignments for at least 10 major flora treatments with highest relevance to SERNEC. The visualizations and estimated > 1 billion inferred concept-to-concept relations will effectively drive specimen data integration in the transformed portal. (3) We will expand Symbiota's taxonomy and occurrence schemas and related user interfaces to support the new concept data, including novel batch and map-based specimen determination modules, with easy output options in Darwin Core Archive format. (4) Through combinations of the new technology, enlisted taxonomic expertise, and SERNEC's large image resources, we will upgrade minimally 80% of all SERNEC specimen identifications from names to the narrowest suitable TCLs, or add "uncertainty" flags to specimens needing further study. (5) We will utilize the novel tools and data to demonstrate how controlling for the taxonomic variable in 12 use cases variously drives the outcomes of evolutionary, ecological, and conservation-based research hypotheses.
Broader impacts. Our project is focused on just one herbarium network, but the potential impact is as wide as Darwin Core or even comparative biology. We believe that trust in networked biodiversity data depends on open and dynamic system designs, allowing expert access and resolution of multiple conflicting views that reflect the complex realities of ongoing taxonomic research. Taking well over 1 million SERNEC records from name- to TCL-resolution will show that "big" specimen data can pass the credibility threshold needed to validate the substantive data mobilization investment. We will mentor one postdoctoral researcher (UNC), two Ph.D. students (ASU, UIUC), and at least 15 undergraduate students (ASU). Each of our workshops will capacitate 10-15 SERNEC experts, who in turn can recruit colleagues and students at their home collections. We will incorporate the project theme and use cases into undergraduate courses taught at six institutions and reaching an estimated 300-500 students annually (10-40% minority students). At each institution, project members will make a systematic effort to recruit new students from underrepresented groups. Our group's leadership of Symbiota (with close ties to iDigBio), SERNEC, and local biodiversity projects and centers will further promote the new data culture. We will create a feature story "Where do plant species occur?" for ASU's popular "Ask A Biologist" website, and a series of undergraduate student-led "How-To" videos that illustrate the use case workflows, including the creation of multi-taxonomy alignments.
Collapse
|