1
|
Wang M, Zhu M, Qian J, Yang Z, Shang F, Egan AN, Li P, Liu L. Phylogenomics of mulberries (Morus, Moraceae) inferred from plastomes and single copy nuclear genes. Mol Phylogenet Evol 2024; 197:108093. [PMID: 38740145 DOI: 10.1016/j.ympev.2024.108093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 04/30/2024] [Accepted: 05/06/2024] [Indexed: 05/16/2024]
Abstract
Mulberries (genus Morus), belonging to the order Rosales, family Moraceae, are important woody plants due to their economic values in sericulture, as well as for nutritional benefits and medicinal values. However, the taxonomy and phylogeny of Morus, especially for the Asian species, remains challenging due to its wide geographical distribution, morphological plasticity, and interspecific hybridization. To better understand the evolutionary history of Morus, we combined plastomes and a large-scale nuclear gene analyses to investigate their phylogenetic relationships. We assembled the plastomes and screened 211 single-copy nuclear genes from 13 Morus species and related taxa. The plastomes of Morus species were relatively conserved in terms of genome size, gene content, synteny, IR boundary and codon usage. Using nuclear data, our results elucidated identical topologies based on coalescent and concatenation methods. The genus Morus was supported as monophyletic, with M. notabilis as the first diverging lineage and the two North American Morus species, M. microphylla and M. rubra, as sister to the other Asian species. In the Asian Morus species, interspecific relationships were completely resolved. However, cyto-nuclear discordances and gene tree-species tree conflicts were detected in the phylogenies of Morus, with multiple evidences supporting hybridization/introgression as the main cause of discordances between nuclear and plastid phylogenies, while gene tree-species tree conflicts were mainly caused by ILS.
Collapse
Affiliation(s)
- Meizhen Wang
- College of Life Sciences, Henan Normal University, Xinxiang 453000, China; Systematic & Evolutionary Botany and Biodiversity Group, MOE Key Laboratory of Biosystems Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou 310058, China
| | - Mengmeng Zhu
- Laboratory of Plant Germplasm and Genetic Engineering, School of Life Sciences, Henan University, Kaifeng 475001, China
| | - Jiayi Qian
- College of Life Sciences, Henan Agricultural University, Zhengzhou 450002, China
| | - Zhaoping Yang
- College of Life Sciences and Technologies, Tarim University, Alar 843300, China
| | - Fude Shang
- Laboratory of Plant Germplasm and Genetic Engineering, School of Life Sciences, Henan University, Kaifeng 475001, China; College of Life Sciences, Henan Agricultural University, Zhengzhou 450002, China.
| | - Ashley N Egan
- Department of Biology, Utah Valley University, Orem, UT 84058, United States.
| | - Pan Li
- Systematic & Evolutionary Botany and Biodiversity Group, MOE Key Laboratory of Biosystems Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou 310058, China.
| | - Luxian Liu
- College of Life Sciences, Henan Normal University, Xinxiang 453000, China; Laboratory of Plant Germplasm and Genetic Engineering, School of Life Sciences, Henan University, Kaifeng 475001, China.
| |
Collapse
|
2
|
Byerly PA, Kearns AM, Welch A, Ochirbat ME, Marra PP, Wilson A, Campana MG, Fleischer RC. Museum genomics provide insight into the extinction of a specialist North American warbler species. Sci Rep 2024; 14:17047. [PMID: 39048633 PMCID: PMC11269716 DOI: 10.1038/s41598-024-67595-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Accepted: 07/12/2024] [Indexed: 07/27/2024] Open
Abstract
Museum genomics provide an opportunity to investigate population demographics of extinct species, especially valuable when research prior to extinction was minimal. The Bachman's warbler (Vermivora bachmanii) is hypothesized to have gone extinct due to loss of its specialized habitat. However, little is known about other potential contributing factors such as natural rarity or changes to connectivity following habitat fragmentation. We examined mitochondrial DNA (mtDNA) and genome-wide SNPs using specimens collected from breeding and migration sites across the range of the Bachman's warbler. We found no signals of strong population structuring across the breeding range of Bachman's warblers in both mtDNA and genome-wide SNPs. Thus, long-term population isolation did not appear to be a significant contributor to the extinction of the Bachman's warbler. Instead, our findings support the theory that Bachman's warblers underwent a rapid decline likely driven by habitat destruction, which may have been exacerbated by the natural rarity, habitat specificity and low genetic diversity of the species.
Collapse
Affiliation(s)
- Paige A Byerly
- Center for Conservation Genomics, Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, 20008, USA.
| | - Anna M Kearns
- Center for Conservation Genomics, Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, 20008, USA
- Australian National Wildlife Collection, CSIRO National Research Collections Australia, Canberra, Australia
| | - Andreanna Welch
- Center for Conservation Genomics, Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, 20008, USA
- Department of Biosciences, Durham University, South Road, Durham, UK
| | - Margad-Erdene Ochirbat
- Center for Conservation Genomics, Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, 20008, USA
| | - Peter P Marra
- Department of Biology and McCourt School of Public Policy, Georgetown University, 37th and O Streets NW, Washington, DC, 20057, USA
| | - Amy Wilson
- Center for Conservation Genomics, Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, 20008, USA
- Department of Forest and Conservation Sciences, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Michael G Campana
- Center for Conservation Genomics, Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, 20008, USA
| | - Robert C Fleischer
- Center for Conservation Genomics, Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, 20008, USA
| |
Collapse
|
3
|
Quek ZBR, Ng SH. Hybrid-Capture Target Enrichment in Human Pathogens: Identification, Evolution, Biosurveillance, and Genomic Epidemiology. Pathogens 2024; 13:275. [PMID: 38668230 PMCID: PMC11054155 DOI: 10.3390/pathogens13040275] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 03/11/2024] [Accepted: 03/18/2024] [Indexed: 04/29/2024] Open
Abstract
High-throughput sequencing (HTS) has revolutionised the field of pathogen genomics, enabling the direct recovery of pathogen genomes from clinical and environmental samples. However, pathogen nucleic acids are often overwhelmed by those of the host, requiring deep metagenomic sequencing to recover sufficient sequences for downstream analyses (e.g., identification and genome characterisation). To circumvent this, hybrid-capture target enrichment (HC) is able to enrich pathogen nucleic acids across multiple scales of divergences and taxa, depending on the panel used. In this review, we outline the applications of HC in human pathogens-bacteria, fungi, parasites and viruses-including identification, genomic epidemiology, antimicrobial resistance genotyping, and evolution. Importantly, we explored the applicability of HC to clinical metagenomics, which ultimately requires more work before it is a reliable and accurate tool for clinical diagnosis. Relatedly, the utility of HC was exemplified by COVID-19, which was used as a case study to illustrate the maturity of HC for recovering pathogen sequences. As we unravel the origins of COVID-19, zoonoses remain more relevant than ever. Therefore, the role of HC in biosurveillance studies is also highlighted in this review, which is critical in preparing us for the next pandemic. We also found that while HC is a popular tool to study viruses, it remains underutilised in parasites and fungi and, to a lesser extent, bacteria. Finally, weevaluated the future of HC with respect to bait design in the eukaryotic groups and the prospect of combining HC with long-read HTS.
Collapse
Affiliation(s)
- Z. B. Randolph Quek
- Defence Medical & Environmental Research Institute, DSO National Laboratories, Singapore 117510, Singapore
| | | |
Collapse
|
4
|
Liu L, Chen M, Folk RA, Wang M, Zhao T, Shang F, Soltis DE, Li P. Phylogenomic and syntenic data demonstrate complex evolutionary processes in early radiation of the rosids. Mol Ecol Resour 2023; 23:1673-1688. [PMID: 37449554 DOI: 10.1111/1755-0998.13833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Revised: 06/16/2023] [Accepted: 06/30/2023] [Indexed: 07/18/2023]
Abstract
Some of the most vexing problems of deep level relationship that remain in angiosperms involve the superrosids. The superrosid clade contains a quarter of all angiosperm species, with 18 orders in three subclades (Vitales, Saxifragales and core rosids) exhibiting remarkable morphological and ecological diversity. To help resolve deep-level relationships, we constructed a high-quality chromosome-level genome assembly for Tiarella polyphylla (Saxifragaceae) thus providing broader genomic representation of Saxifragales. Whole genome microsynteny analysis of superrosids showed that Saxifragales shared more synteny clusters with core rosids than Vitales, further supporting Saxifragales as more closely related with core rosids. To resolve the ordinal phylogeny of superrosids, we screened 122 single copy nuclear genes from genomes of 36 species, representing all 18 superrosid orders. Vitales were recovered as sister to all other superrosids (Saxifragales + core rosids). Our data suggest dramatic differences in relationships compared to earlier studies within core rosids. Fabids should be restricted to the nitrogen-fixing clade, while Picramniales, the Celastrales-Malpighiales (CM) clade, Huerteales, Oxalidales, Sapindales, Malvales and Brassicales formed an "expanded" malvid clade. The Celastrales-Oxalidales-Malpighiales (COM) clade (sensu APG IV) was not monophyletic. Crossosomatales, Geraniales, Myrtales and Zygophyllales did not belong to either of our well-supported malvids or fabids. There is strong discordance between nuclear and plastid phylogenetic hypotheses for superrosid relationships; we show that this is best explained by a combination of incomplete lineage sorting and ancient reticulation.
Collapse
Affiliation(s)
- Luxian Liu
- Laboratory of Plant Germplasm and Genetic Engineering, School of Life Sciences, Henan University, Kaifeng, Henan, China
- Systematic & Evolutionary Botany and Biodiversity Group, MOE Key Laboratory of Biosystems Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China
| | - Mengzhen Chen
- Laboratory of Plant Germplasm and Genetic Engineering, School of Life Sciences, Henan University, Kaifeng, Henan, China
| | - Ryan A Folk
- Department of Biological Sciences, Mississippi State University, Starkville, Mississippi, USA
| | - Meizhen Wang
- Systematic & Evolutionary Botany and Biodiversity Group, MOE Key Laboratory of Biosystems Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China
| | - Tao Zhao
- State Key Laboratory of Crop Stress Biology for Arid Areas/Shaanxi Key Laboratory of Apple, College of Horticulture, Northwest A&F University, Yangling, Shaanxi, China
| | - Fude Shang
- Laboratory of Plant Germplasm and Genetic Engineering, School of Life Sciences, Henan University, Kaifeng, Henan, China
- Henan Engineering Research Center for Osmanthus Germplasm Innovation and Resource Utilization, Henan Agricultural University, Zhengzhou, Henan, China
| | - Douglas E Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, Florida, USA
- Department of Biology, University of Florida, Gainesville, Florida, USA
| | - Pan Li
- Systematic & Evolutionary Botany and Biodiversity Group, MOE Key Laboratory of Biosystems Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China
| |
Collapse
|
5
|
Prado NA, Armstrong EE, Brown JL, Goldenberg SZ, Leimgruber P, Pearson VR, Maldonado JE, Campana MG. Genomic resources for Asian (Elephas maximus) and African savannah elephant (Loxodonta africana) conservation and health research. J Hered 2023; 114:529-538. [PMID: 37246890 DOI: 10.1093/jhered/esad034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Accepted: 05/26/2023] [Indexed: 05/30/2023] Open
Abstract
We provide novel genomic resources to help understand the genomic traits involved in elephant health and to aid conservation efforts. We sequence 11 elephant genomes (5 African savannah, 6 Asian) from North American zoos, including 9 de novo assemblies. We estimate elephant germline mutation rates and reconstruct demographic histories. Finally, we provide an in-solution capture assay to genotype Asian elephants. This assay is suitable for analyzing degraded museum and noninvasive samples, such as feces and hair. The elephant genomic resources we present here should allow for more detailed and uniform studies in the future to aid elephant conservation efforts and disease research.
Collapse
Affiliation(s)
- Natalia A Prado
- Biology Department, College of Arts and Sciences, Adelphi University, Garden City, NY, United States
- Center for Conservation Genomics, Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, United States
- Endocrinology Research Laboratory, Center for Species Survival, Smithsonian's National Zoo and Conservation Biology Institute, Front Royal, VA, United States
| | - Ellie E Armstrong
- Department of Biology, Stanford University, Stanford, CA, United States
- School of Biological Sciences, Washington State University, Pullman, WA, United States
| | - Janine L Brown
- Endocrinology Research Laboratory, Center for Species Survival, Smithsonian's National Zoo and Conservation Biology Institute, Front Royal, VA, United States
| | - Shifra Z Goldenberg
- Conservation Ecology Center, Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, United States
- Conservation Science and Wildlife Health, San Diego Zoo Wildlife Alliance, Escondido, CA, United States
| | - Peter Leimgruber
- Conservation Ecology Center, Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, United States
| | - Virginia R Pearson
- Glenn Rall Laboratory, Fox Chase Cancer Center, Philadelphia, PA, United States
| | - Jesús E Maldonado
- Center for Conservation Genomics, Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, United States
| | - Michael G Campana
- Center for Conservation Genomics, Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, United States
| |
Collapse
|
6
|
Nunes R, Storer C, Doleck T, Kawahara AY, Pierce NE, Lohman DJ. Predictors of sequence capture in a large-scale anchored phylogenomics project. Front Ecol Evol 2022. [DOI: 10.3389/fevo.2022.943361] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/19/2023] Open
Abstract
Next-generation sequencing (NGS) technologies have revolutionized phylogenomics by decreasing the cost and time required to generate sequence data from multiple markers or whole genomes. Further, the fragmented DNA of biological specimens collected decades ago can be sequenced with NGS, reducing the need for collecting fresh specimens. Sequence capture, also known as anchored hybrid enrichment, is a method to produce reduced representation libraries for NGS sequencing. The technique uses single-stranded oligonucleotide probes that hybridize with pre-selected regions of the genome that are sequenced via NGS, culminating in a dataset of numerous orthologous loci from multiple taxa. Phylogenetic analyses using these sequences have the potential to resolve deep and shallow phylogenetic relationships. Identifying the factors that affect sequence capture success could save time, money, and valuable specimens that might be destructively sampled despite low likelihood of sequencing success. We investigated the impacts of specimen age, preservation method, and DNA concentration on sequence capture (number of captured sequences and sequence quality) while accounting for taxonomy and extracted tissue type in a large-scale butterfly phylogenomics project. This project used two probe sets to extract 391 loci or a subset of 13 loci from over 6,000 butterfly specimens. We found that sequence capture is a resilient method capable of amplifying loci in samples of varying age (0–111 years), preservation method (alcohol, papered, pinned), and DNA concentration (0.020 ng/μl - 316 ng/ul). Regression analyses demonstrate that sequence capture is positively correlated with DNA concentration. However, sequence capture and DNA concentration are negatively correlated with sample age and preservation method. Our findings suggest that sequence capture projects should prioritize the use of alcohol-preserved samples younger than 20 years old when available. In the absence of such specimens, dried samples of any age can yield sequence data, albeit with returns that diminish with increasing age.
Collapse
|
7
|
Alanko JN, Slizovskiy IB, Lokshtanov D, Gagie T, Noyes NR, Boucher C. Syotti: scalable bait design for DNA enrichment. Bioinformatics 2022; 38:i177-i184. [PMID: 35758776 PMCID: PMC9235489 DOI: 10.1093/bioinformatics/btac226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Bait enrichment is a protocol that is becoming increasingly ubiquitous as it has been shown to successfully amplify regions of interest in metagenomic samples. In this method, a set of synthetic probes ('baits') are designed, manufactured and applied to fragmented metagenomic DNA. The probes bind to the fragmented DNA and any unbound DNA is rinsed away, leaving the bound fragments to be amplified for sequencing. Metsky et al. demonstrated that bait-enrichment is capable of detecting a large number of human viral pathogens within metagenomic samples. RESULTS We formalize the problem of designing baits by defining the Minimum Bait Cover problem, show that the problem is NP-hard even under very restrictive assumptions, and design an efficient heuristic that takes advantage of succinct data structures. We refer to our method as Syotti. The running time of Syotti shows linear scaling in practice, running at least an order of magnitude faster than state-of-the-art methods, including the method of Metsky et al. At the same time, our method produces bait sets that are smaller than the ones produced by the competing methods, while also leaving fewer positions uncovered. Lastly, we show that Syotti requires only 25 min to design baits for a dataset comprised of 3 billion nucleotides from 1000 related bacterial substrains, whereas the method of Metsky et al. shows clearly super-linear running time and fails to process even a subset of 17% of the data in 72 h. AVAILABILITY AND IMPLEMENTATION https://github.com/jnalanko/syotti. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jarno N Alanko
- Department of Computer Science, University of Helsinki, Helsinki, Finland
- Faculty of Computer Science, Dalhousie University, Halifax, Canada
| | - Ilya B Slizovskiy
- Department of Veterinary Population Medicine, University of Minnesota, St. Paul, MN, USA
| | - Daniel Lokshtanov
- Department of Computer Science, University of California, Santa Barbara, CA, USA
| | - Travis Gagie
- Faculty of Computer Science, Dalhousie University, Halifax, Canada
| | - Noelle R Noyes
- Department of Veterinary Population Medicine, University of Minnesota, St. Paul, MN, USA
| | - Christina Boucher
- Department of Computer and Information Science and Engineering, University of Florida, Gainesville, FL, USA
| |
Collapse
|
8
|
Baier C, Huang J, Reumann K, Indenbirken D, Thol F, Koenecke C, Ebadi E, Heim A, Bange FC, Haid S, Pietschmann T, Fischer N. Target capture sequencing reveals a monoclonal outbreak of respiratory syncytial virus B infections among adult hematologic patients. Antimicrob Resist Infect Control 2022; 11:88. [PMID: 35729671 PMCID: PMC9210056 DOI: 10.1186/s13756-022-01120-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2022] [Accepted: 05/24/2022] [Indexed: 11/10/2022] Open
Abstract
Background Respiratory syncytial virus (RSV) causes community-acquired respiratory tract infections during winter. However, outbreaks in hospitals also occur repeatedly. In particular, patients with hematologic malignancies are at an increased risk for a severe and potentially fatal course of RSV infection. Here we present the investigation of an RSV outbreak in a hematology ward for adults following the ORION statement. Methods An epidemiologic and molecular outbreak analysis was performed. We developed and employed a minimal oligonucleotide probe set in target capture probe sequencing that allows cost-effective RSV-A or -B capturing to reconstruct RSV genomes from clinical samples. Results Four adult patients were involved in the outbreak caused by RSV-B in March 2019. The enforcement of the pre-existing infection control measures by effective training of hospital staff contributed to a successful containment. PCR-based RSV screening on the ward enabled early detection of new cases and rapid isolation measures. The molecular analysis demonstrated that the outbreak sequences were highly related and distinct to other RSV-B strains circulating at the same time. Conclusions A multimodal infection control concept is essential for the timely detection and control of RSV outbreaks in patients with hematological disease. Among other measures, preventive screening for respiratory viruses is recommended. Furthermore, the integration of conventional and molecular epidemiology, such as whole-genome sequencing and variant calling, significantly contributes to the understanding of transmission pathways. Based on this, appropriate conclusions can be drawn for targeted prevention measures that have prepared us for the COVID-19 pandemic beyond the RSV approach described here. Supplementary Information The online version contains supplementary material available at 10.1186/s13756-022-01120-z.
Collapse
|
9
|
Kearns AM, Campana MG, Slikas B, Berry L, Saitoh T, Cibois A, Fleischer RC. Conservation genomics and systematics of a near-extinct island radiation. Mol Ecol 2022; 31:1995-2012. [PMID: 35119154 DOI: 10.1111/mec.16382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Revised: 01/16/2022] [Accepted: 01/24/2022] [Indexed: 11/27/2022]
Abstract
Conservation benefits from incorporating genomics to explore the impacts of population declines, inbreeding, loss of genetic variation and hybridization. Here we use the near-extinct Mariana Islands reedwarbler radiation to showcase how ancient DNA approaches can allow insights into the population dynamics of extinct species and threatened populations for which historical museum specimens or material with low DNA yield (e.g., scats, feathers) are the only sources for DNA. Despite their having paraphyletic mtDNA, nuclear SNPs support the distinctiveness of critically endangered Acrocephalus hiwae and the other three species in the radiation that went extinct between the 1960s and 1990s. Two extinct species, A. yamashinae and A. luscinius, were deeply divergent from each other and from a third less differentiated lineage containing A. hiwae and extinct A. nijoi. Both mtDNA and SNPs suggest that the two isolated populations of A. hiwae from Saipan and Alamagan Islands are sufficiently distinct to warrant subspecies recognition and separate conservation management. We detected no significant differences in genetic diversity or inbreeding between Saipan and Alamagan, nor strong signatures of geographic structuring within either island. However, the implications of possible signatures of inbreeding in both Saipan and Alamagan, and long-term population declines in A. hiwae that predate modern anthropogenic threats require further study with denser population sampling. Our study highlights the value conservation genomics studies of island radiations have as windows onto the possible future for the world's biota as climate change and habitat destruction increasingly fragments their ranges and contributes to rapid declines in population abundances.
Collapse
Affiliation(s)
- Anna M Kearns
- Center for Conservation Genomics, Smithsonian's National Zoological Park and Conservation Biology Institute, Washington, DC, 20008, USA
| | - Michael G Campana
- Center for Conservation Genomics, Smithsonian's National Zoological Park and Conservation Biology Institute, Washington, DC, 20008, USA
| | - Beth Slikas
- Center for Conservation Genomics, Smithsonian's National Zoological Park and Conservation Biology Institute, Washington, DC, 20008, USA.,Center for Evolution & Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, 85287, USA
| | - Lainie Berry
- Department of Lands and Natural Resources, Commonwealth of the Northern Mariana Islands, Saipan, MP, 96950, USA.,Hawaii Department of Land and Natural Resources-Division of Forestry and Wildlife, Honolulu, HI, 96813, USA
| | - Takema Saitoh
- Yamashina Institute for Ornithology, 115 Konoyama, Abiko, Chiba, 270-1145, Japan
| | - Alice Cibois
- Natural History Museum of Geneva, CP, 6434, 1211, Geneva, Switzerland
| | - Robert C Fleischer
- Center for Conservation Genomics, Smithsonian's National Zoological Park and Conservation Biology Institute, Washington, DC, 20008, USA
| |
Collapse
|
10
|
Parker LD, Campana MG, Quinta JD, Cypher B, Rivera I, Fleischer RC, Ralls K, Wilbert TR, Boarman R, Boarman WI, Maldonado JE. An efficient method for simultaneous species, individual, and sex identification via in-solution single nucleotide polymorphism capture from low-quality scat samples. Mol Ecol Resour 2021; 22:1345-1361. [PMID: 34779133 DOI: 10.1111/1755-0998.13552] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 09/24/2021] [Accepted: 10/27/2021] [Indexed: 12/01/2022]
Abstract
Understanding predator population dynamics is important for conservation management because of the critical roles predators play within ecosystems. Noninvasive genetic sampling methods are useful for the study of predators like canids that can be difficult to capture or directly observe. Here, we introduce the FAECES* method (Fast and Accurate Enrichment of Canid Excrement for Species* and other analyses) which expands the toolbox for canid researchers and conservationists by using in-solution hybridization sequence capture to produce single nucleotide polymorphism (SNP) genotypes for multiple canid species from scat-derived DNA using a single enrichment. We designed a set of hybridization probes to genotype both coyotes (Canis latrans) and kit foxes (Vulpes macrotis) at hundreds of polymorphic SNP loci and we tested the probes on both tissues and field-collected scat samples. We enriched and genotyped by sequencing 52 coyote and 70 kit fox scats collected in and around a conservation easement in the Nevada Mojave Desert. We demonstrate that the FAECES* method produces genotypes capable of differentiating coyotes and kit foxes, identifying individuals and their sex, and estimating genetic diversity and effective population sizes, even using highly degraded, low-quantity DNA extracted from scat. We found that the study area harbours a large and diverse population of kit foxes and a relatively smaller population of coyotes. By replicating our methods in the future, conservationists can assess the impacts of management decisions on canid populations. The method can also be adapted and applied more broadly to enrich and sequence multiple loci from any species of interest using scat or other noninvasive genetic samples.
Collapse
Affiliation(s)
- Lillian D Parker
- Center for Conservation Genomics, Smithsonian Conservation Biology Institute and National Zoological Park, Washington, District of Columbia, USA.,School of Systems Biology, George Mason University, Fairfax, Virginia, USA
| | - Michael G Campana
- Center for Conservation Genomics, Smithsonian Conservation Biology Institute and National Zoological Park, Washington, District of Columbia, USA.,School of Systems Biology, George Mason University, Fairfax, Virginia, USA.,Department of Environmental Science and Policy, George Mason University, Fairfax, Virginia, USA
| | - Jessica D Quinta
- Center for Conservation Genomics, Smithsonian Conservation Biology Institute and National Zoological Park, Washington, District of Columbia, USA
| | - Brian Cypher
- Endangered Species Recovery Program, California State University, Turlock, California, USA
| | - Isabel Rivera
- Center for Conservation Genomics, Smithsonian Conservation Biology Institute and National Zoological Park, Washington, District of Columbia, USA
| | - Robert C Fleischer
- Center for Conservation Genomics, Smithsonian Conservation Biology Institute and National Zoological Park, Washington, District of Columbia, USA
| | - Katherine Ralls
- Center for Conservation Genomics, Smithsonian Conservation Biology Institute and National Zoological Park, Washington, District of Columbia, USA
| | - Tammy R Wilbert
- Center for Conservation Genomics, Smithsonian Conservation Biology Institute and National Zoological Park, Washington, District of Columbia, USA
| | - Ryan Boarman
- Conservation Science Research and Consultation, Spring Valley, California, USA
| | - William I Boarman
- Conservation Science Research and Consultation, Spring Valley, California, USA
| | - Jesús E Maldonado
- Center for Conservation Genomics, Smithsonian Conservation Biology Institute and National Zoological Park, Washington, District of Columbia, USA.,School of Systems Biology, George Mason University, Fairfax, Virginia, USA.,Department of Environmental Science and Policy, George Mason University, Fairfax, Virginia, USA
| |
Collapse
|
11
|
McLay TGB, Birch JL, Gunn BF, Ning W, Tate JA, Nauheimer L, Joyce EM, Simpson L, Schmidt‐Lebuhn AN, Baker WJ, Forest F, Jackson CJ. New targets acquired: Improving locus recovery from the Angiosperms353 probe set. APPLICATIONS IN PLANT SCIENCES 2021; 9:APS311420. [PMID: 34336399 PMCID: PMC8312740 DOI: 10.1002/aps3.11420] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/06/2020] [Accepted: 03/15/2021] [Indexed: 05/10/2023]
Abstract
PREMISE Universal target enrichment kits maximize utility across wide evolutionary breadth while minimizing the number of baits required to create a cost-efficient kit. The Angiosperms353 kit has been successfully used to capture loci throughout the angiosperms, but the default target reference file includes sequence information from only 6-18 taxa per locus. Consequently, reads sequenced from on-target DNA molecules may fail to map to references, resulting in fewer on-target reads for assembly, and reducing locus recovery. METHODS We expanded the Angiosperms353 target file, incorporating sequences from 566 transcriptomes to produce a 'mega353' target file, with each locus represented by 17-373 taxa. This mega353 file is a drop-in replacement for the original Angiosperms353 file in HybPiper analyses. We provide tools to subsample the file based on user-selected taxon groups, and to incorporate other transcriptome or protein-coding gene data sets. RESULTS Compared to the default Angiosperms353 file, the mega353 file increased the percentage of on-target reads by an average of 32%, increased locus recovery at 75% length by 49%, and increased the total length of the concatenated loci by 29%. DISCUSSION Increasing the phylogenetic density of the target reference file results in improved recovery of target capture loci. The mega353 file and associated scripts are available at: https://github.com/chrisjackson-pellicle/NewTargets.
Collapse
Affiliation(s)
- Todd G. B. McLay
- National Herbarium of VictoriaRoyal Botanic Gardens VictoriaMelbourneAustralia
- School of BiosciencesUniversity of MelbourneMelbourneAustralia
- Centre for Australian National Biodiversity ResearchCSIROCanberraAustralia
| | - Joanne L. Birch
- School of BiosciencesUniversity of MelbourneMelbourneAustralia
| | - Bee F. Gunn
- National Herbarium of VictoriaRoyal Botanic Gardens VictoriaMelbourneAustralia
- School of BiosciencesUniversity of MelbourneMelbourneAustralia
| | - Weixuan Ning
- School of Fundamental SciencesMassey UniversityPalmerston NorthNew Zealand
| | - Jennifer A. Tate
- School of Fundamental SciencesMassey UniversityPalmerston NorthNew Zealand
| | - Lars Nauheimer
- James Cook UniversityCairnsAustralia
- Australian Tropical HerbariumJames Cook UniversityCairnsAustralia
| | - Elizabeth M. Joyce
- James Cook UniversityCairnsAustralia
- Australian Tropical HerbariumJames Cook UniversityCairnsAustralia
| | - Lalita Simpson
- James Cook UniversityCairnsAustralia
- Australian Tropical HerbariumJames Cook UniversityCairnsAustralia
| | | | | | - Félix Forest
- Royal Botanic Gardens, KewRichmondSurreyTW9 3AEUnited Kingdom
| | - Chris J. Jackson
- National Herbarium of VictoriaRoyal Botanic Gardens VictoriaMelbourneAustralia
| |
Collapse
|
12
|
Joubran SS, Cassin-Sackett L. Genomic resources for an ecologically important rodent, Gunnison’s prairie dogs (Cynomys gunnisoni). CONSERV GENET RESOUR 2021. [DOI: 10.1007/s12686-021-01192-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
13
|
Gustafson GT, Alexander A, Sproul JS, Pflug JM, Maddison DR, Short AEZ. Ultraconserved element (UCE) probe set design: Base genome and initial design parameters critical for optimization. Ecol Evol 2019; 9:6933-6948. [PMID: 31312430 PMCID: PMC6617817 DOI: 10.1002/ece3.5260] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2019] [Revised: 04/26/2019] [Accepted: 04/28/2019] [Indexed: 01/10/2023] Open
Abstract
Targeted capture and enrichment approaches have proven effective for phylogenetic study. Ultraconserved elements (UCEs) in particular have exhibited great utility for phylogenomic analyses, with the software package phyluce being among the most utilized pipelines for UCE phylogenomics, including probe design. Despite the success of UCEs, it is becoming increasing apparent that diverse lineages require probe sets tailored to focal taxa in order to improve locus recovery. However, factors affecting probe design and methods for optimizing probe sets to focal taxa remain underexplored. Here, we use newly available beetle (Coleoptera) genomic resources to investigate factors affecting UCE probe set design using phyluce. In particular, we explore the effects of stringency during initial design steps, as well as base genome choice on resulting probe sets and locus recovery. We found that both base genome choice and initial bait design stringency parameters greatly alter the number of resultant probes included in final probe sets and strongly affect the number of loci detected and recovered during in silico testing of these probe sets. In addition, we identify attributes of base genomes that correlated with high performance in probe design. Ultimately, we provide a recommended workflow for using phyluce to design an optimized UCE probe set that will work across a targeted lineage, and use our findings to develop a new, open-source UCE probe set for beetles of the suborder Adephaga.
Collapse
Affiliation(s)
- Grey T. Gustafson
- Department of Ecology and Evolutionary BiologyUniversity of KansasLawrenceKansas
- Biodiversity InstituteUniversity of KansasLawrenceKansas
| | - Alana Alexander
- Biodiversity InstituteUniversity of KansasLawrenceKansas
- Department of Anatomy, School of Biomedical SciencesUniversity of OtagoDunedinNew Zealand
| | - John S. Sproul
- Department of Integrative BiologyOregon State UniversityCorvallisOregon
- Department of BiologyUniversity of RochesterRochesterNew York
| | - James M. Pflug
- Department of Integrative BiologyOregon State UniversityCorvallisOregon
| | - David R. Maddison
- Department of Integrative BiologyOregon State UniversityCorvallisOregon
| | - Andrew E. Z. Short
- Department of Ecology and Evolutionary BiologyUniversity of KansasLawrenceKansas
- Biodiversity InstituteUniversity of KansasLawrenceKansas
| |
Collapse
|
14
|
Jiang J, Yuan H, Zheng X, Wang Q, Kuang T, Li J, Liu J, Song S, Wang W, Cheng F, Li H, Huang J, Li C. Gene markers for exon capture and phylogenomics in ray-finned fishes. Ecol Evol 2019; 9:3973-3983. [PMID: 31015981 PMCID: PMC6468074 DOI: 10.1002/ece3.5026] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2018] [Revised: 02/11/2019] [Accepted: 02/12/2019] [Indexed: 12/30/2022] Open
Abstract
Gene capture coupled with the next-generation sequencing has become one of the preferred methods of subsampling genomes for phylogenomic studies. Many exon markers have been developed in plants, sharks, frogs, reptiles, fishes, and others, but no universal exon markers have been tested in ray-finned fishes. Here, we identified a suite of "single-copy" protein-coding sequence (CDS) markers through comparing eight fish genomes, and tested them empirically in 83 species (33 families and nine orders or higher clades: Acipenseriformes, Lepisosteiformes, Elopomorpha, Osteoglossomorpha, Clupeiformes, Cypriniformes, Gobiaria, Carangaria, and Eupercaria; sensu Betancur et al. 2013). Sorting the markers according to their completeness and phylogenetic decisiveness in taxa tested resulted in a selection of 4,434 markers, which were proven to be useful in reconstructing phylogenies of the ray-finned fishes at different taxonomic levels. We also proposed a strategy of refining baits (probes) design a posteriori based on empirical data. The markers that we have developed may greatly enrich the batteries of exon markers for phylogenomic study in ray-finned fishes.
Collapse
Affiliation(s)
- Jiamei Jiang
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources (Shanghai Ocean University), Ministry of Education, ShanghaiNational Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University)ShanghaiChina
| | - Hao Yuan
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources (Shanghai Ocean University), Ministry of Education, ShanghaiNational Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University)ShanghaiChina
| | - Xin Zheng
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources (Shanghai Ocean University), Ministry of Education, ShanghaiNational Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University)ShanghaiChina
| | - Qian Wang
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources (Shanghai Ocean University), Ministry of Education, ShanghaiNational Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University)ShanghaiChina
| | - Ting Kuang
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources (Shanghai Ocean University), Ministry of Education, ShanghaiNational Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University)ShanghaiChina
| | - Jingyan Li
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources (Shanghai Ocean University), Ministry of Education, ShanghaiNational Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University)ShanghaiChina
| | - Junning Liu
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources (Shanghai Ocean University), Ministry of Education, ShanghaiNational Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University)ShanghaiChina
| | - Shuli Song
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources (Shanghai Ocean University), Ministry of Education, ShanghaiNational Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University)ShanghaiChina
| | - Weicai Wang
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources (Shanghai Ocean University), Ministry of Education, ShanghaiNational Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University)ShanghaiChina
| | - Fangyuan Cheng
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources (Shanghai Ocean University), Ministry of Education, ShanghaiNational Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University)ShanghaiChina
| | - Hongjie Li
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources (Shanghai Ocean University), Ministry of Education, ShanghaiNational Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University)ShanghaiChina
| | - Junman Huang
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources (Shanghai Ocean University), Ministry of Education, ShanghaiNational Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University)ShanghaiChina
| | - Chenhong Li
- Shanghai Universities Key Laboratory of Marine Animal Taxonomy and Evolution, Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources (Shanghai Ocean University), Ministry of Education, ShanghaiNational Demonstration Center for Experimental Fisheries Science Education (Shanghai Ocean University)ShanghaiChina
| |
Collapse
|
15
|
Population Genomics and Structure of the Critically Endangered Mariana Crow ( Corvus kubaryi). Genes (Basel) 2019; 10:genes10030187. [PMID: 30832245 PMCID: PMC6471520 DOI: 10.3390/genes10030187] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2018] [Revised: 02/16/2019] [Accepted: 02/19/2019] [Indexed: 11/29/2022] Open
Abstract
The Mariana Crow, or Åga (Corvus kubaryi), is a critically endangered species (IUCN -International Union for Conservation of Nature), endemic to the islands of Guam and Rota in the Mariana Archipelago. It is locally extinct on Guam, and numbers have declined dramatically on Rota to a historical low of less than 55 breeding pairs throughout the island in 2013. Because of its extirpation on Guam and population decline on Rota, it is of critical importance to assess the genetic variation among individuals to assist ongoing recovery efforts. We conducted a population genomics analysis comparing the Guam and Rota populations and studied the genetic structure of the Rota population. We used blood samples from five birds from Guam and 78 birds from Rota. We identified 145,552 candidate single nucleotide variants (SNVs) from a genome sequence of an individual from Rota and selected a subset of these to develop an oligonucleotide in-solution capture assay. The Guam and Rota populations were genetically differentiated from each other. Crow populations sampled broadly across their range on Rota showed significant genetic structuring – a surprising result given the small size of this island and the good flight capabilities of the species. Knowledge of its genetic structure will help improve management strategies to help with its recovery.
Collapse
|
16
|
Li X, Hao B, Pan D, Schneeweiss GM. Marker Development for Phylogenomics: The Case of Orobanchaceae, a Plant Family with Contrasting Nutritional Modes. FRONTIERS IN PLANT SCIENCE 2017; 8:1973. [PMID: 29218053 PMCID: PMC5704539 DOI: 10.3389/fpls.2017.01973] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/29/2017] [Accepted: 11/01/2017] [Indexed: 05/02/2023]
Abstract
Phylogenomic approaches, employing next-generation sequencing (NGS) techniques, have revolutionized systematic and evolutionary biology. Target enrichment is an efficient and cost-effective method in phylogenomics and is becoming increasingly popular. Depending on availability and quality of reference data as well as on biological features of the study system, (semi-)automated identification of suitable markers will require specific bioinformatic pipelines. Here, we established a highly flexible bioinformatic pipeline, BaitsFinder, to identify putative orthologous single copy genes (SCGs) and to construct bait sequences in a single workflow. Additionally, this pipeline has been constructed to be able to cope with challenging data sets, such as the nutritionally heterogeneous plant family Orobanchaceae. To this end, we used transcriptome data of differing quality available for four Orobanchaceae species and, as reference, SCG data from monkeyflower (Erythranthe guttata, syn. Mimulus g.; 1,915 genes) and tomato (Solanum lycopersicum; 391 genes). Depending on whether gaps were permitted in initial blast searches of the four Orobanchaceae species against the reference, our pipeline identified 1,307 and 981 SCGs with average length of 994 bp and 775 bp, respectively. Automated bait sequence construction (using 2× tiling) resulted in 38,170 and 21,856 bait sequences, respectively. In comparison to the recently published MarkerMiner 1.0 pipeline BaitsFinder identified about 1.6 times as many SCGs (of at least 900 bp length). Skipping steps specific to analyses of Orobanchaceae, BaitsFinder was successfully used in a group of non-parasitic plants (three Asteraceae species and, as reference, SCG data from Arabidopsis thaliana based on previously compiled SCGs). Thus, BaitsFinder is expected to be broadly applicable in groups, where only transcriptomes or partial genome data of differing quality are available.
Collapse
|