1
|
Jackson AC, White OW, Carine M, Chapman MA. The role of geography, ecology, and hybridization in the evolutionary history of Canary Island Descurainia. AMERICAN JOURNAL OF BOTANY 2023; 110:e16162. [PMID: 36990083 DOI: 10.1002/ajb2.16162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Revised: 03/15/2023] [Accepted: 03/15/2023] [Indexed: 05/31/2023]
Abstract
PREMISE Oceanic islands offer the opportunity to understand evolutionary processes underlying rapid diversification. Along with geographic isolation and ecological shifts, a growing body of genomic evidence has suggested that hybridization can play an important role in island evolution. Here we use genotyping-by-sequencing (GBS) to understand the roles of hybridization, ecology, and geographic isolation in the radiation of Canary Island Descurainia (Brassicaceae). METHODS We carried out GBS for multiple individuals of all Canary Island species and two outgroups. Phylogenetic analyses of the GBS data were performed using both supermatrix and gene tree approaches and hybridization events were examined using D-statistics and Approximate Bayesian Computation. Climatic data were analyzed to examine the relationship between ecology and diversification. RESULTS Analysis of the supermatrix data set resulted in a fully resolved phylogeny. Species networks suggest a hybridization event has occurred for D. gilva, with these results being supported by Approximate Bayesian Computation analysis. Strong phylogenetic signals for temperature and precipitation indicate one major ecological shift within Canary Island Descurainia. CONCLUSIONS Inter-island dispersal played a significant role in the diversification of Descurainia, with evidence of only one major shift in climate preferences. Despite weak reproductive barriers and the occurrence of hybrids, hybridization appears to have played only a limited role in the diversification of the group with a single instance detected. The results highlight the need to use phylogenetic network approaches that can simultaneously accommodate incomplete lineage sorting and gene flow when studying groups prone to hybridization; patterns that might otherwise be obscured in species trees.
Collapse
Affiliation(s)
- Amy C Jackson
- Biological Sciences, University of Southampton, Southampton, SO17 1BJ, United Kingdom
- Algae, Fungi and Plants Division, Department of Life Sciences, The Natural History Museum, Cromwell Road, London, SW7 5BD, United Kingdom
| | - Oliver W White
- Biological Sciences, University of Southampton, Southampton, SO17 1BJ, United Kingdom
- Algae, Fungi and Plants Division, Department of Life Sciences, The Natural History Museum, Cromwell Road, London, SW7 5BD, United Kingdom
| | - Mark Carine
- Algae, Fungi and Plants Division, Department of Life Sciences, The Natural History Museum, Cromwell Road, London, SW7 5BD, United Kingdom
| | - Mark A Chapman
- Biological Sciences, University of Southampton, Southampton, SO17 1BJ, United Kingdom
| |
Collapse
|
2
|
Genome-wide SSR markers in bottle gourd: development, characterization, utilization in assessment of genetic diversity of National Genebank of India and synteny with other related cucurbits. J Appl Genet 2022; 63:237-263. [PMID: 35106708 DOI: 10.1007/s13353-022-00684-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Revised: 01/06/2022] [Accepted: 01/15/2022] [Indexed: 10/19/2022]
Abstract
Lagenaria siceraria (Molina) Standley is an important cultivated crop with its immense importance in pharmaceutical industry and as vegetable. Its seed, root, stem, leaves, flower, and fruit are used as an ointment for ailment of various diseases throughout Asia. Despite its worldwide importance, informative co-dominant microsatellite markers in the bottle gourd crop are very restricted, impeding genetic improvement, cultivar identification, and phylogenetic studies. Next-generation sequencing has revolutionized the approaches for discovery, assessment, and validation of molecular markers. We conducted a genome-wide analysis, for developing SSR markers by utilizing restriction site-associated DNA sequencing (RAD-Seq) data obtained from NCBI. By performing in silico mining of microsatellite repeat motifs, we developed 45,066 perfect SSR markers. Of which 207 markers were successfully validated and 120 (57.97%) polymorphic primer pairs were utilized for an in-depth genetic diversity and population structure analysis of 96 accessions from the National Genebank of India. Tetranucleotide repeats (∼34.3%) were the most prevalent followed by trinucleotide repeats (∼30.73%), further 21.03%, 9.6%, and 4.3% of di-, penta-, and hexa-nucleotide repeats in the bottle gourd genome, respectively. Synteny of SSR markers on 11 bottle gourd linkage groups was correlated with the 7 chromosomes of cucumber (93.2%), 12 chromosomes of melon (87.4%), and 11 of watermelon (90.8%). The generated SSR markers provide a valuable tool for germplasm characterization, genetic linkage map construction, studying synteny, gene discovery, and for breeding in bottle gourd and other cucurbits species. KEY MESSAGE: Development of 45,066 perfect microsatellite markers as a valuable tool for marker assisted selection (MAS) in plant breeding.
Collapse
|
3
|
How to Choose a Good Marker to Analyze the Olive Germplasm ( Olea europaea L.) and Derived Products. Genes (Basel) 2021; 12:genes12101474. [PMID: 34680869 PMCID: PMC8535536 DOI: 10.3390/genes12101474] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Revised: 09/08/2021] [Accepted: 09/16/2021] [Indexed: 12/30/2022] Open
Abstract
The olive tree (Olea europaea L.) is one of the most cultivated crops in the Mediterranean basin. Its economic importance is mainly due to the intense production of table olives and oil. Cultivated varieties are characterized by high morphological and genetic variability and present a large number of synonyms and homonyms. This necessitates the introduction of a rapid and accurate system for varietal identification. In the past, the recognition of olive cultivars was based solely on analysis of the morphological traits, however, these are highly influenced by environmental conditions. Therefore, over the years, several methods based on DNA analysis were developed, allowing a more accurate and reliable varietal identification. This review aims to investigate the evolving history of olive tree characterization approaches, starting from the earlier morphological methods to the latest technologies based on molecular markers, focusing on the main applications of each approach. Furthermore, we discuss the impact of the advent of next generation sequencing and the recent sequencing of the olive genome on the strategies used for the development of new molecular markers.
Collapse
|
4
|
Song Z, Li X. Recent Advances in Molecular Marker-Assisted Breeding for Quality Improvement of Traditional Chinese Medicine. Curr Pharm Biotechnol 2021; 22:867-875. [PMID: 32351179 DOI: 10.2174/1389201021666200430121013] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2020] [Revised: 04/01/2020] [Accepted: 04/02/2020] [Indexed: 11/22/2022]
Abstract
BACKGROUND The quality of Traditional Chinese Medicine (TCM), reflected by its bioactive compounds and associated contents, is directly linked to its clinical efficacy. Therefore, it is of great importance to improve the quality of TCM by increasing the bioactive compound content. METHODS Mapping the active component content-associated QTLs in TCM and further markerassisted breeding has enabled us to rapidly and effectively cultivate new varieties with high bioactive compound contents, which has opened the door for genetic breeding studies on medicinal plants. RESULTS In this paper, a strategy and technical molecular breeding method for TCM are discussed. The development of four methods and progress in functional marker development, as well as the applications of such markers in TCM, are reviewed. CONCLUSION The progress in, challenges of, and future of marker-assisted breeding for quality improvement of TCM are discussed, which provide valuable scientific references for future molecular breeding.
Collapse
Affiliation(s)
- Zhenqiao Song
- State Key Laboratory of Crop Biology, Shandong Key Laboratory of Crop Biology, Shandong Agricultural University, Tai'an 271018, China
| | - Xingfeng Li
- State Key Laboratory of Crop Biology, Shandong Key Laboratory of Crop Biology, Shandong Agricultural University, Tai'an 271018, China
| |
Collapse
|
5
|
Phylogeny, Taxonomy, and Biogeography of Pterocarya (Juglandaceae). PLANTS 2020; 9:plants9111524. [PMID: 33182441 PMCID: PMC7696814 DOI: 10.3390/plants9111524] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Revised: 11/02/2020] [Accepted: 11/06/2020] [Indexed: 12/11/2022]
Abstract
Relict species play an important role in understanding the biogeography of intercontinental disjunctions. Pterocarya (a relict genus) is the valuable model taxon for studying the biogeography of East Asian versus southern European/West Asian disjunct patterns. This disjunction has not been as well studied as others (e.g., between Eastern Asia and North America). Several phylogenetic studies on Pterocarya have been conducted, but none have provided a satisfactory phylogenetic resolution. Here, we report the first well-resolved phylogeny of Pterocarya using restriction site-associated DNA sequencing data based on the sampling of all taxa across the entire distribution area of the genus. Taxonomic treatments were also clarified by combining morphological traits. Furthermore, fossil-calibrated phylogeny was used to explore the biogeography of Pterocarya. Our results support the existence of two sections in Pterocarya, which is in accordance with morphological taxonomy. Section Platyptera comprises three species: P. rhoifolia, P. macroptera, and P. delavayi. Section Pterocarya also comprises three species: P. fraxinifolia, P. hupehensis, and P. stenoptera. The divergence between the two sections took place during the early Miocene (20.5 Ma). The formation of the Gobi Desert and climate cooling of northern Siberia in the Middle Miocene (15.7 Ma) might have caused the split of the continuous distribution of this genus and the formation of the East Asian versus southern European/West Asian disjunct pattern. Lastly, the divergence between P. hupehensis and P. stenoptera as well as between P. rhoifolia and P. macroptera/P. delavayi (10.0 Ma) supports the late Miocene diversification hypothesis in East Asia.
Collapse
|
6
|
Ghirotto S, Vizzari MT, Tassi F, Barbujani G, Benazzo A. Distinguishing among complex evolutionary models using unphased whole-genome data through random forest approximate Bayesian computation. Mol Ecol Resour 2020; 21:2614-2628. [PMID: 33000507 DOI: 10.1111/1755-0998.13263] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Revised: 08/28/2020] [Accepted: 09/07/2020] [Indexed: 01/25/2023]
Abstract
Inferring past demographic histories is crucial in population genetics, and the amount of complete genomes now available should in principle facilitate this inference. In practice, however, the available inferential methods suffer from severe limitations. Although hundreds complete genomes can be simultaneously analysed, complex demographic processes can easily exceed computational constraints, and the procedures to evaluate the reliability of the estimates contribute to increase the computational effort. Here we present an approximate Bayesian computation framework based on the random forest algorithm (ABC-RF), to infer complex past population processes using complete genomes. To this aim, we propose to summarize the data by the full genomic distribution of the four mutually exclusive categories of segregating sites (FDSS), a statistic fast to compute from unphased genome data and that does not require the ancestral state of alleles to be known. We constructed an efficient ABC pipeline and tested how accurately it allows one to recognize the true model among models of increasing complexity, using simulated data and taking into account different sampling strategies in terms of number of individuals analysed, number and size of the genetic loci considered. We also compared the FDSS with the unfolded and folded site frequency spectrum (SFS), and for these statistics we highlighted the experimental conditions maximizing the inferential power of the ABC-RF procedure. We finally analysed real data sets, testing models on the dispersal of anatomically modern humans out of Africa and exploring the evolutionary relationships of the three species of Orangutan inhabiting Borneo and Sumatra.
Collapse
Affiliation(s)
- Silvia Ghirotto
- Department of Mathematics and Computer Science, University of Ferrara, Ferrara, Italy
| | - Maria Teresa Vizzari
- Department of Mathematics and Computer Science, University of Ferrara, Ferrara, Italy
| | - Francesca Tassi
- Department of Life Sciences and Biotechnology, University of Ferrara, Ferrara, Italy
| | - Guido Barbujani
- Department of Life Sciences and Biotechnology, University of Ferrara, Ferrara, Italy
| | - Andrea Benazzo
- Department of Life Sciences and Biotechnology, University of Ferrara, Ferrara, Italy
| |
Collapse
|
7
|
Sui J, Luan S, Dai P, Fu Q, Meng X, Luo K, Cao B, Kong J. High accuracy of pooled DNA genotyping by 2b-RAD sequencing in the Pacific white shrimp, Litopenaeus vannamei. PLoS One 2020; 15:e0236343. [PMID: 32730349 PMCID: PMC7392308 DOI: 10.1371/journal.pone.0236343] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Accepted: 07/04/2020] [Indexed: 11/18/2022] Open
Abstract
Using pooled DNA genotyping to estimate the proportional contributions from multiple families in a pooled sample is of particular interest for selective breeding in aquaculture. We compared different pooled libraries with separate 2b-RAD sequencing of Litopenaeus vannamei individuals to assess the effect of different population structures (different numbers of individuals and families) on pooled DNA sequencing, the accuracy of parent sequencing of the DNA pools and the effect of SNP numbers on pooled DNA sequencing. We demonstrated that small pooled DNA genotyping of up to 53 individuals by 2b-RAD sequencing could provide a highly accurate assessment of population allele frequencies. The accuracy increased as the number of individuals and families increased. The allele frequencies of the parents from each pool were highly correlated with those of the pools or the corresponding individuals in the pool. We chose 500-28,000 SNPs to test the effect of SNP number on the accuracy of pooled sequencing, and no linear relationship was found between them. When the SNP number was fixed, increasing the number of individuals in the mixed pool resulted in higher accuracy of each pooled genotyping. Our data confirmed that pooled DNA genotyping by 2b-RAD sequencing could achieve higher accuracy than that of individual-based genotyping. The results will provide important information for shrimp breeding programs.
Collapse
Affiliation(s)
- Juan Sui
- Key Laboratory for Sustainable Utilization of Marine Fisheries Resources, Ministry of Agriculture, Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao, China
- Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Sheng Luan
- Key Laboratory for Sustainable Utilization of Marine Fisheries Resources, Ministry of Agriculture, Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao, China
- Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Ping Dai
- Key Laboratory for Sustainable Utilization of Marine Fisheries Resources, Ministry of Agriculture, Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao, China
- Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Qiang Fu
- Key Laboratory for Sustainable Utilization of Marine Fisheries Resources, Ministry of Agriculture, Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao, China
- Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Xianhong Meng
- Key Laboratory for Sustainable Utilization of Marine Fisheries Resources, Ministry of Agriculture, Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao, China
- Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Kun Luo
- Key Laboratory for Sustainable Utilization of Marine Fisheries Resources, Ministry of Agriculture, Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao, China
- Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Baoxiang Cao
- Key Laboratory for Sustainable Utilization of Marine Fisheries Resources, Ministry of Agriculture, Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao, China
- Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Jie Kong
- Key Laboratory for Sustainable Utilization of Marine Fisheries Resources, Ministry of Agriculture, Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao, China
- Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
- * E-mail:
| |
Collapse
|
8
|
Loureiro L, Engstrom M, Lim B. Optimization of Genotype by Sequencing data for phylogenetic purposes. MethodsX 2020; 7:100892. [PMID: 32373482 PMCID: PMC7195544 DOI: 10.1016/j.mex.2020.100892] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2019] [Accepted: 04/03/2020] [Indexed: 12/19/2022] Open
Abstract
• Herein we propose a framework for assembling and analyzing Genotype by Sequencing (GBS) data to better understand evolutionary relationships within a group of closely related species using the mastiff bats (Molossus) as our model system. Many species within this genus have low-levels of genetic variation within and between morphologically distinct species, and the relationships among them remain unresolved using traditional Sanger sequencing methods. Given that both de novo and reference genome pipelines can be used to assemble next generation sequences, and that several tree inference methodologies have been proposed for single nucleotide polymorphism (SNP) data, we test whether different alignments and phylogenetic approaches produce similar results. We also examined how the process of SNP identification and mapping can affect the consistency of the analyses. Different alignments and phylogenetic inferences produced consistent results, supporting the GBS approach for answering evolutionary questions on a macroevolutionary scale when the genetic distance among phenotypically identifiable clades is low. We highlight the importance of exploring the relationships among groups using different assembly assumptions and also distinct phylogenetic inference methods, particularly when addressing phylogenetic questions in genetic and morphologically conservative taxa. • The method uses the comparison of several filter settings, alignments, and tree inference approaches on Genotype by Sequencing data. • Consistent results were found among several approaches. • The methodology successfully recovered well supported species boundaries and phylogenetic relationships among species of mastiff bats not hypothesized by previous methods.
Collapse
Affiliation(s)
- L.O. Loureiro
- University of Toronto, Canada
- Royal Ontario Museum, Canada
| | - M.D. Engstrom
- University of Toronto, Canada
- Royal Ontario Museum, Canada
| | | |
Collapse
|
9
|
Bresadola L, Link V, Buerkle CA, Lexer C, Wegmann D. Estimating and accounting for genotyping errors in RAD‐seq experiments. Mol Ecol Resour 2020; 20:856-870. [DOI: 10.1111/1755-0998.13153] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Revised: 02/21/2020] [Accepted: 02/26/2020] [Indexed: 12/26/2022]
Affiliation(s)
- Luisa Bresadola
- Department of Biology University of Fribourg Fribourg Switzerland
| | - Vivian Link
- Department of Biology University of Fribourg Fribourg Switzerland
- Swiss Institute of Bioinformatics Fribourg Switzerland
| | | | - Christian Lexer
- Department of Botany and Biodiversity Research University of Vienna Vienna Austria
| | - Daniel Wegmann
- Department of Biology University of Fribourg Fribourg Switzerland
- Swiss Institute of Bioinformatics Fribourg Switzerland
| |
Collapse
|
10
|
Phelps MP, Seeb LW, Seeb JE. Transforming ecology and conservation biology through genome editing. CONSERVATION BIOLOGY : THE JOURNAL OF THE SOCIETY FOR CONSERVATION BIOLOGY 2020; 34:54-65. [PMID: 30693970 DOI: 10.1111/cobi.13292] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/06/2018] [Revised: 12/23/2018] [Accepted: 01/24/2019] [Indexed: 06/09/2023]
Abstract
As the conservation challenges increase, new approaches are needed to help combat losses in biodiversity and slow or reverse the decline of threatened species. Genome-editing technology is changing the face of modern biology, facilitating applications that were unimaginable only a decade ago. The technology has the potential to make significant contributions to the fields of evolutionary biology, ecology, and conservation, yet the fear of unintended consequences from designer ecosystems containing engineered organisms has stifled innovation. To overcome this gap in the understanding of what genome editing is and what its capabilities are, more research is needed to translate genome-editing discoveries into tools for ecological research. Emerging and future genome-editing technologies include new clustered regularly interspaced short palindromic repeats (CRISPR) targeted sequencing and nucleic acid detection approaches as well as species genetic barcoding and somatic genome-editing technologies. These genome-editing tools have the potential to transform the environmental sciences by providing new noninvasive methods for monitoring threatened species or for enhancing critical adaptive traits. A pioneering effort by the conservation community is required to apply these technologies to real-world conservation problems.
Collapse
Affiliation(s)
- Michael P Phelps
- Department of Pathology, University of Washington, Box 357705, Seattle, WA, 98195, U.S.A
| | - Lisa W Seeb
- School of Aquatic and Fisheries Sciences, University of Washington, Seattle, WA, 98195, U.S.A
| | - James E Seeb
- School of Aquatic and Fisheries Sciences, University of Washington, Seattle, WA, 98195, U.S.A
| |
Collapse
|
11
|
Gould AL, Dunlap PV. Shedding Light on Specificity: Population Genomic Structure of a Symbiosis Between a Coral Reef Fish and Luminous Bacterium. Front Microbiol 2019; 10:2670. [PMID: 31824455 PMCID: PMC6879551 DOI: 10.3389/fmicb.2019.02670] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2019] [Accepted: 11/01/2019] [Indexed: 11/13/2022] Open
Abstract
All organisms depend on symbiotic associations with bacteria for their success, yet how these interspecific interactions influence the population structure, ecology, and evolution of microbial symbionts is not well understood. Additionally, patterns of genetic variation in interacting species can reveal ecological traits that are important to gene flow and co-evolution. In this study, we define patterns of spatial and temporal genetic variation of a coral reef fish, Siphamia tubifer, and its luminous bacterial symbiont, Photobacterium mandapamensis in the Okinawa Islands, Japan. Using restriction site-associated sequencing (RAD-Seq) methods, we show that populations of the facultative light organ symbiont of S. tubifer exhibit genetic structure at fine spatial scales of tens of kilometers despite the absence of physical barriers to dispersal and in contrast to populations of the host fish. These results suggest that the host’s behavioral ecology and environmental interactions between host and symbiont help to structure symbiont populations in the region, consequently fostering the specificity of the association between host generations. Our approach also revealed several symbiont genes that were divergent between host populations, including hfq and a homolog of varS, both of which play a role in host association in Vibrio cholerae. Overall, this study highlights the important role that a host animal can play in structuring the distribution of its bacterial symbiont, particularly in highly connected marine environments, thereby promoting specificity of the symbiosis between host generations.
Collapse
Affiliation(s)
- Alison L Gould
- Department of Ichthyology, California Academy of Sciences, San Francisco, CA, United States.,Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, United States
| | - Paul V Dunlap
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, United States
| |
Collapse
|
12
|
Martín-Hernanz S, Aparicio A, Fernández-Mazuecos M, Rubio E, Reyes-Betancort JA, Santos-Guerra A, Olangua-Corral M, Albaladejo RG. Maximize Resolution or Minimize Error? Using Genotyping-By-Sequencing to Investigate the Recent Diversification of Helianthemum (Cistaceae). FRONTIERS IN PLANT SCIENCE 2019; 10:1416. [PMID: 31781140 PMCID: PMC6859804 DOI: 10.3389/fpls.2019.01416] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Accepted: 10/11/2019] [Indexed: 05/27/2023]
Abstract
A robust phylogenetic framework, in terms of extensive geographical and taxonomic sampling, well-resolved species relationships and high certainty of tree topologies and branch length estimations, is critical in the study of macroevolutionary patterns. Whereas Sanger sequencing-based methods usually recover insufficient phylogenetic signal, especially in recently diversified lineages, reduced-representation sequencing methods tend to provide well-supported phylogenetic relationships, but usually entail remarkable bioinformatic challenges due to the inherent trade-off between the number of SNPs and the magnitude of associated error rates. The genus Helianthemum (Cistaceae) is a species-rich and taxonomically complex Palearctic group of plants that diversified mainly since the Upper Miocene. It is a challenging case study since previous attempts using Sanger sequencing were unable to resolve the intrageneric phylogenetic relationships. Aiming to obtain a robust phylogenetic reconstruction based on genotyping-by-sequencing (GBS), we established a rigorous methodological workflow in which we i) explored how variable settings during dataset assembly have an impact on error rates and on the degree of resolution under concatenation and coalescent approaches, ii) assessed the effect of two extreme parameter configurations (minimizing error rates vs. maximizing phylogenetic resolution) on tree topology and branch lengths, and iii) evaluated the effects of these two configurations on estimates of divergence times and diversification rates. Our analyses produced highly supported topologically congruent phylogenetic trees for both configurations. However, minimizing error rates did produce more reliable branch lengths, critically affecting the accuracy of downstream analyses (i.e. divergence times and diversification rates). In addition to recommending a revision of intrageneric systematics, our results enabled us to identify three highly diversified lineages in Helianthemum in contrasting geographical areas and ecological conditions, which started radiating in the Upper Miocene.
Collapse
Affiliation(s)
- Sara Martín-Hernanz
- Departamento de Biología Vegetal y Ecología, Universidad de Sevilla, Sevilla, Spain
| | - Abelardo Aparicio
- Departamento de Biología Vegetal y Ecología, Universidad de Sevilla, Sevilla, Spain
| | | | - Encarnación Rubio
- Departamento de Biología Vegetal y Ecología, Universidad de Sevilla, Sevilla, Spain
| | - J. Alfredo Reyes-Betancort
- Jardín de Aclimatación de la Orotava, Instituto Canario de Investigaciones Agrarias (ICIA), Santa Cruz de Tenerife, Spain
| | - Arnoldo Santos-Guerra
- Jardín de Aclimatación de la Orotava, Instituto Canario de Investigaciones Agrarias (ICIA), Santa Cruz de Tenerife, Spain
| | - María Olangua-Corral
- Departamento de Biología Reproductiva y Micro-morfología, Jardín Botánico Canario ‘Viera y Clavijo’—Unidad Asociada CSIC (Cabildo de Gran Canaria), Las Palmas de Gran Canaria, Spain
| | - Rafael G. Albaladejo
- Departamento de Biología Vegetal y Ecología, Universidad de Sevilla, Sevilla, Spain
| |
Collapse
|
13
|
Li Z, Tian C, Huang Y, Lin X, Wang Y, Jiang D, Zhu C, Chen H, Li G. A First Insight into a Draft Genome of Silver Sillago ( Sillago sihama) via Genome Survey Sequencing. Animals (Basel) 2019; 9:ani9100756. [PMID: 31581597 PMCID: PMC6827152 DOI: 10.3390/ani9100756] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Accepted: 09/27/2019] [Indexed: 11/16/2022] Open
Abstract
Sillago sihama has high economic value and is one of the most attractive aquaculture species in China. Despite its economic importance, studies of its genome have barely been performed. In this study, we conducted a first genomic survey of S. sihama using next-generation sequencing (NGS). In total, 45.063 Gb of high-quality sequence data were obtained. For the 17-mer frequency distribution, the genome size was estimated to be 508.50 Mb. The sequence repeat ratio was calculated to be 21.25%, and the heterozygosity ratio was 0.92%. Reads were assembled into 1,009,363 contigs, with a N50 length of 1362 bp, and then into 814,219 scaffolds, with a N50 length of 2173 bp. The average Guanine and Cytosine (GC) content was 45.04%. Dinucleotide repeats (56.55%) were the dominant form of simple sequence repeats (SSR).
Collapse
Affiliation(s)
- Zhiyuan Li
- Guangdong Research Center on Reproductive Control and Breeding Technology of Indigenous Valuable Fish Species, Fisheries College, Guangdong Ocean University, Zhanjiang 524088, China.
| | - Changxu Tian
- Guangdong Research Center on Reproductive Control and Breeding Technology of Indigenous Valuable Fish Species, Fisheries College, Guangdong Ocean University, Zhanjiang 524088, China.
- Southern Marine Science and Engineering Guangdong Laboratory, Zhanjiang 524025, China.
| | - Yang Huang
- Guangdong Research Center on Reproductive Control and Breeding Technology of Indigenous Valuable Fish Species, Fisheries College, Guangdong Ocean University, Zhanjiang 524088, China.
- Southern Marine Science and Engineering Guangdong Laboratory, Zhanjiang 524025, China.
| | - Xinghua Lin
- Guangdong Research Center on Reproductive Control and Breeding Technology of Indigenous Valuable Fish Species, Fisheries College, Guangdong Ocean University, Zhanjiang 524088, China.
| | - Yaorong Wang
- Guangdong Research Center on Reproductive Control and Breeding Technology of Indigenous Valuable Fish Species, Fisheries College, Guangdong Ocean University, Zhanjiang 524088, China.
| | - Dongneng Jiang
- Guangdong Research Center on Reproductive Control and Breeding Technology of Indigenous Valuable Fish Species, Fisheries College, Guangdong Ocean University, Zhanjiang 524088, China.
- Southern Marine Science and Engineering Guangdong Laboratory, Zhanjiang 524025, China.
| | - Chunhua Zhu
- Guangdong Research Center on Reproductive Control and Breeding Technology of Indigenous Valuable Fish Species, Fisheries College, Guangdong Ocean University, Zhanjiang 524088, China.
- Southern Marine Science and Engineering Guangdong Laboratory, Zhanjiang 524025, China.
| | - Huapu Chen
- Guangdong Research Center on Reproductive Control and Breeding Technology of Indigenous Valuable Fish Species, Fisheries College, Guangdong Ocean University, Zhanjiang 524088, China.
- Southern Marine Science and Engineering Guangdong Laboratory, Zhanjiang 524025, China.
| | - Guangli Li
- Guangdong Research Center on Reproductive Control and Breeding Technology of Indigenous Valuable Fish Species, Fisheries College, Guangdong Ocean University, Zhanjiang 524088, China.
- Southern Marine Science and Engineering Guangdong Laboratory, Zhanjiang 524025, China.
| |
Collapse
|
14
|
Roques S, Chancerel E, Boury C, Pierre M, Acolas M. From microsatellites to single nucleotide polymorphisms for the genetic monitoring of a critically endangered sturgeon. Ecol Evol 2019; 9:7017-7029. [PMID: 31380030 PMCID: PMC6662312 DOI: 10.1002/ece3.5268] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2018] [Revised: 02/12/2019] [Accepted: 04/04/2019] [Indexed: 01/06/2023] Open
Abstract
The use of genetic information is crucial in conservation programs for the establishment of breeding plans and for the evaluation of restocking success. Short tandem repeats (STRs) have been the most widely used molecular markers in such programs, but next-generation sequencing approaches have prompted the transition to genome-wide markers such as single nucleotide polymorphisms (SNPs). Until now, most sturgeon species have been monitored using STRs. The low diversity found in the critically endangered European sturgeon (Acipenser sturio), however, makes its future genetic monitoring challenging, and the current resolution needs to be increased. Here, we describe the discovery of a highly informative set of 79 SNPs using double-digest restriction-associated DNA (ddRAD) sequencing and its validation by genotyping using the MassARRAY system. Comparing with STRs, the SNP panel proved to be highly efficient and reproducible, allowing for more accurate parentage and kinship assignments' on 192 juveniles of known pedigree and 40 wild-born adults. We explore the effectiveness of both markers to estimated relatedness and inbreeding, using simulated and empirical datasets. Interestingly, we found significant correlations between STRs and SNPs at individual heterozygosity and inbreeding that give support to a reasonable representation of whole genome diversity for both markers. These results are useful for the conservation program of A. sturio in building a comprehensive studbook, which will optimize conservation strategies. This approach also proves suitable for other case studies in which highly discriminatory genetic markers are needed to assess parentage and kinship.
Collapse
Affiliation(s)
- Séverine Roques
- Aquatic Ecosystems and Global ChangesIRSTEA, EABX URCestasFrance
| | | | | | - Maud Pierre
- Aquatic Ecosystems and Global ChangesIRSTEA, EABX URCestasFrance
| | | |
Collapse
|
15
|
Monteiro WP, Veiga JC, Silva AR, Carvalho CDS, Lanes ÉCM, Rico Y, Jaffé R. Everything you always wanted to know about gene flow in tropical landscapes (but were afraid to ask). PeerJ 2019; 7:e6446. [PMID: 30783576 PMCID: PMC6377592 DOI: 10.7717/peerj.6446] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2018] [Accepted: 01/15/2019] [Indexed: 11/30/2022] Open
Abstract
The bulk of the world’s biodiversity is found in tropical regions, which are increasingly threatened by the human-led degradation of natural habitats. Yet, little is known about tropical biodiversity responses to habitat loss and fragmentation. Here we review all available literature assessing landscape effects on gene flow in tropical species, aiming to help unravel the factors underpinning functional connectivity in the tropics. We map and classify studies by focus species, the molecular markers employed, statistical approaches to assess landscape effects on gene flow, and the evaluated landscape and environmental variables. We then compare qualitatively and quantitatively landscape effects on gene flow across species and units of analysis. We found 69 articles assessing landscape effects on gene flow in tropical organisms, most of which were published in the last five years, were concentrated in the Americas, and focused on amphibians or mammals. Most studies employed population-level approaches, microsatellites were the preferred type of markers, and Mantel and partial Mantel tests the most common statistical approaches used. While elevation, land cover and forest cover were the most common gene flow predictors assessed, habitat suitability was found to be a common predictor of gene flow. A third of all surveyed studies explicitly assessed the effect of habitat degradation, but only 14 of these detected a reduced gene flow with increasing habitat loss. Elevation was responsible for most significant microsatellite-based isolation by resistance effects and a single study reported significant isolation by non-forested areas in an ant. Our study reveals important knowledge gaps on the study of landscape effects on gene flow in tropical organisms, and provides useful guidelines on how to fill them.
Collapse
Affiliation(s)
| | - Jamille Costa Veiga
- Instituto de Ciências Biológicas, Universidade Federal do Pará, Belém, Pará, Brazil
| | - Amanda Reis Silva
- Departamento de Botânica, Museu Paraense Emílio Goeldi, Belém, Pará, Brazil
| | | | | | - Yessica Rico
- CONACYT, Red de Diversidad Biológica del Occidente Mexicano, Instituto de Ecología, A.C., Michoacán, Mexico
| | - Rodolfo Jaffé
- Instituto Tecnológico Vale, Belém, PA, Brazil.,Departamento de Ecologia, Universidade de São Paulo, São Paulo, Brazil
| |
Collapse
|
16
|
Perrier C, Lozano del Campo A, Szulkin M, Demeyrier V, Gregoire A, Charmantier A. Great tits and the city: Distribution of genomic diversity and gene-environment associations along an urbanization gradient. Evol Appl 2018; 11:593-613. [PMID: 29875805 PMCID: PMC5979639 DOI: 10.1111/eva.12580] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2017] [Accepted: 11/19/2017] [Indexed: 01/02/2023] Open
Abstract
Urbanization is a growing concern challenging the evolutionary potential of wild populations by reducing genetic diversity and imposing new selection regimes affecting many key fitness traits. However, genomic footprints of urbanization have received little attention so far. Using RAD sequencing, we investigated the genomewide effects of urbanization on neutral and adaptive genomic diversity in 140 adult great tits Parus major collected in locations with contrasted urbanization levels (from a natural forest to highly urbanized areas of a city; Montpellier, France). Heterozygosity was slightly lower in the more urbanized sites compared to the more rural ones. Low but significant effect of urbanization on genetic differentiation was found, at the site level but not at the nest level, indicative of the geographic scale of urbanization impact and of the potential for local adaptation despite gene flow. Gene-environment association tests identified numerous SNPs with small association scores to urbanization, distributed across the genome, from which a subset of 97 SNPs explained up to 81% of the variance in urbanization, overall suggesting a polygenic response to selection in the urban environment. These findings open stimulating perspectives for broader applications of high-resolution genomic tools on other cities and larger sample sizes to investigate the consistency of the effects of urbanization on the spatial distribution of genetic diversity and the polygenic nature of gene-urbanization association.
Collapse
Affiliation(s)
- Charles Perrier
- Centre d'Ecologie Fonctionnelle et Evolutive, CEFE UMR 5175, Campus CNRS, Université de MontpellierMontpellier Cedex 5France
| | - Ana Lozano del Campo
- Centre d'Ecologie Fonctionnelle et Evolutive, CEFE UMR 5175, Campus CNRS, Université de MontpellierMontpellier Cedex 5France
| | - Marta Szulkin
- Centre d'Ecologie Fonctionnelle et Evolutive, CEFE UMR 5175, Campus CNRS, Université de MontpellierMontpellier Cedex 5France
- Wild Urban Evolution and Ecology LaboratoryCentre of New TechnologiesUniversity of WarsawWarsawPoland
| | - Virginie Demeyrier
- Centre d'Ecologie Fonctionnelle et Evolutive, CEFE UMR 5175, Campus CNRS, Université de MontpellierMontpellier Cedex 5France
| | - Arnaud Gregoire
- Centre d'Ecologie Fonctionnelle et Evolutive, CEFE UMR 5175, Campus CNRS, Université de MontpellierMontpellier Cedex 5France
| | - Anne Charmantier
- Centre d'Ecologie Fonctionnelle et Evolutive, CEFE UMR 5175, Campus CNRS, Université de MontpellierMontpellier Cedex 5France
| |
Collapse
|
17
|
Molecular genetic diversity and population structure of Ethiopian white lupin landraces: Implications for breeding and conservation. PLoS One 2017; 12:e0188696. [PMID: 29190792 PMCID: PMC5708786 DOI: 10.1371/journal.pone.0188696] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2017] [Accepted: 11/11/2017] [Indexed: 01/22/2023] Open
Abstract
White lupin is one of the four economically important species of the Lupinus genus and is an important grain legume in the Ethiopian farming system. However, there has been limited research effort to characterize the Ethiopian white lupin landraces. Fifteen polymorphic simple sequence repeat (SSR) markers were used to assess the genetic diversity and population structure of 212 Ethiopian white lupin (Lupinus albus) landraces and two genotypes from different species (Lupinus angustifolius and Lupinus mutabilis) were used as out-group. The SSR markers revealed 108 different alleles, 98 of them from 212 landraces and 10 from out-group genotypes, with an average of 6.5 alleles per locus. The average gene diversity was 0.31. Twenty eight landraces harbored one or more private alleles from the total of 28 private alleles identified in the 212 white lupin accessions. Seventy-seven rare alleles with a frequency of less than 5% were identified and accounted for 78.6% of the total alleles detected. Analysis of molecular variance (AMOVA) showed that 92% of allelic diversity was attributed to individual accessions within populations while only 8% was distributed among populations. At 70% similarity level, the UPGMA dendrogram resulted in the formation of 13 clusters comprised of 2 to 136 landraces, with the out-group genotypes and five landraces remaining distinct and ungrouped. Population differentiation and genetic distance were relatively high between Gondar and Ethiopian white lupin populations collected by Australians. A model-based population structure analysis divided the white lupin landraces into two populations. All Ethiopian white lupin landrace populations, except most of the landraces collected by Australians (77%) and about 44% from Awi, were grouped together with significant admixtures. The study also suggested that 34 accessions, as core collections, were sufficient to retain 100% of SSR diversity. These accessions (core G-34) represent 16% of the whole 212 Ethiopian white lupin accessions and populations from West Gojam, Awi and Australian collections contributed more accessions to the core collection.
Collapse
|
18
|
Construction of a high-density linkage map and mapping of sex determination and growth-related loci in the mandarin fish (Siniperca chuatsi). BMC Genomics 2017; 18:446. [PMID: 28587594 PMCID: PMC5461734 DOI: 10.1186/s12864-017-3830-3] [Citation(s) in RCA: 56] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2016] [Accepted: 05/30/2017] [Indexed: 11/10/2022] Open
Abstract
Background The mandarin fish (Siniperca chuatsi) is an important and widely cultured fish in China. However, the lack of selective breeding of mandarin fish in previous decades has resulted in a decline in the growth rate of pond-cultured fish, a shortened period of sexual maturity, and reduced disease resistance; these issues seriously affect the quality and safety of the fish products. Therefore, it is necessary to establish a selective breeding program for the mandarin fish to improve the economical traits of the fish and to sustain the development of the mandarin fish industry. Results We constructed a high-density linkage map for it based on double digest restriction site associated DNA sequencing (ddRAD-Sequencing). This map contained 3283 dimorphic single nucleotide polymorphism markers and 24 linkage groups (LGs). The total map-length was 1972.01 cM, with an average interlocus distance of 0.61 cM. One significant quantitative trait locus (QTL) for sex determination trait was detected on LG23, which was supported by five markers, clustered between 60.27 and 68.71 cM. The highest logarithm of odds value (17.73) was located at 60.27 cM, near the marker r1_73194, accounting for 53.3% of the phenotypic variance. Genotypes of all the male fish on r1_33008 were homozygous, whereas those of all females were heterozygous. Thus, LG23 was considered a sex-related linkage group. Eleven significant QTLs, for three growth traits, at two growth stages and the increased values were distributed on four LGs; their contributions to the phenotypic variation were quite low (12.4–17.2%), suggesting that multiple genes affected the growth traits. Conclusion This high-resolution genetic map provides a valuable resource for fine-mapping of important traits and for identification of sex-related markers that should facilitate breeding of all-female mandarin fish for aquaculture and mechanistic studies on sex determination. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3830-3) contains supplementary material, which is available to authorized users.
Collapse
|
19
|
Wan SM, Liu H, Zhao BW, Nie CH, Wang WM, Gao ZX. Construction of a high-density linkage map and fine mapping of QTLs for growth and gonad related traits in blunt snout bream. Sci Rep 2017; 7:46509. [PMID: 28422147 PMCID: PMC5395971 DOI: 10.1038/srep46509] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2017] [Accepted: 03/17/2017] [Indexed: 01/16/2023] Open
Abstract
High-density genetic maps based on SNPs are essential for fine mapping loci controlling specific traits for fish species. Using restriction-site associated DNA tag sequencing (RAD-Seq) technology, we identified 42,784 SNPs evenly distributed across the Megalobrama amblycephala genome. Based on 2 parents and 187 intra-specific hybridization progenies, a total of 14,648 high-confidence SNPs were assigned to 24 consensus linkage groups (LGs) of maternal and paternal map. The total length of the integrated map was 3,258.38 cM with an average distance of 0.57 cM among 5676 effective loci, thereby representing the first high-density genetic map reported for M. amblycephala. A total of eight positive quantitative trait loci (QTLs) were detected in QTL analysis. Of that, five QTL explained ≥35% of phenotypic variation for growth traits and three QTL explained ≥16% phenotypic variation for gonad related traits. A total of 176 mapped markers had significant hits in the zebrafish genome and almost all of the 24 putative-chromosomes of M. amblycephala were in relatively conserved synteny with chromosomes of zebrafish. Almost all M. amblycephala and zebrafish chromosomes had a 1:1 correspondence except for putative-chromosome 4, which mapped to two chromosomes of zebrafish caused by the difference in chromosome numbers between two species.
Collapse
Affiliation(s)
- Shi-Ming Wan
- College of Fisheries, Key Lab of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education/Key Lab of Freshwater Animal Breeding, Ministry of Agriculture, Huazhong Agricultural University, Wuhan, Hubei 430070, China.,Freshwater Aquaculture Collaborative Innovation Center of Hubei Province, Wuhan 430070, China
| | - Hong Liu
- College of Fisheries, Key Lab of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education/Key Lab of Freshwater Animal Breeding, Ministry of Agriculture, Huazhong Agricultural University, Wuhan, Hubei 430070, China.,Freshwater Aquaculture Collaborative Innovation Center of Hubei Province, Wuhan 430070, China
| | - Bo-Wen Zhao
- College of Fisheries, Key Lab of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education/Key Lab of Freshwater Animal Breeding, Ministry of Agriculture, Huazhong Agricultural University, Wuhan, Hubei 430070, China.,Freshwater Aquaculture Collaborative Innovation Center of Hubei Province, Wuhan 430070, China
| | - Chun-Hong Nie
- College of Fisheries, Key Lab of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education/Key Lab of Freshwater Animal Breeding, Ministry of Agriculture, Huazhong Agricultural University, Wuhan, Hubei 430070, China.,Freshwater Aquaculture Collaborative Innovation Center of Hubei Province, Wuhan 430070, China
| | - Wei-Min Wang
- College of Fisheries, Key Lab of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education/Key Lab of Freshwater Animal Breeding, Ministry of Agriculture, Huazhong Agricultural University, Wuhan, Hubei 430070, China
| | - Ze-Xia Gao
- College of Fisheries, Key Lab of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education/Key Lab of Freshwater Animal Breeding, Ministry of Agriculture, Huazhong Agricultural University, Wuhan, Hubei 430070, China.,Freshwater Aquaculture Collaborative Innovation Center of Hubei Province, Wuhan 430070, China.,Hubei Provincial Engineering Laboratory for Pond Aquaculture, Wuhan 430070, China
| |
Collapse
|
20
|
Affiliation(s)
- Josephine R. Paris
- Biosciences College of Life and Environmental Sciences University of Exeter Exeter UK
| | - Jamie R. Stevens
- Biosciences College of Life and Environmental Sciences University of Exeter Exeter UK
| | - Julian M. Catchen
- Department of Animal Biology University of Illinois at Urbana–Champaign Urbana IL 61801 USA
| |
Collapse
|
21
|
Construction of a High-Density Genetic Map and Quantitative Trait Locus Mapping in the Manila clam Ruditapes philippinarum. Sci Rep 2017; 7:229. [PMID: 28331182 PMCID: PMC5427961 DOI: 10.1038/s41598-017-00246-0] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2016] [Accepted: 02/15/2017] [Indexed: 11/13/2022] Open
Abstract
Genetic linkage maps are indispensable tools in a wide range of genetic and genomic research. With the advancement of genotyping-by-sequencing (GBS) methods, the construction of a high-density linkage maps has become achievable in marine organisms lacking sufficient genomic resources, such as mollusks. In this study, high-density linkage map was constructed for an ecologically and commercially important clam species, Ruditapes philippinarum. For the consensus linkage map, a total of 9658 markers spanning 1926.98 cM were mapped to 18 sex-averaged linkage groups, with an average marker distance of 0.42 cM. Based on the high-density linkage map, ten QTLs for growth-related traits and shell color were detected. The coverage and density of the current map are sufficient for us to effectively detect QTL for segregating traits, and two QTL positions were all coincident with the closest markers. This high-density genetic linkage map reveals basic genomic architecture and will be useful for comparative genomics research, genome assembly and genetic improvement of R. philippinarum and other bivalve molluscan species.
Collapse
|
22
|
İpek A, İpek M, Ercişli S, Tangu NA. Transcriptome-based SNP discovery by GBS and the construction of a genetic map for olive. Funct Integr Genomics 2017; 17:493-501. [PMID: 28213629 DOI: 10.1007/s10142-017-0552-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2016] [Revised: 02/04/2017] [Accepted: 02/07/2017] [Indexed: 10/20/2022]
Abstract
Molecular markers located in the genic regions of plants are valuable tools for the identification of candidate genes of economically important traits and consequent use in marker-assisted selection (MAS). In the past, simple sequence repeat markers (SSRs) and single-nucleotide polymorphisms (SNPs) located in expressed sequence tags (ESTs) were developed by sequencing RNA derived from different plant tissues, which involves laborious RNA extraction, mRNA isolation, and cDNA synthesis. In order to develop SNP markers located in olive transcriptomes, we used the recently developed genotyping-by-sequencing (GBS) technique. An analysis was done for 125 olive DNA samples (123 DNA samples from a cross-pollinated F1 mapping population, and two samples from parents). From 45 to 66% of Illumina reads from GBS analysis were aligned to the olive transcriptome. A total of 22,033 transcriptome-based SNP markers were identified, and 3384 of these were mapped in the olive genome. The genetic linkage map constructed in this study consists of 1 cleaved amplified polymorphic sequence (CAPS), 19 SSR, and 3384 transcriptome-based SNP markers. The map covers 3340.8 cM of the olive genome in 23 linkage groups, with the length of the linkage groups ranging from 55.6 to 248.7 cM. Average map distance between flanking markers was 0.98 cM. This genetic linkage map is a saturated genetic map and will be a useful tool for the localization of quantitative trait loci (QTLs) and gene(s) of interest and for the identification of candidate genes for economically important traits.
Collapse
Affiliation(s)
- Ahmet İpek
- Faculty of Agriculture, Horticulture Department, Uludag University, Bursa, Turkey.
| | - Meryem İpek
- Faculty of Agriculture, Horticulture Department, Uludag University, Bursa, Turkey
| | - Sezai Ercişli
- Faculty of Agriculture, Horticulture Department, Atatürk University, Erzurum, Turkey
| | | |
Collapse
|
23
|
Xiao S, Wang P, Dong L, Zhang Y, Han Z, Wang Q, Wang Z. Whole-genome single-nucleotide polymorphism (SNP) marker discovery and association analysis with the eicosapentaenoic acid (EPA) and docosahexaenoic acid (DHA) content in Larimichthys crocea. PeerJ 2016; 4:e2664. [PMID: 28028455 PMCID: PMC5180582 DOI: 10.7717/peerj.2664] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2016] [Accepted: 10/07/2016] [Indexed: 12/30/2022] Open
Abstract
Whole-genome single-nucleotide polymorphism (SNP) markers are valuable genetic resources for the association and conservation studies. Genome-wide SNP development in many teleost species are still challenging because of the genome complexity and the cost of re-sequencing. Genotyping-By-Sequencing (GBS) provided an efficient reduced representative method to squeeze cost for SNP detection; however, most of recent GBS applications were reported on plant organisms. In this work, we used an EcoRI-NlaIII based GBS protocol to teleost large yellow croaker, an important commercial fish in China and East-Asia, and reported the first whole-genome SNP development for the species. 69,845 high quality SNP markers that evenly distributed along genome were detected in at least 80% of 500 individuals. Nearly 95% randomly selected genotypes were successfully validated by Sequenom MassARRAY assay. The association studies with the muscle eicosapentaenoic acid (EPA) and docosahexaenoic acid (DHA) content discovered 39 significant SNP markers, contributing as high up to ∼63% genetic variance that explained by all markers. Functional genes that involved in fat digestion and absorption pathway were identified, such as APOB, CRAT and OSBPL10. Notably, PPT2 Gene, previously identified in the association study of the plasma n-3 and n-6 polyunsaturated fatty acid level in human, was re-discovered in large yellow croaker. Our study verified that EcoRI-NlaIII based GBS could produce quality SNP markers in a cost-efficient manner in teleost genome. The developed SNP markers and the EPA and DHA associated SNP loci provided invaluable resources for the population structure, conservation genetics and genomic selection of large yellow croaker and other fish organisms.
Collapse
Affiliation(s)
- Shijun Xiao
- Fisheries College, Jimei University, Xiamen, Fujian, China
| | - Panpan Wang
- Fisheries College, Jimei University, Xiamen, Fujian, China
| | - Linsong Dong
- Fisheries College, Jimei University, Xiamen, Fujian, China
| | - Yaguang Zhang
- Fisheries College, Jimei University, Xiamen, Fujian, China
| | - Zhaofang Han
- Fisheries College, Jimei University, Xiamen, Fujian, China
| | - Qiurong Wang
- Fisheries College, Jimei University, Xiamen, Fujian, China
| | - Zhiyong Wang
- Fisheries College, Jimei University, Xiamen, Fujian, China
| |
Collapse
|
24
|
Rodrigues ASB, Silva SE, Pina-Martins F, Loureiro J, Castro M, Gharbi K, Johnson KP, Dietrich CH, Borges PAV, Quartau JA, Jiggins CD, Paulo OS, Seabra SG. Assessing genotype-phenotype associations in three dorsal colour morphs in the meadow spittlebug Philaenus spumarius (L.) (Hemiptera: Aphrophoridae) using genomic and transcriptomic resources. BMC Genet 2016; 17:144. [PMID: 27846816 PMCID: PMC5111214 DOI: 10.1186/s12863-016-0455-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2016] [Accepted: 11/07/2016] [Indexed: 01/08/2023] Open
Abstract
Background Colour polymorphisms are common among animal species. When combined with genetic and ecological data, these polymorphisms can be excellent systems in which to understand adaptation and the molecular changes underlying phenotypic evolution. The meadow spittlebug, Philaenus spumarius (L.) (Hemiptera, Aphrophoridae), a widespread insect species in the Holarctic region, exhibits a striking dorsal colour/pattern balanced polymorphism. Although experimental crosses have revealed the Mendelian inheritance of this trait, its genetic basis remains unknown. In this study we aimed to identify candidate genomic regions associated with the colour balanced polymorphism in this species. Results By using restriction site-associated DNA (RAD) sequencing we were able to obtain a set of 1,837 markers across 33 individuals to test for associations with three dorsal colour phenotypes (typicus, marginellus, and trilineatus). Single and multi-association analyses identified a total of 60 SNPs associated with dorsal colour morphs. The genome size of P. spumarius was estimated by flow cytometry, revealing a 5.3 Gb genome, amongst the largest found in insects. A partial genome assembly, representing 24% of the total size, and an 81.4 Mb transcriptome, were also obtained. From the SNPs found to be associated with colour, 35% aligned to the genome and 10% to the transcriptome. Our data suggested that major loci, consisting of multi-genomic regions, may be involved in dorsal colour variation among the three dorsal colour morphs analysed. However, no homology was found between the associated loci and candidate genes known to be responsible for coloration pattern in other insect species. The associated markers showed stronger differentiation of the trilineatus colour phenotype, which has been shown previously to be more differentiated in several life-history and physiological characteristics as well. It is possible that colour variation and these traits are linked in a complex genetic architecture. Conclusions The loci detected to have an association with colour and the genomic and transcriptomic resources developed here constitute a basis for further research on the genetic basis of colour pattern in the meadow spittlebug P. spumarius. Electronic supplementary material The online version of this article (doi:10.1186/s12863-016-0455-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ana S B Rodrigues
- Computational Biology and Population Genomics Group, cE3c - Centre for Ecology, Evolution and Environmental Changes, Departamento de Biologia Animal, Faculdade de Ciências da Universidade de Lisboa, Campo Grande, Lisbon, P-1749-016, Portugal.
| | - Sara E Silva
- Computational Biology and Population Genomics Group, cE3c - Centre for Ecology, Evolution and Environmental Changes, Departamento de Biologia Animal, Faculdade de Ciências da Universidade de Lisboa, Campo Grande, Lisbon, P-1749-016, Portugal
| | - Francisco Pina-Martins
- Computational Biology and Population Genomics Group, cE3c - Centre for Ecology, Evolution and Environmental Changes, Departamento de Biologia Animal, Faculdade de Ciências da Universidade de Lisboa, Campo Grande, Lisbon, P-1749-016, Portugal.,Centro de Estudos do Ambiente e do Mar (CESAM), DBA/FCUL, Lisbon, Portugal
| | - João Loureiro
- Centre for Functional Ecology, Department of Life Sciences, University of Coimbra, Coimbra, Portugal
| | - Mariana Castro
- Centre for Functional Ecology, Department of Life Sciences, University of Coimbra, Coimbra, Portugal
| | - Karim Gharbi
- Edinburgh Genomics, Ashworth Laboratories, King's Buildings, The University of Edinburgh, Edinburgh, EH9 3JT, UK
| | - Kevin P Johnson
- Illinois Natural History Survey, Prairie Research Institute, University of Illinois, Champaign, IL, USA
| | - Christopher H Dietrich
- Illinois Natural History Survey, Prairie Research Institute, University of Illinois, Champaign, IL, USA
| | - Paulo A V Borges
- cE3c - Centre for Ecology, Evolution and Environmental Changes/Azorean Biodiversity Group and Universidade dos Açores - Departamento de Ciências e Engenharia do Ambiente, Angra do Heroísmo, Açores, Portugal
| | - José A Quartau
- Computational Biology and Population Genomics Group, cE3c - Centre for Ecology, Evolution and Environmental Changes, Departamento de Biologia Animal, Faculdade de Ciências da Universidade de Lisboa, Campo Grande, Lisbon, P-1749-016, Portugal
| | - Chris D Jiggins
- Department of Zoology, University of Cambridge, Downing Street, Cambridge, CB2 3EJ, UK
| | - Octávio S Paulo
- Computational Biology and Population Genomics Group, cE3c - Centre for Ecology, Evolution and Environmental Changes, Departamento de Biologia Animal, Faculdade de Ciências da Universidade de Lisboa, Campo Grande, Lisbon, P-1749-016, Portugal
| | - Sofia G Seabra
- Computational Biology and Population Genomics Group, cE3c - Centre for Ecology, Evolution and Environmental Changes, Departamento de Biologia Animal, Faculdade de Ciências da Universidade de Lisboa, Campo Grande, Lisbon, P-1749-016, Portugal
| |
Collapse
|
25
|
Hodel RGJ, Segovia-Salcedo MC, Landis JB, Crowl AA, Sun M, Liu X, Gitzendanner MA, Douglas NA, Germain-Aubrey CC, Chen S, Soltis DE, Soltis PS. The report of my death was an exaggeration: A review for researchers using microsatellites in the 21st century. APPLICATIONS IN PLANT SCIENCES 2016; 4:apps1600025. [PMID: 27347456 PMCID: PMC4915923 DOI: 10.3732/apps.1600025] [Citation(s) in RCA: 91] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2016] [Accepted: 05/25/2016] [Indexed: 05/19/2023]
Abstract
Microsatellites, or simple sequence repeats (SSRs), have long played a major role in genetic studies due to their typically high polymorphism. They have diverse applications, including genome mapping, forensics, ascertaining parentage, population and conservation genetics, identification of the parentage of polyploids, and phylogeography. We compare SSRs and newer methods, such as genotyping by sequencing (GBS) and restriction site associated DNA sequencing (RAD-Seq), and offer recommendations for researchers considering which genetic markers to use. We also review the variety of techniques currently used for identifying microsatellite loci and developing primers, with a particular focus on those that make use of next-generation sequencing (NGS). Additionally, we review software for microsatellite development and report on an experiment to assess the utility of currently available software for SSR development. Finally, we discuss the future of microsatellites and make recommendations for researchers preparing to use microsatellites. We argue that microsatellites still have an important place in the genomic age as they remain effective and cost-efficient markers.
Collapse
Affiliation(s)
- Richard G. J. Hodel
- Department of Biology, University of Florida, Gainesville, Florida 32611 USA
- Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611 USA
- Author for correspondence:
| | | | - Jacob B. Landis
- Department of Biology, University of Florida, Gainesville, Florida 32611 USA
- Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611 USA
| | - Andrew A. Crowl
- Department of Biology, University of Florida, Gainesville, Florida 32611 USA
- Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611 USA
| | - Miao Sun
- Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611 USA
| | - Xiaoxian Liu
- Department of Biology, University of Florida, Gainesville, Florida 32611 USA
- Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611 USA
| | | | - Norman A. Douglas
- Department of Biology, University of Florida, Gainesville, Florida 32611 USA
| | | | - Shichao Chen
- College of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Douglas E. Soltis
- Department of Biology, University of Florida, Gainesville, Florida 32611 USA
- Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611 USA
- The Genetics Institute, University of Florida, Gainesville, Florida 32611 USA
| | - Pamela S. Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611 USA
- The Genetics Institute, University of Florida, Gainesville, Florida 32611 USA
| |
Collapse
|
26
|
Massatti R, Reznicek AA, Knowles LL. Utilizing RADseq data for phylogenetic analysis of challenging taxonomic groups: A case study in Carex sect. Racemosae. AMERICAN JOURNAL OF BOTANY 2016; 103:337-347. [PMID: 26851268 DOI: 10.3732/ajb.1500315] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/05/2015] [Accepted: 12/29/2015] [Indexed: 06/05/2023]
Abstract
PREMISE OF THE STUDY Relationships among closely related and recently diverged taxa can be especially difficult to resolve. Here we use both Sanger sequencing and next-generation RADseq data sets to estimate phylogenetic relationships among species of Carex section Racemosae (Cyperaceae), a clade largely restricted to high latitudes and elevations. Interest in relationships among these taxa derives from questions about the species' biogeographic histories and possible links between diversification and Pleistocene glaciations. METHODS A combination of approaches and molecular markers were used to estimate relationships among Carex species within sect. Racemosae and taxa from closely related sections. Nuclear and chloroplast loci generated by Sanger sequencing were analyzed with *BEAST, and SNP data from RADseq loci were analyzed as a concatenated data set using maximum likelihood and as independent loci using SVDquartets. KEY RESULTS Sanger sequencing data sets resolved relationships among taxa at intermediate phylogenetic depths (albeit with low levels of support). Only the RADseq data resolved relationships with strong support at all phylogenetic depths. Moreover, different methods and data partitions of the RADseq data resulted in nearly identical topologies. Carex sect. Racemosae is a strongly supported clade, although a handful of species were found to group with closely related sections. Herbarium specimens up to 35 yr old successfully produced informative RADseq data. CONCLUSIONS Despite the short read lengths of RADseq data, they nevertheless resolved relationships that Sanger sequencing data did not. Resolution of the phylogenetic relationships among recently and rapidly diversifying taxa within sect. Racemosae clades suggest a role for the Pleistocene glaciations in clade diversification.
Collapse
Affiliation(s)
- Rob Massatti
- Department of Ecology and Evolutionary Biology, The University of Michigan, Ann Arbor, Michigan, 41809-1079 USA
| | - Anton A Reznicek
- Department of Ecology and Evolutionary Biology, The University of Michigan, Ann Arbor, Michigan, 41809-1079 USA
| | - L Lacey Knowles
- Department of Ecology and Evolutionary Biology, The University of Michigan, Ann Arbor, Michigan, 41809-1079 USA
| |
Collapse
|
27
|
Bielenberg DG, Rauh B, Fan S, Gasic K, Abbott AG, Reighard GL, Okie WR, Wells CE. Genotyping by Sequencing for SNP-Based Linkage Map Construction and QTL Analysis of Chilling Requirement and Bloom Date in Peach [Prunus persica (L.) Batsch]. PLoS One 2015; 10:e0139406. [PMID: 26430886 PMCID: PMC4592218 DOI: 10.1371/journal.pone.0139406] [Citation(s) in RCA: 86] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2015] [Accepted: 09/11/2015] [Indexed: 12/25/2022] Open
Abstract
Low-cost, high throughput genotyping methods are crucial to marker discovery and marker-assisted breeding efforts, but have not been available for many ‘specialty crops’ such as fruit and nut trees. Here we apply the Genotyping-By-Sequencing (GBS) method developed for cereals to the discovery of single nucleotide polymorphisms (SNPs) in a peach F2 mapping population. Peach is a genetic and genomic model within the Rosaceae and will provide a template for the use of this method with other members of this family. Our F2 mapping population of 57 genotypes segregates for bloom time (BD) and chilling requirement (CR) and we have extensively phenotyped this population. The population derives from a selfed F1 progeny of a cross between ‘Hakuho’ (high CR) and ‘UFGold’ (low CR). We were able to successfully employ GBS and the TASSEL GBS pipeline without modification of the original methodology using the ApeKI restriction enzyme and multiplexing at an equivalent of 96 samples per Illumina HiSeq 2000 lane. We obtained hundreds of SNP markers which were then used to construct a genetic linkage map and identify quantitative trait loci (QTL) for BD and CR.
Collapse
Affiliation(s)
- Douglas Gary Bielenberg
- Department of Biological Sciences, College of Agriculture, Forestry & Life Sciences, Clemson University, Clemson, South Carolina, 29634–0314, United States of America
- * E-mail:
| | - Bradley Rauh
- Advanced Plant Technology Program, Clemson University, Clemson, South Carolina, 29634, United States of America
| | - Shenghua Fan
- Department of Genetics & Biochemistry, College of Agriculture, Forestry & Life Sciences, Clemson University, Clemson, South Carolina, 29634, United States of America
| | - Ksenija Gasic
- Department of Agricultural and Environmental Sciences, College of Agriculture, Forestry & Life Sciences, Clemson University, Clemson, South Carolina, 29634, United States of America
| | - Albert Glenn Abbott
- Department of Genetics & Biochemistry, College of Agriculture, Forestry & Life Sciences, Clemson University, Clemson, South Carolina, 29634, United States of America
| | - Gregory Lynn Reighard
- Department of Agricultural and Environmental Sciences, College of Agriculture, Forestry & Life Sciences, Clemson University, Clemson, South Carolina, 29634, United States of America
| | - William R. Okie
- Southeastern Fruit and Tree Nut Research Laboratory (retired), USDA-ARS, Byron, Georgia, 31008, United States of America
| | - Christina Elizabeth Wells
- Department of Biological Sciences, College of Agriculture, Forestry & Life Sciences, Clemson University, Clemson, South Carolina, 29634–0314, United States of America
| |
Collapse
|
28
|
Identification of Laying-Related SNP Markers in Geese Using RAD Sequencing. PLoS One 2015; 10:e0131572. [PMID: 26181055 PMCID: PMC4504669 DOI: 10.1371/journal.pone.0131572] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2015] [Accepted: 06/03/2015] [Indexed: 12/18/2022] Open
Abstract
Laying performance is an important economical trait of goose production. As laying performance is of low heritability, it is of significance to develop a marker-assisted selection (MAS) strategy for this trait. Definition of sequence variation related to the target trait is a prerequisite of quantitating MAS, but little is presently known about the goose genome, which greatly hinders the identification of genetic markers for the laying traits of geese. Recently developed restriction site-associated DNA (RAD) sequencing is a possible approach for discerning large-scale single nucleotide polymorphism (SNP) and reducing the complexity of a genome without having reference genomic information available. In the present study, we developed a pooled RAD sequencing strategy for detecting geese laying-related SNP. Two DNA pools were constructed, each consisting of equal amounts of genomic DNA from 10 individuals with either high estimated breeding value (HEBV) or low estimated breeding value (LEBV). A total of 139,013 SNP were obtained from 42,291,356 sequences, of which 18,771,943 were for LEBV and 23,519,413 were for HEBV cohorts. Fifty-five SNP which had different allelic frequencies in the two DNA pools were further validated by individual-based AS-PCR genotyping in the LEBV and HEBV cohorts. Ten out of 55 SNP exhibited distinct allele distributions in these two cohorts. These 10 SNP were further genotyped in a goose population of 492 geese to verify the association with egg numbers. The result showed that 8 of 10 SNP were associated with egg numbers. Additionally, liner regression analysis revealed that SNP Record-111407, 106975 and 112359 were involved in a multiplegene network affecting laying performance. We used IPCR to extend the unknown regions flanking the candidate RAD tags. The obtained sequences were subjected to BLAST to retrieve the orthologous genes in either ducks or chickens. Five novel genes were cloned for geese which harbored the candidate laying-related SNP, including membrane associated guanylate kinase (MAGI-1), KIAA1462, Rho GTPase activating protein 21 (ARHGAP21), acyl-CoA synthetase family member 2 (ACSF2), astrotactin 2 (ASTN2). Collectively, our data suggests that 8 SNP and 5 genes might be promising candidate markers or targets for marker-assisted selection of egg numbers in geese.
Collapse
|
29
|
Genome Assembly Improvement and Mapping Convergently Evolved Skeletal Traits in Sticklebacks with Genotyping-by-Sequencing. G3-GENES GENOMES GENETICS 2015; 5:1463-72. [PMID: 26044731 PMCID: PMC4502380 DOI: 10.1534/g3.115.017905] [Citation(s) in RCA: 80] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Marine populations of the threespine stickleback (Gasterosteus aculeatus) have repeatedly colonized and rapidly adapted to freshwater habitats, providing a powerful system to map the genetic architecture of evolved traits. Here, we developed and applied a binned genotyping-by-sequencing (GBS) method to build dense genome-wide linkage maps of sticklebacks using two large marine by freshwater F2 crosses of more than 350 fish each. The resulting linkage maps significantly improve the genome assembly by anchoring 78 new scaffolds to chromosomes, reorienting 40 scaffolds, and rearranging scaffolds in 4 locations. In the revised genome assembly, 94.6% of the assembly was anchored to a chromosome. To assess linkage map quality, we mapped quantitative trait loci (QTL) controlling lateral plate number, which mapped as expected to a 200-kb genomic region containing Ectodysplasin, as well as a chromosome 7 QTL overlapping a previously identified modifier QTL. Finally, we mapped eight QTL controlling convergently evolved reductions in gill raker length in the two crosses, which revealed that this classic adaptive trait has a surprisingly modular and nonparallel genetic basis.
Collapse
|
30
|
Shao C, Niu Y, Rastas P, Liu Y, Xie Z, Li H, Wang L, Jiang Y, Tai S, Tian Y, Sakamoto T, Chen S. Genome-wide SNP identification for the construction of a high-resolution genetic map of Japanese flounder (Paralichthys olivaceus): applications to QTL mapping of Vibrio anguillarum disease resistance and comparative genomic analysis. DNA Res 2015; 22:161-70. [PMID: 25762582 PMCID: PMC4401326 DOI: 10.1093/dnares/dsv001] [Citation(s) in RCA: 75] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2014] [Accepted: 02/01/2015] [Indexed: 12/18/2022] Open
Abstract
High-resolution genetic maps are essential for fine mapping of complex traits, genome assembly, and comparative genomic analysis. Single-nucleotide polymorphisms (SNPs) are the primary molecular markers used for genetic map construction. In this study, we identified 13,362 SNPs evenly distributed across the Japanese flounder (Paralichthys olivaceus) genome. Of these SNPs, 12,712 high-confidence SNPs were subjected to high-throughput genotyping and assigned to 24 consensus linkage groups (LGs). The total length of the genetic linkage map was 3,497.29 cM with an average distance of 0.47 cM between loci, thereby representing the densest genetic map currently reported for Japanese flounder. Nine positive quantitative trait loci (QTLs) forming two main clusters for Vibrio anguillarum disease resistance were detected. All QTLs could explain 5.1-8.38% of the total phenotypic variation. Synteny analysis of the QTL regions on the genome assembly revealed 12 immune-related genes, among them 4 genes strongly associated with V. anguillarum disease resistance. In addition, 246 genome assembly scaffolds with an average size of 21.79 Mb were anchored onto the LGs; these scaffolds, comprising 522.99 Mb, represented 95.78% of assembled genomic sequences. The mapped assembly scaffolds in Japanese flounder were used for genome synteny analyses against zebrafish (Danio rerio) and medaka (Oryzias latipes). Flounder and medaka were found to possess almost one-to-one synteny, whereas flounder and zebrafish exhibited a multi-syntenic correspondence. The newly developed high-resolution genetic map, which will facilitate QTL mapping, scaffold assembly, and genome synteny analysis of Japanese flounder, marks a milestone in the ongoing genome project for this species.
Collapse
Affiliation(s)
- Changwei Shao
- Ministry of Agriculture, Yellow Sea Fisheries Research Institute, CAFS, Key Lab for Sustainable Development of Marine Fisheries, Qingdao 266071, China Function Laboratory for Marine Fisheries Science and Food Production Processes, National Lab for Ocean Science and Technology, Qingdao 266071, China Faculty of Marine Science, Tokyo University of Marine Science and Technology, Minato, Tokyo 108-8477, Japan
| | | | - Pasi Rastas
- Department of Biosciences, Metapopulation Research Group, University of Helsinki, Helsinki FI-00014, Finland
| | - Yang Liu
- Ministry of Agriculture, Yellow Sea Fisheries Research Institute, CAFS, Key Lab for Sustainable Development of Marine Fisheries, Qingdao 266071, China Function Laboratory for Marine Fisheries Science and Food Production Processes, National Lab for Ocean Science and Technology, Qingdao 266071, China
| | | | - Hengde Li
- Chinese Academy of Fisheries Science, Beijing 100039, China
| | - Lei Wang
- Ministry of Agriculture, Yellow Sea Fisheries Research Institute, CAFS, Key Lab for Sustainable Development of Marine Fisheries, Qingdao 266071, China Function Laboratory for Marine Fisheries Science and Food Production Processes, National Lab for Ocean Science and Technology, Qingdao 266071, China
| | - Yong Jiang
- National Oceanographic Center, Qingdao 266071, China
| | | | - Yongsheng Tian
- Ministry of Agriculture, Yellow Sea Fisheries Research Institute, CAFS, Key Lab for Sustainable Development of Marine Fisheries, Qingdao 266071, China Function Laboratory for Marine Fisheries Science and Food Production Processes, National Lab for Ocean Science and Technology, Qingdao 266071, China
| | - Takashi Sakamoto
- Faculty of Marine Science, Tokyo University of Marine Science and Technology, Minato, Tokyo 108-8477, Japan
| | - Songlin Chen
- Ministry of Agriculture, Yellow Sea Fisheries Research Institute, CAFS, Key Lab for Sustainable Development of Marine Fisheries, Qingdao 266071, China Function Laboratory for Marine Fisheries Science and Food Production Processes, National Lab for Ocean Science and Technology, Qingdao 266071, China
| |
Collapse
|
31
|
López ME, Neira R, Yáñez JM. Applications in the search for genomic selection signatures in fish. Front Genet 2015; 5:458. [PMID: 25642239 PMCID: PMC4294200 DOI: 10.3389/fgene.2014.00458] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2014] [Accepted: 12/15/2014] [Indexed: 11/25/2022] Open
Abstract
Selection signatures are genomic regions harboring DNA sequences functionally involved in the genetic variation of traits subject to selection. Selection signatures have been intensively studied in recent years because of their relevance to evolutionary biology and their potential association with genes that control phenotypes of interest in wild and domestic populations. Selection signature research in fish has been confined to a smaller scale, due in part to the relatively recent domestication of fish species and limited genomic resources such as molecular markers, genetic mapping, DNA sequences, and reference genomes. However, recent genomic technology advances are paving the way for more studies that may contribute to the knowledge of genomic regions underlying phenotypes of biological and productive interest in fish.
Collapse
Affiliation(s)
- María E López
- Faculty of Agricultural Sciences, University of Chile Santiago, Chile ; Aquainnovo, Puerto Montt Chile
| | - Roberto Neira
- Faculty of Agricultural Sciences, University of Chile Santiago, Chile
| | - José M Yáñez
- Aquainnovo, Puerto Montt Chile ; Faculty of Veterinary and Animal Sciences, University of Chile Santiago, Chile
| |
Collapse
|
32
|
Carlson BM, Onusko SW, Gross JB. A high-density linkage map for Astyanax mexicanus using genotyping-by-sequencing technology. G3 (BETHESDA, MD.) 2014; 5:241-51. [PMID: 25520037 PMCID: PMC4321032 DOI: 10.1534/g3.114.015438] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/30/2014] [Accepted: 12/11/2014] [Indexed: 12/17/2022]
Abstract
The Mexican tetra, Astyanax mexicanus, is a unique model system consisting of cave-adapted and surface-dwelling morphotypes that diverged >1 million years (My) ago. This remarkable natural experiment has enabled powerful genetic analyses of cave adaptation. Here, we describe the application of next-generation sequencing technology to the creation of a high-density linkage map. Our map comprises more than 2200 markers populating 25 linkage groups constructed from genotypic data generated from a single genotyping-by-sequencing project. We leveraged emergent genomic and transcriptomic resources to anchor hundreds of anonymous Astyanax markers to the genome of the zebrafish (Danio rerio), the most closely related model organism to our study species. This facilitated the identification of 784 distinct connections between our linkage map and the Danio rerio genome, highlighting several regions of conserved genomic architecture between the two species despite ~150 My of divergence. Using a Mendelian cave-associated trait as a proof-of-principle, we successfully recovered the genomic position of the albinism locus near the gene Oca2. Further, our map successfully informed the positions of unplaced Astyanax genomic scaffolds within particular linkage groups. This ability to identify the relative location, orientation, and linear order of unaligned genomic scaffolds will facilitate ongoing efforts to improve on the current early draft and assemble future versions of the Astyanax physical genome. Moreover, this improved linkage map will enable higher-resolution genetic analyses and catalyze the discovery of the genetic basis for cave-associated phenotypes.
Collapse
Affiliation(s)
- Brian M Carlson
- Department of Biological Sciences, University of Cincinnati, Cincinnati, Ohio 45221
| | - Samuel W Onusko
- Department of Biological Sciences, University of Cincinnati, Cincinnati, Ohio 45221
| | - Joshua B Gross
- Department of Biological Sciences, University of Cincinnati, Cincinnati, Ohio 45221
| |
Collapse
|
33
|
Putman AI, Carbone I. Challenges in analysis and interpretation of microsatellite data for population genetic studies. Ecol Evol 2014; 4:4399-428. [PMID: 25540699 PMCID: PMC4267876 DOI: 10.1002/ece3.1305] [Citation(s) in RCA: 237] [Impact Index Per Article: 23.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2014] [Revised: 10/02/2014] [Accepted: 10/03/2014] [Indexed: 12/14/2022] Open
Abstract
Advancing technologies have facilitated the ever-widening application of genetic markers such as microsatellites into new systems and research questions in biology. In light of the data and experience accumulated from several years of using microsatellites, we present here a literature review that synthesizes the limitations of microsatellites in population genetic studies. With a focus on population structure, we review the widely used fixation (F ST) statistics and Bayesian clustering algorithms and find that the former can be confusing and problematic for microsatellites and that the latter may be confounded by complex population models and lack power in certain cases. Clustering, multivariate analyses, and diversity-based statistics are increasingly being applied to infer population structure, but in some instances these methods lack formalization with microsatellites. Migration-specific methods perform well only under narrow constraints. We also examine the use of microsatellites for inferring effective population size, changes in population size, and deeper demographic history, and find that these methods are untested and/or highly context-dependent. Overall, each method possesses important weaknesses for use with microsatellites, and there are significant constraints on inferences commonly made using microsatellite markers in the areas of population structure, admixture, and effective population size. To ameliorate and better understand these constraints, researchers are encouraged to analyze simulated datasets both prior to and following data collection and analysis, the latter of which is formalized within the approximate Bayesian computation framework. We also examine trends in the literature and show that microsatellites continue to be widely used, especially in non-human subject areas. This review assists with study design and molecular marker selection, facilitates sound interpretation of microsatellite data while fostering respect for their practical limitations, and identifies lessons that could be applied toward emerging markers and high-throughput technologies in population genetics.
Collapse
Affiliation(s)
- Alexander I Putman
- Department of Plant Pathology, North Carolina State University Raleigh, North Carolina, 27695-7616
| | - Ignazio Carbone
- Department of Plant Pathology, North Carolina State University Raleigh, North Carolina, 27695-7616
| |
Collapse
|
34
|
Puritz JB, Hollenbeck CM, Gold JR. dDocent: a RADseq, variant-calling pipeline designed for population genomics of non-model organisms. PeerJ 2014; 2:e431. [PMID: 24949246 PMCID: PMC4060032 DOI: 10.7717/peerj.431] [Citation(s) in RCA: 259] [Impact Index Per Article: 25.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2014] [Accepted: 05/27/2014] [Indexed: 12/14/2022] Open
Abstract
Restriction-site associated DNA sequencing (RADseq) has become a powerful and useful approach for population genomics. Currently, no software exists that utilizes both paired-end reads from RADseq data to efficiently produce population-informative variant calls, especially for non-model organisms with large effective population sizes and high levels of genetic polymorphism. dDocent is an analysis pipeline with a user-friendly, command-line interface designed to process individually barcoded RADseq data (with double cut sites) into informative SNPs/Indels for population-level analyses. The pipeline, written in BASH, uses data reduction techniques and other stand-alone software packages to perform quality trimming and adapter removal, de novo assembly of RAD loci, read mapping, SNP and Indel calling, and baseline data filtering. Double-digest RAD data from population pairings of three different marine fishes were used to compare dDocent with Stacks, the first generally available, widely used pipeline for analysis of RADseq data. dDocent consistently identified more SNPs shared across greater numbers of individuals and with higher levels of coverage. This is due to the fact that dDocent quality trims instead of filtering, incorporates both forward and reverse reads (including reads with INDEL polymorphisms) in assembly, mapping, and SNP calling. The pipeline and a comprehensive user guide can be found at http://dDocent.wordpress.com.
Collapse
Affiliation(s)
- Jonathan B Puritz
- Marine Genomics Laboratory, Harte Research Institute, Texas A&M University-Corpus Christi , Corpus Christi, TX , USA
| | - Christopher M Hollenbeck
- Marine Genomics Laboratory, Harte Research Institute, Texas A&M University-Corpus Christi , Corpus Christi, TX , USA
| | - John R Gold
- Marine Genomics Laboratory, Harte Research Institute, Texas A&M University-Corpus Christi , Corpus Christi, TX , USA
| |
Collapse
|
35
|
Zhang Q, Li L, VanBuren R, Liu Y, Yang M, Xu L, Bowers JE, Zhong C, Han Y, Li S, Ming R. Optimization of linkage mapping strategy and construction of a high-density American lotus linkage map. BMC Genomics 2014; 15:372. [PMID: 24885335 PMCID: PMC4045970 DOI: 10.1186/1471-2164-15-372] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2014] [Accepted: 05/08/2014] [Indexed: 01/22/2023] Open
Abstract
Background Lotus is a diploid plant with agricultural, medicinal, and ecological significance. Genetic linkage maps are fundamental resources for genome and genetic study, and also provide molecular markers for breeding in agriculturally important species. Genotyping by sequencing revolutionized genetic mapping, the restriction-site associated DNA sequencing (RADseq) allowed rapid discovery of thousands of SNPs markers, and a crucial aspect of the sequence based mapping strategy is the reference sequences used for marker identification. Results We assessed the effectiveness of linkage mapping using three types of references for scoring markers: the unmasked genome, repeat masked genome, and gene models. Overall, the repeat masked genome produced the optimal genetic maps. A high-density genetic map of American lotus was constructed using an F1 population derived from a cross between Nelumbo nucifera ‘China Antique’ and N. lutea ‘AL1’. A total of 4,098 RADseq markers were used to construct the American lotus ‘AL1’ genetic map, and 147 markers were used to construct the Chinese lotus ‘China Antique’ genetic map. The American lotus map has 9 linkage groups, and spans 494.3 cM, with an average distance of 0.7 cM between adjacent markers. The American lotus map was used to anchor scaffold sequences in the N. nucifera ‘China Antique’ draft genome. 3,603 RADseq markers anchored 234 individual scaffold sequences into 9 megascaffolds spanning 67% of the 804 Mb draft genome. Conclusions Among the unmasked genome, repeat masked genome and gene models, the optimal reference sequences to call RADseq markers for map construction is repeat masked genome. This high density genetic map is a valuable resource for genomic research and crop improvement in lotus.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | - Ray Ming
- Key Laborary of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, P,R, China.
| |
Collapse
|
36
|
Toonen RJ, Puritz JB, Forsman ZH, Whitney JL, Fernandez-Silva I, Andrews KR, Bird CE. ezRAD: a simplified method for genomic genotyping in non-model organisms. PeerJ 2013; 1:e203. [PMID: 24282669 PMCID: PMC3840413 DOI: 10.7717/peerj.203] [Citation(s) in RCA: 127] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2013] [Accepted: 10/13/2013] [Indexed: 12/17/2022] Open
Abstract
Here, we introduce ezRAD, a novel strategy for restriction site–associated DNA (RAD) that requires little technical expertise or investment in laboratory equipment, and demonstrate its utility for ten non-model organisms across a wide taxonomic range. ezRAD differs from other RAD methods primarily through its use of standard Illumina TruSeq library preparation kits, which makes it possible for any laboratory to send out to a commercial genomic core facility for library preparation and next-generation sequencing with virtually no additional investment beyond the cost of the service itself. This simplification opens RADseq to any lab with the ability to extract DNA and perform a restriction digest. ezRAD also differs from others in its flexibility to use any restriction enzyme (or combination of enzymes) that cuts frequently enough to generate fragments of the desired size range, without requiring the purchase of separate adapters for each enzyme or a sonication step, which can further decrease the cost involved in choosing optimal enzymes for particular species and research questions. We apply this method across a wide taxonomic diversity of non-model organisms to demonstrate the utility and flexibility of our approach. The simplicity of ezRAD makes it particularly useful for the discovery of single nucleotide polymorphisms and targeted amplicon sequencing in natural populations of non-model organisms that have been historically understudied because of lack of genomic information.
Collapse
Affiliation(s)
- Robert J Toonen
- Hawai'i Institute of Marine Biology, School of Ocean & Earth Sciences & Technology, University of Hawai'i at Mānoa , Coconut Island, Kāne'ohe, HI , United States
| | | | | | | | | | | | | |
Collapse
|
37
|
Evaluating adaptive divergence between migratory and nonmigratory ecotypes of a salmonid fish, Oncorhynchus mykiss. G3-GENES GENOMES GENETICS 2013; 3:1273-85. [PMID: 23797103 PMCID: PMC3737167 DOI: 10.1534/g3.113.006817] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Next-generation sequencing and the application of population genomic and association approaches have made it possible to detect selection and unravel the genetic basis to variable phenotypic traits. The use of these two approaches in parallel is especially attractive in nonmodel organisms that lack a sequenced and annotated genome, but only works well when population structure is not confounded with the phenotype of interest. Herein, we use population genomics in a nonmodel fish species, rainbow trout (Oncorhynchus mykiss), to better understand adaptive divergence between migratory and nonmigratory ecotypes and to further our understanding about the genetic basis of migration. Restriction site-associated DNA (RAD) tag sequencing was used to identify single-nucleotide polymorphisms (SNPs) in migrant and resident O. mykiss from two systems, one in Alaska and the other in Oregon. A total of 7920 and 6755 SNPs met filtering criteria in the Alaska and Oregon data sets, respectively. Population genetic tests determined that 1423 SNPs were candidates for selection when loci were compared between resident and migrant samples. Previous linkage mapping studies that used RAD DNA tag SNPs were available to determine the position of 1990 markers. Several significant SNPs are located in genome regions that contain quantitative trait loci for migratory-related traits, reinforcing the importance of these regions in the genetic basis of migration/residency. Annotation of genome regions linked to significant SNPs revealed genes involved in processes known to be important in migration (such as osmoregulatory function). This study adds to our growing knowledge on adaptive divergence between migratory and nonmigratory ecotypes of this species; across studies, this complex trait appears to be controlled by many loci of small effect, with some in common, but many loci not shared between populations studied.
Collapse
|
38
|
Caballero A, García-Pereira MJ, Quesada H. Genomic distribution of AFLP markers relative to gene locations for different eukaryotic species. BMC Genomics 2013; 14:528. [PMID: 24060007 PMCID: PMC3750350 DOI: 10.1186/1471-2164-14-528] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2013] [Accepted: 07/30/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Amplified fragment length polymorphism (AFLP) markers are frequently used for a wide range of studies, such as genome-wide mapping, population genetic diversity estimation, hybridization and introgression studies, phylogenetic analyses, and detection of signatures of selection. An important issue to be addressed for some of these fields is the distribution of the markers across the genome, particularly in relation to gene sequences. RESULTS Using in-silico restriction fragment analysis of the genomes of nine eukaryotic species we characterise the distribution of AFLP fragments across the genome and, particularly, in relation to gene locations. First, we identify the physical position of markers across the chromosomes of all species. An observed accumulation of fragments around (peri) centromeric regions in some species is produced by repeated sequences, and this accumulation disappears when AFLP bands rather than fragments are considered. Second, we calculate the percentage of AFLP markers positioned within gene sequences. For the typical EcoRI/MseI enzyme pair, this ranges between 28 and 87% and is usually larger than that expected by chance because of the higher GC content of gene sequences relative to intergenic ones. In agreement with this, the use of enzyme pairs with GC-rich restriction sites substantially increases the above percentages. For example, using the enzyme system SacI/HpaII, 86% of AFLP markers are located within gene sequences in A. thaliana, and 100% of markers in Plasmodium falciparun. We further find that for a typical trait controlled by 50 genes of average size, if 1000 AFLPs are used in a study, the number of those within 1 kb distance from any of the genes would be only about 1-2, and only about 50% of the genes would have markers within that distance. CONCLUSIONS The high coverage of AFLP markers across the genomes and the high proportion of markers within or close to gene sequences make them suitable for genome scans and detecting large islands of differentiation in the genome. However, for specific traits, the percentage of AFLP markers close to genes can be rather small. Therefore, genome scans directed towards the search of markers closely linked to selected loci can be a difficult task in many instances.
Collapse
Affiliation(s)
- Armando Caballero
- Departamento de Bioquímica, Genética e Inmunología, Facultad de Biología, Universidade de Vigo, 36310 Vigo, Spain.
| | | | | |
Collapse
|
39
|
Catchen J, Hohenlohe PA, Bassham S, Amores A, Cresko WA. Stacks: an analysis tool set for population genomics. Mol Ecol 2013; 22:3124-40. [PMID: 23701397 DOI: 10.1111/mec.12354] [Citation(s) in RCA: 2139] [Impact Index Per Article: 194.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2012] [Revised: 04/16/2013] [Accepted: 04/16/2013] [Indexed: 02/06/2023]
Abstract
Massively parallel short-read sequencing technologies, coupled with powerful software platforms, are enabling investigators to analyse tens of thousands of genetic markers. This wealth of data is rapidly expanding and allowing biological questions to be addressed with unprecedented scope and precision. The sizes of the data sets are now posing significant data processing and analysis challenges. Here we describe an extension of the Stacks software package to efficiently use genotype-by-sequencing data for studies of populations of organisms. Stacks now produces core population genomic summary statistics and SNP-by-SNP statistical tests. These statistics can be analysed across a reference genome using a smoothed sliding window. Stacks also now provides several output formats for several commonly used downstream analysis packages. The expanded population genomics functions in Stacks will make it a useful tool to harness the newest generation of massively parallel genotyping data for ecological and evolutionary genetics.
Collapse
Affiliation(s)
- Julian Catchen
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403-5289, USA
| | | | | | | | | |
Collapse
|
40
|
Eaton DAR, Ree RH. Inferring phylogeny and introgression using RADseq data: an example from flowering plants (Pedicularis: Orobanchaceae). Syst Biol 2013; 62:689-706. [PMID: 23652346 PMCID: PMC3739883 DOI: 10.1093/sysbio/syt032] [Citation(s) in RCA: 326] [Impact Index Per Article: 29.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Phylogenetic relationships among recently diverged species are often difficult to resolve due to insufficient phylogenetic signal in available markers and/or conflict among gene trees. Here we explore the use of reduced-representation genome sequencing, specifically in the form of restriction-site associated DNA (RAD), for phylogenetic inference and the detection of ancestral hybridization in non-model organisms. As a case study, we investigate Pedicularis section Cyathophora, a systematically recalcitrant clade of flowering plants in the broomrape family (Orobanchaceae). Two methods of phylogenetic inference, maximum likelihood and Bayesian concordance, were applied to data sets that included as many as 40,000 RAD loci. Both methods yielded similar topologies that included two major clades: a "rex-thamnophila" clade, composed of two species and several subspecies with relatively low floral diversity, and geographically widespread distributions at lower elevations, and a "superba" clade, composed of three species characterized by relatively high floral diversity and isolated geographic distributions at higher elevations. Levels of molecular divergence between subspecies in the rex-thamnophila clade are similar to those between species in the superba clade. Using Patterson's D-statistic test, including a novel extension of the method that enables finer-grained resolution of introgression among multiple candidate taxa by removing the effect of their shared ancestry, we detect significant introgression among nearly all taxa in the rex-thamnophila clade, but not between clades or among taxa within the superba clade. These results suggest an important role for geographic isolation in the emergence of species barriers, by facilitating local adaptation and differentiation in the absence of homogenizing gene flow.
Collapse
Affiliation(s)
- Deren A R Eaton
- Committee on Evolutionary Biology, University of Chicago, Chicago, IL 60637, USA; and Botany Department, Field Museum of Natural History, Chicago, IL 60605, USA
| | | |
Collapse
|
41
|
Cariou M, Duret L, Charlat S. Is RAD-seq suitable for phylogenetic inference? An in silico assessment and optimization. Ecol Evol 2013; 3:846-52. [PMID: 23610629 PMCID: PMC3631399 DOI: 10.1002/ece3.512] [Citation(s) in RCA: 108] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2012] [Revised: 01/10/2013] [Accepted: 01/17/2013] [Indexed: 02/03/2023] Open
Abstract
INFERRING PHYLOGENETIC RELATIONSHIPS BETWEEN CLOSELY RELATED TAXA CAN BE HINDERED BY THREE FACTORS: (1) the lack of informative molecular variation at short evolutionary timescale; (2) the lack of established markers in poorly studied taxa; and (3) the potential phylogenetic conflicts among different genomic regions due to incomplete lineage sorting or introgression. In this context, Restriction site Associated DNA sequencing (RAD-seq) seems promising as this technique can generate sequence data from numerous DNA fragments scattered throughout the genome, from a large number of samples, and without preliminary knowledge on the taxa under study. However, divergence beyond the within-species level will necessarily reduce the number of conserved and non-duplicated restriction sites, and therefore the number of loci usable for phylogenetic inference. Here, we assess the suitability of RAD-seq for phylogeny using a simulated experiment on the 12 Drosophila genomes, with divergence times ranging from 5 to 63 million years. These simulations show that RAD-seq allows the recovery of the known Drosophila phylogeny with strong statistical support, even for relatively ancient nodes. Notably, this conclusion is robust to the potentially confounding effects of sequencing errors, heterozygosity, and low coverage. We further show that clustering RAD-seq data using the BLASTN and SiLiX programs significantly improves the recovery of orthologous RAD loci compared with previously proposed approaches, especially for distantly related species. This study therefore validates the view that RAD sequencing is a powerful tool for phylogenetic inference.
Collapse
Affiliation(s)
- Marie Cariou
- Université de Lyon, Université Lyon 1, CNRS, UMR 5558, Laboratoire de Biométrie et Biologie Evolutive 43 boulevard du 11 novembre 1918, Villeurbanne, F-69622, France
| | | | | |
Collapse
|
42
|
Richards PM, Liu MM, Lowe N, Davey JW, Blaxter ML, Davison A. RAD-Seq derived markers flank the shell colour and banding loci of the Cepaea nemoralis supergene. Mol Ecol 2013; 22:3077-89. [PMID: 23496771 PMCID: PMC3712483 DOI: 10.1111/mec.12262] [Citation(s) in RCA: 68] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2012] [Revised: 01/11/2013] [Accepted: 01/15/2013] [Indexed: 01/24/2023]
Abstract
Studies on the classic shell colour and banding polymorphism of the land snail Cepaea played a crucial role in establishing the importance of natural selection in maintaining morphological variation. Cepaea is also a pre-eminent model for ecological genetics because the outward colour and banding phenotype is entirely genetically determined, primarily by a 'supergene' of at least five loci. Unfortunately, progress in understanding the evolution and maintenance of the Cepaea polymorphism stalled, partly because of a lack of genetic markers. With a view to re-establish Cepaea as a prominent model of molecular ecology, we made six laboratory crosses of Cepaea nemoralis, five of which segregated for shell ground colour (C) and the presence or absence of bands (B). First, scoring of colour and banding in 323 individuals found no recombination between the C and B loci of the supergene. Second, using restriction site-associated DNA sequencing (RAD-Seq) of two parents and 22 offspring, we identified 44 anonymous markers putatively linked to the colour (C) and banding (B) loci. The genotype of eleven of the most promising RAD-Seq markers was independently validated in the same 22 offspring, then up to a further 146 offspring were genotyped. The closest RAD-Seq markers scored are within ~0.6 centimorgan (cM) of the C-B supergene linkage group, with the combined loci together forming a 35.8 cM linkage map of markers that flank both sides of the Cepaea C-B supergene.
Collapse
Affiliation(s)
- Paul M Richards
- Centre for Genetics and Genomics, School of Biology, University of Nottingham, Nottingham, NG7 2RD, UK
| | | | | | | | | | | |
Collapse
|
43
|
Reitzel AM, Herrera S, Layden MJ, Martindale MQ, Shank TM. Going where traditional markers have not gone before: utility of and promise for RAD sequencing in marine invertebrate phylogeography and population genomics. Mol Ecol 2013; 22:2953-70. [PMID: 23473066 DOI: 10.1111/mec.12228] [Citation(s) in RCA: 117] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2012] [Accepted: 12/11/2012] [Indexed: 12/20/2022]
Abstract
Characterization of large numbers of single-nucleotide polymorphisms (SNPs) throughout a genome has the power to refine the understanding of population demographic history and to identify genomic regions under selection in natural populations. To this end, population genomic approaches that harness the power of next-generation sequencing to understand the ecology and evolution of marine invertebrates represent a boon to test long-standing questions in marine biology and conservation. We employed restriction-site-associated DNA sequencing (RAD-seq) to identify SNPs in natural populations of the sea anemone Nematostella vectensis, an emerging cnidarian model with a broad geographic range in estuarine habitats in North and South America, and portions of England. We identified hundreds of SNP-containing tags in thousands of RAD loci from 30 barcoded individuals inhabiting four locations from Nova Scotia to South Carolina. Population genomic analyses using high-confidence SNPs resulted in a highly-resolved phylogeography, a result not achieved in previous studies using traditional markers. Plots of locus-specific FST against heterozygosity suggest that a majority of polymorphic sites are neutral, with a smaller proportion suggesting evidence for balancing selection. Loci inferred to be under balancing selection were mapped to the genome, where 90% were located in gene bodies, indicating potential targets of selection. The results from analyses with and without a reference genome supported similar conclusions, further highlighting RAD-seq as a method that can be efficiently applied to species lacking existing genomic resources. We discuss the utility of RAD-seq approaches in burgeoning Nematostella research as well as in other cnidarian species, particularly corals and jellyfishes, to determine phylogeographic relationships of populations and identify regions of the genome undergoing selection.
Collapse
Affiliation(s)
- A M Reitzel
- Biology Department, Woods Hole Oceanographic Institution, Woods Hole, MA 02543, USA.
| | | | | | | | | |
Collapse
|
44
|
Hancock‐Hanser BL, Frey A, Leslie MS, Dutton PH, Archer FI, Morin PA. Targeted multiplex next‐generation sequencing: advances in techniques of mitochondrial and nuclear
DNA
sequencing for population genomics. Mol Ecol Resour 2013; 13:254-68. [DOI: 10.1111/1755-0998.12059] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2012] [Revised: 11/30/2012] [Accepted: 12/06/2012] [Indexed: 11/27/2022]
Affiliation(s)
- Brittany L. Hancock‐Hanser
- Protected Resources Division Southwest Fisheries Science Center National Marine Fisheries Service, NOAA 8901 La Jolla Shores Drive La Jolla CA 92037 USA
| | - Amy Frey
- Protected Resources Division Southwest Fisheries Science Center National Marine Fisheries Service, NOAA 8901 La Jolla Shores Drive La Jolla CA 92037 USA
| | - Matthew S. Leslie
- Scripps Institution of Oceanography University of California San Diego 9500 Gilman Drive, MC 0202 La Jolla CA 92093 USA
| | - Peter H. Dutton
- Protected Resources Division Southwest Fisheries Science Center National Marine Fisheries Service, NOAA 8901 La Jolla Shores Drive La Jolla CA 92037 USA
| | - Frederick I. Archer
- Protected Resources Division Southwest Fisheries Science Center National Marine Fisheries Service, NOAA 8901 La Jolla Shores Drive La Jolla CA 92037 USA
| | - Phillip A. Morin
- Protected Resources Division Southwest Fisheries Science Center National Marine Fisheries Service, NOAA 8901 La Jolla Shores Drive La Jolla CA 92037 USA
| |
Collapse
|
45
|
Zhou L, Holliday JA. Targeted enrichment of the black cottonwood (Populus trichocarpa) gene space using sequence capture. BMC Genomics 2012; 13:703. [PMID: 23241106 PMCID: PMC3542275 DOI: 10.1186/1471-2164-13-703] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2012] [Accepted: 12/05/2012] [Indexed: 01/04/2023] Open
Abstract
Background High-throughput re-sequencing is rapidly becoming the method of choice for studies of neutral and adaptive processes in natural populations across taxa. As re-sequencing the genome of large numbers of samples is still cost-prohibitive in many cases, methods for genome complexity reduction have been developed in attempts to capture most ecologically-relevant genetic variation. One of these approaches is sequence capture, in which oligonucleotide baits specific to genomic regions of interest are synthesized and used to retrieve and sequence those regions. Results We used sequence capture to re-sequence most predicted exons, their upstream regulatory regions, as well as numerous random genomic intervals in a panel of 48 genotypes of the angiosperm tree Populus trichocarpa (black cottonwood, or ‘poplar’). A total of 20.76Mb (5%) of the poplar genome was targeted, corresponding to 173,040 baits. With 12 indexed samples run in each of four lanes on an Illumina HiSeq instrument (2x100 paired-end), 86.8% of the bait regions were on average sequenced at a depth ≥10X. Few off-target regions (>250bp away from any bait) were present in the data, but on average ~80bp on either side of the baits were captured and sequenced to an acceptable depth (≥10X) to call heterozygous SNPs. Nucleotide diversity estimates within and adjacent to protein-coding genes were similar to those previously reported in Populus spp., while intergenic regions had higher values consistent with a relaxation of selection. Conclusions Our results illustrate the efficiency and utility of sequence capture for re-sequencing highly heterozygous tree genomes, and suggest design considerations to optimize the use of baits in future studies.
Collapse
Affiliation(s)
- Lecong Zhou
- Department of Forest Resources and Environmental Conservation, Virginia Polytechnic Institute and State University, 304 Cheatham Hall, Blacksburg, VA 24061, USA
| | | |
Collapse
|
46
|
Two different high throughput sequencing approaches identify thousands of de novo genomic markers for the genetically depleted Bornean elephant. PLoS One 2012. [PMID: 23185354 PMCID: PMC3504023 DOI: 10.1371/journal.pone.0049533] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
High throughput sequencing technologies are being applied to an increasing number of model species with a high-quality reference genome. The application and analyses of whole-genome sequence data in non-model species with no prior genomic information are currently under way. Recent sequencing technologies provide new opportunities for gathering genomic data in natural populations, laying the empirical foundation for future research in the field of conservation and population genomics. Here we present the case study of the Bornean elephant, which is the most endangered subspecies of Asian elephant and exhibits very low genetic diversity. We used two different sequencing platforms, the Roche 454 FLX (shotgun) and Illumina, GAIIx (Restriction site associated DNA, RAD) to evaluate the feasibility of the two methodologies for the discovery of de novo markers (single nucleotide polymorphism, SNPs and microsatellites) using low coverage data. Approximately, 6,683 (shotgun) and 14,724 (RAD) SNPs were detected within our elephant sequence dataset. Genotyping of a representative sample of 194 SNPs resulted in a SNP validation rate of ∼ 83 to 94% and 17% of the loci were polymorphic with a low diversity (Ho = 0.057). Different numbers of microsatellites were identified through shotgun (27,226) and RAD (868) techniques. Out of all di-, tri-, and tetra-microsatellite loci, 1,706 loci had sufficient flanking regions (shotgun) while only 7 were found with RAD. All microsatellites were monomorphic in the Bornean but polymorphic in another elephant subspecies. Despite using different sample sizes, and the well known differences in the two platforms used regarding sequence length and throughput, the two approaches showed high validation rate. The approaches used here for marker development in a threatened species demonstrate the utility of high throughput sequencing technologies as a starting point for the development of genomic tools in a non-model species and in particular for a species with low genetic diversity.
Collapse
|
47
|
Challenges and pitfalls in the characterization of anonymous outlier AFLP markers in non-model species: lessons from an ocellated lizard genome scan. Heredity (Edinb) 2012; 109:340-8. [PMID: 22892639 DOI: 10.1038/hdy.2012.48] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
In the last few years, dozens of studies have documented the detection of loci influenced by selection from genome scans in a wide range of non-model species. Many of those studies used amplified fragment length polymorphism (AFLP) markers, which became popular for being easily applicable to any organism. However, because they are anonymous markers, AFLPs impose many challenges for their isolation and identification. Most recent AFLP genome scans used capillary electrophoresis (CE), which adds even more obstacles to the isolation of bands with a specific size for sequencing. These caveats might explain the extremely low number of studies that moved from the detection of outlier AFLP markers to their actual isolation and characterization. We document our efforts to characterize a set of outlier AFLP markers from a previous genome scan with CE in ocellated lizards (Lacerta lepida). Seven outliers were successfully isolated, cloned and sequenced. Their sequences are noncoding and show internal indels or polymorphic repetitive elements (microsatellites). Three outliers were converted into codominant markers by using specific internal primers to sequence and screen population variability from undigested DNA. Amplification in closely related lizard species was also achieved, revealing remarkable interspecific conservation in outlier loci sequences. We stress the importance of following up AFLP genome scans to validate selection signatures of outlier loci, but also report the main challenges and pitfalls that may be faced during the process.
Collapse
|
48
|
Funk WC, McKay JK, Hohenlohe PA, Allendorf FW. Harnessing genomics for delineating conservation units. Trends Ecol Evol 2012; 27:489-96. [PMID: 22727017 DOI: 10.1016/j.tree.2012.05.012] [Citation(s) in RCA: 527] [Impact Index Per Article: 43.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2012] [Revised: 05/25/2012] [Accepted: 05/25/2012] [Indexed: 01/07/2023]
Abstract
Genomic data have the potential to revolutionize the delineation of conservation units (CUs) by allowing the detection of adaptive genetic variation, which is otherwise difficult for rare, endangered species. In contrast to previous recommendations, we propose that the use of neutral versus adaptive markers should not be viewed as alternatives. Rather, neutral and adaptive markers provide different types of information that should be combined to make optimal management decisions. Genetic patterns at neutral markers reflect the interaction of gene flow and genetic drift that affects genome-wide variation within and among populations. This population genetic structure is what natural selection operates on to cause adaptive divergence. Here, we provide a new framework to integrate data on neutral and adaptive markers to protect biodiversity.
Collapse
Affiliation(s)
- W Chris Funk
- Department of Biology, Graduate Degree Program in Ecology, Colorado State University, Fort Collins, CO 80523, USA.
| | | | | | | |
Collapse
|
49
|
Coding Gene SNP Mapping Reveals QTL Linked to Growth and Stress Response in Brook Charr (Salvelinus fontinalis). G3-GENES GENOMES GENETICS 2012; 2:707-20. [PMID: 22690380 PMCID: PMC3362300 DOI: 10.1534/g3.112.001990] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/11/2012] [Accepted: 04/16/2012] [Indexed: 11/27/2022]
Abstract
Growth performance and reduced stress response are traits of major interest in fish production. Growth and stress-related quantitative trait loci (QTL) have been already identified in several salmonid species, but little effort has been devoted to charrs (genus Salvelinus). Moreover, most QTL studies to date focused on one or very few traits, and little investigation has been devoted to QTL identification for gene expression. Here, our objective was to identify QTL for 27 phenotypes related to growth and stress responses in brook charr (Salvelinus fontinalis), which is one of the most economically important freshwater aquaculture species in Canada. Phenotypes included 12 growth parameters, six blood and plasma variables, three hepatic variables, and one plasma hormone level as well as the relative expression measurements of five genes of interest linked to growth regulation. QTL analysis relied on a linkage map recently built from S. fontinalis consisting of both single-nucleotide polymorphism (SNP, n = 266) and microsatellite (n =81) markers in an F2 interstrain hybrid population (n = 171). We identified 63 growth-related QTL and four stress-related QTL across 18 of the 40 linkage groups of the brook charr linkage map. Percent variance explained, confidence interval, and allelic QTL effects also were investigated to provide insight into the genetic architecture of growth- and stress-related QTL. QTL related to growth performance and stress response that were identified could be classified into two groups: (1) a group composed of the numerous, small-effect QTL associated with some traits related to growth (i.e., weight) that may be under the control of a large number of genes or pleiotropic genes, and (2) a group of less numerous QTL associated with growth (i.e., gene expression) and with stress-related QTL that display a larger effect, suggesting that these QTL are under the control of a limited number of genes of major effect. This study represents a first step toward the identification of genes potentially linked to phenotypic variation of growth and stress response in brook charr. The ultimate goal is to provide new tools for developing Molecular Assisted Selection for this species.
Collapse
|
50
|
Angeloni F, Wagemaker N, Vergeer P, Ouborg J. Genomic toolboxes for conservation biologists. Evol Appl 2011; 5:130-43. [PMID: 25568036 PMCID: PMC3353346 DOI: 10.1111/j.1752-4571.2011.00217.x] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2011] [Accepted: 10/18/2011] [Indexed: 12/01/2022] Open
Abstract
Conservation genetics is expanding its research horizon with a genomic approach, by incorporating the modern techniques of next-generation sequencing (NGS). Application of NGS overcomes many limitations of conservation genetics. First, NGS allows for genome-wide screening of markers, which may lead to a more representative estimation of genetic variation within and between populations. Second, NGS allows for distinction between neutral and non-neutral markers. By screening populations on thousands of single nucleotide polymorphism markers, signals of selection can be found for some markers. Variation in these markers will give insight into functional rather than neutral genetic variation. Third, NGS facilitates the study of gene expression. Conservation genomics will increase our insight in how the environment and genes interact to affect phenotype and fitness. In addition, the NGS approach opens a way to study processes such as inbreeding depression and local adaptation mechanistically. Conservation genetics programs are directed to a fundamental understanding of the processes involved in conservation genetics and should preferably be started in species for which large databases on ecology, demography and genetics are available. Here, we describe and illustrate the connection between the application of NGS technologies and the research questions in conservation. The perspectives of conservation genomics programs are also discussed.
Collapse
Affiliation(s)
- Francesco Angeloni
- Institute for Water and Wetland Research (IWWR), Department of Molecular Ecology, Radboud University Nijmegen AJ Nijmegen, The Netherlands
| | - Niels Wagemaker
- Institute for Water and Wetland Research (IWWR), Department of Molecular Ecology, Radboud University Nijmegen AJ Nijmegen, The Netherlands
| | - Philippine Vergeer
- Institute for Water and Wetland Research (IWWR), Department of Molecular Ecology, Radboud University Nijmegen AJ Nijmegen, The Netherlands
| | - Joop Ouborg
- Institute for Water and Wetland Research (IWWR), Department of Molecular Ecology, Radboud University Nijmegen AJ Nijmegen, The Netherlands
| |
Collapse
|