1
|
Duval H, Coindre E, Ramos-Onsins SE, Alexiou KG, Rubio-Cabetas MJ, Martínez-García PJ, Wirthensohn M, Dhingra A, Samarina A, Arús P. Development and Evaluation of an Axiom TM 60K SNP Array for Almond ( Prunus dulcis). Plants (Basel) 2023; 12:242. [PMID: 36678957 PMCID: PMC9866729 DOI: 10.3390/plants12020242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 12/24/2022] [Accepted: 12/29/2022] [Indexed: 06/17/2023]
Abstract
A high-density single nucleotide polymorphism (SNP) array is essential to enable faster progress in plant breeding for new cultivar development. In this regard, we have developed an Axiom 60K almond SNP array by resequencing 81 almond accessions. For the validation of the array, a set of 210 accessions were genotyped and 82.8% of the SNPs were classified in the best recommended SNPs. The rate of missing data was between 0.4% and 2.7% for the almond accessions and less than 15.5% for the few peach and wild accessions, suggesting that this array can be used for peach and interspecific peach × almond genetic studies. The values of the two SNPs linked to the RMja (nematode resistance) and SK (bitterness) genes were consistent. We also genotyped 49 hybrids from an almond F2 progeny and could build a genetic map with a set of 1159 SNPs. Error rates, less than 1%, were evaluated by comparing replicates and by detection of departures from Mendelian inheritance in the F2 progeny. This almond array is commercially available and should be a cost-effective genotyping tool useful in the search for new genes and quantitative traits loci (QTL) involved in the control of agronomic traits.
Collapse
Affiliation(s)
- Henri Duval
- Unité de Génétique et Amélioration des Fruits et Légumes (GAFL), INRAE (French National Research Institute for Agriculture, Food and Environment), 84143 Montfavet, France
| | - Eva Coindre
- Unité de Génétique et Amélioration des Fruits et Légumes (GAFL), INRAE (French National Research Institute for Agriculture, Food and Environment), 84143 Montfavet, France
| | - Sebastian E. Ramos-Onsins
- Centre for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Carrer de la Vall Moronta, Edifici CRAG, Campus UAB, Cerdanyola del Valles, 08193 Barcelona, Spain
| | - Konstantinos G. Alexiou
- Centre for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Carrer de la Vall Moronta, Edifici CRAG, Campus UAB, Cerdanyola del Valles, 08193 Barcelona, Spain
- IRTA (Institute of Agrifood Research and Technology), Campus UAB, Edifici CRAG, Cerdanyola del Valles (Bellaterra), 08193 Barcelona, Spain
| | - Maria J. Rubio-Cabetas
- CITA (Agrifood Research and Technology Centre of Aragon), Department of Plant Science, Avda. Montañana 930, 50059 Zaragoza, Spain
| | - Pedro J. Martínez-García
- CEBAS (Centro de Edafología y Biología Aplicada del Segura), CSIC, Department of Plant Breeding, Campus Universitario de Espinardo, 30100 Espinardo, Spain
| | - Michelle Wirthensohn
- Waite Research Institute, University of Adelaide, PMB 1 Glen, Osmond, SA 5064, Australia
| | - Amit Dhingra
- Department of Horticulture, Washington State University, Pullman, WA 99164-6414, USA
| | - Anna Samarina
- Thermo Fisher Scientific, Frankfurter Str. 129B, 64293 Darmstadt, Germany
| | - Pere Arús
- Centre for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Carrer de la Vall Moronta, Edifici CRAG, Campus UAB, Cerdanyola del Valles, 08193 Barcelona, Spain
- IRTA (Institute of Agrifood Research and Technology), Campus UAB, Edifici CRAG, Cerdanyola del Valles (Bellaterra), 08193 Barcelona, Spain
| |
Collapse
|
2
|
Rodríguez-Valera Y, Rocha D, Naves M, Renand G, Pérez-Pineda E, Ramayo-Caldas Y, Ramos-Onsins SE. The Identification of Runs of Homozygosity Gives a Focus on the Genetic Diversity and Adaptation of the "Charolais de Cuba" Cattle. Animals (Basel) 2020; 10:ani10122233. [PMID: 33261195 PMCID: PMC7760288 DOI: 10.3390/ani10122233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2020] [Accepted: 11/23/2020] [Indexed: 11/16/2022] Open
Abstract
Simple Summary The Charolais de Cuba cattle is a tropical adapted breed founded in Cuba around 120 years ago from Charolais French specimens. Nowadays, it is still a closed breed and remains as a small population. In this work, we analyzed the inbreeding and diversity patterns, as well as the population size, of this recent adapted breed via a run of homozygosity (ROH) analysis. We found that the genomic inbreeding levels are higher in the Charolais de Cuba breed compared to French and British Charolais populations. Nevertheless, we detected that the effective population size experienced a very similar decline during the last century in the three Charolais populations studied. Finally, a number of regions with exceptional patterns of long homozygosity were identified in this breed, and these could be related to processes of adaptation to tropical conditions. Abstract Inbreeding and effective population size (Ne) are fundamental indicators for the management and conservation of genetic diversity in populations. Genomic inbreeding gives accurate estimates of inbreeding, and the Ne determines the rate of the loss of genetic variation. The objective of this work was to study the distribution of runs of homozygosity (ROHs) in order to estimate genomic inbreeding (FROH) and an effective population size using 38,789 Single Nucleotide Polymorphisms (SNPs) from the Illumina Bovine 50K BeadChip in 86 samples from populations of Charolais de Cuba (n = 40) cattle and to compare this information with French (n = 20) and British Charolais (n = 26) populations. In the Cuban, French, and British Charolais populations, the average estimated genomic inbreeding values using the FROH statistics were 5.7%, 3.4%, and 4%, respectively. The dispersion measured by variation coefficient was high at 43.9%, 37.0%, and 54.2%, respectively. The effective population size experienced a very similar decline during the last century in Charolais de Cuba (from 139 to 23 individuals), in French Charolais (from 142 to 12), and in British Charolais (from 145 to 14) for the ~20 last generations. However, the high variability found in the ROH indicators and FROH reveals an opportunity for maintaining the genetic diversity of this breed with an adequate mating strategy, which can be favored with the use of molecular markers. Moreover, the detected ROH were compared to previous results obtained on the detection of signatures of selection in the same breed. Some of the observed signatures were confirmed by the ROHs, emphasizing the process of adaptation to tropical climate experienced by the Charolais de Cuba population.
Collapse
Affiliation(s)
- Yoel Rodríguez-Valera
- Faculty of Agricultural Sciences, University of Granma, Bayamo 95100, Cuba; (Y.R.-V.); (E.P.-P.)
| | - Dominique Rocha
- GABI, INRAE, AgroParisTech, University Paris-Saclay, F-78350 Jouy-en-Josas, France; (D.R.); (G.R.)
| | - Michel Naves
- INRAE, URZ, 97170 Petit Bourg, Guadeloupe, France;
| | - Gilles Renand
- GABI, INRAE, AgroParisTech, University Paris-Saclay, F-78350 Jouy-en-Josas, France; (D.R.); (G.R.)
| | - Eliecer Pérez-Pineda
- Faculty of Agricultural Sciences, University of Granma, Bayamo 95100, Cuba; (Y.R.-V.); (E.P.-P.)
| | - Yuliaxis Ramayo-Caldas
- GABI, INRAE, AgroParisTech, University Paris-Saclay, F-78350 Jouy-en-Josas, France; (D.R.); (G.R.)
- Animal Breeding and Genetics Program, Institute for Research and Technology in Food and Agriculture (IRTA), Torre Marimon, 08140 Caldes de Montbui, Spain
- Correspondence: (Y.R.-C.); (S.E.R.-O.)
| | - Sebastian E. Ramos-Onsins
- Plant and Animal Genomics, Centre of Research in Agricultural Genomics (CRAG) Consortium CSIC-IRTA-UAB-UB, Campus UAB, 08193 Bellaterra, Spain
- Correspondence: (Y.R.-C.); (S.E.R.-O.)
| |
Collapse
|
3
|
Jadhav A, Zhao L, Liu W, Ding C, Nair V, Ramos-Onsins SE, Ferretti L. Genomic Diversity and Evolution of Quasispecies in Newcastle Disease Virus Infections. Viruses 2020; 12:v12111305. [PMID: 33202558 PMCID: PMC7698180 DOI: 10.3390/v12111305] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Revised: 10/31/2020] [Accepted: 11/06/2020] [Indexed: 12/19/2022] Open
Abstract
Newcastle disease virus (NDV) infections are well known to harbour quasispecies, due to the error-prone nature of the RNA polymerase. Quasispecies variants in the fusion cleavage site of the virus are known to significantly change its virulence. However, little is known about the genomic patterns of diversity and selection in NDV viral swarms. We analyse deep sequencing data from in vitro and in vivo NDV infections to uncover the genomic patterns of diversity and the signatures of selection within NDV swarms. Variants in viruses from in vitro samples are mostly localised in non-coding regions and 3′ and 5′ untranslated regions (3′UTRs or 5′UTRs), while in vivo samples contain an order of magnitude more variants. We find different patterns of genomic divergence and diversity among NDV genotypes, as well as differences in the genomic distribution of intra-host variants among in vitro and in vivo infections of the same strain. The frequency spectrum shows clear signatures of intra-host purifying selection in vivo on the matrix protein (M) coding gene and positive or diversifying selection on nucleocapsid (NP) and haemagglutinin-neuraminidase (HN). The comparison between within-host polymorphisms and phylogenetic divergence reveals complex patterns of selective pressure on the NDV genome at between- and within-host level. The M sequence is strongly constrained both between and within hosts, fusion protein (F) coding gene is under intra-host positive selection, and NP and HN show contrasting patterns: HN RNA sequence is positively selected between hosts while its protein sequence is positively selected within hosts, and NP is under intra-host positive selection at the RNA level and negative selection at the protein level.
Collapse
Affiliation(s)
- Archana Jadhav
- Viral Oncogenesis Group, The Pirbright Institute, Pirbright, Woking GU24 0NF, Surrey, UK; (A.J.); (V.N.)
| | - Lele Zhao
- Nuffield Department of Medicine, Li Ka Shing Centre for Health Information and Discovery, Big Data Institute, University of Oxford, Oxford OX3 7LF, UK;
| | - Weiwei Liu
- Department of Avian Infectious Diseases, Shanghai Veterinary Research Institute, Chinese Academy of Agricultural Science, Shanghai 200241, China; (W.L.); (C.D.)
| | - Chan Ding
- Department of Avian Infectious Diseases, Shanghai Veterinary Research Institute, Chinese Academy of Agricultural Science, Shanghai 200241, China; (W.L.); (C.D.)
| | - Venugopal Nair
- Viral Oncogenesis Group, The Pirbright Institute, Pirbright, Woking GU24 0NF, Surrey, UK; (A.J.); (V.N.)
- UK-China Centre of Excellence on Avian Disease Research, Pirbright, Woking GU24 0NF, Surrey, UK
| | - Sebastian E. Ramos-Onsins
- Plant and Animal Genomics, Centre de Recerca en Agrigenòmica (CRAG) CSIC-IRTA-UAB-UB, 08193 Bellaterra, Spain
- Correspondence: (S.E.R.-O.); (L.F.)
| | - Luca Ferretti
- Nuffield Department of Medicine, Li Ka Shing Centre for Health Information and Discovery, Big Data Institute, University of Oxford, Oxford OX3 7LF, UK;
- Correspondence: (S.E.R.-O.); (L.F.)
| |
Collapse
|
4
|
Abstract
The Site Frequency Spectrum (SFS) and the heterozygosity of allelic variants are among the most important summary statistics for population genetic analysis of diploid organisms. We discuss the generalization of these statistics to populations of autopolyploid organisms in terms of the joint Site Frequency/Dosage Spectrum and its expected value for autopolyploid populations that follow the standard neutral model. Based on these results, we present estimators of nucleotide variability from High-Throughput Sequencing (HTS) data of autopolyploids and discuss potential issues related to sequencing errors and variant calling. We use these estimators to generalize Tajima's D and other SFS-based neutrality tests to HTS data from autopolyploid organisms. Finally, we discuss how these approaches fail when the number of individuals is small. In fact, in autopolyploids there are many possible deviations from the Hardy–Weinberg equilibrium, each reflected in a different shape of the individual dosage distribution. The SFS from small samples is often dominated by the shape of these deviations of the dosage distribution from its Hardy–Weinberg expectations.
Collapse
|
5
|
Guirao-Rico S, Ramirez O, Ojeda A, Amills M, Ramos-Onsins SE. Porcine Y-chromosome variation is consistent with the occurrence of paternal gene flow from non-Asian to Asian populations. Heredity (Edinb) 2017; 120:63-76. [PMID: 29234173 DOI: 10.1038/s41437-017-0002-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2017] [Accepted: 06/21/2017] [Indexed: 11/09/2022] Open
Abstract
Pigs (Sus scrofa) originated in Southeast Asia and expanded to Europe and North Africa approximately 1 MYA. Analyses of porcine Y-chromosome variation have shown the existence of two main haplogroups that are highly divergent, a result that is consistent with previous mitochondrial and autosomal data showing that the Asian and non-Asian pig populations remained geographically isolated until recently. Paradoxically, one of these Y-chromosome haplogroups is extensively shared by pigs and wild boars from Asia and Europe, an observation that is difficult to reconcile with a scenario of prolonged geographic isolation. To shed light on this issue, we genotyped 33 Y-linked SNPs and one indel in a worldwide sample of pigs and wild boars and sequenced a total of 9903 nucleotide sites from seven loci distributed along the Y-chromosome. Notably, the nucleotide diversity per site at the Y-linked loci (0.0015 in Asian pigs) displayed the same order of magnitude as that described for autosomal loci (~0.0023), a finding compatible with a process of sustained and intense isolation. We performed an approximate Bayesian computation analysis focused on the paternal diversity of wild boars and local pig breeds in which we compared three demographic models: two isolation models (I models) differing in the time of isolation and a model of isolation with recent unidirectional migration (IM model). Our results suggest that the most likely explanation for the extensive sharing of one Y-chromosome haplogroup between non-Asian and Asian populations is a recent and unidirectional (non-Asian > Asian) paternal migration event.
Collapse
Affiliation(s)
- Sara Guirao-Rico
- Plant and Animal Genomics Program, Center for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Edifici CRAG, Campus Universitat Autònoma Barcelona, Bellaterra, Spain
| | - Oscar Ramirez
- Facultat de Veterinària, Departament de Ciència Animal i dels Aliments, Universitat Autònoma de Barcelona, Bellaterra, Spain.,Vetgenomics, Edifici Eureka, Campus Universitat Autònoma Barcelona, Bellaterra, Spain
| | - Ana Ojeda
- Facultat de Veterinària, Departament de Ciència Animal i dels Aliments, Universitat Autònoma de Barcelona, Bellaterra, Spain
| | - Marcel Amills
- Plant and Animal Genomics Program, Center for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Edifici CRAG, Campus Universitat Autònoma Barcelona, Bellaterra, Spain. .,Facultat de Veterinària, Departament de Ciència Animal i dels Aliments, Universitat Autònoma de Barcelona, Bellaterra, Spain.
| | - Sebastian E Ramos-Onsins
- Plant and Animal Genomics Program, Center for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Edifici CRAG, Campus Universitat Autònoma Barcelona, Bellaterra, Spain.
| |
Collapse
|
6
|
Abstract
Coalescent theory is a powerful tool for population geneticists as well as molecular biologists interested in understanding the patterns and levels of DNA variation. Using coalescent Monte Carlo simulations it is possible to obtain the empirical distributions for a number of statistics across a wide range of evolutionary models; these distributions can be used to test evolutionary hypotheses using experimental data. The mlcoalsim application presented here (based on a version of the ms program, Hudson, 2002) adds important new features to improve methodology (uncertainty and conditional methods for mutation and recombination), models (including strong positive selection, finite sites and heterogeneity in mutation and recombination rates) and analyses (calculating a number of statistics used in population genetics and P-values for observed data). One of the most important features of mlcoalsim is the analysis of multilocus data in linked and independent regions. In summary, mlcoalsim is an integrated software application aimed at researchers interested in molecular evolution. mlcoalsim is written in ANSI C and is available at: http://www.ub.es/softevol/mlcoalsim .
Collapse
Affiliation(s)
- Sebastian E. Ramos-Onsins
- Max-Planck Institute for Chemical Ecology, Hans-Knöll Str. 8, D-07745 Jena, Germany
- Present address: Departament de Genètica, Universitat de Barcelona, Diagonal 645, Barcelona, Spain
| | - Thomas Mitchell-Olds
- Max-Planck Institute for Chemical Ecology, Hans-Knöll Str. 8, D-07745 Jena, Germany
- Present address: Department of Biology, Duke University, Durham, NC 27708, USA
| |
Collapse
|
7
|
Ramos-Onsins SE, Burgos-Paz W, Manunza A, Amills M. Mining the pig genome to investigate the domestication process. Heredity (Edinb) 2014; 113:471-84. [PMID: 25074569 PMCID: PMC4815588 DOI: 10.1038/hdy.2014.68] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2013] [Revised: 05/22/2014] [Accepted: 06/09/2014] [Indexed: 12/11/2022] Open
Abstract
Pig domestication began around 9000 YBP in the Fertile Crescent and Far East, involving marked morphological and genetic changes that occurred in a relatively short window of time. Identifying the alleles that drove the behavioural and physiological transformation of wild boars into pigs through artificial selection constitutes a formidable challenge that can only be faced from an interdisciplinary perspective. Indeed, although basic facts regarding the demography of pig domestication and dispersal have been uncovered, the biological substrate of these processes remains enigmatic. Considerable hope has been placed on new approaches, based on next-generation sequencing, which allow whole-genome variation to be analyzed at the population level. In this review, we provide an outline of the current knowledge on pig domestication by considering both archaeological and genetic data. Moreover, we discuss several potential scenarios of genome evolution under the complex mixture of demography and selection forces at play during domestication. Finally, we highlight several technical and methodological approaches that may represent significant advances in resolving the conundrum of livestock domestication.
Collapse
Affiliation(s)
- S E Ramos-Onsins
- Department of Animal Genetics, Center for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Edifici CRAG, Campus Universitat Autònoma Barcelona, Bellaterra, Spain
| | - W Burgos-Paz
- Department of Animal Genetics, Center for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Edifici CRAG, Campus Universitat Autònoma Barcelona, Bellaterra, Spain
| | - A Manunza
- Department of Animal Genetics, Center for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Edifici CRAG, Campus Universitat Autònoma Barcelona, Bellaterra, Spain
| | - M Amills
- Department of Animal Genetics, Center for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Edifici CRAG, Campus Universitat Autònoma Barcelona, Bellaterra, Spain
| |
Collapse
|
8
|
Abstract
Although many computer programs can perform population genetics calculations, they are typically limited in the analyses and data input formats they offer; few applications can process the large data sets produced by whole-genome resequencing projects. Furthermore, there is no coherent framework for the easy integration of new statistics into existing pipelines, hindering the development and application of new population genetics and genomics approaches. Here, we present PopGenome, a population genomics package for the R software environment (a de facto standard for statistical analyses). PopGenome can efficiently process genome-scale data as well as large sets of individual loci. It reads DNA alignments and single-nucleotide polymorphism (SNP) data sets in most common formats, including those used by the HapMap, 1000 human genomes, and 1001 Arabidopsis genomes projects. PopGenome also reads associated annotation files in GFF format, enabling users to easily define regions or classify SNPs based on their annotation; all analyses can also be applied to sliding windows. PopGenome offers a wide range of diverse population genetics analyses, including neutrality tests as well as statistics for population differentiation, linkage disequilibrium, and recombination. PopGenome is linked to Hudson’s MS and Ewing’s MSMS programs to assess statistical significance based on coalescent simulations. PopGenome’s integration in R facilitates effortless and reproducible downstream analyses as well as the production of publication-quality graphics. Developers can easily incorporate new analyses methods into the PopGenome framework. PopGenome and R are freely available from CRAN (http://cran.r-project.org/) for all major operating systems under the GNU General Public License.
Collapse
Affiliation(s)
- Bastian Pfeifer
- Institute for Computer Science, Heinrich Heine University, Düsseldorf, Germany
| | | | | | - Martin J Lercher
- Institute for Computer Science, Heinrich Heine University, Düsseldorf, GermanyCluster of Excellence on Plant Sciences, Düsseldorf, Germany
| |
Collapse
|
9
|
Esteve-Codina A, Paudel Y, Ferretti L, Raineri E, Megens HJ, Silió L, Rodríguez MC, Groenen MAM, Ramos-Onsins SE, Pérez-Enciso M. Dissecting structural and nucleotide genome-wide variation in inbred Iberian pigs. BMC Genomics 2013; 14:148. [PMID: 23497037 PMCID: PMC3601988 DOI: 10.1186/1471-2164-14-148] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2012] [Accepted: 02/21/2013] [Indexed: 11/16/2022] Open
Abstract
BACKGROUND In contrast to international pig breeds, the Iberian breed has not been admixed with Asian germplasm. This makes it an important model to study both domestication and relevance of Asian genes in the pig. Besides, Iberian pigs exhibit high meat quality as well as appetite and propensity to obesity. Here we provide a genome wide analysis of nucleotide and structural diversity in a reduced representation library from a pool (n=9 sows) and shotgun genomic sequence from a single sow of the highly inbred Guadyerbas strain. In the pool, we applied newly developed tools to account for the peculiarities of these data. RESULTS A total of 254,106 SNPs in the pool (79.6 Mb covered) and 643,783 in the Guadyerbas sow (1.47 Gb covered) were called. The nucleotide diversity (1.31x10-3 per bp in autosomes) is very similar to that reported in wild boar. A much lower than expected diversity in the X chromosome was confirmed (1.79x10-4 per bp in the individual and 5.83x10-4 per bp in the pool). A strong (0.70) correlation between recombination and variability was observed, but not with gene density or GC content. Multicopy regions affected about 4% of annotated pig genes in their entirety, and 2% of the genes partially. Genes within the lowest variability windows comprised interferon genes and, in chromosome X, genes involved in behavior like HTR2C or MCEP2. A modified Hudson-Kreitman-Aguadé test for pools also indicated an accelerated evolution in genes involved in behavior, as well as in spermatogenesis and in lipid metabolism. CONCLUSIONS This work illustrates the strength of current sequencing technologies to picture a comprehensive landscape of variability in livestock species, and to pinpoint regions containing genes potentially under selection. Among those genes, we report genes involved in behavior, including feeding behavior, and lipid metabolism. The pig X chromosome is an outlier in terms of nucleotide diversity, which suggests selective constraints. Our data further confirm the importance of structural variation in the species, including Iberian pigs, and allowed us to identify new paralogs for known gene families.
Collapse
Affiliation(s)
- Anna Esteve-Codina
- Center for Research in Agricultural Genomics (CRAG), Campus UAB, Bellaterra, 08193, Spain
- Departament de Ciència Animal i dels Aliments, Universitat Autònoma de Barcelona, Bellaterra, 08193, Spain
- Centre Nacional d'Anàlisi Genòmica (CNAG), Barcelona, Spain
| | - Yogesh Paudel
- Animal Breeding and Genomics Centre, Wageningen University, De Elst 1, Wageningen, 6708 WD, The Netherlands
| | - Luca Ferretti
- Center for Research in Agricultural Genomics (CRAG), Campus UAB, Bellaterra, 08193, Spain
| | | | - Hendrik-Jan Megens
- Animal Breeding and Genomics Centre, Wageningen University, De Elst 1, Wageningen, 6708 WD, The Netherlands
| | - Luis Silió
- Departamento de Mejora Genética Animal, INIA, Madrid, 28040, Spain
| | | | - Martein AM Groenen
- Animal Breeding and Genomics Centre, Wageningen University, De Elst 1, Wageningen, 6708 WD, The Netherlands
| | | | - Miguel Pérez-Enciso
- Center for Research in Agricultural Genomics (CRAG), Campus UAB, Bellaterra, 08193, Spain
- Departament de Ciència Animal i dels Aliments, Universitat Autònoma de Barcelona, Bellaterra, 08193, Spain
- Institut Català de Recerca i Estudis Avançats (ICREA), Carrer de Lluís Companys 23, Barcelona, 08010, Spain
| |
Collapse
|
10
|
Amaral AJ, Ferretti L, Megens HJ, Crooijmans RPMA, Nie H, Ramos-Onsins SE, Perez-Enciso M, Schook LB, Groenen MAM. Genome-wide footprints of pig domestication and selection revealed through massive parallel sequencing of pooled DNA. PLoS One 2011; 6:e14782. [PMID: 21483733 PMCID: PMC3070695 DOI: 10.1371/journal.pone.0014782] [Citation(s) in RCA: 99] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2010] [Accepted: 01/29/2011] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Artificial selection has caused rapid evolution in domesticated species. The identification of selection footprints across domesticated genomes can contribute to uncover the genetic basis of phenotypic diversity. METHODOLOGY/MAIN FINDINGS Genome wide footprints of pig domestication and selection were identified using massive parallel sequencing of pooled reduced representation libraries (RRL) representing ∼2% of the genome from wild boar and four domestic pig breeds (Large White, Landrace, Duroc and Pietrain) which have been under strong selection for muscle development, growth, behavior and coat color. Using specifically developed statistical methods that account for DNA pooling, low mean sequencing depth, and sequencing errors, we provide genome-wide estimates of nucleotide diversity and genetic differentiation in pig. Widespread signals suggestive of positive and balancing selection were found and the strongest signals were observed in Pietrain, one of the breeds most intensively selected for muscle development. Most signals were population-specific but affected genomic regions which harbored genes for common biological categories including coat color, brain development, muscle development, growth, metabolism, olfaction and immunity. Genetic differentiation in regions harboring genes related to muscle development and growth was higher between breeds than between a given breed and the wild boar. CONCLUSIONS/SIGNIFICANCE These results, suggest that although domesticated breeds have experienced similar selective pressures, selection has acted upon different genes. This might reflect the multiple domestication events of European breeds or could be the result of subsequent introgression of Asian alleles. Overall, it was estimated that approximately 7% of the porcine genome has been affected by selection events. This study illustrates that the massive parallel sequencing of genomic pools is a cost-effective approach to identify footprints of selection.
Collapse
Affiliation(s)
- Andreia J. Amaral
- Animal Breeding and Genomics Centre, Wageningen University, Wageningen, The Netherlands
| | - Luca Ferretti
- Department of Animal Science and Food Technology, Universitat Autonoma de Barcelona, Bellaterra, Spain
- Animal Science Department, Centre for Research in Agricultural Genomics, Bellaterra, Spain
| | - Hendrik-Jan Megens
- Animal Breeding and Genomics Centre, Wageningen University, Wageningen, The Netherlands
| | | | - Haisheng Nie
- Animal Breeding and Genomics Centre, Wageningen University, Wageningen, The Netherlands
| | - Sebastian E. Ramos-Onsins
- Department of Animal Science and Food Technology, Universitat Autonoma de Barcelona, Bellaterra, Spain
- Animal Science Department, Centre for Research in Agricultural Genomics, Bellaterra, Spain
| | - Miguel Perez-Enciso
- Department of Animal Science and Food Technology, Universitat Autonoma de Barcelona, Bellaterra, Spain
- Animal Science Department, Centre for Research in Agricultural Genomics, Bellaterra, Spain
- Life and Medical Sciences, Institució Catalana de Recerca i Estudis Avançats, Barcelona, Spain
| | - Lawrence B. Schook
- Institute for Genomic Biology, University of Illinois, Urbana, Illinois, United States of America
| | - Martien A. M. Groenen
- Animal Breeding and Genomics Centre, Wageningen University, Wageningen, The Netherlands
| |
Collapse
|
11
|
Abstract
A. halleri is a psuedometallophyte with a patchy distribution in Europe and is often spread by human activity. To determine the population history and whether this history is consistent with potential human effects, we surveyed nucleotide variation using 24 loci from 12 individuals in a large A. halleri population. The means of total and silent nucleotide variation (theta(W)) are within the range expected for the species. The population genetic neutrality tests Tajima's D and Wall's B had significant composite results rejecting panmixia, and Approximate Bayesian Computation analysis revealed that a subdivision model better explained the variation than the standard neutral model, refugia (or admixture), bottleneck or change of population size models. A categorical regression analysis further supports the subdivision model, and under the subdivision model, the neutrality tests are no longer significant. The best support was for two source populations, a situation consistent with the mixing of two populations possibly mediated by human activity. This scenario might limit the genetic diversity and adaptive potential of the population. The non-neutral population variation described here should be considered in bioinformatic searches for adaptation.
Collapse
Affiliation(s)
- Andrew J Heidel
- Department of Genetics and Evolution, Max Planck Institute for Chemical Ecology, 07745 Jena, Germany.
| | | | | | | | | |
Collapse
|
12
|
Ramos-Onsins SE, Puerma E, Balañá-Alcaide D, Salguero D, Aguadé M. Multilocus analysis of variation using a large empirical data set: phenylpropanoid pathway genes in Arabidopsis thaliana. Mol Ecol 2008; 17:1211-23. [PMID: 18221273 DOI: 10.1111/j.1365-294x.2007.03633.x] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Detecting the signature of adaptation on nucleotide variation is often difficult in species that like Arabidopsis thaliana might have a complex demographic history. Recent re-sequencing surveys in this species provided genome-wide information that would mainly reflect its demographic history. We have used a large empirical data set (LED) as well as multilocus coalescent simulations to analyse sequence variation at loci involved in the phenylpropanoid pathway of this species. We surveyed and examined DNA sequence variation at nine of these loci (about 19.7 kb) in 23 accessions of A. thaliana and one accession of its closely related species Arabidopsis lyrata. Nucleotide variation was lower at nonsynonymous sites than at silent sites in all loci, indicating generalized functional constraint at the protein level. No association between variation and position in the metabolic pathway was detected. When the data were contrasted against the standard neutral model, significant deviations for silent variation were detected with Tajima's D, Fu's F(S) and Fay and Wu's H multilocus test statistics. These deviations were in the same direction than in previous large-scale multilocus analyses, suggesting a genome-wide effect. When the nine-locus data set was contrasted against the large empirical data set, the level (Watterson's theta) and pattern of variation (Tajima's D) detected in these loci did not deviate either at the single-locus or multilocus level from the corresponding empirical distributions. These results would support an important role of the demographic history of A. thaliana in shaping nucleotide variation at the nine studied phenylpropanoid loci. The potential and limitations of the empirical distribution approach are discussed.
Collapse
Affiliation(s)
- S E Ramos-Onsins
- Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, 08028 Barcelona, Spain.
| | | | | | | | | |
Collapse
|
13
|
Balañá-Alcaide D, Ramos-Onsins SE, Boone Q, Aguadé M. Highly structured nucleotide variation within and among Arabidopsis lyrata populations at the FAH1 and DFR gene regions. Mol Ecol 2006; 15:2059-68. [PMID: 16780424 DOI: 10.1111/j.1365-294x.2006.02918.x] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Nucleotide variation at the FAH1 and DFR gene regions was surveyed in four populations of Arabidopsis lyrata (two European A. l. petraea and two North American A. l. lyrata populations). In contrast to previous results, levels of variation were not consistently lower in A. l. lyrata than in A. l. petraea, and similar degrees of genetic differentiation were detected between and within subspecies. These observations and the significant genetic differentiation detected among populations suggest population substructure and no real subdivision between subspecies. For each gene studied, genotypic data were obtained, which allowed comparing nucleotide diversity within individuals (between sequences from the same individual) and within populations (between sequences from the same population). The generally lower level of variation within than among individuals detected in each population yielded a significant deviation from panmixia within populations. In three of the four populations studied, two highly divergent alleles were detected within populations at the highly variable DFR locus. This pattern and the significant excess of derived variants detected in most populations suggest that most variation segregating within populations results from rare migration events between relatively small and isolated populations exhibiting reduced panmixia.
Collapse
Affiliation(s)
- D Balañá-Alcaide
- Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, 08028 Barcelona, Spain
| | | | | | | |
Collapse
|
14
|
Ramos-Onsins SE, Stranger BE, Mitchell-Olds T, Aguadé M. Multilocus Analysis of Variation and Speciation in the Closely Related Species Arabidopsis halleri and A. lyrata. Genetics 2004. [DOI: 10.1093/genetics/167.4.2143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
15
|
Abstract
A number of statistical tests for detecting population growth are described. We compared the statistical power of these tests with that of others available in the literature. The tests evaluated fall into three categories: those tests based on the distribution of the mutation frequencies, on the haplotype distribution, and on the mismatch distribution. We found that, for an extensive variety of cases, the most powerful tests for detecting population growth are Fu's F(S) test and the newly developed R(2) test. The behavior of the R(2) test is superior for small sample sizes, whereas F(S) is better for large sample sizes. We also show that some popular statistics based on the mismatch distribution are very conservative.
Collapse
|