1
|
Qiao G, Xu P, Guo T, He X, Yue Y, Yang B. Genome-wide detection of structural variation in some sheep breeds using whole-genome long-read sequencing data. J Anim Breed Genet 2024; 141:403-414. [PMID: 38247268 DOI: 10.1111/jbg.12846] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 12/21/2023] [Accepted: 12/29/2023] [Indexed: 01/23/2024]
Abstract
Genomic structural variants (SVs) constitute a significant proportion of genetic variation in the genome. The rapid development of long-reads sequencing has facilitated the detection of long-fragment SVs. There is no published study to detect SVs using long-read data from sheep. We applied a long-read mapping approach to detect SVs and characterized a total of 30,771 insertions, deletions, inversions and translocations. We identified 716, 916, 842 and 303 specific SVs in Southdown sheep, Alpine merino sheep, Qilian White Tibetan sheep and Oula sheep, respectively. We annotated these SVs and found that these SV-related genes were primarily enriched in the well-established pathways involved in the regulation of the immune system, growth and development and environmental adaptability. We detected and annotated SVs based on NGS resequencing data to validate the accuracy based on third-generation detection. Moreover, five candidate SVs were verified using the PCR method in 50 sheep. Our study is the first to use a long-reads sequencing approach to construct a novel structural variation map in sheep. We have completed a preliminary exploration of the potential effects of SVs on sheep.
Collapse
Affiliation(s)
- Guoyan Qiao
- Lanzhou Institute of Husbandry and Pharmaceutical Sciences of Chinese Academy of Agricultural Sciences, Lanzhou, China
- College of Ecological Agriculture and Animal Husbandry, Qinghai Communications Technical College, Xining, China
| | - Pan Xu
- State Key Laboratory of Grassland Agro-Ecosystems, Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs, Engineering Research Center of Grassland Industry, Ministry of Education, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, China
| | - Tingting Guo
- Lanzhou Institute of Husbandry and Pharmaceutical Sciences of Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Xue He
- Lanzhou Institute of Husbandry and Pharmaceutical Sciences of Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Yaojing Yue
- Lanzhou Institute of Husbandry and Pharmaceutical Sciences of Chinese Academy of Agricultural Sciences, Lanzhou, China
| | - Bohui Yang
- Lanzhou Institute of Husbandry and Pharmaceutical Sciences of Chinese Academy of Agricultural Sciences, Lanzhou, China
| |
Collapse
|
2
|
Assessment of linkage disequilibrium patterns between structural variants and single nucleotide polymorphisms in three commercial chicken populations. BMC Genomics 2022; 23:193. [PMID: 35264116 PMCID: PMC8908679 DOI: 10.1186/s12864-022-08418-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Accepted: 02/24/2022] [Indexed: 12/29/2022] Open
Abstract
BACKGROUND Structural variants (SV) are causative for some prominent phenotypic traits of livestock as different comb types in chickens or color patterns in pigs. Their effects on production traits are also increasingly studied. Nevertheless, accurately calling SV remains challenging. It is therefore of interest, whether close-by single nucleotide polymorphisms (SNPs) are in strong linkage disequilibrium (LD) with SVs and can serve as markers. Literature comes to different conclusions on whether SVs are in LD to SNPs on the same level as SNPs to other SNPs. The present study aimed to generate a precise SV callset from whole-genome short-read sequencing (WGS) data for three commercial chicken populations and to evaluate LD patterns between the called SVs and surrounding SNPs. It is thereby the first study that assessed LD between SVs and SNPs in chickens. RESULTS The final callset consisted of 12,294,329 bivariate SNPs, 4,301 deletions (DEL), 224 duplications (DUP), 218 inversions (INV) and 117 translocation breakpoints (BND). While average LD between DELs and SNPs was at the same level as between SNPs and SNPs, LD between other SVs and SNPs was strongly reduced (DUP: 40%, INV: 27%, BND: 19% of between-SNP LD). A main factor for the reduced LD was the presence of local minor allele frequency differences, which accounted for 50% of the difference between SNP - SNP and DUP - SNP LD. This was potentially accompanied by lower genotyping accuracies for DUP, INV and BND compared with SNPs and DELs. An evaluation of the presence of tag SNPs (SNP in highest LD to the variant of interest) further revealed DELs to be slightly less tagged by WGS SNPs than WGS SNPs by other SNPs. This difference, however, was no longer present when reducing the pool of potential tag SNPs to SNPs located on four different chicken genotyping arrays. CONCLUSIONS The results implied that genomic variance due to DELs in the chicken populations studied can be captured by different SNP marker sets as good as variance from WGS SNPs, whereas separate SV calling might be advisable for DUP, INV, and BND effects.
Collapse
|
3
|
Wang K, Hu H, Tian Y, Li J, Scheben A, Zhang C, Li Y, Wu J, Yang L, Fan X, Sun G, Li D, Zhang Y, Han R, Jiang R, Huang H, Yan F, Wang Y, Li Z, Li G, Liu X, Li W, Edwards D, Kang X. The chicken pan-genome reveals gene content variation and a promoter region deletion in IGF2BP1 affecting body size. Mol Biol Evol 2021; 38:5066-5081. [PMID: 34329477 PMCID: PMC8557422 DOI: 10.1093/molbev/msab231] [Citation(s) in RCA: 68] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Domestication and breeding have reshaped the genomic architecture of chicken, but the retention and loss of genomic elements during these evolutionary processes remain unclear. We present the first chicken pan-genome constructed using 664 individuals, which identified an additional ∼66.5 Mb sequences that are absent from the reference genome (GRCg6a). The constructed pan-genome encoded 20,491 predicated protein-coding genes, of which higher expression level are observed in conserved genes relative to dispensable genes. Presence/absence variation (PAV) analyses demonstrated that gene PAV in chicken was shaped by selection, genetic drift, and hybridization. PAV-based GWAS identified numerous candidate mutations related to growth, carcass composition, meat quality, or physiological traits. Among them, a deletion in the promoter region of IGF2BP1 affecting chicken body size is reported, which is supported by functional studies and extra samples. This is the first time to report the causal variant of chicken body size QTL located at chromosome 27 which was repeatedly reported. Therefore, the chicken pan-genome is a useful resource for biological discovery and breeding. It improves our understanding of chicken genome diversity and provides materials to unveil the evolution history of chicken domestication.
Collapse
Affiliation(s)
- Kejun Wang
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China.,Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| | - Haifei Hu
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Crawley, 6009 WA, Australia
| | - Yadong Tian
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China.,Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| | - Jingyi Li
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, College of Animal Science and Technology, Huazhong Agricultural University, 430070 Wuhan, Hubei, China
| | - Armin Scheben
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Chenxi Zhang
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China.,Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| | - Yiyi Li
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China.,Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| | - Junfeng Wu
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China.,Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| | - Lan Yang
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China.,Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| | - Xuewei Fan
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China.,Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| | - Guirong Sun
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China.,Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| | - Donghua Li
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China.,Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| | - Yanhua Zhang
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China.,Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| | - Ruili Han
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China.,Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| | - Ruirui Jiang
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China.,Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| | - Hetian Huang
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China.,Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| | - Fengbin Yan
- Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| | - Yanbin Wang
- Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| | - Zhuanjian Li
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China.,Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| | - Guoxi Li
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China.,Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| | - Xiaojun Liu
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China.,Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| | - Wenting Li
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China.,Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| | - David Edwards
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Crawley, 6009 WA, Australia
| | - Xiangtao Kang
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou 450046, China.,Henan Key laboratory for innovation and utilization of chicken germplasm resources,Zhengzhou, 450046, China
| |
Collapse
|
4
|
Identification of Copy Number Variation in Domestic Chicken Using Whole-Genome Sequencing Reveals Evidence of Selection in the Genome. Animals (Basel) 2019; 9:ani9100809. [PMID: 31618984 PMCID: PMC6826909 DOI: 10.3390/ani9100809] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2019] [Revised: 10/13/2019] [Accepted: 10/14/2019] [Indexed: 12/12/2022] Open
Abstract
Simple Summary Chickens have been bred for meat and egg production as a source of animal protein. With the increase of productivity as the main purpose of domestication, factors such as metabolism and immunity were boosted, which are detectable signs of selection on the genome. This study focused on copy number variation (CNV) to find evidence of domestication on the genome. CNV was detected from whole-genome sequencing of 65 chickens including Red Jungle Fowl, broilers, and layers. After that, CNV region, the overlapping region of CNV between individuals, was made to identify which genomic regions showed copy number differentiation. The 663 domesticated-specific CNV regions were associated with various functions such as metabolism and organ development. Also, by performing population differentiation analyses such as clustering analysis and ANOVA test, we found that there are a lot of genomic regions with different copy number patterns between broilers and layers. This result indicates that different genetic variations can be found, depending on the purpose of artificial selection and provides considerations for future animal breeding. Abstract Copy number variation (CNV) has great significance both functionally and evolutionally. Various CNV studies are in progress to find the cause of human disease and to understand the population structure of livestock. Recent advances in next-generation sequencing (NGS) technology have made CNV detection more reliable and accurate at whole-genome level. However, there is a lack of CNV studies on chickens using NGS. Therefore, we obtained whole-genome sequencing data of 65 chickens including Red Jungle Fowl, Cornish (broiler), Rhode Island Red (hybrid), and White Leghorn (layer) from the public databases for CNV region (CNVR) detection. Using CNVnator, a read-depth based software, a total of 663 domesticated-specific CNVRs were identified across autosomes. Gene ontology analysis of genes annotated in CNVRs showed that mainly enriched terms involved in organ development, metabolism, and immune regulation. Population analysis revealed that CN and RIR are closer to each other than WL, and many genes (LOC772271, OR52R1, RD3, ADH6, TLR2B, PRSS2, TPK1, POPDC3, etc.) with different copy numbers between breeds found. In conclusion, this study has helped to understand the genetic characteristics of domestic chickens at CNV level, which may provide useful information for the development of breeding systems in chickens.
Collapse
|
5
|
Functional genomics in chicken (Gallus gallus) - status and implications in poultry. WORLD POULTRY SCI J 2019. [DOI: 10.1017/s004393391400004x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
6
|
Liu C, Ran X, Yu C, Xu Q, Niu X, Zhao P, Wang J. Whole-genome analysis of structural variations between Xiang pigs with larger litter sizes and those with smaller litter sizes. Genomics 2018; 111:310-319. [PMID: 29481841 DOI: 10.1016/j.ygeno.2018.02.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2017] [Revised: 02/08/2018] [Accepted: 02/11/2018] [Indexed: 11/30/2022]
Abstract
To gain a better knowledge of structural variations (SVs) in Xiang pig, we used next-generation sequencing to analyze the Xiang pigs with larger (XL) or smaller litter sizes (XS). Our analysis yielded 28,040 putative SVs in the Xiang pig. These SVs distributed throughout all of chromosomes. Some functional regions including exons and untranslated regions were less varied than introns and intergenic regions. We detected 4637 and 4119 specific SVs, which contained 1697 and 1582 genes in XL and XS group, respectively. These genes were mainly enriched in the well-known pathways involved in development and reproduction processes. Population validation was carried out on 50 SVs candidates using PCR method in 144 Xiang pig crowds. All of 50 SVs were confirmed by PCR method and 14 SVs were associated with the litter size of Xiang pigs. These results may be helpful for the elucidation of growth and reproduction regulation in Xiang pig.
Collapse
Affiliation(s)
- Chang Liu
- Institute of Agro-Bioengineering, College of Animal Science, Guizhou University, Guiyang 550025, China
| | - Xueqin Ran
- Institute of Agro-Bioengineering, College of Animal Science, Guizhou University, Guiyang 550025, China.
| | - Changyan Yu
- Institute of Agro-Bioengineering, College of Animal Science, Guizhou University, Guiyang 550025, China
| | - Qian Xu
- Institute of Agro-Bioengineering, College of Animal Science, Guizhou University, Guiyang 550025, China
| | - Xi Niu
- Institute of Agro-Bioengineering, College of Animal Science, Guizhou University, Guiyang 550025, China
| | - Pengju Zhao
- College of Animal Science and Technology, China Agricultural University, Beijing 100083, China
| | - Jiafu Wang
- Institute of Agro-Bioengineering, College of Animal Science, Guizhou University, Guiyang 550025, China; Tongren University, Tongren 554300, China.
| |
Collapse
|
7
|
Wang Y, Cao X, Zhao Y, Fei J, Hu X, Li N. Optimized double-digest genotyping by sequencing (ddGBS) method with high-density SNP markers and high genotyping accuracy for chickens. PLoS One 2017; 12:e0179073. [PMID: 28598985 PMCID: PMC5466311 DOI: 10.1371/journal.pone.0179073] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2017] [Accepted: 05/23/2017] [Indexed: 12/04/2022] Open
Abstract
High-density single nucleotide polymorphism (SNP) markers are crucial to improve the resolution and accuracy of genome-wide association study (GWAS) and genomic selection (GS). Numerous approaches, including whole genome sequencing, genome sampling sequencing, and SNP chips are able to discover or genotype markers at different densities and costs. Achieving an optimal balance between sequencing resolution and budgets, especially in large-scale population genetics research, constitutes a major challenge. Here, we performed improved double-enzyme digestion genotyping by sequencing (ddGBS) on chicken. We evaluated eight double-enzyme digestion combinations, and EcoR I- Mse I was chosen as the optimal combination for the chicken genome. We firstly proposed that two parameters, optimal read-count point (ORP) and saturated read-count point (SRP), could be utilized to determine the optimal sequencing volume. A total of 291,772 high-density SNPs from 824 animals were identified. By validation using the SNP chip, we found that the consistency between ddGBS data and the SNP chip is over 99%. The approach that we developed in chickens, which is high-quality, high-density, cost-effective (300 K, $30/sample), and time-saving (within 48 h), will have broad applications in animal breeding programs.
Collapse
Affiliation(s)
- Yuzhe Wang
- State Key Laboratories of Agro-biotechnology, College of Biological Science, China Agricultural University, Beijing, China
| | - Xuemin Cao
- State Key Laboratories of Agro-biotechnology, College of Biological Science, China Agricultural University, Beijing, China
| | - Yiqiang Zhao
- State Key Laboratories of Agro-biotechnology, College of Biological Science, China Agricultural University, Beijing, China
| | - Jing Fei
- State Key Laboratories of Agro-biotechnology, College of Biological Science, China Agricultural University, Beijing, China
| | - Xiaoxiang Hu
- State Key Laboratories of Agro-biotechnology, College of Biological Science, China Agricultural University, Beijing, China
- * E-mail:
| | - Ning Li
- State Key Laboratories of Agro-biotechnology, College of Biological Science, China Agricultural University, Beijing, China
- National Engineering Laboratory for Animal Breeding, China Agricultural University, Beijing, China
| |
Collapse
|
8
|
Structural Variant Detection by Large-scale Sequencing Reveals New Evolutionary Evidence on Breed Divergence between Chinese and European Pigs. Sci Rep 2016; 6:18501. [PMID: 26729041 PMCID: PMC4700453 DOI: 10.1038/srep18501] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2015] [Accepted: 11/19/2015] [Indexed: 01/28/2023] Open
Abstract
In this study, we performed a genome-wide SV detection among the genomes of thirteen pigs from diverse Chinese and European originated breeds by next genetation sequencing, and constrcuted a single-nucleotide resolution map involving 56,930 putative SVs. We firstly identified a SV hotspot spanning 35 Mb region on the X chromosome specifically in the genomes of Chinese originated individuals. Further scrutinizing this region by large-scale sequencing data of extra 111 individuals, we obtained the confirmatory evidence on our initial finding. Moreover, thirty five SV-related genes within the hotspot region, being of importance for reproduction ability, rendered significant different evolution rates between Chinese and European originated breeds. The SV hotspot identified herein offers a novel evidence for assessing phylogenetic relationships, as well as likely explains the genetic difference of corresponding phenotypes and features, among Chinese and European pig breeds. Furthermore, we employed various SVs to infer genetic structure of individuls surveyed. We found SVs can clearly detect the difference of genetic background among individuals. This clues us that genome-wide SVs can capture majority of geneic variation and be applied into cladistic analyses. Characterizing whole genome SVs demonstrated that SVs are significantly enriched/depleted with various genomic features.
Collapse
|
9
|
aCGH Analysis to Estimate Genetic Variations among Domesticated Chickens. BIOMED RESEARCH INTERNATIONAL 2016; 2016:1794329. [PMID: 27525263 PMCID: PMC4972930 DOI: 10.1155/2016/1794329] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/11/2016] [Accepted: 06/20/2016] [Indexed: 11/29/2022]
Abstract
Chickens have been familiar to humans since ancient times and have been used not only for culinary purposes but also for cultural purposes including ritual ceremonies and traditional entertainment. The various chicken breeds developed for these purposes often display distinct morphological and/or behavioural traits. For example, the Japanese Shamo is larger and more aggressive than other domesticated chickens, reflecting its role as a fighting cock breed, whereas Japanese Naganakidori breeds, which have long-crowing behaviour, were bred instead for their entertaining and aesthetic qualities. However, the genetic backgrounds of these distinct morphological and behavioural traits remain unclear. Therefore, the question arises as to which genomic regions in these chickens were acted upon by selective pressures through breeding. We compared the entire genomes of six chicken breeds domesticated for various cultural purposes by utilizing array comparative genomic hybridization. From these analyses, we identified 782 regions that underwent insertions, deletions, or mutations, representing man-made selection pressure in these chickens. Furthermore, we found that a number of genes diversified in domesticated chickens bred for cultural or entertainment purposes were different from those diversified in chickens bred for food, such as broilers and layers.
Collapse
|
10
|
Jiang Z, Wang H, Michal JJ, Zhou X, Liu B, Woods LCS, Fuchs RA. Genome Wide Sampling Sequencing for SNP Genotyping: Methods, Challenges and Future Development. Int J Biol Sci 2016; 12:100-8. [PMID: 26722221 PMCID: PMC4679402 DOI: 10.7150/ijbs.13498] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2015] [Accepted: 11/07/2015] [Indexed: 12/04/2022] Open
Abstract
Genetic polymorphisms, particularly single nucleotide polymorphisms (SNPs), have been widely used to advance quantitative, functional and evolutionary genomics. Ideally, all genetic variants among individuals should be discovered when next generation sequencing (NGS) technologies and platforms are used for whole genome sequencing or resequencing. In order to improve the cost-effectiveness of the process, however, the research community has mainly focused on developing genome-wide sampling sequencing (GWSS) methods, a collection of reduced genome complexity sequencing, reduced genome representation sequencing and selective genome target sequencing. Here we review the major steps involved in library preparation, the types of adapters used for ligation and the primers designed for amplification of ligated products for sequencing. Unfortunately, currently available GWSS methods have their drawbacks, such as inconsistency in the number of reads per sample library, the number of sites/targets per individual, and the number of reads per site/target, all of which result in missing data. Suggestions are proposed here to improve library construction, genotype calling accuracy, genome-wide marker density and read mapping rate. In brief, optimized GWSS library preparation should generate a unique set of target sites with dense distribution along chromosomes and even coverage per site across all individuals.
Collapse
Affiliation(s)
- Zhihua Jiang
- 1. Department of Animal Sciences, Washington State University, Pullman, WA 99164-7620, USA
| | - Hongyang Wang
- 1. Department of Animal Sciences, Washington State University, Pullman, WA 99164-7620, USA; ; 2. Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education and The Cooperative Innovation Center for Sustainable Pig Production, Huazhong Agricultural University, Wuhan, China
| | - Jennifer J Michal
- 1. Department of Animal Sciences, Washington State University, Pullman, WA 99164-7620, USA
| | - Xiang Zhou
- 1. Department of Animal Sciences, Washington State University, Pullman, WA 99164-7620, USA
| | - Bang Liu
- 2. Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education and The Cooperative Innovation Center for Sustainable Pig Production, Huazhong Agricultural University, Wuhan, China
| | - Leah C Solberg Woods
- 3. Department of Pediatrics, Human and Molecular Genetics Center and Children's Research Institute, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Rita A Fuchs
- 4. Department of Integrative Physiology and Neuroscience, Washington State University College of Veterinary Medicine, Pullman, WA 99164-7620, USA
| |
Collapse
|
11
|
Liao R, Wang Z, Chen Q, Tu Y, Chen Z, Wang Q, Yang C, Zhang X, Pan Y. An Efficient Genotyping Method in Chicken Based on Genome Reducing and Sequencing. PLoS One 2015; 10:e0137010. [PMID: 26313744 PMCID: PMC4551734 DOI: 10.1371/journal.pone.0137010] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2015] [Accepted: 08/11/2015] [Indexed: 01/21/2023] Open
Abstract
Single nucleotide polymorphisms (SNPs) are essential for identifying the genetic mechanisms of complex traits. In the present study, we applied genotyping by genome reducing and sequencing (GGRS) method to construct a 252-plex sequencing library for SNP discovery and genotyping in chicken. The library was successfully sequenced on an Illumina HiSeq 2500 sequencer with a paired-end pattern; approximately 400 million raw reads were generated, and an average of approximately 1.4 million good reads per sample were generated. A total of 91,767 SNPs were identified after strict filtering, and all of the 252 samples and all of the chromosomes were well represented. Compared with the Illumina 60K chicken SNP chip data, approximately 34,131 more SNPs were identified using GGRS, and a higher SNP density was found using GGRS, which could be beneficial for downstream analysis. Using the GGRS method, more than 3528 samples can be sequenced simultaneously, and the cost is reduced to $18 per sample. To the best of our knowledge, this study describes the first report of such highly multiplexed sequencing in chicken, indicating potential applications for genome-wide association and genomic selection in chicken.
Collapse
Affiliation(s)
- Rongrong Liao
- School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Zhen Wang
- School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Qiang Chen
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, Yunnan, China
| | - Yingying Tu
- National Poultry Engineering Research Center, Animal Husbandry and Veterinary Research Institute, Shanghai Academy of Agricultural Sciences, Shanghai, China
| | - Zhenliang Chen
- School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Qishan Wang
- School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, China; Key Laboratory of Veterinary Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Changsuo Yang
- National Poultry Engineering Research Center, Animal Husbandry and Veterinary Research Institute, Shanghai Academy of Agricultural Sciences, Shanghai, China
| | - Xiangzhe Zhang
- School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, China; Key Laboratory of Veterinary Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Yuchun Pan
- School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, China; Key Laboratory of Veterinary Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| |
Collapse
|
12
|
Bickhart DM, Liu GE. The challenges and importance of structural variation detection in livestock. Front Genet 2014; 5:37. [PMID: 24600474 PMCID: PMC3927395 DOI: 10.3389/fgene.2014.00037] [Citation(s) in RCA: 83] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2013] [Accepted: 01/31/2014] [Indexed: 01/25/2023] Open
Abstract
Recent studies in humans and other model organisms have demonstrated that structural variants (SVs) comprise a substantial proportion of variation among individuals of each species. Many of these variants have been linked to debilitating diseases in humans, thereby cementing the importance of refining methods for their detection. Despite progress in the field, reliable detection of SVs still remains a problem even for human subjects. Many of the underlying problems that make SVs difficult to detect in humans are amplified in livestock species, whose lower quality genome assemblies and incomplete gene annotation can often give rise to false positive SV discoveries. Regardless of the challenges, SV detection is just as important for livestock researchers as it is for human researchers, given that several productive traits and diseases have been linked to copy number variations (CNVs) in cattle, sheep, and pig. Already, there is evidence that many beneficial SVs have been artificially selected in livestock such as a duplication of the agouti signaling protein gene that causes white coat color in sheep. In this review, we will list current SV and CNV discoveries in livestock and discuss the problems that hinder routine discovery and tracking of these polymorphisms. We will also discuss the impacts of selective breeding on CNV and SV frequencies and mention how SV genotyping could be used in the future to improve genetic selection.
Collapse
Affiliation(s)
- Derek M Bickhart
- Animal Improvement Programs Laboratory, United States Department of Agriculture-Agricultural Research Service Beltsville, MD, USA
| | - George E Liu
- Bovine Functional Genomics Laboratory, United States Department of Agriculture-Agricultural Research Service Beltsville, MD, USA
| |
Collapse
|
13
|
Dalloul RA, Zimin AV, Settlage RE, Kim S, Reed KM. Next-generation sequencing strategies for characterizing the turkey genome. Poult Sci 2014; 93:479-84. [DOI: 10.3382/ps.2013-03560] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
|
14
|
Greminger MP, Stölting KN, Nater A, Goossens B, Arora N, Bruggmann R, Patrignani A, Nussberger B, Sharma R, Kraus RHS, Ambu LN, Singleton I, Chikhi L, van Schaik CP, Krützen M. Generation of SNP datasets for orangutan population genomics using improved reduced-representation sequencing and direct comparisons of SNP calling algorithms. BMC Genomics 2014; 15:16. [PMID: 24405840 PMCID: PMC3897891 DOI: 10.1186/1471-2164-15-16] [Citation(s) in RCA: 70] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2013] [Accepted: 12/21/2013] [Indexed: 12/30/2022] Open
Abstract
Background High-throughput sequencing has opened up exciting possibilities in population and conservation genetics by enabling the assessment of genetic variation at genome-wide scales. One approach to reduce genome complexity, i.e. investigating only parts of the genome, is reduced-representation library (RRL) sequencing. Like similar approaches, RRL sequencing reduces ascertainment bias due to simultaneous discovery and genotyping of single-nucleotide polymorphisms (SNPs) and does not require reference genomes. Yet, generating such datasets remains challenging due to laboratory and bioinformatical issues. In the laboratory, current protocols require improvements with regards to sequencing homologous fragments to reduce the number of missing genotypes. From the bioinformatical perspective, the reliance of most studies on a single SNP caller disregards the possibility that different algorithms may produce disparate SNP datasets. Results We present an improved RRL (iRRL) protocol that maximizes the generation of homologous DNA sequences, thus achieving improved genotyping-by-sequencing efficiency. Our modifications facilitate generation of single-sample libraries, enabling individual genotype assignments instead of pooled-sample analysis. We sequenced ~1% of the orangutan genome with 41-fold median coverage in 31 wild-born individuals from two populations. SNPs and genotypes were called using three different algorithms. We obtained substantially different SNP datasets depending on the SNP caller. Genotype validations revealed that the Unified Genotyper of the Genome Analysis Toolkit and SAMtools performed significantly better than a caller from CLC Genomics Workbench (CLC). Of all conflicting genotype calls, CLC was only correct in 17% of the cases. Furthermore, conflicting genotypes between two algorithms showed a systematic bias in that one caller almost exclusively assigned heterozygotes, while the other one almost exclusively assigned homozygotes. Conclusions Our enhanced iRRL approach greatly facilitates genotyping-by-sequencing and thus direct estimates of allele frequencies. Our direct comparison of three commonly used SNP callers emphasizes the need to question the accuracy of SNP and genotype calling, as we obtained considerably different SNP datasets depending on caller algorithms, sequencing depths and filtering criteria. These differences affected scans for signatures of natural selection, but will also exert undue influences on demographic inferences. This study presents the first effort to generate a population genomic dataset for wild-born orangutans with known population provenance.
Collapse
Affiliation(s)
- Maja P Greminger
- Evolutionary Genetics Group, Anthropological Institute and Museum, University of Zurich, Zurich, Switzerland.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Fan WL, Ng CS, Chen CF, Lu MYJ, Chen YH, Liu CJ, Wu SM, Chen CK, Chen JJ, Mao CT, Lai YT, Lo WS, Chang WH, Li WH. Genome-wide patterns of genetic variation in two domestic chickens. Genome Biol Evol 2013; 5:1376-92. [PMID: 23814129 PMCID: PMC3730349 DOI: 10.1093/gbe/evt097] [Citation(s) in RCA: 57] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Domestic chickens are excellent models for investigating the genetic basis of phenotypic diversity, as numerous phenotypic changes in physiology, morphology, and behavior in chickens have been artificially selected. Genomic study is required to study genome-wide patterns of DNA variation for dissecting the genetic basis of phenotypic traits. We sequenced the genomes of the Silkie and the Taiwanese native chicken L2 at ∼23- and 25-fold average coverage depth, respectively, using Illumina sequencing. The reads were mapped onto the chicken reference genome (including 5.1% Ns) to 92.32% genome coverage for the two breeds. Using a stringent filter, we identified ∼7.6 million single-nucleotide polymorphisms (SNPs) and 8,839 copy number variations (CNVs) in the mapped regions; 42% of the SNPs have not found in other chickens before. Among the 68,906 SNPs annotated in the chicken sequence assembly, 27,852 were nonsynonymous SNPs located in 13,537 genes. We also identified hundreds of shared and divergent structural and copy number variants in intronic and intergenic regions and in coding regions in the two breeds. Functional enrichments of identified genetic variants were discussed. Radical nsSNP-containing immunity genes were enriched in the QTL regions associated with some economic traits for both breeds. Moreover, genetic changes involved in selective sweeps were detected. From the selective sweeps identified in our two breeds, several genes associated with growth, appetite, and metabolic regulation were identified. Our study provides a framework for genetic and genomic research of domestic chickens and facilitates the domestic chicken as an avian model for genomic, biomedical, and evolutionary studies.
Collapse
Affiliation(s)
- Wen-Lang Fan
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Identification of null alleles and deletions from SNP genotypes for an intercross between domestic and wild chickens. G3-GENES GENOMES GENETICS 2013; 3:1253-60. [PMID: 23708300 PMCID: PMC3737165 DOI: 10.1534/g3.113.006643] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]
Abstract
We analyzed genotypes from ~10K single-nucleotide polymorphisms (SNPs) in two families of an F2 intercross between Red Junglefowl and White Leghorn chickens. Possible null alleles were found by patterns of incompatible and missing genotypes. We estimated that 2.6% of SNPs had null alleles compared with 2.3% with genotyping errors and that 40% of SNPs in which a parent and offspring were genotyped as different homozygotes had null alleles. Putative deletions were identified by null alleles at adjacent markers. We found two candidate deletions that were supported by fluorescence intensity data from a 60K SNP chip. One of the candidate deletions was from the Red Junglefowl, and one was present in both the Red Junglefowl and White Leghorn. Both candidate deletions spanned protein-coding regions and were close to a previously detected quantitative trait locus affecting body weight in this population. This study demonstrates that the ~50K SNP genotyping arrays now available for several agricultural species can be used to identify null alleles and deletions in data from large families. We suggest that our approach could be a useful complement to linkage analysis in experimental crosses.
Collapse
|
17
|
A complex genomic rearrangement involving the endothelin 3 locus causes dermal hyperpigmentation in the chicken. PLoS Genet 2011; 7:e1002412. [PMID: 22216010 PMCID: PMC3245302 DOI: 10.1371/journal.pgen.1002412] [Citation(s) in RCA: 98] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2011] [Accepted: 10/22/2011] [Indexed: 02/03/2023] Open
Abstract
Dermal hyperpigmentation or Fibromelanosis (FM) is one of the few examples of skin pigmentation phenotypes in the chicken, where most other pigmentation variants influence feather color and patterning. The Silkie chicken is the most widespread and well-studied breed displaying this phenotype. The presence of the dominant FM allele results in extensive pigmentation of the dermal layer of skin and the majority of internal connective tissue. Here we identify the causal mutation of FM as an inverted duplication and junction of two genomic regions separated by more than 400 kb in wild-type individuals. One of these duplicated regions contains endothelin 3 (EDN3), a gene with a known role in promoting melanoblast proliferation. We show that EDN3 expression is increased in the developing Silkie embryo during the time in which melanoblasts are migrating, and elevated levels of expression are maintained in the adult skin tissue. We have examined four different chicken breeds from both Asia and Europe displaying dermal hyperpigmentation and conclude that the same structural variant underlies this phenotype in all chicken breeds. This complex genomic rearrangement causing a specific monogenic trait in the chicken illustrates how novel mutations with major phenotypic effects have been reused during breed formation in domestic animals. The process of animal domestication has been a long and ongoing effort of the human race to cultivate beneficial traits in agriculturally productive or otherwise beneficial species. We are just now beginning to understand the effect this type of selection pressure has had on genetic variation and overall genome architecture using quickly advancing modern genetic and genomic technologies. Here we show how along the path of animal domestication a single large rearrangement involving a duplication and inversion of two distinct regions of the chicken genome occurred, likely disrupting long-range cis-regulatory elements of endothelin 3 (EDN3) and resulting in a very extreme skin pigmentation phenotype. Dermal hyperpigmentation, or Fibromelanosis (FM), is a defining characteristic of the Silkie chicken breed, which originates in China. Chickens very similar to the Silkie have been described in ancient Chinese texts on traditional medicine, illustrating how unique phenotypes in domesticated animals are incorporated into human culture and tradition that persists to this day. The presence of the same rearrangement in other FM chicken breeds found around the world highlights both the causality of this mutation as well as how humans serve to spread genetic variation linked to novel traits in domestic animals.
Collapse
|
18
|
Genome-wide genetic marker discovery and genotyping using next-generation sequencing. Nat Rev Genet 2011; 12:499-510. [PMID: 21681211 DOI: 10.1038/nrg3012] [Citation(s) in RCA: 1438] [Impact Index Per Article: 110.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The advent of next-generation sequencing (NGS) has revolutionized genomic and transcriptomic approaches to biology. These new sequencing tools are also valuable for the discovery, validation and assessment of genetic markers in populations. Here we review and discuss best practices for several NGS methods for genome-wide genetic marker development and genotyping that use restriction enzyme digestion of target genomes to reduce the complexity of the target. These new methods -- which include reduced-representation sequencing using reduced-representation libraries (RRLs) or complexity reduction of polymorphic sequences (CRoPS), restriction-site-associated DNA sequencing (RAD-seq) and low coverage genotyping -- are applicable to both model organisms with high-quality reference genome sequences and, excitingly, to non-model species with no existing genomic data.
Collapse
|