1
|
Wang Y, Gou Y, Yuan R, Zou Q, Zhang X, Zheng T, Fei K, Shi R, Zhang M, Li Y, Gong Z, Luo C, Xiong Y, Shan D, Wei C, Shen L, Tang G, Li M, Zhu L, Li X, Jiang Y. A chromosome-level genome of Chenghua pig provides new insights into the domestication and local adaptation of pigs. Int J Biol Macromol 2024; 270:131796. [PMID: 38677688 DOI: 10.1016/j.ijbiomac.2024.131796] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 03/24/2024] [Accepted: 04/04/2024] [Indexed: 04/29/2024]
Abstract
As a country with abundant genetic resources of pigs, the domestication history of pigs in China and the adaptive evolution of Chinese pig breeds at different latitudes have rarely been elucidated at the genome-wide level. To fill this gap, we first assembled a high-quality chromosome-level genome of the Chenghua pig and used it as a benchmark to analyse the genomes of 272 samples from three genera of three continents. The divergence of the three species belonging to three genera, Phacochoerus africanus, Potamochoerus porcus, and Sus scrofa, was assessed. The introgression of pig breeds redefined that the migration routes were basically from southern China to central and southwestern China, then spread to eastern China, arrived in northern China, and finally reached Europe. The domestication of pigs in China occurred ∼12,000 years ago, earlier than the available Chinese archaeological domestication evidence. In addition, FBN1 and NR6A1 were identified in our study as candidate genes related to extreme skin thickness differences in Eurasian pig breeds and adaptive evolution at different latitudes in Chinese pig breeds, respectively. Our study provides a new resource for the pig genomic pool and refines our understanding of pig genetic diversity, domestication, migration, and adaptive evolution at different latitudes.
Collapse
Affiliation(s)
- Yifei Wang
- Department of Zoology, College of Life Science, Sichuan Agricultural University, Ya'an, Sichuan 625014, China
| | - Yuwei Gou
- College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan 611130, China
| | - Rong Yuan
- Chengdu Livestock and Poultry Genetic Resources Protection Center, Chengdu, Sichuan 610081, China
| | - Qin Zou
- Department of Zoology, College of Life Science, Sichuan Agricultural University, Ya'an, Sichuan 625014, China
| | - Xukun Zhang
- Academy for Engineering and Technology, Fudan University, Shanghai 200433, China
| | - Ting Zheng
- Department of Zoology, College of Life Science, Sichuan Agricultural University, Ya'an, Sichuan 625014, China
| | - Kaixin Fei
- Department of Zoology, College of Life Science, Sichuan Agricultural University, Ya'an, Sichuan 625014, China
| | - Rui Shi
- Department of Zoology, College of Life Science, Sichuan Agricultural University, Ya'an, Sichuan 625014, China
| | - Mei Zhang
- Department of Zoology, College of Life Science, Sichuan Agricultural University, Ya'an, Sichuan 625014, China
| | - Yujing Li
- Department of Zoology, College of Life Science, Sichuan Agricultural University, Ya'an, Sichuan 625014, China
| | - Zhengyin Gong
- Department of Zoology, College of Life Science, Sichuan Agricultural University, Ya'an, Sichuan 625014, China
| | - Chenggang Luo
- Chengdu Livestock and Poultry Genetic Resources Protection Center, Chengdu, Sichuan 610081, China
| | - Ying Xiong
- Department of Zoology, College of Life Science, Sichuan Agricultural University, Ya'an, Sichuan 625014, China
| | - Dai Shan
- BGI Genomics, BGI-Shenzhen, Shenzhen 518083, China
| | - Chenyang Wei
- BGI Genomics, BGI-Shenzhen, Shenzhen 518083, China
| | - Linyuan Shen
- College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan 611130, China
| | - Guoqing Tang
- College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan 611130, China
| | - Mingzhou Li
- College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan 611130, China
| | - Li Zhu
- College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan 611130, China
| | - Xuewei Li
- College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan 611130, China
| | - Yanzhi Jiang
- Department of Zoology, College of Life Science, Sichuan Agricultural University, Ya'an, Sichuan 625014, China.
| |
Collapse
|
2
|
Bhati M, Mapel XM, Lloret-Villas A, Pausch H. Structural variants and short tandem repeats impact gene expression and splicing in bovine testis tissue. Genetics 2023; 225:iyad161. [PMID: 37655920 PMCID: PMC10627265 DOI: 10.1093/genetics/iyad161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 06/05/2023] [Accepted: 08/24/2023] [Indexed: 09/02/2023] Open
Abstract
Structural variants (SVs) and short tandem repeats (STRs) are significant sources of genetic variation. However, the impacts of these variants on gene regulation have not been investigated in cattle. Here, we genotyped and characterized 19,408 SVs and 374,821 STRs in 183 bovine genomes and investigated their impact on molecular phenotypes derived from testis transcriptomes. We found that 71% STRs were multiallelic. The vast majority (95%) of STRs and SVs were in intergenic and intronic regions. Only 37% SVs and 40% STRs were in high linkage disequilibrium (LD) (R2 > 0.8) with surrounding SNPs/insertions and deletions (Indels), indicating that SNP-based association testing and genomic prediction are blind to a nonnegligible portion of genetic variation. We showed that both SVs and STRs were more than 2-fold enriched among expression and splicing QTL (e/sQTL) relative to SNPs/Indels and were often associated with differential expression and splicing of multiple genes. Deletions and duplications had larger impacts on splicing and expression than any other type of SV. Exonic duplications predominantly increased gene expression either through alternative splicing or other mechanisms, whereas expression- and splicing-associated STRs primarily resided in intronic regions and exhibited bimodal effects on the molecular phenotypes investigated. Most e/sQTL resided within 100 kb of the affected genes or splicing junctions. We pinpoint candidate causal STRs and SVs associated with the expression of SLC13A4 and TTC7B and alternative splicing of a lncRNA and CAPP1. We provide a catalog of STRs and SVs for taurine cattle and show that these variants contribute substantially to gene expression and splicing variation.
Collapse
Affiliation(s)
- Meenu Bhati
- Animal Genomics, ETH Zurich, Universitaetstrasse 2, 8092, Zurich, Switzerland
| | - Xena Marie Mapel
- Animal Genomics, ETH Zurich, Universitaetstrasse 2, 8092, Zurich, Switzerland
| | | | - Hubert Pausch
- Animal Genomics, ETH Zurich, Universitaetstrasse 2, 8092, Zurich, Switzerland
| |
Collapse
|
3
|
Shi Y, Niu Y, Zhang P, Luo H, Liu S, Zhang S, Wang J, Li Y, Liu X, Song T, Xu T, He S. Characterization of genome-wide STR variation in 6487 human genomes. Nat Commun 2023; 14:2092. [PMID: 37045857 PMCID: PMC10097659 DOI: 10.1038/s41467-023-37690-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Accepted: 03/27/2023] [Indexed: 04/14/2023] Open
Abstract
Short tandem repeats (STRs) are abundant and highly mutagenic in the human genome. Many STR loci have been associated with a range of human genetic disorders. However, most population-scale studies on STR variation in humans have focused on European ancestry cohorts or are limited by sequencing depth. Here, we depicted a comprehensive map of 366,013 polymorphic STRs (pSTRs) constructed from 6487 deeply sequenced genomes, comprising 3983 Chinese samples (~31.5x, NyuWa) and 2504 samples from the 1000 Genomes Project (~33.3x, 1KGP). We found that STR mutations were affected by motif length, chromosome context and epigenetic features. We identified 3273 and 1117 pSTRs whose repeat numbers were associated with gene expression and 3'UTR alternative polyadenylation, respectively. We also implemented population analysis, investigated population differentiated signatures, and genotyped 60 known disease-causing STRs. Overall, this study further extends the scale of STR variation in humans and propels our understanding of the semantics of STRs.
Collapse
Affiliation(s)
- Yirong Shi
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yiwei Niu
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Peng Zhang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Huaxia Luo
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Shuai Liu
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Sijia Zhang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Jiajia Wang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Yanyan Li
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Xinyue Liu
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Tingrui Song
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Tao Xu
- National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China.
- Shandong First Medical University & Shandong Academy of Medical Sciences, Jinan, 250117, Shandong, China.
| | - Shunmin He
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China.
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China.
| |
Collapse
|
4
|
Verbiest M, Maksimov M, Jin Y, Anisimova M, Gymrek M, Bilgin Sonay T. Mutation and selection processes regulating short tandem repeats give rise to genetic and phenotypic diversity across species. J Evol Biol 2023; 36:321-336. [PMID: 36289560 PMCID: PMC9990875 DOI: 10.1111/jeb.14106] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Revised: 06/29/2022] [Accepted: 08/01/2022] [Indexed: 02/03/2023]
Abstract
Short tandem repeats (STRs) are units of 1-6 bp that repeat in a tandem fashion in DNA. Along with single nucleotide polymorphisms and large structural variations, they are among the major genomic variants underlying genetic, and likely phenotypic, divergence. STRs experience mutation rates that are orders of magnitude higher than other well-studied genotypic variants. Frequent copy number changes result in a wide range of alleles, and provide unique opportunities for modulating complex phenotypes through variation in repeat length. While classical studies have identified key roles of individual STR loci, the advent of improved sequencing technology, high-quality genome assemblies for diverse species, and bioinformatics methods for genome-wide STR analysis now enable more systematic study of STR variation across wide evolutionary ranges. In this review, we explore mutation and selection processes that affect STR copy number evolution, and how these processes give rise to varying STR patterns both within and across species. Finally, we review recent examples of functional and adaptive changes linked to STRs.
Collapse
Affiliation(s)
- Max Verbiest
- Institute of Computational Life Sciences, School of Life Sciences and Facility ManagementZürich University of Applied SciencesWädenswilSwitzerland
- Department of Molecular Life SciencesUniversity of ZurichZurichSwitzerland
- Swiss Institute of BioinformaticsLausanneSwitzerland
| | - Mikhail Maksimov
- Department of Computer Science & EngineeringUniversity of California San DiegoLa JollaCaliforniaUSA
- Department of MedicineUniversity of California San DiegoLa JollaCaliforniaUSA
| | - Ye Jin
- Department of MedicineUniversity of California San DiegoLa JollaCaliforniaUSA
- Department of BioengineeringUniversity of California San DiegoLa JollaCaliforniaUSA
| | - Maria Anisimova
- Institute of Computational Life Sciences, School of Life Sciences and Facility ManagementZürich University of Applied SciencesWädenswilSwitzerland
- Swiss Institute of BioinformaticsLausanneSwitzerland
| | - Melissa Gymrek
- Department of Computer Science & EngineeringUniversity of California San DiegoLa JollaCaliforniaUSA
- Department of MedicineUniversity of California San DiegoLa JollaCaliforniaUSA
| | - Tugce Bilgin Sonay
- Institute of Ecology, Evolution and Environmental BiologyColumbia UniversityNew YorkNew YorkUSA
| |
Collapse
|
5
|
Blaj I, Tetens J, Bennewitz J, Thaller G, Falker-Gieske C. Structural variants and tandem repeats in the founder individuals of four F 2 pig crosses and implications to F 2 GWAS results. BMC Genomics 2022; 23:631. [PMID: 36057580 PMCID: PMC9440560 DOI: 10.1186/s12864-022-08716-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Accepted: 06/23/2022] [Indexed: 12/03/2022] Open
Abstract
BACKGROUND Structural variants and tandem repeats are relevant sources of genomic variation that are not routinely analyzed in genome wide association studies mainly due to challenging identification and genotyping. Here, we profiled these variants via state-of-the-art strategies in the founder animals of four F2 pig crosses using whole-genome sequence data (20x coverage). The variants were compared at a founder level with the commonly screened SNPs and small indels. At the F2 level, we carried out an association study using imputed structural variants and tandem repeats with four growth and carcass traits followed by a comparison with a previously conducted SNPs and small indels based association study. RESULTS A total of 13,201 high confidence structural variants and 103,730 polymorphic tandem repeats (with a repeat length of 2-20 bp) were profiled in the founders. We observed a moderate to high (r from 0.48 to 0.57) level of co-localization between SNPs or small indels and structural variants or tandem repeats. In the association step 56.56% of the significant variants were not in high LD with significantly associated SNPs and small indels identified for the same traits in the earlier study and thus presumably not tagged in case of a standard association study. For the four growth and carcass traits investigated, many of the already proposed candidate genes in our previous studies were confirmed and additional ones were identified. Interestingly, a common pattern on how structural variants or tandem repeats regulate the phenotypic traits emerged. Many of the significant variants were embedded or nearby long non-coding RNAs drawing attention to their functional importance. Through which specific mechanisms the identified long non-coding RNAs and their associated structural variants or tandem repeats contribute to quantitative trait variation will need further investigation. CONCLUSIONS The current study provides insights into the characteristics of structural variants and tandem repeats and their role in association studies. A systematic incorporation of these variants into genome wide association studies is advised. While not of immediate interest for genomic prediction purposes, this will be particularly beneficial for elucidating biological mechanisms driving the complex trait variation.
Collapse
Affiliation(s)
- Iulia Blaj
- Institute of Animal Breeding and Husbandry, Kiel University, Kiel, Germany.
| | - Jens Tetens
- Department of Animal Sciences, Georg-August-University, Göttingen, Germany
- Center for Integrated Breeding Research, Georg-August-University, Göttingen, Germany
| | - Jörn Bennewitz
- Institute of Animal Husbandry and Breeding, University of Hohenheim, Stuttgart, Germany
| | - Georg Thaller
- Institute of Animal Breeding and Husbandry, Kiel University, Kiel, Germany
| | | |
Collapse
|
6
|
Gong H, Liu W, Wu Z, Zhang M, Sun Y, Ling Z, Xiao S, Ai H, Xin Y, Yang B, Huang L. Evolutionary insights into porcine genomic structural variations based on a novel constructed dataset from 24 worldwide diverse populations. Evol Appl 2022. [DOI: 10.1111/eva.13455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
Affiliation(s)
- Huanfa Gong
- State Key Laboratory of Pig Genetic Improvement and Production Technology Jiangxi Agricultural University Nanchang P.R. China
- Key Laboratory of Molecular Animal Nutrition, Ministry of Education, College of Animal Sciences Zhejiang University Hangzhou P.R. China
- Key Laboratory of Animal Nutrition and Feed Science in Eastern China, Ministry of Agriculture, College of Animal Sciences Zhejiang University Hangzhou P.R. China
| | - Weiwei Liu
- State Key Laboratory of Pig Genetic Improvement and Production Technology Jiangxi Agricultural University Nanchang P.R. China
| | - Zhongzi Wu
- State Key Laboratory of Pig Genetic Improvement and Production Technology Jiangxi Agricultural University Nanchang P.R. China
| | - Mingpeng Zhang
- State Key Laboratory of Pig Genetic Improvement and Production Technology Jiangxi Agricultural University Nanchang P.R. China
| | - Yingchun Sun
- State Key Laboratory of Pig Genetic Improvement and Production Technology Jiangxi Agricultural University Nanchang P.R. China
| | - Ziqi Ling
- State Key Laboratory of Pig Genetic Improvement and Production Technology Jiangxi Agricultural University Nanchang P.R. China
| | - Shijun Xiao
- State Key Laboratory of Pig Genetic Improvement and Production Technology Jiangxi Agricultural University Nanchang P.R. China
| | - Huashui Ai
- State Key Laboratory of Pig Genetic Improvement and Production Technology Jiangxi Agricultural University Nanchang P.R. China
| | - Yuyun Xin
- State Key Laboratory of Pig Genetic Improvement and Production Technology Jiangxi Agricultural University Nanchang P.R. China
| | - Bin Yang
- State Key Laboratory of Pig Genetic Improvement and Production Technology Jiangxi Agricultural University Nanchang P.R. China
| | - Lusheng Huang
- State Key Laboratory of Pig Genetic Improvement and Production Technology Jiangxi Agricultural University Nanchang P.R. China
| |
Collapse
|