1
|
Wang C, Wang S, Zhao Y, Liu J, Zhang D, Wang F, Fan H, Li C, Jiang L. A biogeographical ancestry inference pipeline using PCA-XGBoost model and its application in Asian populations. Forensic Sci Int Genet 2025; 77:103239. [PMID: 40037006 DOI: 10.1016/j.fsigen.2025.103239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2024] [Revised: 01/09/2025] [Accepted: 02/12/2025] [Indexed: 03/06/2025]
Abstract
Biogeographical ancestry (BGA) inference plays a crucial role in genetics, anthropology, forensic science, and medical research. Current methods like principal component analysis (PCA) and ADMIXTURE, based on single nucleotide polymorphisms, are commonly used. Here, we introduce a bio-geographical ancestry inference pipeline that integrates prior population structure and clustering. Our pipeline first analyzes genetic structure on cleaned data to obtain optimal parameters and classification model labels. An XGBoost (eXtreme Gradient Boosting) classification model is constructed using principal components from PCA, and model predictions are evaluated with LR (likelihood ratio). The pipeline was applied to a dataset of Asian populations, with a first prediction accuracy of 96.27 % achieved. The LR-based evaluation accuracy reached 98.96 %, showing an improvement of 2.69 % with the introduction of LR assessment. This highlights the robust predictive capability of our pipeline and the improved accuracy in evaluation with LR. This successful application will benefit genetic research, human history studies, and criminal investigations. Additionally, the pipeline's versatility allows application to new datasets.
Collapse
Affiliation(s)
- Chunnain Wang
- School of Computer Science, Shaanxi Normal University, Xian, Shaanxi 710119, China; Key Laboratory of Forensic Genetics, Beijing Engineering Research Center of Crime Scene Evidence Examination, National Engineering Laboratory for Forensic Science, Institute of Forensic Science, Beijing 100038, China
| | - Shuaiqi Wang
- Key Laboratory of Forensic Genetics, Beijing Engineering Research Center of Crime Scene Evidence Examination, National Engineering Laboratory for Forensic Science, Institute of Forensic Science, Beijing 100038, China; School of Investigation, People's Public Security University of China, Beijing 100038, China
| | - Yiru Zhao
- Key Laboratory of Forensic Genetics, Beijing Engineering Research Center of Crime Scene Evidence Examination, National Engineering Laboratory for Forensic Science, Institute of Forensic Science, Beijing 100038, China; Jiangsu International Joint Research Center of Genomics, Jiangsu Key Laboratory of Phylogenomics and Comparative Genomics, School of Life Science, Jiangsu Normal University, Xuzhou, Jiangsu 221116, China
| | - Jun Liu
- Key Laboratory of Forensic Genetics, Beijing Engineering Research Center of Crime Scene Evidence Examination, National Engineering Laboratory for Forensic Science, Institute of Forensic Science, Beijing 100038, China; School of Forensic Medicine, Shanxi Medical University, Jinzhong, Shanxi 030600, China
| | - Deqin Zhang
- Key Laboratory of Forensic Genetics, Beijing Engineering Research Center of Crime Scene Evidence Examination, National Engineering Laboratory for Forensic Science, Institute of Forensic Science, Beijing 100038, China; Institute of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou 550004, China
| | - Fuyang Wang
- Key Laboratory of Forensic Genetics, Beijing Engineering Research Center of Crime Scene Evidence Examination, National Engineering Laboratory for Forensic Science, Institute of Forensic Science, Beijing 100038, China
| | - Hong Fan
- School of Computer Science, Shaanxi Normal University, Xian, Shaanxi 710119, China.
| | - Caixia Li
- Key Laboratory of Forensic Genetics, Beijing Engineering Research Center of Crime Scene Evidence Examination, National Engineering Laboratory for Forensic Science, Institute of Forensic Science, Beijing 100038, China.
| | - Li Jiang
- Key Laboratory of Forensic Genetics, Beijing Engineering Research Center of Crime Scene Evidence Examination, National Engineering Laboratory for Forensic Science, Institute of Forensic Science, Beijing 100038, China.
| |
Collapse
|
2
|
He Y, Lei C, Wan C, Zeng S, Zhang T, Luo F, Li R, Li X, Zhao A, Xiao D, Luo Y, Shan K, Qi X, Jin X. A comprehensive whole genome database of ethnic minority populations. Sci Rep 2024; 14:13954. [PMID: 38886537 PMCID: PMC11183174 DOI: 10.1038/s41598-024-63892-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 06/03/2024] [Indexed: 06/20/2024] Open
Abstract
China, is characterized by its remarkable ethnical diversity, which necessitates whole genome variation data from multiple populations as crucial tools for advancing population genetics and precision medical research. However, there has been a scarcity of research concentrating on the whole genome of ethnic minority groups. To fill this gap, we developed the Guizhou Multi-ethnic Genome Database (GMGD). It comprises whole genome sequencing data from 476 healthy unrelated individuals spanning 11 ethnic minorities groups in Guizhou Province, Southwest China, including Bouyei, Dong, Miao, Yi, Bai, Gelo, Zhuang, Tujia, Yao, Hui, and Sui. The GMGD database comprises more than 16.33 million variants in GRCh38 and 16.20 million variants in GRCh37. Among these, approximately 11.9% (1,956,322) of the variants in GRCh38 and 18.5% (3,009,431) of the variants in GRCh37 are entirely new and do not exist in the dbSNP database. These novel variants shed light on the genetic diversity landscape across these populations, providing valuable insights with an average coverage of 5.5 ×. This makes GMGD the largest genome-wide database encompassing the most diverse ethnic groups to date. The GMGD interactive interface facilitates researchers with multi-dimensional mutation search methods and displays population frequency differences among global populations. Furthermore, GMGD is equipped with a genotype-imputation function, enabling enhanced capabilities for low-depth genomic research or targeted region capture studies. GMGD offers unique insights into the genomic variation landscape of different ethnic groups, which are freely accessible at https://db.cngb.org/pop/gmgd/ .
Collapse
Affiliation(s)
- Yan He
- Key Laboratory of Endemic and Ethnic Diseases, Ministry of Education and Key Laboratory of Medical Molecular Biology of Guizhou Province, Collaborative Innovation Center for Prevention and Control of Endemic and Ethnic Regional Diseases Co-constructed by the Province and Ministry, Guizhou Medical University, Guiyang, 550004, Guizhou, China
| | - Changgui Lei
- BGI Research, Shenzhen, 518083, China
- BGI Research, Guiyang, 550000, China
- BGI Research, Wuhan, 430074, China
| | - Chanjuan Wan
- Key Laboratory of Endemic and Ethnic Diseases, Ministry of Education and Key Laboratory of Medical Molecular Biology of Guizhou Province, Collaborative Innovation Center for Prevention and Control of Endemic and Ethnic Regional Diseases Co-constructed by the Province and Ministry, Guizhou Medical University, Guiyang, 550004, Guizhou, China
| | - Shuang Zeng
- Key Laboratory of Endemic and Ethnic Diseases, Ministry of Education and Key Laboratory of Medical Molecular Biology of Guizhou Province, Collaborative Innovation Center for Prevention and Control of Endemic and Ethnic Regional Diseases Co-constructed by the Province and Ministry, Guizhou Medical University, Guiyang, 550004, Guizhou, China
- BGI Research, Shenzhen, 518083, China
- BGI Research, Guiyang, 550000, China
- BGI Research, Wuhan, 430074, China
| | - Ting Zhang
- Key Laboratory of Endemic and Ethnic Diseases, Ministry of Education and Key Laboratory of Medical Molecular Biology of Guizhou Province, Collaborative Innovation Center for Prevention and Control of Endemic and Ethnic Regional Diseases Co-constructed by the Province and Ministry, Guizhou Medical University, Guiyang, 550004, Guizhou, China
| | - Fei Luo
- BGI Research, Shenzhen, 518083, China
- BGI Research, Guiyang, 550000, China
- BGI Research, Wuhan, 430074, China
| | - Ruichao Li
- Key Laboratory of Endemic and Ethnic Diseases, Ministry of Education and Key Laboratory of Medical Molecular Biology of Guizhou Province, Collaborative Innovation Center for Prevention and Control of Endemic and Ethnic Regional Diseases Co-constructed by the Province and Ministry, Guizhou Medical University, Guiyang, 550004, Guizhou, China
| | - Xiaokun Li
- BGI Research, Shenzhen, 518083, China
- BGI Research, Guiyang, 550000, China
| | - Anshu Zhao
- Key Laboratory of Endemic and Ethnic Diseases, Ministry of Education and Key Laboratory of Medical Molecular Biology of Guizhou Province, Collaborative Innovation Center for Prevention and Control of Endemic and Ethnic Regional Diseases Co-constructed by the Province and Ministry, Guizhou Medical University, Guiyang, 550004, Guizhou, China
| | - Defu Xiao
- BGI Research, Shenzhen, 518083, China
- BGI Research, Guiyang, 550000, China
| | - Yunyan Luo
- Key Laboratory of Endemic and Ethnic Diseases, Ministry of Education and Key Laboratory of Medical Molecular Biology of Guizhou Province, Collaborative Innovation Center for Prevention and Control of Endemic and Ethnic Regional Diseases Co-constructed by the Province and Ministry, Guizhou Medical University, Guiyang, 550004, Guizhou, China
- BGI Research, Guiyang, 550000, China
| | - Keren Shan
- Key Laboratory of Endemic and Ethnic Diseases, Ministry of Education and Key Laboratory of Medical Molecular Biology of Guizhou Province, Collaborative Innovation Center for Prevention and Control of Endemic and Ethnic Regional Diseases Co-constructed by the Province and Ministry, Guizhou Medical University, Guiyang, 550004, Guizhou, China
| | - Xiaolan Qi
- Key Laboratory of Endemic and Ethnic Diseases, Ministry of Education and Key Laboratory of Medical Molecular Biology of Guizhou Province, Collaborative Innovation Center for Prevention and Control of Endemic and Ethnic Regional Diseases Co-constructed by the Province and Ministry, Guizhou Medical University, Guiyang, 550004, Guizhou, China.
| | - Xin Jin
- BGI Research, Shenzhen, 518083, China.
- BGI Research, Guiyang, 550000, China.
- Shenzhen Key Laboratory of Transomics Biotechnologies, BGI Research, Shenzhen, China.
- School of Medicine, South China University of Technology, Guangzhou, China.
| |
Collapse
|
3
|
Zhang H, Yang M, Zhang H, Ren Z, Wang Q, Liu Y, Jin X, Ji J, Feng Y, Cai C, Ran Q, Li C, Huang J. Forensic features and phylogenetic structure survey of four populations from southwest China via the autosomal insertion/deletion markers. Forensic Sci Res 2024; 9:owad052. [PMID: 38765700 PMCID: PMC11102079 DOI: 10.1093/fsr/owad052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Accepted: 05/09/2023] [Indexed: 05/22/2024] Open
Abstract
Insertion/Deletion (InDel) polymorphisms, characterized by their smaller amplicons, reduced mutation rates, and compatibility with the prevalent capillary electrophoresis (CE) platforms in forensic laboratories, significantly contribute to the advancement and application of genetic analysis. Guizhou province in China serves as an important region for investigating the genetic structure, ethnic group origins, and human evolution. However, DNA data and the sampling of present-day populations are lacking, especially about the InDel markers. Here, we reported data on 47 autosomal InDels from 592 individuals from four populations in Guizhou (Han, Dong, Yi, and Chuanqing). Genotyping was performed with the AGCU InDel 50 kit to evaluate their utility for forensic purposes and to explore the population genetic structure. Our findings showed no significant deviations from Hardy-Weinberg and linkage equilibriums. The combined power of discrimination (CPD) and the combined power of exclusion (CPE) for each population demonstrated that the kit could be applied to forensic individual identification and was an effective supplement for parentage testing. Genetic structure analyses, including principal component analysis, multidimensional scaling, genetic distance calculation, STRUCTURE, and phylogenetic analysis, highlighted that the genetic proximity of the studied populations correlates with linguistic, geographical, and cultural factors. The observed genetic variances within four research populations were less pronounced than those discerned between populations across different regions. Notably, the Guizhou Han, Dong, and Chuanqing populations showed closer genetic affiliations with linguistically similar groups than the Guizhou Yi. These results underscore the potential of InDel markers in forensic science and provide insights into the genetic landscape and human evolution in multi-ethnic regions like Guizhou. Key points InDel markers show promise for forensic individual identification and parentage testing via the AGCU InDel 50 kit.Genetic analysis of Guizhou populations reveals correlations with linguistic, geographical, and cultural factors.Guizhou Han, Dong, and Chuanqing populations showed closer genetic affiliations with linguistically similar groups than the Guizhou Yi.
Collapse
Affiliation(s)
- Han Zhang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
- Institute of Forensic Science, Fudan University, Shanghai, China
| | - Meiqing Yang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Hongling Zhang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Zheng Ren
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Qiyan Wang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Yubo Liu
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Xiaoye Jin
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Jingyan Ji
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Yuhang Feng
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Changsheng Cai
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Qianchong Ran
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Chengtao Li
- Institute of Forensic Science, Fudan University, Shanghai, China
| | - Jiang Huang
- The Key Laboratory of Environmental Pollution Monitoring and Disease Control, Ministry of Education, Guizhou Medical University, Guiyang, China
| |
Collapse
|
4
|
Yang M, He G, Ren Z, Wang Q, Liu Y, Zhang H, Zhang H, Chen J, Ji J, Zhao J, Guo J, Zhu K, Yang X, Wang R, Ma H, Wang CC, Huang J. Genomic Insights Into the Unique Demographic History and Genetic Structure of Five Hmong-Mien-Speaking Miao and Yao Populations in Southwest China. Front Ecol Evol 2022. [DOI: 10.3389/fevo.2022.849195] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Southern China was the original center of multiple ancestral populations related to modern Hmong-Mien, Tai-Kadai, Austroasiatic, and Austronesian people. More recent genetic surveys have focused on the fine-scale genetic structure and admixture history of southern Chinese populations, but the genetic formation and diversification of Hmong-Mien speakers are far from clear due to the sparse genetic sampling. Here, we reported nearly 700,000 single-nucleotide polymorphisms (SNPs) data from 130 Guizhou Miao and Yao individuals. We used principal component analysis, ADMIXTURE, f-statistics, qpAdm, phylogenetic tree, fineSTRUCTURE, and ALDER to explore the fine-scale population genetic structure and admixture pattern of Hmong-Mien people. The sharing allele patterns showed that our studied populations had a strong genetic affinity with ancient and modern groups from southern and southeastern East Asia. We identified one unique ancestry component maximized in Yao people, which widely existed in other Hmong-Mien-speaking populations in southern China and Southeast Asia and ancient samples of Guangxi. Guizhou Hmong-Mien speakers harbored the dominant proportions of ancestry related to southern indigenous East Asians and minor proportions of northern ancestry related to Yellow River farmers, suggesting the possibility of genetic admixture between Hmong-Mien people and recent southward Sino-Tibetan-related populations. Furthermore, we found a genetic substructure among geographically different Miao and Yao people in Leishan and Songtao. The Yao and Miao people in Leishan harbored more southern East Asian ancestry, but Miao in Songtao received more northern East Asian genetic influence. We observed high mtDNA but low Y-chromosome diversity in studied Hmong-Mien groups, supporting the role of sex-specific residence in influencing human genetic variation. Our data provide valuable clues for further exploring population dynamics in southern China.
Collapse
|
5
|
Chen J, He G, Ren Z, Wang Q, Liu Y, Zhang H, Yang M, Zhang H, Ji J, Zhao J, Guo J, Chen J, Zhu K, Yang X, Wang R, Ma H, Tao L, Liu Y, Shen Q, Yang W, Wang CC, Huang J. Fine-Scale Population Admixture Landscape of Tai–Kadai-Speaking Maonan in Southwest China Inferred From Genome-Wide SNP Data. Front Genet 2022; 13:815285. [PMID: 35251126 PMCID: PMC8891617 DOI: 10.3389/fgene.2022.815285] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 01/27/2022] [Indexed: 12/27/2022] Open
Abstract
Guizhou Province harbors extensive ethnolinguistic and cultural diversity with Sino-Tibetan-, Hmong–Mien-, and Tai–Kadai-speaking populations. However, previous genetic analyses mainly focused on the genetic admixture history of the former two linguistic groups. The admixture history of Tai–Kadai-speaking populations in Guizhou needed to be characterized further. Thus, we genotyped genome-wide SNP data from 41 Tai–Kadai-speaking Maonan people and made a comprehensive population genetic analysis to explore their genetic origin and admixture history based on the pattern of the sharing alleles and haplotypes. We found a genetic affinity among geographically different Tai–Kadai-speaking populations, especially for Guizhou Maonan people and reference Maonan from Guangxi. Furthermore, formal tests based on the f3/f4-statistics further identified an adjacent connection between Maonan and geographically adjacent Hmong–Mien and Sino-Tibetan people, which was consistent with their historically documented shared material culture (Zhang et al., iScience, 2020, 23, 101032). Fitted qpAdm-based two-way admixture models with ancestral sources from northern and southern East Asians demonstrated that Maonan people were an admixed population with primary ancestry related to Guangxi historical people and a minor proportion of ancestry from Northeast Asians, consistent with their linguistically supported southern China origin. Here, we presented the landscape of genetic structure and diversity of Maonan people and a simple demographic model for their evolutionary process. Further whole-genome-sequence–based projects can be presented with more detailed information about the population history and adaptative history of the Guizhou Maonan people.
Collapse
Affiliation(s)
- Jing Chen
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
| | - Guanglin He
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, China
- Department of Anthropology and Ethnology, School of Sociology and Anthropology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China
- State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen, China
- Institute Of Rare Diseases, West China Hospital of Sichuan University, Chengdu, China
| | - Zheng Ren
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
| | - Qiyan Wang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
| | - Yubo Liu
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
| | - Hongling Zhang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
| | - Meiqing Yang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
| | - Han Zhang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
| | - Jingyan Ji
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
| | - Jing Zhao
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, China
- Department of Anthropology and Ethnology, School of Sociology and Anthropology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China
- State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen, China
| | - Jianxin Guo
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, China
- Department of Anthropology and Ethnology, School of Sociology and Anthropology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China
- State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen, China
| | - Jinwen Chen
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, China
- Department of Anthropology and Ethnology, School of Sociology and Anthropology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China
- State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen, China
| | - Kongyang Zhu
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, China
- Department of Anthropology and Ethnology, School of Sociology and Anthropology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China
- State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen, China
| | - Xiaomin Yang
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, China
- Department of Anthropology and Ethnology, School of Sociology and Anthropology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China
- State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen, China
| | - Rui Wang
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, China
- Department of Anthropology and Ethnology, School of Sociology and Anthropology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China
- State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen, China
| | - Hao Ma
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, China
- Department of Anthropology and Ethnology, School of Sociology and Anthropology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China
- State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen, China
| | - Le Tao
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, China
- Department of Anthropology and Ethnology, School of Sociology and Anthropology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China
- State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen, China
| | - Yilan Liu
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, China
- Department of Anthropology and Ethnology, School of Sociology and Anthropology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China
- State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen, China
| | - Qu Shen
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, China
- Department of Anthropology and Ethnology, School of Sociology and Anthropology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China
- State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen, China
| | - Wenjiao Yang
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, China
- Department of Anthropology and Ethnology, School of Sociology and Anthropology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China
- State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen, China
| | - Chuan-Chao Wang
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, China
- Department of Anthropology and Ethnology, School of Sociology and Anthropology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China
- State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen, China
- *Correspondence: Chuan-Chao Wang, ; Jiang Huang,
| | - Jiang Huang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, China
- *Correspondence: Chuan-Chao Wang, ; Jiang Huang,
| |
Collapse
|
6
|
Liu Y, Xie J, Wang M, Liu C, Zhu J, Zou X, Li W, Wang L, Leng C, Xu Q, Yeh HY, Wang CC, Wen X, Liu C, He G. Genomic Insights Into the Population History and Biological Adaptation of Southwestern Chinese Hmong-Mien People. Front Genet 2022; 12:815160. [PMID: 35047024 PMCID: PMC8762323 DOI: 10.3389/fgene.2021.815160] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 12/03/2021] [Indexed: 01/19/2023] Open
Abstract
Hmong-Mien (HM) -speaking populations, widely distributed in South China, the north of Thailand, Laos, and Vietnam, have experienced different settlement environments, dietary habits, and pathogenic exposure. However, their specific biological adaptation remained largely uncharacterized, which is important in the population evolutionary genetics and Trans-Omics for regional Precision Medicine. Besides, the origin and genetic diversity of HM people and their phylogenetic relationship with surrounding modern and ancient populations are also unknown. Here, we reported genome-wide SNPs in 52 representative Miao people and combined them with 144 HM people from 13 geographically representative populations to characterize the full genetic admixture and adaptive landscape of HM speakers. We found that obvious genetic substructures existed in geographically different HM populations; one localized in the HM clines, and others possessed affinity with Han Chinese. We also identified one new ancestral lineage specifically existed in HM people, which spatially distributed from Sichuan and Guizhou in the north to Thailand in the south. The sharing patterns of the newly identified homogenous ancestry component combined the estimated admixture times via the decay of linkage disequilibrium and haplotype sharing in GLOBETROTTER suggested that the modern HM-speaking populations originated from Southwest China and migrated southward in the historic period, which is consistent with the reconstructed phenomena of linguistic and archeological documents. Additionally, we identified specific adaptive signatures associated with several important human nervous system biological functions. Our pilot work emphasized the importance of anthropologically informed sampling and deeply genetic structure reconstruction via whole-genome sequencing in the next step in the deep Chinese Population Genomic Diversity Project (CPGDP), especially in the regions with rich ethnolinguistic diversity.
Collapse
Affiliation(s)
- Yan Liu
- School of Basic Medical Sciences, North Sichuan Medical College, Nanchong, China.,Medical Imaging Key Laboratory of Sichuan Province, North Sichuan Medical College, Nanchong, China
| | - Jie Xie
- School of Basic Medical Sciences, North Sichuan Medical College, Nanchong, China
| | - Mengge Wang
- Guangzhou Forensic Science Institute, Guangzhou, China.,Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
| | - Changhui Liu
- Guangzhou Forensic Science Institute, Guangzhou, China
| | - Jingrong Zhu
- Department of Anthropology and Ethnology, Xiamen University, Xiamen, China
| | - Xing Zou
- College of Medicine, Chongqing University, Chongqing, China
| | - Wenshan Li
- College of Medical Imaging, North Sichuan Medical College, Nanchong, China
| | - Lin Wang
- College of Clinical Medicine, North Sichuan Medical College, Nanchong, China
| | - Cuo Leng
- College of Medical Imaging, North Sichuan Medical College, Nanchong, China
| | - Quyi Xu
- Guangzhou Forensic Science Institute, Guangzhou, China
| | - Hui-Yuan Yeh
- School of Humanities, Nanyang Technological University, Singapore, Singapore
| | - Chuan-Chao Wang
- State Key Laboratory of Cellular Stress Biology, National Institute for Data Science in Health and Medicine, School of Life Sciences, Xiamen University, Xiamen, China.,Department of Anthropology and Ethnology, Institute of Anthropology, School of Sociology and Anthropology, Xiamen University, Xiamen, China.,State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen, China
| | - Xiaohong Wen
- School of Basic Medical Sciences, North Sichuan Medical College, Nanchong, China
| | - Chao Liu
- Guangzhou Forensic Science Institute, Guangzhou, China.,Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
| | - Guanglin He
- School of Humanities, Nanyang Technological University, Singapore, Singapore.,State Key Laboratory of Cellular Stress Biology, National Institute for Data Science in Health and Medicine, School of Life Sciences, Xiamen University, Xiamen, China.,Department of Anthropology and Ethnology, Institute of Anthropology, School of Sociology and Anthropology, Xiamen University, Xiamen, China.,State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen, China
| |
Collapse
|
7
|
Liu Y, Yang J, Li Y, Tang R, Yuan D, Wang Y, Wang P, Deng S, Zeng S, Li H, Chen G, Zou X, Wang M, He G. Significant East Asian Affinity of the Sichuan Hui Genomic Structure Suggests the Predominance of the Cultural Diffusion Model in the Genetic Formation Process. Front Genet 2021; 12:626710. [PMID: 34194465 PMCID: PMC8237860 DOI: 10.3389/fgene.2021.626710] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Accepted: 04/28/2021] [Indexed: 11/13/2022] Open
Abstract
The ancestral origin and genomic history of Chinese Hui people remain to be explored due to the paucity of genome-wide data. Some evidence argues that an eastward migration of Central Asians gave rise to modern Hui people, which is referred to as the demic diffusion hypothesis; other evidence favors the cultural diffusion hypothesis, which posits that East Asians adopted Muslim culture to form the modern culturally distinct populations. However, the extent to which the observed genetic structure of the Huis was mediated by the movement of people or the assimilation of Muslim culture also remains highly contentious. Analyses of over 700 K SNPs in 109 western Chinese individuals (49 Sichuan Huis and 60 geographically close Nanchong Hans) together with the available ancient and modern Eurasian sequences allowed us to fully explore the genomic makeup and origin of Hui and neighboring Han populations. The results from PCA, ADMIXTURE, and allele-sharing-based f-statistics revealed a strong genomic affinity between Sichuan Huis and Neolithic-to-modern Northern East Asians, which suggested a massive gene influx from East Asians into the Sichuan Hui people. Three-way admixture models in the qpWave/qpAdm analyses further revealed a small stream of gene influx from western Eurasians into the Sichuan Hui people, which was further directly confirmed via the admixture event from the temporally distinct Western sources to Sichuan Hui people in the qpGraph-based phylogenetic model, suggesting the key role of the cultural diffusion model in the genetic formation of the Sichuan Huis. ALDER-based admixture date estimation showed that this observed western Eurasian admixture signal was introduced into the Sichuan Huis during the historic periods, which was concordant with the extensive western-eastern communication along the Silk Road and historically documented Huis' migration history. In summary, although significant cultural differentiation exists between Hui people and their neighbors, our genomic analysis showed their strong genetic affinity with modern and ancient Northern East Asians. Our results support the hypothesis that the Sichuan Huis arose from a mixture of minor western Eurasian ancestry and predominant East Asian ancestry.
Collapse
Affiliation(s)
- Yan Liu
- School of Basic Medical Sciences, North Sichuan Medical College, Nanchong, China
| | - Junbao Yang
- School of Basic Medical Sciences, North Sichuan Medical College, Nanchong, China
| | | | - Renkuan Tang
- Department of Forensic Medicine, College of Basic Medicine, Chongqing Medical University, Chongqing, China
| | - Didi Yuan
- Department of Forensic Medicine, College of Basic Medicine, Chongqing Medical University, Chongqing, China
| | - Yicheng Wang
- Department of Forensic Medicine, College of Basic Medicine, Chongqing Medical University, Chongqing, China
| | - Peixin Wang
- College of Medical Information, Chongqing Medical University, Chongqing, China
| | - Shudan Deng
- School of Medical Imaging, North Sichuan Medical College, Nanchong, China
| | - Simei Zeng
- School of Basic Medical Sciences, North Sichuan Medical College, Nanchong, China
| | - Hongliang Li
- School of Basic Medical Sciences, North Sichuan Medical College, Nanchong, China
| | - Gang Chen
- Hunan Key Lab of Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, China
| | - Xing Zou
- Department of Forensic Genetics, Institute of Forensic Medicine, West China School of Basic Science and Forensic Medicine, Sichuan University, Chengdu, China
| | - Mengge Wang
- Department of Forensic Genetics, Institute of Forensic Medicine, West China School of Basic Science and Forensic Medicine, Sichuan University, Chengdu, China
| | - Guanglin He
- Department of Forensic Genetics, Institute of Forensic Medicine, West China School of Basic Science and Forensic Medicine, Sichuan University, Chengdu, China
- Department of Anthropology and Ethnology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, School of Life Sciences, Xiamen University, Xiamen, China
| |
Collapse
|