1
|
Song M, Zhou Y, Zhao C, Song F, Hou Y. YHP: Y-chromosome Haplogroup Predictor for predicting male lineages based on Y-STRs. Forensic Sci Int 2024; 361:112113. [PMID: 38936202 DOI: 10.1016/j.forsciint.2024.112113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Revised: 05/24/2024] [Accepted: 06/16/2024] [Indexed: 06/29/2024]
Abstract
Human Y chromosome reflects the evolutionary process of males. Male lineage tracing by Y chromosome is of great use in evolutionary, forensic, and anthropological studies. Identifying the male lineage based on the specific distribution of Y haplogroups narrows down the investigation scope, which has been used in forensic scenarios. However, existing software aids in familial searching using Y-STRs (Y-chromosome short tandem repeats) to predict Y-SNP (Y-chromosome single nucleotide polymorphism) haplogroups, they often lack resolution. In this study, we developed YHP (Y Haplogroup Predictor), a novel software offering high-resolution haplogroup inference without requiring extensive Y-SNP sequencing. Leveraging existing datasets (219 haplogroups, 4064 samples in total), YHP predicts haplogroups with 0.923 accuracy under the highest haplogroup resolution, employing a random forest algorithm. YHP, available on Github (https://github.com/cissy123/YHP-Y-Haplogroup-Predictor-), facilitates high-resolution haplogroup prediction, haplotype mismatch analysis, and haplotype similarity comparison. Notably, it demonstrates efficacy in East Asian populations, benefiting from training data from eight distinct East Asian ethnic populations. Moreover, it enables seamless integration of additional training sets, extending its utility to diverse populations.
Collapse
Affiliation(s)
- Mengyuan Song
- Department of Forensic Genetics, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China; Department of Laboratory Medicine, West China Hospital, Sichuan University, Chengdu, China
| | - Yuxiang Zhou
- Department of Forensic Genetics, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China
| | - Chenxi Zhao
- College of Computer Science, Sichuan University, Chengdu, China
| | - Feng Song
- Department of Forensic Genetics, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China.
| | - Yiping Hou
- Department of Forensic Genetics, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China.
| |
Collapse
|
2
|
Zhang H, Huang X, Jin X, Ren Z, Wang Q, Yang M, Xu R, Yuan X, Yang D, Liu H, Shen W, Zhang H, Que Y, Huang J. Comprehensive analyses of genetic diversities and population structure of the Guizhou Dong group based on 44 Y-markers. PeerJ 2023; 11:e16183. [PMID: 37780380 PMCID: PMC10538297 DOI: 10.7717/peerj.16183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Accepted: 09/05/2023] [Indexed: 10/03/2023] Open
Abstract
Background The non-recombining region of the human Y chromosome (NRY) is a strictly paternally inherited genetic marker and the best material to trace the paternal lineages of populations. Y chromosomal short tandem repeat (Y-STR) is characterized by high polymorphism and paternal inheritance pattern, so it has been widely used in forensic medicine and population genetic research. This study aims to understand the genetic distribution of Y-STRs in the Guizhou Dong population, provide reference data for forensic application, and explore the phylogenetic relationships between the Guizhou Dong population and other comparison populations. Methods Based on the allele profile of 44 Y-markers in the Guizhou Dong group, we estimate their allele frequencies and haplotype frequencies. In addition, we also compare the forensic application efficiency of different Y-STR sets in the Guizhou Dong group. Finally, genetic relationships among Guizhou Dong and other reference populations are dissected by the multi-dimensional scaling and the phylogenetic tree. Results A total of 393 alleles are observed in 312 Guizhou Dong individuals for these Y-markers, with allele frequencies ranging from 0.0032 to 0.9679. The haplotype diversity and discriminatory capacity for these Y-markers in the Guizhou Dong population are 0.99984 and 0.97440, respectively. The population genetic analyses of the Guizhou Dong group and other reference populations show that the Guizhou Dong group has the closest genetic relationship with the Hunan Dong population, and followed by the Guizhou Tujia population. Conclusions In conclusion, these 44 Y-markers can be used as an effective tool for male differentiation in the Guizhou Dong group. The haplotype data in this study not only enrich the Y-STR data of different ethnic groups in China, but also have important significance for population genetics and forensic research.
Collapse
Affiliation(s)
- Hongling Zhang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Xiaolan Huang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Xiaoye Jin
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Zheng Ren
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Qiyan Wang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Meiqing Yang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Ronglan Xu
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Xiang Yuan
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Daiquan Yang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Hongyan Liu
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Wanyi Shen
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Huiying Zhang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Yangjie Que
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| | - Jiang Huang
- Department of Forensic Medicine, Guizhou Medical University, Guiyang, Guizhou, China
| |
Collapse
|
3
|
Li J, Song F, Lang M, Xie M. Comprehensive insights into the genetic background of Chinese populations using Y chromosome markers. ROYAL SOCIETY OPEN SCIENCE 2023; 10:230814. [PMID: 37736526 PMCID: PMC10509572 DOI: 10.1098/rsos.230814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 09/01/2023] [Indexed: 09/23/2023]
Abstract
China is located in East Asia. With a high genetic and cultural diversity, human migration in China has always been a hot topic of genetics research. To explore the origins and migration routes of Chinese males, 3333 Chinese individuals (Han, Hui, Mongolia, Yi and Kyrgyz) with 27 Y-STRs and 143 Y-SNPs from published literature were analysed. Our data showed that there are five dominant haplogroups (O2-M122, O1-F265, C-M130, N-M231, R-M207) in China. Combining analysis of haplogroup frequencies, geographical positions and time with the most recent common ancestor (TMRCA), we found that haplogroups C-M130, N-M231 and R1-M173 and O1a-M175 probably migrated into China via the northern route. Interestingly, we found that haplogroup C*-M130 in China may originate in South Asia, whereas the major subbranches C2a-L1373 and C2b-F1067 migrated from northern China. The results of BATWING showed that the common ancestry of Y haplogroup in China can be traced back to 17 000 years ago, which was concurrent with global temperature increases after the Last Glacial Maximum.
Collapse
Affiliation(s)
- Jienan Li
- Department of Forensic Science, School of Basic Medical Sciences, Central South University, Changsha, Hunan Province, People's Republic of China
| | - Feng Song
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, People's Republic of China
| | - Min Lang
- Sichuan University Law School, Sichuan University, Chengdu, People's Republic of China
| | - Mingkun Xie
- Department of Obstetrics, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, Hunan 410008, People's Republic of China
| |
Collapse
|
4
|
Zhang X, Li K, Duan Y. Population data and phylogenetic analysis of 37 Y-STR loci in the Hui population from Yunnan Province, Southwest China. Ann Hum Biol 2023; 50:196-199. [PMID: 36878885 DOI: 10.1080/03014460.2023.2188258] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/08/2023]
Abstract
Background: Previous studies of the genetic polymorphism of the Y-chromosomal short tandem repeat (Y-STR) of Huis were focused on the northwest of China. However, the population genetic characteristics of Chinese Hui residing in Yunnan province, Southwest China, remain unclear.Aim: To provide genetic data for 37 Y-STRs in the Chinese Hui population of Yunnan province, as well as to investigate population genetic relationships between the Chinese Hui and another 26 populations from China and neighbouring countries.Subjects and methods: 326 unrelated healthy male individuals were genotyped using the GoldeneyeTM Y Plus PCR Amplification Kit. Genetic relationships between different populations were analysed using YHRD's AMOVA tools.Results: A total of 279 haplotypes were detected, out of which 244 were unique. The overall haplotype diversity (HD) and discrimination capacity (DC) were 0.9989 and 0.8611, respectively. The gene diversity (GD) ranged from 0.0544 (DYS645) to 0.9656 (DYS385).Conclusions: The population comparison indicated that Muslim populations (Hui, Salar, and Uighur) showed significant genetic affinity than other populations. Our results could be applied in forensic practice and population genetic studies.
Collapse
Affiliation(s)
- Xiufeng Zhang
- School of Forensic Medicine, Kunming Medical University, Kunming, China.,Judicial Expertise Center, Kunming Medical University, Kunming, China
| | - Kunya Li
- Yuxi Public Security Bureau, Yuxi, China
| | - Yifan Duan
- School of Forensic Medicine, Kunming Medical University, Kunming, China
| |
Collapse
|
5
|
Xu C, Wei W, Zuo M, Zuo W, Li K, Lian X, Wang S, Zhou X, Zhang X, Zhong S. Genetic polymorphisms and phylogenetic characteristics of Tibeto-Burman-speaking Lahu population from southwest China based on 41 Y-STR loci. Ann Hum Biol 2023; 50:75-81. [PMID: 36688851 DOI: 10.1080/03014460.2023.2171123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
BACKGROUND Male sex-linked Y-chromosome short tandem repeats (Y-STRs) have been widely used in forensic cases and population genetics research. At present, the forensic-related Y-STR data in the Chinese Lahu population are still poorly understood. AIM To enrich the available Y-STR data of this Chinese minority population and investigate its phylogenetic relationships with other reported populations. SUBJECTS AND METHODS The genetic polymorphisms of 41 Y-STR loci were analysed in 299 unrelated healthy Lahu male individuals from Southwest China. Phylogenetic analyses were performed by multidimensional scaling analysis and neighbor-joining phylogenetic tree construction. RESULTS A total of 379 alleles were observed at the 41 Y-STR loci. The allele frequencies ranged from 0.0033 to 0.9666. The genetic diversity values ranged from 0.0653 to 0.9072. A total of 254 different haplotypes of the 41 Y-STR loci were observed in 299 individuals. The values of haplotype diversity, haplotype match probability, and discrimination capacity were 0.9987, 0.0047, and 0.8495, respectively. The phylogenetic analysis indicated that the Tibeto-Burman-speaking Lahu population showed a close genetic relationship with the Yunnan Yi population. CONCLUSIONS The haplotype data of the present study can enrich the forensic databases of this Chinese minority population and will be useful for population genetics and forensic DNA application.
Collapse
Affiliation(s)
- Chongchong Xu
- School of Forensic Medicine, Kunming Medical University, Kunming, China
| | - Wei Wei
- School of Forensic Medicine, Kunming Medical University, Kunming, China
| | - Mingxin Zuo
- School of Forensic Medicine, Kunming Medical University, Kunming, China
| | - Wu Zuo
- Institute of Criminal Science and Technology, Yuxi Municipal Public Security Bureau, Yuxi, China
| | - Kuan Li
- School of Forensic Medicine, Kunming Medical University, Kunming, China
| | - Xinqing Lian
- School of Forensic Medicine, Kunming Medical University, Kunming, China
| | - Shixu Wang
- School of Forensic Medicine, Kunming Medical University, Kunming, China
| | - Xuemei Zhou
- School of Forensic Medicine, Kunming Medical University, Kunming, China
| | - Xiufeng Zhang
- School of Forensic Medicine, Kunming Medical University, Kunming, China.,Judicial Expertise Center of Kunming Medical University, Kunming, China
| | - Shurong Zhong
- School of Forensic Medicine, Kunming Medical University, Kunming, China.,Judicial Expertise Center of Kunming Medical University, Kunming, China
| |
Collapse
|
6
|
Zhang X, Zheng H, Liu C. Genetic diversity of 23 Y-STR loci of the Lisu ethnic minority residing in Chuxiong Yi Autonomous Prefecture, Yunnan province, Southwest China. Ann Hum Biol 2023; 50:356-359. [PMID: 37523197 DOI: 10.1080/03014460.2023.2224972] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 05/17/2023] [Accepted: 05/30/2023] [Indexed: 08/01/2023]
Abstract
BACKGROUND The Lisu group is a unique minority in Yunnan province. However, there is a lack of Y-STR population data for Chinese Lisu and the genetic structure of the Lisu group and other populations is unclear. AIM To provide genetic data for 23 Y-STRs in the Chinese Lisu population from Chuxiong Yi Autonomous Prefecture, as well as to analyse population genetic relationships between Chinese Lisu ethnic minority and other reference groups. SUBJECTS AND METHODS 423 unrelated healthy Lisu males were genotyped using the PowerPlex® Y23 system. Forensic parameters were calculated according to the previously published studies. Genetic structure analysis among Chinese Lisu and other populations was conducted using the YHRD's AMOVA tools. RESULTS Gene diversity (GD) ranged from 0.2,466 (DYS438) to 0.8,945 (DYS385a/b) among the 23 Y-STR loci. According to haplotype analysis, 323 different haplotypes were obtained, out of which 271 were unique. The haplotype diversity (HD) and discrimination capacity (DC) were 0.9,977 and 0.7,636, respectively. MDS plot indicated that the Chuxiong Lisu group is genetically related to the Yunnan Yi group. CONCLUSIONS This is the first report on Y-STR population data for the Chinese Lisu population. These data would be valuable for forensic applications.
Collapse
Affiliation(s)
- Xiufeng Zhang
- School of Forensic Medicine, Kunming Medical University, Kunming, Yunnan Province, China
- Judicial Expertise Center, Kunming Medical University, Kunming, Yunnan Province, China
| | - Hecheng Zheng
- Chuxiong Public Security Bureau, Chuxiong, Yunnan Province, China
| | - Chengjing Liu
- Chuxiong Public Security Bureau, Chuxiong, Yunnan Province, China
| |
Collapse
|
7
|
Fan GY, Jiang DZ, Jiang YH, Song W, He YY, Wuo NA. Phylogenetic analyses of 41 Y-STRs and machine learning-based haplogroup prediction in the Qingdao Han population from Shandong province, Eastern China. Ann Hum Biol 2023; 50:35-41. [PMID: 36636009 DOI: 10.1080/03014460.2023.2168057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
BACKGROUND Known for its rich history and culture, Qingdao is a typical symbol of Chinese maritime culture. Its unique genetic landscape has aroused interest among geneticists and forensic scientists. However, the genetic landscape of Qingdao has never been uncovered. AIM This investigation intends to provide light on Qingdao's paternal genetic diversity and its evolutionary connections to other Han subgroups. SUBJECTS AND METHODS The genetic polymorphisms of 41 Y-chromosomal short tandem repeat (STR) loci in the Qingdao Han were investigated using SureID® PathFinder Plus Kit. Phylogenetic studies were performed using genotype data from 52 East Asian groups at 23 common Y-STR loci. A multidimensional scaling plot and cladogram were constructed. Linear Discriminant Analysis (LDA) was carried out for predicting categories among the Han people. The k-nearest neighbour (kNN) algorithm was utilised to designate Y-SNP haplogroups for each haplotype. RESULTS The Qingdao Han were genetically far from the Tibeto-Burman populations and close with the Han people from northern China. LDA indicated a deep integration among the present-day Han people. By the kNN model, the predicted O2a2 and O2a1 were shown to be the predominant Y-SNP haplogroups. CONCLUSIONS This study would be helpful for reconstructing the patrilineal history in China and establishing a more comprehensive Y-STR database.
Collapse
Affiliation(s)
- Guang-Yao Fan
- Forensic Center, College of Medicine, Shaoxing University, Shaoxing, China
| | - De-Zhi Jiang
- Criminal Technology Department, Qingdao Public Security Bureau, Shandong, China
| | - Yao-Heng Jiang
- Criminal Technology Department, Qingdao Public Security Bureau, Shandong, China
| | - Wei Song
- Criminal Technology Department, Qingdao Public Security Bureau, Shandong, China
| | - Ying-Yun He
- Ningbo Health Gene Technologies Co. Ltd, Ningbo, China
| | | |
Collapse
|
8
|
Forensic Analysis and Genetic Structure Construction of Chinese Chongming Island Han Based on Y Chromosome STRs and SNPs. Genes (Basel) 2022; 13:genes13081363. [PMID: 36011274 PMCID: PMC9407086 DOI: 10.3390/genes13081363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2022] [Revised: 07/26/2022] [Accepted: 07/27/2022] [Indexed: 02/01/2023] Open
Abstract
Y-chromosome short tandem repeat (Y-STR) and Y-chromosome single nucleotide polymorphism (Y-SNP) are genetic markers on the male Y chromosome for individual identification, forensic applications, and paternal genetic history analysis. In this study we successfully genotyped 38 Y-STR loci and 24 Y-SNP loci of Pudong Han (n = 689) and Chongming Han (n = 530) in Shanghai. The haplotype diversity of the Y filer platinum genotyping system was the highest in the Han population in the Pudong area of Shanghai (0.99996) and Chongming Island (0.99997). The proportion of unique haplotypes was 97.10% (Pudong) and 98.49% (Chongming), respectively. The multidimensional scaling analysis and phylogenetic analysis were performed according to the genetic distance Rst, which was calculated based on the Y-STR gene frequency data. Moreover, we made a comparison on the frequency distribution analysis and principal component analysis of haplogroups in both populations. As a result, Shanghai Pudong Han, Chongming Island Han, and Jiangsu Han were determined to have a strong genetic affinity. The haplogroup distribution characteristics of the Pudong Han and Chongming Han populations were similar to those of the southern Han population. The results of haplotype network analysis showed that Jiangsu Wujiang Han and Jiangsu Changshu Han had more paternal genetic contributions to the formation of Shanghai Pudong Han and Chongming Island Han. Through the joint analysis of SNPs and STRs, this study deeply analyzed the paternal genetic structure of the Pudong Han and Chongming Han populations. The addition of Y-SNP haplogroups to forensic applications can provide information for pedigree investigation.
Collapse
|