1
|
Alladio E, Poggiali B, Cosenza G, Pilli E. Multivariate statistical approach and machine learning for the evaluation of biogeographical ancestry inference in the forensic field. Sci Rep 2022; 12:8974. [PMID: 35643723 PMCID: PMC9148302 DOI: 10.1038/s41598-022-12903-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Accepted: 04/13/2022] [Indexed: 11/24/2022] Open
Abstract
The biogeographical ancestry (BGA) of a trace or a person/skeleton refers to the component of ethnicity, constituted of biological and cultural elements, that is biologically determined. Nowadays, many individuals are interested in exploring their genealogy, and the capability to distinguish biogeographic information about population groups and subgroups via DNA analysis plays an essential role in several fields such as in forensics. In fact, for investigative and intelligence purposes, it is beneficial to inference the biogeographical origins of perpetrators of crimes or victims of unsolved cold cases when no reference profile from perpetrators or database hits for comparative purposes are available. Current approaches for biogeographical ancestry estimation using SNPs data are usually based on PCA and Structure software. The present study provides an alternative method that involves multivariate data analysis and machine learning strategies to evaluate BGA discriminating power of unknown samples using different commercial panels. Starting from 1000 Genomes project, Simons Genome Diversity Project and Human Genome Diversity Project datasets involving African, American, Asian, European and Oceania individuals, and moving towards further and more geographically restricted populations, powerful multivariate techniques such as Partial Least Squares-Discriminant Analysis (PLS-DA) and machine learning techniques such as XGBoost were employed, and their discriminating power was compared. PLS-DA method provided more robust classifications than XGBoost method, showing that the adopted approach might be an interesting tool for forensic experts to infer BGA information from the DNA profile of unknown individuals, but also highlighting that the commercial forensic panels could be inadequate to discriminate populations at intra-continental level.
Collapse
Affiliation(s)
- Eugenio Alladio
- Department of Chemistry, University of Turin, Turin, Italy.,Centro Regionale Antidoping e di Tossicologia "A. Bertinaria", Orbassano, Torino, Italy
| | - Brando Poggiali
- Department of Biology, Forensic Molecular Anthropology Laboratory, University of Florence, Florence, Italy
| | - Giulia Cosenza
- Department of Biology, Forensic Molecular Anthropology Laboratory, University of Florence, Florence, Italy
| | - Elena Pilli
- Department of Biology, Forensic Molecular Anthropology Laboratory, University of Florence, Florence, Italy.
| |
Collapse
|
2
|
Alnaqbi H, Tay GK, Chehadeh SEH, Alsafar H. Characterizing the diversity of MHC conserved extended haplotypes using families from the United Arab Emirates. Sci Rep 2022; 12:7165. [PMID: 35504942 PMCID: PMC9065074 DOI: 10.1038/s41598-022-11256-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Accepted: 04/20/2022] [Indexed: 11/19/2022] Open
Abstract
Aside from its anthropological relevance, the characterization of the allele frequencies of genes in the human Major Histocompatibility Complex (MHC) and the combination of these alleles that make up MHC conserved extended haplotypes (CEHs) is necessary for histocompatibility matching in transplantation as well as mapping disease association loci. The structure and content of the MHC region in Middle Eastern populations remain poorly characterized, posing challenges when establishing disease association studies in ethnic groups that inhabit the region and reducing the capacity to translate genetic research into clinical practice. This study was conceived to address a gap of knowledge, aiming to characterize CEHs in the United Arab Emirates (UAE) population through segregation analysis of high-resolution, pedigree-phased, MHC haplotypes derived from 41 families. Twenty per cent (20.5%) of the total haplotype pool derived from this study cohort were identified as putative CEHs in the UAE population. These consisted of CEHs that have been previously detected in other ethnic groups, including the South Asian CEH 8.2 [HLA- C*07:02-B*08:01-DRB1*03:01-DQA1*05:01-DQB1*02:01 (H.F. 0.094)] and the common East Asian CEH 58.1 [HLA- C*03:02-B*58:01-DRB1*03:01- DQA1*05:01-DQB1*02:01 (H.F. 0.024)]. Additionally, three novel CEHs were identified in the current cohort, including HLA- C*15:02-B*40:06-DRB1*16:02-DQB1*05:02 (H.F. 0.035), HLA- C*16:02-B*51:01-DRB1*16:01-DQA1*01:02-DQB1*05:02 (H.F. 0.029), and HLA- C*03:02-B*58:01-DRB1*16:01-DQA1*01:02-DQB1*05:02 (H.F. 0.024). Overall, the results indicate a substantial gene flow with neighbouring ethnic groups in the contemporary UAE population including South Asian, East Asian, African, and European populations. Importantly, alleles and haplotypes that have been previously associated with autoimmune diseases (e.g., Type 1 Diabetes) were also present. In this regard, this study emphasizes that an appreciation for ethnic differences can provide insights into subpopulation-specific disease-related polymorphisms, which has remained a difficult endeavour.
Collapse
Affiliation(s)
- Halima Alnaqbi
- Center for Biotechnology, Khalifa University of Science and Technology, P.O. BOX 127788, Abu Dhabi, UAE.,Department of Biomedical Engineering, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Guan K Tay
- Center for Biotechnology, Khalifa University of Science and Technology, P.O. BOX 127788, Abu Dhabi, UAE.,Division of Psychiatry, UWA Medical School, The University of Western Australia, Perth, WA, Australia.,School of Medical and Health Sciences, Edith Cowan University, Joondalup, WA, Australia
| | - Sarah El Hajj Chehadeh
- Center for Biotechnology, Khalifa University of Science and Technology, P.O. BOX 127788, Abu Dhabi, UAE.,Department of Pharmacology, College of Medicine and Health Sciences, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Habiba Alsafar
- Center for Biotechnology, Khalifa University of Science and Technology, P.O. BOX 127788, Abu Dhabi, UAE. .,Department of Biomedical Engineering, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates. .,Department of Genetics and Molecular Biology, College of Medicine and Health Sciences, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.
| |
Collapse
|
3
|
Mousa M, Vurivi H, Kannout H, Uddin M, Alkaabi N, Mahboub B, Tay GK, Alsafar HS. Genome-wide association study of hospitalized COVID-19 patients in the United Arab Emirates. EBioMedicine 2021; 74:103695. [PMID: 34775353 PMCID: PMC8587122 DOI: 10.1016/j.ebiom.2021.103695] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Revised: 10/26/2021] [Accepted: 10/27/2021] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND The heterogeneity in symptomatology and phenotypic profile attributable to COVID-19 is widely unknown. The objective of this manuscript is to conduct a trans-ancestry genome wide association study (GWAS) meta-analysis of COVID-19 severity to improve the understanding of potentially causal targets for SARS-CoV-2. METHODS This cross-sectional study recruited 646 participants in the UAE that were divided into two phenotypic groups based on the severity of COVID-19 phenotypes, hospitalized (n=482) and non-hospitalized (n=164) participants. Hospitalized participants were COVID-19 patients that developed acute respiratory distress syndrome (ARDS), pneumonia or progression to respiratory failure that required supplemental oxygen therapy or mechanical ventilation support or had severe complications such as septic shock or multi-organ failure. We conducted a trans-ancestry meta-analysis GWAS of European (n=302), American (n=102), South Asian (n=99), and East Asian (n=107) ancestry populations. We also carried out comprehensive post-GWAS analysis, including enrichment of SNP associations in tissues and cell-types, expression quantitative trait loci and differential expression analysis. FINDINGS Eight genes demonstrated a strong association signal: VWA8 gene in locus 13p14·11 (SNP rs10507497; p=9·54 x10-7), PDE8B gene in locus 5q13·3 (SNP rs7715119; p=2·19 x10-6), CTSC gene in locus 11q14·2 (rs72953026; p=2·38 x10-6), THSD7B gene in locus 2q22·1 (rs7605851; p=3·07x10-6), STK39 gene in locus 2q24·3 (rs7595310; p=4·55 x10-6), FBXO34 gene in locus 14q22·3 (rs10140801; p=8·26 x10-6), RPL6P27 gene in locus 18p11·31 (rs11659676; p=8·88 x10-6), and METTL21C gene in locus 13q33·1 (rs599976; p=8·95 x10-6). The genes are expressed in the lung, associated to tumour progression, emphysema, airway obstruction, and surface tension within the lung, as well as an association to T-cell-mediated inflammation and the production of inflammatory cytokines. INTERPRETATION We have discovered eight highly plausible genetic association with hospitalized cases in COVID-19. Further studies must be conducted on worldwide population genetics to facilitate the development of population specific therapeutics to mitigate this worldwide challenge. FUNDING This review was commissioned as part of a project to study the host cell receptors of coronaviruses funded by Khalifa University's CPRA grant (Reference number 2020-004).
Collapse
Affiliation(s)
- Mira Mousa
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates; Nuffield Department of Women's and Reproduction Health, Oxford University, Oxford, United Kingdom
| | - Hema Vurivi
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Hussein Kannout
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Maimunah Uddin
- Department of Pediatric Infectious Disease, Sheikh Khalifa Medical City, Abu Dhabi, United Arab Emirates
| | - Nawal Alkaabi
- Department of Pediatric Infectious Disease, Sheikh Khalifa Medical City, Abu Dhabi, United Arab Emirates
| | - Bassam Mahboub
- Dubai Health Authority, Rashid Hospital, Dubai, United Arab Emirates
| | - Guan K Tay
- Division of Psychiatry, Faculty of Health and Medical Sciences, the University of Western Australia, Crawley, Western Australia, Australia; School of Medical and Health Sciences, Edith Cowan University, Joondalup, Western Australia, Australia
| | - Habiba S Alsafar
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates; Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates; Department of Biomedical Engineering, College of Engineering, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates; Department of Genetics and Molecular Biology, College of Medicine and Health Sciences, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.
| |
Collapse
|
4
|
Mousa M, Al-Jefout M, Alsafar H, Becker CM, Zondervan KT, Rahmioglu N. Impact of Endometriosis in Women of Arab Ancestry on: Health-Related Quality of Life, Work Productivity, and Diagnostic Delay. Front Glob Womens Health 2021; 2:708410. [PMID: 34816238 PMCID: PMC8593935 DOI: 10.3389/fgwh.2021.708410] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Accepted: 08/23/2021] [Indexed: 12/04/2022] Open
Abstract
Introduction: Endometriosis has a negative effect on health-related quality of life (HRQoL), wellbeing and daily functioning. Endometriosis is an under-researched condition within non-western populations. Cultural representations are needed to understand the relative roles of societal norms, traditional factors, and religious sensitivities on the impact of endometriosis on HRQoL in various populations. In particular, there is a lack of emphasis placed in understanding the association of HRQoL on endometriosis in Arab women. Method: In this prospective case-control study, 2,610 Arab ancestry women in the United Arab Emirates were recruited to investigate the impact of endometriosis on HRQoL, diagnostic delay, psychological co-morbidities, work productivity, and physical activity. Participants completed the following standardized, validated questionnaires: Short Form-36 version 2 questionnaire, the World Endometriosis Research Foundation EPHect minimum clinical questionnaire version, and Work Productivity and Activity Impairment questionnaire. Translations to the Arabic language, validated using the forward-backward translation method, of the questionnaires were utilized. Results: HRQoL scores were significantly impaired in women with endometriosis, as demonstrated in the Physical Composite Scores and Mental Composite Scores in the symptomatic control group (p = 0.001; p = 0.003, respectively) and the asymptomatic control group (p < 0.001; p < 0.001, respectively). Susceptibility and severity of multiple pain syndromes and infertility in women with endometriosis was the main indicator of lower HRQoL. Anxiety (p = 0.007) and depression (p = 0.005) were significantly associated with endometriosis, in comparison to symptomatic controls. The average diagnostic delay was 11.61 years, however single women experience 15.81 years of diagnosis delay, with approximately 18% (n = 15) of the single women experiencing more than a 20-year delay in diagnosis. The intensity of physical activity was not associated with endometriosis, when compared to symptomatic (p = 0.405) or asymptomatic controls (p = 0.144). Conclusion: For the first time, we provide evidence from a combined hospital, clinic, and population-based study that Arab women with endometriosis experience significant impacts on HRQoL, substantial diagnostic delay after the onset of symptoms, significant association to psychological disorders (anxiety and depression), and a negative impact on work productivity. Future research must focus on understanding the personal and culturally centered beliefs of Arab women to ensure a positive HRQoL trajectory by improving diagnosis and management strategies.
Collapse
Affiliation(s)
- Mira Mousa
- Nuffield Department of Women's and Reproductive Health, Endometriosis CaRe Centre, University of Oxford, Oxford, United Kingdom
| | - Moamar Al-Jefout
- Department of Obstetrics and Gynaecology, College of Medical and Health Sciences, United Arab Emirates University, Al Ain, United Arab Emirates
- Department of Obstetrics and Gynaecology, Mutah Medical Faculty, Mutah University, Al-Karak, Jordan
| | - Habiba Alsafar
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
- Department of Genetics and Molecular Biology, College of Medicine and Health Sciences, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Christian M. Becker
- Nuffield Department of Women's and Reproductive Health, Endometriosis CaRe Centre, University of Oxford, Oxford, United Kingdom
| | - Krina T. Zondervan
- Nuffield Department of Women's and Reproductive Health, Endometriosis CaRe Centre, University of Oxford, Oxford, United Kingdom
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Nilufer Rahmioglu
- Nuffield Department of Women's and Reproductive Health, Endometriosis CaRe Centre, University of Oxford, Oxford, United Kingdom
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
5
|
Genetic etiology of hereditary hearing loss in the Gulf Cooperation Council countries. Hum Genet 2021; 141:595-605. [PMID: 34338889 DOI: 10.1007/s00439-021-02323-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2021] [Accepted: 07/19/2021] [Indexed: 10/20/2022]
Abstract
The past 30 years have seen an exponential growth concerning the identification of genes and variants responsible for hereditary hearing loss (HL) worldwide. This has led to a huge gain in our understanding of molecular mechanisms of hearing and deafness, which improved diagnosis for populations with hereditary HL. Many communities around the world, especially in the Middle East and North Africa, have a high prevalence of consanguineous marriages. Congenital monogenic conditions, such as recessive HL, are more common in these populations due to high consanguinity rates. Many studies have shown that high rates of consanguinity, endogamy, and first cousin marriages were observed in the six countries of the Gulf Cooperation Council (GCC). The intent of this study is to investigate the etiology of HL in the GCC region. A deep literature review of genes and variants responsible for HL in this region revealed 89 recessive DNA pathogenic variants reported in 138 cases/familial cases. A total of 21 genes responsible for non-syndromic hearing loss (NSHL) and 17 genes associated with syndromic hearing loss (SHL) were reported in cases from the GCC region. Out of 156 reported affected cases, 112 showed HL only, and 44 showed HL associated with other clinical manifestations. This data suggests that in the GCC region 72% of HL forms are non-syndromic and 28% are syndromic. For individuals with NSHL, 66% of variants were detected in four genes (GJB2, OTOF, TMC1 and CDH23), with a predominance of variants located in the GJB2 gene (37.5%). However, among SHL, Usher syndrome was the more frequent as it has been observed in 41% of the reported syndromic GCC cases. Finally, our analysis showed that HL genetics testing and research in the GCC region took advantage of the next generation sequencing (NGS)-based techniques, as approximately 58% of reported variants were identified using this technology.
Collapse
|
6
|
Daw Elbait G, Henschel A, Tay GK, Al Safar HS. A Population-Specific Major Allele Reference Genome From The United Arab Emirates Population. Front Genet 2021; 12:660428. [PMID: 33968136 PMCID: PMC8102833 DOI: 10.3389/fgene.2021.660428] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Accepted: 03/19/2021] [Indexed: 12/30/2022] Open
Abstract
The ethnic composition of the population of a country contributes to the uniqueness of each national DNA sequencing project and, ideally, individual reference genomes are required to reduce the confounding nature of ethnic bias. This work represents a representative Whole Genome Sequencing effort of an understudied population. Specifically, high coverage consensus sequences from 120 whole genomes and 33 whole exomes were used to construct the first ever population specific major allele reference genome for the United Arab Emirates (UAE). When this was applied and compared to the archetype hg19 reference, assembly of local Emirati genomes was reduced by ∼19% (i.e., some 1 million fewer calls). In compiling the United Arab Emirates Reference Genome (UAERG), sets of annotated 23,038,090 short (novel: 1,790,171) and 137,713 structural (novel: 8,462) variants; their allele frequencies (AFs) and distribution across the genome were identified. Population-specific genetic characteristics including loss-of-function variants, admixture, and ancestral haplogroup distribution were identified and reported here. We also detect a strong correlation between F ST and admixture components in the UAE. This baseline study was conceived to establish a high-quality reference genome and a genetic variations resource to enable the development of regional population specific initiatives and thus inform the application of population studies and precision medicine in the UAE.
Collapse
Affiliation(s)
- Gihan Daw Elbait
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Andreas Henschel
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
- Department of Electrical Engineering and Computer Science, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Guan K. Tay
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
- Department of Biomedical Engineering, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
- Division of Psychiatry, Faculty of Health and Medical Sciences, The University of Western Australia, Crawley, WA, Australia
- School of Medical and Health Sciences, Edith Cowan University, Joondalup, WA, Australia
| | - Habiba S. Al Safar
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
- Department of Biomedical Engineering, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
- Department of Genetics and Molecular Biology, College of Medicine and Health Sciences, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| |
Collapse
|