1
|
Pereira JL, de Souza CA, Neyra JEM, Leite JMRS, Cerqueira A, Mingroni-Netto RC, Soler JMP, Rogero MM, Sarti FM, Fisberg RM. Genetic Ancestry and Self-Reported "Skin Color/Race" in the Urban Admixed Population of São Paulo City, Brazil. Genes (Basel) 2024; 15:917. [PMID: 39062696 PMCID: PMC11276533 DOI: 10.3390/genes15070917] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2024] [Revised: 07/08/2024] [Accepted: 07/10/2024] [Indexed: 07/28/2024] Open
Abstract
Epidemiological studies frequently classify groups based on phenotypes like self-reported skin color/race, which inaccurately represent genetic ancestry and may lead to misclassification, particularly among individuals of multiracial backgrounds. This study aimed to characterize both global and local genome-wide genetic ancestries and to assess their relationship with self-reported skin color/race in an admixed population of Sao Paulo city. We analyzed 226,346 single-nucleotide polymorphisms from 841 individuals participating in the population-based ISA-Nutrition study. Our findings confirmed the admixed nature of the population, demonstrating substantial European, significant Sub-Saharan African, and minor Native American ancestries, irrespective of skin color. A correlation was observed between global genetic ancestry and self-reported color-race, which was more evident in the extreme proportions of African and European ancestries. Individuals with higher African ancestry tended to identify as Black, those with higher European ancestry tended to identify as White, and individuals with higher Native American ancestry were more likely to self-identify as Mixed, a group with diverse ancestral compositions. However, at the individual level, this correlation was notably weak, and no deviations were observed for specific regions throughout the individual's genome. Our findings emphasize the significance of accurately defining and thoroughly analyzing race and ancestry, especially within admixed populations.
Collapse
Affiliation(s)
- Jaqueline L. Pereira
- Department of Nutrition, School of Public Health, University of São Paulo, São Paulo 01246-904, Brazil; (J.L.P.); (J.M.R.S.L.); (M.M.R.)
| | - Camila A. de Souza
- Department of Statistics, Institute of Mathematics and Statistics, University of São Paulo, São Paulo 05508-090, Brazil; (C.A.d.S.); (J.E.M.N.); (J.M.P.S.)
| | - Jennyfer E. M. Neyra
- Department of Statistics, Institute of Mathematics and Statistics, University of São Paulo, São Paulo 05508-090, Brazil; (C.A.d.S.); (J.E.M.N.); (J.M.P.S.)
| | - Jean M. R. S. Leite
- Department of Nutrition, School of Public Health, University of São Paulo, São Paulo 01246-904, Brazil; (J.L.P.); (J.M.R.S.L.); (M.M.R.)
| | - Andressa Cerqueira
- Department of Statistics, Federal University of Sao Carlos, São Carlos 13565-905, Brazil;
| | - Regina C. Mingroni-Netto
- Human Genome and Stem Cell Research Center, Department of Genetics and Evolutionary Biology, Biosciences Institute, University of São Paulo, São Paulo 05508-090, Brazil;
| | - Julia M. P. Soler
- Department of Statistics, Institute of Mathematics and Statistics, University of São Paulo, São Paulo 05508-090, Brazil; (C.A.d.S.); (J.E.M.N.); (J.M.P.S.)
| | - Marcelo M. Rogero
- Department of Nutrition, School of Public Health, University of São Paulo, São Paulo 01246-904, Brazil; (J.L.P.); (J.M.R.S.L.); (M.M.R.)
| | - Flavia M. Sarti
- School of Arts, Sciences and Humanities, University of Sao Paulo, São Paulo 03828-000, Brazil;
| | - Regina M. Fisberg
- Department of Nutrition, School of Public Health, University of São Paulo, São Paulo 01246-904, Brazil; (J.L.P.); (J.M.R.S.L.); (M.M.R.)
| |
Collapse
|
2
|
Nikpay M. Genome-wide search identified DNA methylation sites that regulate the metabolome. Front Genet 2023; 14:1093882. [PMID: 37274792 PMCID: PMC10233745 DOI: 10.3389/fgene.2023.1093882] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Accepted: 05/09/2023] [Indexed: 06/07/2023] Open
Abstract
Background: Identifying DNA methylation sites that regulate the metabolome is important for several purposes. In this study, publicly available GWAS data were integrated to find methylation sites that impact metabolome through a discovery and replication scheme and by using Mendelian randomization. Results: The outcome of analyses revealed 107 methylation sites associated with 84 metabolites at the genome-wide significance level (p<5e-8) at both the discovery and replication stages. A large percentage of the observed associations (85%) were with lipids, significantly higher than expected (p = 0.0003). A number of CpG (methylation) sites showed specificity e.g., cg20133200 within PFKP was associated with glucose only and cg10760299 within GATM impacted the level of creatinine; in contrast, there were sites associated with numerous metabolites e.g., cg20102877 on the 2p23.3 region was associated with 39 metabolites. Integrating transcriptome data enabled identifying genes (N = 82) mediating the impact of methylation sites on the metabolome and cardiometabolic traits. For example, PABPC4 mediated the impact of cg15123755-HDL on type-2 diabetes. KCNK7 mediated the impact of cg21033440-lipids on hypertension. POC5, ILRUN, FDFT1, and NEIL2 mediated the impact of CpG sites on obesity through metabolic pathways. Conclusion: This study provides a catalog of DNA methylation sites that regulate the metabolome for downstream applications.
Collapse
|
3
|
De Oliveira TC, Secolin R, Lopes-Cendes I. A review of ancestrality and admixture in Latin America and the caribbean focusing on native American and African descendant populations. Front Genet 2023; 14:1091269. [PMID: 36741309 PMCID: PMC9893294 DOI: 10.3389/fgene.2023.1091269] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2022] [Accepted: 01/09/2023] [Indexed: 01/21/2023] Open
Abstract
Genomics can reveal essential features about the demographic evolution of a population that may not be apparent from historical elements. In recent years, there has been a significant increase in the number of studies applying genomic epidemiological approaches to understand the genetic structure and diversity of human populations in the context of demographic history and for implementing precision medicine. These efforts have traditionally been applied predominantly to populations of European origin. More recently, initiatives in the United States and Africa are including more diverse populations, establishing new horizons for research in human populations with African and/or Native ancestries. Still, even in the most recent projects, the under-representation of genomic data from Latin America and the Caribbean (LAC) is remarkable. In addition, because the region presents the most recent global miscegenation, genomics data from LAC may add relevant information to understand population admixture better. Admixture in LAC started during the colonial period, in the 15th century, with intense miscegenation between European settlers, mainly from Portugal and Spain, with local indigenous and sub-Saharan Africans brought through the slave trade. Since, there are descendants of formerly enslaved and Native American populations in the LAC territory; they are considered vulnerable populations because of their history and current living conditions. In this context, studying LAC Native American and African descendant populations is important for several reasons. First, studying human populations from different origins makes it possible to understand the diversity of the human genome better. Second, it also has an immediate application to these populations, such as empowering communities with the knowledge of their ancestral origins. Furthermore, because knowledge of the population genomic structure is an essential requirement for implementing genomic medicine and precision health practices, population genomics studies may ensure that these communities have access to genomic information for risk assessment, prevention, and the delivery of optimized treatment; thus, helping to reduce inequalities in the Western Hemisphere. Hoping to set the stage for future studies, we review different aspects related to genetic and genomic research in vulnerable populations from LAC countries.
Collapse
Affiliation(s)
- Thais C. De Oliveira
- Department of Translational Medicine, School of Medical Sciences, University of Campinas (UNICAMP), Campinas, Brazil
- The Brazilian Institute of Neuroscience and Neurotechnology (BRAINN), Campinas, Brazil
| | - Rodrigo Secolin
- Department of Translational Medicine, School of Medical Sciences, University of Campinas (UNICAMP), Campinas, Brazil
- The Brazilian Institute of Neuroscience and Neurotechnology (BRAINN), Campinas, Brazil
| | - Iscia Lopes-Cendes
- Department of Translational Medicine, School of Medical Sciences, University of Campinas (UNICAMP), Campinas, Brazil
- The Brazilian Institute of Neuroscience and Neurotechnology (BRAINN), Campinas, Brazil
| |
Collapse
|
4
|
Balagué-Dobón L, Cáceres A, González JR. Fully exploiting SNP arrays: a systematic review on the tools to extract underlying genomic structure. Brief Bioinform 2022; 23:bbac043. [PMID: 35211719 PMCID: PMC8921734 DOI: 10.1093/bib/bbac043] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Revised: 01/25/2022] [Accepted: 01/28/2022] [Indexed: 12/12/2022] Open
Abstract
Single nucleotide polymorphisms (SNPs) are the most abundant type of genomic variation and the most accessible to genotype in large cohorts. However, they individually explain a small proportion of phenotypic differences between individuals. Ancestry, collective SNP effects, structural variants, somatic mutations or even differences in historic recombination can potentially explain a high percentage of genomic divergence. These genetic differences can be infrequent or laborious to characterize; however, many of them leave distinctive marks on the SNPs across the genome allowing their study in large population samples. Consequently, several methods have been developed over the last decade to detect and analyze different genomic structures using SNP arrays, to complement genome-wide association studies and determine the contribution of these structures to explain the phenotypic differences between individuals. We present an up-to-date collection of available bioinformatics tools that can be used to extract relevant genomic information from SNP array data including population structure and ancestry; polygenic risk scores; identity-by-descent fragments; linkage disequilibrium; heritability and structural variants such as inversions, copy number variants, genetic mosaicisms and recombination histories. From a systematic review of recently published applications of the methods, we describe the main characteristics of R packages, command-line tools and desktop applications, both free and commercial, to help make the most of a large amount of publicly available SNP data.
Collapse
|
5
|
Franceschi VB, Caldana GD, de Menezes Mayer A, Cybis GB, Neves CAM, Ferrareze PAG, Demoliner M, de Almeida PR, Gularte JS, Hansen AW, Weber MN, Fleck JD, Zimerman RA, Kmetzsch L, Spilki FR, Thompson CE. Genomic epidemiology of SARS-CoV-2 in Esteio, Rio Grande do Sul, Brazil. BMC Genomics 2021; 22:371. [PMID: 34016042 PMCID: PMC8136996 DOI: 10.1186/s12864-021-07708-w] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Accepted: 05/11/2021] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Brazil is the third country most affected by Coronavirus disease-2019 (COVID-19), but viral evolution in municipality resolution is still poorly understood in Brazil and it is crucial to understand the epidemiology of viral spread. We aimed to track molecular evolution and spread of Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in Esteio (Southern Brazil) using phylogenetics and phylodynamics inferences from 21 new genomes in global and regional context. Importantly, the case fatality rate (CFR) in Esteio (3.26%) is slightly higher compared to the Rio Grande do Sul (RS) state (2.56%) and the entire Brazil (2.74%). RESULTS We provided a comprehensive view of mutations from a representative sampling from May to October 2020, highlighting two frequent mutations in spike glycoprotein (D614G and V1176F), an emergent mutation (E484K) in spike Receptor Binding Domain (RBD) characteristic of the B.1.351 and P.1 lineages, and the adjacent replacement of 2 amino acids in Nucleocapsid phosphoprotein (R203K and G204R). E484K was found in two genomes from mid-October, which is the earliest description of this mutation in Southern Brazil. Lineages containing this substitution must be subject of intense surveillance due to its association with immune evasion. We also found two epidemiologically-related clusters, including one from patients of the same neighborhood. Phylogenetics and phylodynamics analysis demonstrates multiple introductions of the Brazilian most prevalent lineages (B.1.1.33 and B.1.1.248) and the establishment of Brazilian lineages ignited from the Southeast to other Brazilian regions. CONCLUSIONS Our data show the value of correlating clinical, epidemiological and genomic information for the understanding of viral evolution and its spatial distribution over time. This is of paramount importance to better inform policy making strategies to fight COVID-19.
Collapse
Affiliation(s)
- Vinícius Bonetti Franceschi
- Center of Biotechnology, Graduate Program in Cell and Molecular Biology (PPGBCM), Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil
| | - Gabriel Dickin Caldana
- Graduate Program in Health Sciences, Universidade Federal de Ciências da Saúde de Porto Alegre (UFCSPA), Porto Alegre, RS, Brazil
| | - Amanda de Menezes Mayer
- Center of Biotechnology, Graduate Program in Cell and Molecular Biology (PPGBCM), Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil
| | - Gabriela Bettella Cybis
- Department of Statistics, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil
| | - Carla Andretta Moreira Neves
- Graduate Program in Health Sciences, Universidade Federal de Ciências da Saúde de Porto Alegre (UFCSPA), Porto Alegre, RS, Brazil
| | - Patrícia Aline Gröhs Ferrareze
- Graduate Program in Health Sciences, Universidade Federal de Ciências da Saúde de Porto Alegre (UFCSPA), Porto Alegre, RS, Brazil
| | - Meriane Demoliner
- Molecular Microbiology Laboratory, Universidade Feevale, Novo Hamburgo, RS, Brazil
| | | | | | - Alana Witt Hansen
- Molecular Microbiology Laboratory, Universidade Feevale, Novo Hamburgo, RS, Brazil
| | - Matheus Nunes Weber
- Molecular Microbiology Laboratory, Universidade Feevale, Novo Hamburgo, RS, Brazil
| | - Juliane Deise Fleck
- Molecular Microbiology Laboratory, Universidade Feevale, Novo Hamburgo, RS, Brazil
| | | | - Lívia Kmetzsch
- Center of Biotechnology, Graduate Program in Cell and Molecular Biology (PPGBCM), Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil
| | | | - Claudia Elizabeth Thompson
- Center of Biotechnology, Graduate Program in Cell and Molecular Biology (PPGBCM), Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil. .,Graduate Program in Health Sciences, Universidade Federal de Ciências da Saúde de Porto Alegre (UFCSPA), Porto Alegre, RS, Brazil. .,Department of Pharmacosciences, Universidade Federal de Ciências da Saúde de Porto Alegre (UFCSPA), 245/200C Sarmento Leite St, Porto Alegre, RS, 90050-170, Brazil.
| |
Collapse
|