1
|
Laplana M, Lopez-Ortega R, Fibla J. Polygenic risk score comparator (PRScomp): Test population vs. worldwide populations. Int J Med Inform 2024; 183:105333. [PMID: 38184939 DOI: 10.1016/j.ijmedinf.2023.105333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 12/19/2023] [Accepted: 12/28/2023] [Indexed: 01/09/2024]
Abstract
BACKGROUND Polygenic risk scores (PRS) are a powerful tool for predicting an individual's genetic risk for complex diseases. METHODS We have developed a web service (PRScomp) as a user-friendly tool to evaluate PRS of the user own population and compare it with worldwide populations. RESULTS A disease/trait database has been constructed from GWAS Catalog summary statistics. Genotype data of test population is uploaded and merged with the reference dataset (1000 Genome Project and Human Genome Diversity Project) to obtain a file including the common SNPs. The user can select a disease/trait from the database and a curated set of risk markers is used to calculate summatory PRS. Distribution of z-scored PRS values is presented in publication-ready plots and text files that can be downloaded. DISCUSSION PRScomp can be useful for public health decision-making by identifying population-specific genetic risk factors and informing the development of targeted interventions for at-risk populations.
Collapse
Affiliation(s)
- Marina Laplana
- Departament de Ciències Mèdiques Bàsiques, Universitat de Lleida, IRBLleida, Av. Alcalde Rovira Roure, 80, 25198 Lleida, Spain.
| | - Ricard Lopez-Ortega
- Departament de Ciències Mèdiques Bàsiques, Universitat de Lleida, IRBLleida, Av. Alcalde Rovira Roure, 80, 25198 Lleida, Spain; Unitat de Citogenètica i Genètica Mèdica, Hospital Universitari Arnau de Vilanova, IRBLleida, Av. Alcalde Rovira Roure, 80, 25198 Lleida, Spain
| | - Joan Fibla
- Departament de Ciències Mèdiques Bàsiques, Universitat de Lleida, IRBLleida, Av. Alcalde Rovira Roure, 80, 25198 Lleida, Spain.
| |
Collapse
|
2
|
Naret O, Kutalik Z, Hodel F, Xu ZM, Marques-Vidal P, Fellay J. Improving polygenic prediction with genetically inferred ancestry. HGG ADVANCES 2022; 3:100109. [PMID: 35571679 PMCID: PMC9095896 DOI: 10.1016/j.xhgg.2022.100109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Accepted: 04/11/2022] [Indexed: 11/21/2022] Open
Abstract
Genome-wide association studies (GWASs) have demonstrated that most common diseases have a strong genetic component from many genetic variants each with a small effect size. GWAS summary statistics have allowed the construction of polygenic scores (PGSs) estimating part of the individual risk for common diseases. Here, we propose to improve PGS-based risk estimation by incorporating genetic ancestry derived from genome-wide genotyping data. Our method involves three cohorts: a base (or discovery) for association studies, a target for phenotype/risk prediction, and a map for ancestry mapping; successively, (1) it generates for each individual in the base and target cohorts a set of principal components based on the map cohort-called mapped PCs, (2) it associates in the base cohort the phenotype with the mapped-PCs, and (3) it uses the mapped PCs in the target cohort to generate a phenotypic predictor called the ancestry score. We evaluated the ancestry score by comparing a predictive model using a PGS with one combining a PGS and an ancestry score. First, we performed simulations and found that the ancestry score has a greater impact on traits that correlate with ancestry-specific variants. Second, we showed, using UK Biobank data, that the ancestry score improves genetic prediction for our nine phenotypes to very different degrees. Third, we performed simulations and found that the more heterogeneous the base and target cohorts, the more beneficial the ancestry score is. Finally, we validated our approach under realistic conditions with UK Biobank as the base cohort and Swiss individuals from the CoLaus|PsyCoLaus study as the target cohort.
Collapse
Affiliation(s)
- Olivier Naret
- School of Life Sciences, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Zoltan Kutalik
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- University Center for Primary Care and Public Health, Lausanne, Switzerland
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
| | - Flavia Hodel
- School of Life Sciences, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Zhi Ming Xu
- School of Life Sciences, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Pedro Marques-Vidal
- Department of Medicine, Internal Medicine, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland
| | - Jacques Fellay
- School of Life Sciences, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Precision Medicine Unit, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland
| |
Collapse
|
3
|
Chande AT, Nagar SD, Rishishwar L, Mariño-Ramírez L, Medina-Rivas MA, Valderrama-Aguirre AE, Jordan IK, Gallo JE. The Impact of Ethnicity and Genetic Ancestry on Disease Prevalence and Risk in Colombia. Front Genet 2021; 12:690366. [PMID: 34650589 PMCID: PMC8507149 DOI: 10.3389/fgene.2021.690366] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Accepted: 08/11/2021] [Indexed: 11/13/2022] Open
Abstract
Currently, the vast majority of genomic research cohorts are made up of participants with European ancestry. Genomic medicine will only reach its full potential when genomic studies become more broadly representative of global populations. We are working to support the establishment of genomic medicine in developing countries in Latin America via studies of ethnically and ancestrally diverse Colombian populations. The goal of this study was to analyze the effect of ethnicity and genetic ancestry on observed disease prevalence and predicted disease risk in Colombia. Population distributions of Colombia's three major ethnic groups - Mestizo, Afro-Colombian, and Indigenous - were compared to disease prevalence and socioeconomic indicators. Indigenous and Mestizo ethnicity show the highest correlations with disease prevalence, whereas the effect of Afro-Colombian ethnicity is substantially lower. Mestizo ethnicity is mostly negatively correlated with six high-impact health conditions and positively correlated with seven of eight common cancers; Indigenous ethnicity shows the opposite effect. Malaria prevalence in particular is strongly correlated with ethnicity. Disease prevalence co-varies across geographic regions, consistent with the regional distribution of ethnic groups. Ethnicity is also correlated with regional variation in human development, partially explaining the observed differences in disease prevalence. Patterns of genetic ancestry and admixture for a cohort of 624 individuals from Medellín were compared to disease risk inferred via polygenic risk scores (PRS). African genetic ancestry is most strongly correlated with predicted disease risk, whereas European and Native American ancestry show weaker effects. African ancestry is mostly positively correlated with disease risk, and European ancestry is mostly negatively correlated. The relationships between ethnicity and disease prevalence do not show an overall correspondence with the relationships between ancestry and disease risk. We discuss possible reasons for the divergent health effects of ethnicity and ancestry as well as the implication of our results for the development of precision medicine in Colombia.
Collapse
Affiliation(s)
- Aroon T Chande
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, United States.,IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, GA, United States.,PanAmerican Bioinformatics Institute, Cali, Colombia
| | - Shashwat Deepali Nagar
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, United States.,IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, GA, United States.,PanAmerican Bioinformatics Institute, Cali, Colombia
| | - Lavanya Rishishwar
- IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, GA, United States.,PanAmerican Bioinformatics Institute, Cali, Colombia
| | - Leonardo Mariño-Ramírez
- PanAmerican Bioinformatics Institute, Cali, Colombia.,National Institute on Minority Health and Health Disparities, National Institutes of Health, Bethesda, MD, United States
| | - Miguel A Medina-Rivas
- Centro de Investigación en Biodiversidad y Hábitat, Universidad Tecnológica del Chocó, Quibdó, Colombia
| | - Augusto E Valderrama-Aguirre
- Biomedical Research Institute (COL0082529), Cali, Colombia.,Department of Biomedical Sciences, Universidad Santiago de Cali, Cali, Colombia.,Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
| | - I King Jordan
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, United States.,IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, GA, United States.,PanAmerican Bioinformatics Institute, Cali, Colombia
| | | |
Collapse
|
4
|
Chande AT, Rishishwar L, Ban D, Nagar SD, Conley AB, Rowell J, Valderrama-Aguirre AE, Medina-Rivas MA, Jordan IK. The Phenotypic Consequences of Genetic Divergence between Admixed Latin American Populations: Antioquia and Chocó, Colombia. Genome Biol Evol 2021; 12:1516-1527. [PMID: 32681795 PMCID: PMC7513793 DOI: 10.1093/gbe/evaa154] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/12/2020] [Indexed: 12/11/2022] Open
Abstract
Genome-wide association studies have uncovered thousands of genetic variants that are associated with a wide variety of human traits. Knowledge of how trait-associated variants are distributed within and between populations can provide insight into the genetic basis of group-specific phenotypic differences, particularly for health-related traits. We analyzed the genetic divergence levels for 1) individual trait-associated variants and 2) collections of variants that function together to encode polygenic traits, between two neighboring populations in Colombia that have distinct demographic profiles: Antioquia (Mestizo) and Chocó (Afro-Colombian). Genetic ancestry analysis showed 62% European, 32% Native American, and 6% African ancestry for Antioquia compared with 76% African, 10% European, and 14% Native American ancestry for Chocó, consistent with demography and previous results. Ancestry differences can confound cross-population comparison of polygenic risk scores (PRS); however, we did not find any systematic bias in PRS distributions for the two populations studied here, and population-specific differences in PRS were, for the most part, small and symmetrically distributed around zero. Both genetic differentiation at individual trait-associated single nucleotide polymorphisms and population-specific PRS differences between Antioquia and Chocó largely reflected anthropometric phenotypic differences that can be readily observed between the populations along with reported disease prevalence differences. Cases where population-specific differences in genetic risk did not align with observed trait (disease) prevalence point to the importance of environmental contributions to phenotypic variance, for both infectious and complex, common disease. The results reported here are distributed via a web-based platform for searching trait-associated variants and PRS divergence levels at http://map.chocogen.com (last accessed August 12, 2020).
Collapse
Affiliation(s)
- Aroon T Chande
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia.,IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, Georgia.,PanAmerican Bioinformatics Institute, Valle del Cauca, Cali, Colombia
| | - Lavanya Rishishwar
- IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, Georgia.,PanAmerican Bioinformatics Institute, Valle del Cauca, Cali, Colombia
| | - Dongjo Ban
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia.,PanAmerican Bioinformatics Institute, Valle del Cauca, Cali, Colombia
| | - Shashwat D Nagar
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia.,PanAmerican Bioinformatics Institute, Valle del Cauca, Cali, Colombia
| | - Andrew B Conley
- IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, Georgia.,PanAmerican Bioinformatics Institute, Valle del Cauca, Cali, Colombia
| | - Jessica Rowell
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia
| | - Augusto E Valderrama-Aguirre
- PanAmerican Bioinformatics Institute, Valle del Cauca, Cali, Colombia.,Biomedical Research Institute (COL0082529), Cali, Colombia.,Universidad Santiago de Cali, Colombia
| | - Miguel A Medina-Rivas
- PanAmerican Bioinformatics Institute, Valle del Cauca, Cali, Colombia.,Centro de Investigación en Biodiversidad y Hábitat, Universidad Tecnológica del Chocó, Quibdó, Colombia
| | - I King Jordan
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia.,IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, Georgia.,PanAmerican Bioinformatics Institute, Valle del Cauca, Cali, Colombia
| |
Collapse
|
5
|
Sookoian S, Pirola CJ. Precision medicine in nonalcoholic fatty liver disease: New therapeutic insights from genetics and systems biology. Clin Mol Hepatol 2020; 26:461-475. [PMID: 32906228 PMCID: PMC7641575 DOI: 10.3350/cmh.2020.0136] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Revised: 07/16/2020] [Accepted: 07/26/2020] [Indexed: 12/13/2022] Open
Abstract
Despite more than two decades of extensive research focusing on nonalcoholic fatty liver disease (NAFLD), no approved therapy for steatohepatitis-the severe histological form of the disease-presently exists. More importantly, new drugs and small molecules with diverse molecular targets on the pathways of hepatocyte injury, inflammation, and fibrosis cannot achieve the primary efficacy endpoints. Precision medicine can potentially overcome this issue, as it is founded on extensive knowledge of the druggable genome/proteome. Hence, this review summarizes significant trends and developments in precision medicine with a particular focus on new potential therapeutic discoveries modeled via systems biology approaches. In addition, we computed and simulated the potential utility of the NAFLD polygenic risk score, which could be conceptually very advantageous not only for early disease detection but also for implementing actionable measures. Incomplete knowledge of the druggable NAFLD genome severely impedes the drug discovery process and limits the likelihood of identifying robust and safe drug candidates. Thus, we close this article with some insights into emerging disciplines, such as chemical genetics, that may accelerate accurate identification of the druggable NAFLD genome/proteome.
Collapse
Affiliation(s)
- Silvia Sookoian
- Institute of Medical Research A Lanari, School of Medicine, University of Buenos Aires, Autonomous City of Buenos Aires, Argentina
- Department of Clinical and Molecular Hepatology, Institute of Medical Research (IDIM), National Scientific and Technical Research Council (CONICET)-University of Buenos Aires, Autonomous City of Buenos Aires, Argentina
| | - Carlos J. Pirola
- Institute of Medical Research A Lanari, School of Medicine, University of Buenos Aires, Autonomous City of Buenos Aires, Argentina
- Department of Molecular Genetics and Biology of Complex Diseases, Institute of Medical Research (IDIM), National Scientific and Technical Research Council (CONICET)-University of Buenos Aires, Autonomous City of Buenos Aires, Argentina
| |
Collapse
|
6
|
Chande AT, Rishishwar L, Conley AB, Valderrama-Aguirre A, Medina-Rivas MA, Jordan IK. Ancestry effects on type 2 diabetes genetic risk inference in Hispanic/Latino populations. BMC MEDICAL GENETICS 2020; 21:132. [PMID: 32580712 PMCID: PMC7315475 DOI: 10.1186/s12881-020-01068-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Accepted: 06/10/2020] [Indexed: 12/18/2022]
Abstract
Background Hispanic/Latino (HL) populations bear a disproportionately high burden of type 2 diabetes (T2D). The ability to predict T2D genetic risk using polygenic risk scores (PRS) offers great promise for improved screening and prevention. However, there are a number of complications related to the accurate inference of genetic risk across HL populations with distinct ancestry profiles. We investigated how ancestry affects the inference of T2D genetic risk using PRS in diverse HL populations from Colombia and the United States (US). In Colombia, we compared T2D genetic risk for the Mestizo population of Antioquia to the Afro-Colombian population of Chocó, and in the US, we compared European-American versus Mexican-American populations. Methods Whole genome sequences and genotypes from the 1000 Genomes Project and the ChocoGen Research Project were used for genetic ancestry inference and for T2D polygenic risk score (PRS) calculation. Continental ancestry fractions for HL genomes were inferred via comparison with African, European, and Native American reference genomes, and PRS were calculated using T2D risk variants taken from multiple genome-wide association studies (GWAS) conducted on cohorts with diverse ancestries. A correction for ancestry bias in T2D risk inference based on the frequencies of ancestral versus derived alleles was developed and applied to PRS calculations in the HL populations studied here. Results T2D genetic risk in Colombian and US HL populations is positively correlated with African and Native American ancestry and negatively correlated with European ancestry. The Afro-Colombian population of Chocó has higher predicted T2D risk than Antioquia, and the Mexican-American population has higher predicted risk than the European-American population. The inferred relative risk of T2D is robust to differences in the ancestry of the GWAS cohorts used for variant discovery. For trans-ethnic GWAS, population-specific variants and variants with same direction effects across populations yield consistent results. Nevertheless, the control for bias in T2D risk prediction confirms that explicit consideration of genetic ancestry can yield more reliable cross-population genetic risk inferences. Conclusions T2D associations that replicate across populations provide for more reliable risk inference, and modeling population-specific frequencies of ancestral and derived risk alleles can help control for biases in PRS estimation.
Collapse
Affiliation(s)
- Aroon T Chande
- School of Biological Sciences, Georgia Institute of Technology, 950 Atlantic Drive, Atlanta, GA, 30332, USA.,IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, GA, USA.,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia
| | - Lavanya Rishishwar
- IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, GA, USA.,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia
| | - Andrew B Conley
- IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, GA, USA.,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia
| | - Augusto Valderrama-Aguirre
- School of Biological Sciences, Georgia Institute of Technology, 950 Atlantic Drive, Atlanta, GA, 30332, USA.,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia.,Biomedical Research Institute (COL0082529), Cali, Colombia.,Universidad Santiago de Cali, Cali, Colombia
| | - Miguel A Medina-Rivas
- PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia.,Centro de Investigación en Biodiversidad y Hábitat, Universidad Tecnológica del Chocó, Quibdó, Chocó, Colombia
| | - I King Jordan
- School of Biological Sciences, Georgia Institute of Technology, 950 Atlantic Drive, Atlanta, GA, 30332, USA. .,IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, GA, USA. .,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia.
| |
Collapse
|
7
|
Scholkmann F, Wolf U. The Pulse-Respiration Quotient: A Powerful but Untapped Parameter for Modern Studies About Human Physiology and Pathophysiology. Front Physiol 2019; 10:371. [PMID: 31024336 PMCID: PMC6465339 DOI: 10.3389/fphys.2019.00371] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2018] [Accepted: 03/18/2019] [Indexed: 01/06/2023] Open
Abstract
A specific and unique aspect of cardiorespiratory activity can be captured by dividing the heart rate (HR) by the respiration rate (RR), giving the pulse-respiration quotient (PRQ = HR/RR). In this review article, we summarize the main findings of studies using and investigating the PRQ. We describe why the PRQ is a powerful parameter that captures complex regulatory states of the cardiorespiratory system, and we highlight the need to re-introduce the use of this parameter into modern studies about human physiology and pathophysiology. In particular, we show that the PRQ (i) changes during human development, (ii) is time-dependent (ultradian, circadian, and infradian rhythms), (iii) shows specific patterns during sleep, (iv) changes with physical activity and body posture, (v) is linked with psychophysical and cognitive activity, (vi) is sex-dependent, and (vii) is determined by the individual physiological constitution. Furthermore, we discuss the medical aspects of the PRQ in terms of applications for disease classification and monitoring. Finally, we explain why there should be a revival in the use of the PRQ for basic research about human physiology and for applications in medicine, and we give recommendations for the use of the PRQ in studies and medical applications.
Collapse
Affiliation(s)
- Felix Scholkmann
- Institute of Complementary and Integrative Medicine, University of Bern, Bern, Switzerland
| | | |
Collapse
|