1
|
Borges VM, Horimoto ARVR, Wijsman EM, Kimura L, Nunes K, Nato AQ, Mingroni-Netto RC. Genomic Exploration of Essential Hypertension in African-Brazilian Quilombo Populations: A Comprehensive Approach With Pedigree Analysis and Family-Based Association Studies. J Am Heart Assoc 2025; 14:e036193. [PMID: 40118787 DOI: 10.1161/jaha.124.036193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/23/2024] [Accepted: 01/27/2025] [Indexed: 03/23/2025]
Abstract
BACKGROUND Essential hypertension (EH) is a global health issue. Despite extensive research, much of EH heritability remains unexplained. We investigated the genetic basis of EH in African-derived individuals from partially isolated quilombo populations in Vale do Ribeira (São Paulo, Brazil). METHODS AND RESULTS Samples from 431 individuals (167 affected, 261 unaffected, 3 unknown) were genotyped using a 650 000 single-nucleotide polymorphism array. Estimated global ancestry proportions were 47% African, 36% European, and 16% Native American. We constructed 6 pedigrees using additional data from 673 individuals and created 3 nonoverlapping single-nucleotide polymorphism subpanels. We phased haplotypes and performed local ancestry analysis to account for admixture. Genome-wide linkage analysis and fine-mapping via family-based association studies were conducted, prioritizing EH-associated genes through a systematic approach involving databases like PubMed, ClinVar, and GWAS (Genome-Wide Association Studies) Catalog. Linkage analysis identified 22 regions of interest with logarithm of the odds scores ranging from 1.45 to 3.03, encompassing 2363 genes. Fine-mapping (family-based association studies) identified 60 EH-related candidate genes and 117 suggestive/significant variants. Among these, 14 genes, including PHGDH, S100A10, MFN2, and RYR2, were strongly related to hypertension harboring 29 suggestive/significant single-nucleotide polymorphisms. CONCLUSIONS Through a complementary approach combining admixture-adjusted Genome-wide linkage analysis based on Markov chain Monte Carlo methods, family-based association studies on known and imputed data, and gene prioritizing, new loci, variants, and candidate genes were identified. These findings provide targets for future research, replication in other populations, facilitate personalized treatments, and improve public health toward African-derived underrepresented populations. Limitations include restricted single-nucleotide polymorphism coverage, self-reported pedigree data, and lack of available EH genomic studies on admixed populations for independent validation, despite the performed genetic correlation analyses using summary statistics.
Collapse
Affiliation(s)
- Vinicius Magalhães Borges
- Human Genome and Stem Cells Research Center, Institute of Biosciences University of Séo Paulo São Paulo Brazil
- Department of Biomedical Sciences, Joan C. Edwards School of Medicine Marshall University Huntington WV USA
| | - Andrea R V R Horimoto
- Division of Medical Genetics, Department of Medicine University of Washington Seattle WA USA
| | - Ellen Marie Wijsman
- Division of Medical Genetics, Department of Medicine University of Washington Seattle WA USA
| | - Lilian Kimura
- Human Genome and Stem Cells Research Center, Institute of Biosciences University of Séo Paulo São Paulo Brazil
| | - Kelly Nunes
- Human Genome and Stem Cells Research Center, Institute of Biosciences University of Séo Paulo São Paulo Brazil
| | - Alejandro Q Nato
- Department of Biomedical Sciences, Joan C. Edwards School of Medicine Marshall University Huntington WV USA
| | - Regina Célia Mingroni-Netto
- Human Genome and Stem Cells Research Center, Institute of Biosciences University of Séo Paulo São Paulo Brazil
| |
Collapse
|
2
|
Israeli S, Maiers M, Louzoun Y. Graph-Based Imputation Methods and Their Applications to Single Donors and Families. Methods Mol Biol 2024; 2809:193-214. [PMID: 38907899 DOI: 10.1007/978-1-0716-3874-3_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/24/2024]
Abstract
The outcome of Hematopoietic Stem Cell (HSCT) and organ transplant is strongly affected by the matching of the HLA alleles of the donor and the recipient. However, donors and sometimes recipients are often typed at low resolution, with some alleles either missing or ambiguous. Thus, imputation methods are required to detect the most probably high-resolution HLA haplotypes consistent with a typing. Such imputation algorithms require predefined haplotype frequencies. As such, the phasing of the typing is required for both imputation and frequency generation.We have developed a new approach to HLA haplotype and genotype imputation, where first all candidate phases of a typing are explicated, and then the ambiguity within each phase is solved. This ambiguity is solved through a graph structure of all partial haplotypes and the haplotypes consistent with them.This phasing approach was used to produce an imputation algorithm (GRIMM-Graph Imputation and Matching). GRIMM was then combined with the possibility of combining information from multiple races to produce MR-GRIMM (Multi-Race GRIMM). When family information is available, the phasing of each family member can be restricted by the others. We propose GRAMM (GRaph-bAsed faMily iMputation) to phase alleles in family pedigree HLA typing data and in mother-cord blood unit pairs. Finally, we combined MR-GRIMM with an expectation-maximization (EM) algorithm to estimate haplotype frequencies sharing information between races to produce MR-GRIMME (MR-GRIMM EM).We have shown that these algorithms naturally combine information between races and family members. The accuracy of each of these algorithms is significantly better than its current parallel methods. MR-GRIMM leads to high accuracy in matching predictions. GRAMM better imputes family members than either MR-GRIMM or any existing algorithm and has practically no phasing errors. MR-GRIMME obtains a higher likelihood than existing algorithms.MR-GRIMM, MR-GRIMME, and GRAMM are available as servers or through stand-alone versions in GITHUB and PyPi, as detailed in the appropriate sections.
Collapse
Affiliation(s)
- Sapir Israeli
- Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel
| | - Martin Maiers
- Center for International Blood and Marrow Transplant Research (CIBMTR), National Marrow Donor Program/Be The Match, Minneapolis, MN, USA
| | - Yoram Louzoun
- Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel.
| |
Collapse
|
3
|
Ansbacher-Feldman Z, Israeli S, Maiers M, Gragert L, De Santis D, Israeli M, Louzoun Y. GRAMM: A new method for analysis of HLA in families. HLA 2023; 102:477-488. [PMID: 37102220 DOI: 10.1111/tan.15075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2022] [Revised: 03/23/2023] [Accepted: 04/05/2023] [Indexed: 04/28/2023]
Abstract
Recently, haplo-identical transplantation with multiple HLA mismatches has become a viable option for stem cell transplants. Haplotype sharing detection requires the imputation of donor and recipient. We show that even in high-resolution typing when all alleles are known, there is a 15% error rate in haplotype phasing, and even more in low-resolution typings. Similarly, in related donors, the parents' haplotypes should be imputed to determine what haplotype each child inherited. We propose graph-based family imputation (GRAMM) to phase alleles in family pedigree HLA typing data, and in mother-cord blood unit pairs. We show that GRAMM has practically no phasing errors when pedigree data are available. We apply GRAMM to simulations with different typing resolutions as well as paired cord-mother typings, and show very high phasing accuracy, and improved allele imputation accuracy. We use GRAMM to detect recombination events and show that the rate of falsely detected recombination events (false-positive rate) in simulations is very low. We then apply recombination detection to typed families to estimate the recombination rate in Israeli and Australian population datasets. The estimated recombination rate has an upper bound of 10%-20% per family (1%-4% per individual).
Collapse
Affiliation(s)
| | - Sapir Israeli
- Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel
| | - Martin Maiers
- Center for Blood and Marrow Transplant Research, Minneapolis, Minnesota, USA
- National Marrow Donor Program/Be The Match, Minneapolis, Minnesota, USA
| | - Loren Gragert
- Center for Blood and Marrow Transplant Research, Minneapolis, Minnesota, USA
- National Marrow Donor Program/Be The Match, Minneapolis, Minnesota, USA
- Department of Pathology and Laboratory Medicine, Tulane Cancer Center, Tulane University School of Medicine, New Orleans, Louisiana, USA
| | - Dianne De Santis
- Department of Clinical Immunology, PathWest, Fiona Stanley Hospital, Perth, Australia
| | - Moshe Israeli
- Tissue Typing Laboratory, Beilinson Hospital, Rabin Medical Center, Petach-Tikva, Israel
- Department of Digital Medical Technologies, Holon Institute of Technology, Holon, Israel
| | - Yoram Louzoun
- Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel
| |
Collapse
|
4
|
Baud A, McPeek S, Chen N, Hughes KA. Indirect Genetic Effects: A Cross-disciplinary Perspective on Empirical Studies. J Hered 2022; 113:1-15. [PMID: 34643239 PMCID: PMC8851665 DOI: 10.1093/jhered/esab059] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Indirect genetic effects (IGE) occur when an individual's phenotype is influenced by genetic variation in conspecifics. Opportunities for IGE are ubiquitous, and, when present, IGE have profound implications for behavioral, evolutionary, agricultural, and biomedical genetics. Despite their importance, the empirical study of IGE lags behind the development of theory. In large part, this lag can be attributed to the fact that measuring IGE, and deconvoluting them from the direct genetic effects of an individual's own genotype, is subject to many potential pitfalls. In this Perspective, we describe current challenges that empiricists across all disciplines will encounter in measuring and understanding IGE. Using ideas and examples spanning evolutionary, agricultural, and biomedical genetics, we also describe potential solutions to these challenges, focusing on opportunities provided by recent advances in genomic, monitoring, and phenotyping technologies. We hope that this cross-disciplinary assessment will advance the goal of understanding the pervasive effects of conspecific interactions in biology.
Collapse
Affiliation(s)
- Amelie Baud
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain.,the Universitat Pompeu Fabra (UPF), Barcelona,Spain
| | - Sarah McPeek
- the Department of Biology, University of Virginia, Charlottesville, VA 22904, USA
| | - Nancy Chen
- the Department of Biology, University of Rochester, Rochester, NY 14627,USA
| | - Kimberly A Hughes
- the Department of Biological Science, Florida State University, Tallahassee, FL 32303,USA
| |
Collapse
|
5
|
Mdyogolo S, MacNeil MD, Neser FWC, Scholtz MM, Makgahlela ML. Assessing accuracy of genotype imputation in the Afrikaner and Brahman cattle breeds of South Africa. Trop Anim Health Prod 2022; 54:90. [PMID: 35133512 DOI: 10.1007/s11250-022-03102-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 02/01/2022] [Indexed: 11/26/2022]
Abstract
Imputation may be used to rescue genomic data from animals that would otherwise be eliminated due to a lower than desired call rate. The aim of this study was to compare the accuracy of genotype imputation for Afrikaner, Brahman, and Brangus cattle of South Africa using within- and multiple-breed reference populations. A total of 373, 309, and 101 Afrikaner, Brahman, and Brangus cattle, respectively, were genotyped using the GeneSeek Genomic Profiler 150 K panel that contained 141,746 markers. Markers with MAF ≤ 0.02 and call rates ≤ 0.95 or that deviated from Hardy Weinberg Equilibrium frequency with a probability of ≤ 0.0001 were excluded from the data as were animals with a call rate ≤ 0.90. The remaining data included 99,086 SNPs and 360 Afrikaner, 75,291 SNPs and 288 animals Brahman, and 97,897 SNPs and 99 Brangus animals. A total of 7986, 7002, and 7000 SNP from 50 Afrikaner and Brahman and 30 Brangus cattle, respectively, were masked and then imputed using BEAGLE v3 and FImpute v2. The within-breed imputation yielded accuracies ranging from 89.9 to 96.6% for the three breeds. The multiple-breed imputation yielded corresponding accuracies from 69.21 to 88.35%. The results showed that population homogeneity and numerical representation for within and across breed strategies, respectively, are crucial components for improving imputation accuracies.
Collapse
Affiliation(s)
- S Mdyogolo
- Department of Animal Breeding and Genetics, Agricultural Research Council, Irene, South Africa.
- Department of Animal, Wildlife and Grassland Sciences, University of the Free State, Bloemfontein, South Africa.
| | - M D MacNeil
- Department of Animal Breeding and Genetics, Agricultural Research Council, Irene, South Africa
- Department of Animal, Wildlife and Grassland Sciences, University of the Free State, Bloemfontein, South Africa
- Delta G, Miles City, MT, USA
| | - F W C Neser
- Department of Animal, Wildlife and Grassland Sciences, University of the Free State, Bloemfontein, South Africa
| | - M M Scholtz
- Department of Animal Breeding and Genetics, Agricultural Research Council, Irene, South Africa
- Department of Animal, Wildlife and Grassland Sciences, University of the Free State, Bloemfontein, South Africa
| | - M L Makgahlela
- Department of Animal Breeding and Genetics, Agricultural Research Council, Irene, South Africa
- Department of Animal, Wildlife and Grassland Sciences, University of the Free State, Bloemfontein, South Africa
| |
Collapse
|
6
|
Miller DB, Robison R, Piccolo SR. Toward a methodology for evaluating DNA variants in nuclear families. PLoS One 2021; 16:e0258375. [PMID: 34624066 PMCID: PMC8500447 DOI: 10.1371/journal.pone.0258375] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Accepted: 09/27/2021] [Indexed: 11/22/2022] Open
Abstract
The genetic underpinnings of most pediatric-cancer cases are unknown. Population-based studies use large sample sizes but have accounted for only a small proportion of the estimated heritability of pediatric cancers. Pedigree-based studies are infeasible for most human populations. One alternative is to collect genetic data from a single nuclear family and use inheritance patterns within the family to filter candidate variants. This approach can be applied to common and rare variants, including those that are private to a given family or to an affected individual. We evaluated this approach using genetic data from three nuclear families with 5, 4, and 7 children, respectively. Only one child in each nuclear family had been diagnosed with cancer, and neither parent had been affected. Diagnoses for the affected children were benign low-grade astrocytoma, Wilms tumor (stage 2), and Burkitt's lymphoma, respectively. We used whole-genome sequencing to profile normal cells from each family member and a linked-read technology for genomic phasing. For initial variant filtering, we used global minor allele frequencies, deleteriousness scores, and functional-impact annotations. Next, we used genetic variation in the unaffected siblings as a guide to filter the remaining variants. As a way to evaluate our ability to detect variant(s) that may be relevant to disease status, the corresponding author blinded the primary author to affected status; the primary author then assigned a risk score to each child. Based on this evidence, the primary author predicted which child had been affected in each family. The primary author's prediction was correct for the child who had been diagnosed with a Wilms tumor; the child with Burkitt's lymphoma had the second-highest risk score among the seven children in that family. This study demonstrates a methodology for filtering and evaluating candidate genomic variants and genes within nuclear families that may merit further exploration.
Collapse
Affiliation(s)
- Dustin B. Miller
- Department of Biology, Brigham Young University, Provo, UT, United States of America
| | - Reid Robison
- Department of Biology, Brigham Young University, Provo, UT, United States of America
- Department of Psychiatry, University of Utah, Salt Lake City, UT, United States of America
| | - Stephen R. Piccolo
- Department of Biology, Brigham Young University, Provo, UT, United States of America
| |
Collapse
|
7
|
Galla SJ, Brown L, Couch-Lewis Ngāi Tahu Te Hapū O Ngāti Wheke Ngāti Waewae Y, Cubrinovska I, Eason D, Gooley RM, Hamilton JA, Heath JA, Hauser SS, Latch EK, Matocq MD, Richardson A, Wold JR, Hogg CJ, Santure AW, Steeves TE. The relevance of pedigrees in the conservation genomics era. Mol Ecol 2021; 31:41-54. [PMID: 34553796 PMCID: PMC9298073 DOI: 10.1111/mec.16192] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Revised: 09/12/2021] [Accepted: 09/17/2021] [Indexed: 01/21/2023]
Abstract
Over the past 50 years conservation genetics has developed a substantive toolbox to inform species management. One of the most long‐standing tools available to manage genetics—the pedigree—has been widely used to characterize diversity and maximize evolutionary potential in threatened populations. Now, with the ability to use high throughput sequencing to estimate relatedness, inbreeding, and genome‐wide functional diversity, some have asked whether it is warranted for conservation biologists to continue collecting and collating pedigrees for species management. In this perspective, we argue that pedigrees remain a relevant tool, and when combined with genomic data, create an invaluable resource for conservation genomic management. Genomic data can address pedigree pitfalls (e.g., founder relatedness, missing data, uncertainty), and in return robust pedigrees allow for more nuanced research design, including well‐informed sampling strategies and quantitative analyses (e.g., heritability, linkage) to better inform genomic inquiry. We further contend that building and maintaining pedigrees provides an opportunity to strengthen trusted relationships among conservation researchers, practitioners, Indigenous Peoples, and Local Communities.
Collapse
Affiliation(s)
- Stephanie J Galla
- Department of Biological Sciences, Boise State University, Boise, Idaho, USA.,School of Biological Sciences, University of Canterbury, Christchurch, Canterbury, New Zealand
| | - Liz Brown
- New Zealand Department of Conservation, Twizel, Canterbury, New Zealand
| | | | - Ilina Cubrinovska
- School of Biological Sciences, University of Canterbury, Christchurch, Canterbury, New Zealand
| | - Daryl Eason
- New Zealand Department of Conservation, Invercargill, Southland, New Zealand
| | - Rebecca M Gooley
- Smithsonian-Mason School of Conservation, Front Royal, Maryland, USA.,Center for Species Survival, Smithsonian Conservation Biology Institute, National Zoological Park, Washington, District of Columbia, USA
| | - Jill A Hamilton
- Department of Biological Sciences, North Dakota State University, Fargo, North Dakota, USA
| | - Julie A Heath
- Department of Biological Sciences, Boise State University, Boise, Idaho, USA
| | - Samantha S Hauser
- Department of Biological Sciences, University of Wisconsin-Milwaukee, Milwaukee, Wisconsin, USA
| | - Emily K Latch
- Department of Biological Sciences, University of Wisconsin-Milwaukee, Milwaukee, Wisconsin, USA
| | - Marjorie D Matocq
- Department of Natural Resources and Environmental Science, Program in Ecology, Evolution and Conservation Biology, University of Nevada Reno, Reno, Nevada, USA
| | - Anne Richardson
- The Isaac Conservation and Wildlife Trust, Christchurch, Canterbury, New Zealand
| | - Jana R Wold
- School of Biological Sciences, University of Canterbury, Christchurch, Canterbury, New Zealand
| | - Carolyn J Hogg
- School of Life and Environmental Sciences, University of Sydney, Sydney, NSW, Australia
| | - Anna W Santure
- School of Biological Sciences, University of Auckland, Auckland, Auckland, New Zealand
| | - Tammy E Steeves
- School of Biological Sciences, University of Canterbury, Christchurch, Canterbury, New Zealand
| |
Collapse
|
8
|
Delpuech E, Aliakbari A, Labrune Y, Fève K, Billon Y, Gilbert H, Riquet J. Identification of genomic regions affecting production traits in pigs divergently selected for feed efficiency. Genet Sel Evol 2021; 53:49. [PMID: 34126920 PMCID: PMC8201702 DOI: 10.1186/s12711-021-00642-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Accepted: 05/28/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Feed efficiency is a major driver of the sustainability of pig production systems. Understanding the biological mechanisms that underlie these agronomic traits is an important issue for environment questions and farms' economy. This study aimed at identifying genomic regions that affect residual feed intake (RFI) and other production traits in two pig lines divergently selected for RFI during nine generations (LRFI, low RFI; HRFI, high RFI). RESULTS We built a whole dataset of 570,447 single nucleotide polymorphisms (SNPs) in 2426 pigs with records for 24 production traits after both imputation and prediction of genotypes using pedigree information. Genome-wide association studies (GWAS) were performed including both lines (global-GWAS) or each line independently (LRFI-GWAS and HRFI-GWAS). Forty-five chromosomal regions were detected in the global-GWAS, whereas 28 and 42 regions were detected in the HRFI-GWAS and LRFI-GWAS, respectively. Among these 45 regions, only 13 were shared between at least two analyses, and only one was common between the three GWAS but it affects different traits. Among the five quantitative trait loci (QTL) detected for RFI, two were close to QTL for meat quality traits and two pinpointed novel genomic regions that harbor candidate genes involved in cell proliferation and differentiation processes of gastrointestinal tissues or in lipid metabolism-related signaling pathways. In most cases, different QTL regions were detected between the three designs, which suggests a strong impact of the dataset structure on the detection power and could be due to the changes in allelic frequencies during the establishment of lines. CONCLUSIONS In addition to efficiently detecting known and new QTL regions for feed efficiency, the combination of GWAS carried out per line or simultaneously using all individuals highlighted chromosomal regions that affect production traits and presented significant changes in allelic frequencies across generations. Further analyses are needed to estimate whether these regions correspond to traces of selection or result from genetic drift.
Collapse
Affiliation(s)
- Emilie Delpuech
- GenPhySE, Université de Toulouse, INRAE, ENVT, 31320, Castanet-Tolosan, France
| | - Amir Aliakbari
- GenPhySE, Université de Toulouse, INRAE, ENVT, 31320, Castanet-Tolosan, France
| | - Yann Labrune
- GenPhySE, Université de Toulouse, INRAE, ENVT, 31320, Castanet-Tolosan, France
| | - Katia Fève
- GenPhySE, Université de Toulouse, INRAE, ENVT, 31320, Castanet-Tolosan, France
| | | | - Hélène Gilbert
- GenPhySE, Université de Toulouse, INRAE, ENVT, 31320, Castanet-Tolosan, France
| | - Juliette Riquet
- GenPhySE, Université de Toulouse, INRAE, ENVT, 31320, Castanet-Tolosan, France.
| |
Collapse
|
9
|
Ergun U, Say B, Ergun SG, Percin FE, Inan L, Kaygisiz S, Asal PG, Yurteri B, Struchalin M, Shtokalo D, Ergun MA. Genome-wide association and whole exome sequencing studies reveal a novel candidate locus for restless legs syndrome. Eur J Med Genet 2021; 64:104186. [PMID: 33662638 DOI: 10.1016/j.ejmg.2021.104186] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 02/05/2021] [Accepted: 02/27/2021] [Indexed: 12/19/2022]
Abstract
INTRODUCTION The restless legs syndrome (RLS) is a common heritable neurologic disorder which is characterized by an irresistible desire to move and unpleasant sensations in the legs. METHODS We aim to identify new variants associated with RLS by performing genome-wide linkage and subsequent association analysis of forty member's family with history of RLS. RESULTS We found evidence of linkage for three loci 7q21.11 (HLOD = 3.02), 7q21.13-7q21.3 (HLOD = 3.02) and 7q22.3 (HLOD = 3.09). Fine-mapping of those regions in association study using exome sequencing identified SEMA3A (p-value = 8.5·10-4), PPP1R9A (p-value = 7.2·10-4), PUS7 (p-value = 8.7·10-4), CDHR3 (p-value = 7.2·10-4), HBP1 (p-value = 1.5·10-4) and COG5 (p-value = 1.5·10-4) genes with p-values below significance threshold. CONCLUSION Linkage analysis with subsequent association study of exome variants identified six new genes associated with RLS mapped on 7q21 and q22.
Collapse
Affiliation(s)
- Ufuk Ergun
- Kırıkkale University Faculty of Medicine, Department of Neurology, Kırıkkale, Turkey
| | - Bahar Say
- Kırıkkale University Faculty of Medicine, Department of Neurology, Kırıkkale, Turkey
| | - Sezen Guntekin Ergun
- Hacettepe University Faculty of Medicine, Department of Medical Biology, Anakara, Turkey
| | - Ferda Emriye Percin
- Gazi University Faculty of Medicine, Department of Medical Genetics, Ankara, Turkey
| | - Levent Inan
- Ministry of Health Ankara Research and Training Hospital Neurology and Algology Department, Ankara, Turkey
| | - Sukran Kaygisiz
- Ministry of Health Ordu University Traning and Research Hospital, Ordu, Turkey
| | - Pınar Gelener Asal
- Dr. Suat Gunsel University of Kyrenia Hospital, Kyrenia, Turkish Republic of Northern Cyprus
| | - Buket Yurteri
- Hacettepe University Faculty of Medicine, Department of Pediatric Basic Sciences, Ankara, Turkey
| | | | - Dmitry Shtokalo
- AcademGene Ltd, Russia; A.P.Ershov Institute of Informatics Systems SB RAS, Russia
| | - Mehmet Ali Ergun
- Gazi University Faculty of Medicine, Department of Medical Genetics, Ankara, Turkey.
| |
Collapse
|
10
|
Zhou Q, Tang D, Huang W, Yang Z, Zhang Y, Hamilton JP, Visser RGF, Bachem CWB, Robin Buell C, Zhang Z, Zhang C, Huang S. Haplotype-resolved genome analyses of a heterozygous diploid potato. Nat Genet 2020; 52:1018-1023. [PMID: 32989320 PMCID: PMC7527274 DOI: 10.1038/s41588-020-0699-x] [Citation(s) in RCA: 123] [Impact Index Per Article: 24.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2019] [Accepted: 08/24/2020] [Indexed: 02/07/2023]
Abstract
Potato (Solanum tuberosum L.) is the most important tuber crop worldwide. Efforts are underway to transform the crop from a clonally propagated tetraploid into a seed-propagated, inbred-line-based hybrid, but this process requires a better understanding of potato genome. Here, we report the 1.67-Gb haplotype-resolved assembly of a diploid potato, RH89-039-16, using a combination of multiple sequencing strategies, including circular consensus sequencing. Comparison of the two haplotypes revealed ~2.1% intragenomic diversity, including 22,134 predicted deleterious mutations in 10,642 annotated genes. In 20,583 pairs of allelic genes, 16.6% and 30.8% exhibited differential expression and methylation between alleles, respectively. Deleterious mutations and differentially expressed alleles were dispersed throughout both haplotypes, complicating strategies to eradicate deleterious alleles or stack beneficial alleles via meiotic recombination. This study offers a holistic view of the genome organization of a clonally propagated diploid species and provides insights into technological evolution in resolving complex genomes.
Collapse
Affiliation(s)
- Qian Zhou
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Area, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
- Peng Cheng Laboratory, Shenzhen, China
| | - Dié Tang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Area, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Wu Huang
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops of the Ministry of Agriculture, Sino-Dutch Joint Laboratory of Horticultural Genomics, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Zhongmin Yang
- College of Horticulture, Northwest Agriculture and Forest University, Yangling, China
| | - Yu Zhang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Area, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - John P Hamilton
- Department of Plant Biology, Michigan State University, East Lansing, MI, USA
| | - Richard G F Visser
- Plant Breeding, Wageningen University and Research, Wageningen, the Netherlands
| | | | - C Robin Buell
- Department of Plant Biology, Michigan State University, East Lansing, MI, USA
| | - Zhonghua Zhang
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops of the Ministry of Agriculture, Sino-Dutch Joint Laboratory of Horticultural Genomics, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, China
- College of Horticulture, Qingdao Agricultural University, Qingdao, China
| | - Chunzhi Zhang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Area, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Sanwen Huang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Area, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China.
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops of the Ministry of Agriculture, Sino-Dutch Joint Laboratory of Horticultural Genomics, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, China.
| |
Collapse
|
11
|
Yuan J, Xing H, Lamy AL, Lencz T, Pe’er I. Leveraging correlations between variants in polygenic risk scores to detect heterogeneity in GWAS cohorts. PLoS Genet 2020; 16:e1009015. [PMID: 32956347 PMCID: PMC7529195 DOI: 10.1371/journal.pgen.1009015] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Revised: 10/01/2020] [Accepted: 07/29/2020] [Indexed: 11/18/2022] Open
Abstract
Evidence from both GWAS and clinical observation has suggested that certain psychiatric, metabolic, and autoimmune diseases are heterogeneous, comprising multiple subtypes with distinct genomic etiologies and Polygenic Risk Scores (PRS). However, the presence of subtypes within many phenotypes is frequently unknown. We present CLiP (Correlated Liability Predictors), a method to detect heterogeneity in single GWAS cohorts. CLiP calculates a weighted sum of correlations between SNPs contributing to a PRS on the case/control liability scale. We demonstrate mathematically and through simulation that among i.i.d. homogeneous cases generated by a liability threshold model, significant anti-correlations are expected between otherwise independent predictors due to ascertainment on the hidden liability score. In the presence of heterogeneity from distinct etiologies, confounding by covariates, or mislabeling, these correlation patterns are altered predictably. We further extend our method to two additional association study designs: CLiP-X for quantitative predictors in applications such as transcriptome-wide association, and CLiP-Y for quantitative phenotypes, where there is no clear distinction between cases and controls. Through simulations, we demonstrate that CLiP and its extensions reliably distinguish between homogeneous and heterogeneous cohorts when the PRS explains as low as 3% of variance on the liability scale and cohorts comprise 50, 000 − 100, 000 samples, an increasingly practical size for modern GWAS. We apply CLiP to heterogeneity detection in schizophrenia cohorts totaling > 50, 000 cases and controls collected by the Psychiatric Genomics Consortium. We observe significant heterogeneity in mega-analysis of the combined PGC data (p-value 8.54 × 0−4), as well as in individual cohorts meta-analyzed using Fisher’s method (p-value 0.03), based on significantly associated variants. We also apply CLiP-Y to detect heterogeneity in neuroticism in over 10, 000 individuals from the UK Biobank and detect heterogeneity with a p-value of 1.68 × 10−9. Scores were not significantly reduced when partitioning by known subclusters (“Depression” and “Worry”), suggesting that these factors are not the primary source of observed heterogeneity. Several traits, such as bipolar disease, are known to be heterogeneous and comprise distinct subtypes with unique genomic associations. For other traits such as schizophrenia, heterogeneity may be suspected, but specific subtypes are less well characterized. Furthermore, conventional mixture model-based methods to detect subtypes in genome-wide association data struggle with the high polygenicity of complex traits. We propose CLiP (Correlated Liability Predictors), a method that does not identify subtype-specific effects, but is very well-powered to detect heterogeneity of any kind within the very weak signals of GWAS. CLiP serves as a method to flag particular phenotypes for potential further study into the genomic factors driving heterogeneity, as well as a means to evaluate the transferability of polygenic risk scores. We also develop extensions of CLiP applicable to scoring heterogeneity in quantitative phenotypes and quantitative predictors such as gene expression. We apply CLiP to scoring heterogeneity in schizophrenia cohorts from the Psychiatric Genomics Consortium and neuroticism in individuals in the UK Biobank and find significant heterogeneity in both phenotypes, warranting further study.
Collapse
Affiliation(s)
- Jie Yuan
- Department of Computer Science, Columbia University, New York, United States of America
- * E-mail:
| | - Henry Xing
- Department of Computer Science, Columbia University, New York, United States of America
| | - Alexandre Louis Lamy
- Department of Computer Science, Columbia University, New York, United States of America
| | | | - Todd Lencz
- The Center for Psychiatric Neuroscience, Feinstein Institutes for Medical Research, New York, United States of America
| | - Itsik Pe’er
- Department of Computer Science, Columbia University, New York, United States of America
| |
Collapse
|
12
|
Abstract
Genotype imputation infers missing genotypes in silico using haplotype information from reference samples with genotypes from denser genotyping arrays or sequencing. This approach can confer a number of improvements on genome-wide association studies: it can improve statistical power to detect associations by reducing the number of missing genotypes; it can simplify data harmonization for meta-analyses by improving overlap of genomic variants between differently-genotyped sample sets; and it can increase the overall number and density of genomic variants available for association testing. This article reviews the general concepts behind imputation, describes imputation approaches and methods for various types of genotype data, including family-based data, and identifies web-based resources that can be used in different steps of the imputation process. For practical application, it provides a step-by-step guide to implementation of a two-step imputation process consisting of phasing of the study genotypes and the imputation of reference panel genotypes into the study haplotypes. In addition, this review describes recently developed haplotype reference panel resources and online imputation servers that are capable of remotely and securely implementing an imputation workflow on uploaded genotype array data. © 2019 by John Wiley & Sons, Inc.
Collapse
Affiliation(s)
- Adam C Naj
- Department of Biostatistics, Epidemiology, and Informatics and Center for Clinical Epidemiology and Biostatistics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania.,Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| |
Collapse
|
13
|
Abney M, ElSherbiny A. Kinpute: using identity by descent to improve genotype imputation. Bioinformatics 2019; 35:4321-4326. [PMID: 30918937 PMCID: PMC6821425 DOI: 10.1093/bioinformatics/btz221] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2018] [Revised: 02/21/2019] [Accepted: 03/26/2019] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Genotype imputation, though generally accurate, often results in many genotypes being poorly imputed, particularly in studies where the individuals are not well represented by standard reference panels. When individuals in the study share regions of the genome identical by descent (IBD), it is possible to use this information in combination with a study-specific reference panel (SSRP) to improve the imputation results. Kinpute uses IBD information-due to recent, familial relatedness or distant, unknown ancestors-in conjunction with the output from linkage disequilibrium (LD) based imputation methods to compute more accurate genotype probabilities. Kinpute uses a novel method for IBD imputation, which works even in the absence of a pedigree, and results in substantially improved imputation quality. RESULTS Given initial estimates of average IBD between subjects in the study sample, Kinpute uses a novel algorithm to select an optimal set of individuals to sequence and use as an SSRP. Kinpute is designed to use as input both this SSRP and the genotype probabilities output from other LD-based imputation software, and uses a new method to combine the LD imputed genotype probabilities with IBD configurations to substantially improve imputation. We tested Kinpute on a human population isolate where 98 individuals have been sequenced. In half of this sample, whose sequence data was masked, we used Impute2 to perform LD-based imputation and Kinpute was used to obtain higher accuracy genotype probabilities. Measures of imputation accuracy improved significantly, particularly for those genotypes that Impute2 imputed with low certainty. AVAILABILITY AND IMPLEMENTATION Kinpute is an open-source and freely available C++ software package that can be downloaded from. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mark Abney
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| | - Aisha ElSherbiny
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| |
Collapse
|