1
|
Khvorykh GV, Sapozhnikov NA, Limborska SA, Khrunin AV. Evaluation of Density-Based Spatial Clustering for Identifying Genomic Loci Associated with Ischemic Stroke in Genome-Wide Data. Int J Mol Sci 2023; 24:15355. [PMID: 37895035 PMCID: PMC10607504 DOI: 10.3390/ijms242015355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2023] [Revised: 09/19/2023] [Accepted: 09/28/2023] [Indexed: 10/29/2023] Open
Abstract
The genetic architecture of ischemic stroke (IS), which is one of the leading causes of death worldwide, is complex and underexplored. The traditional approach for associative gene mapping is genome-wide association studies (GWASs), testing individual single-nucleotide polymorphisms (SNPs) across the genomes of case and control groups. The purpose of this research is to develop an alternative approach in which groups of SNPs are examined rather than individual ones. We proposed, validated and applied to real data a new workflow consisting of three key stages: grouping SNPs in clusters, inferring the haplotypes in the clusters and testing haplotypes for the association with phenotype. To group SNPs, we applied the clustering algorithms DBSCAN and HDBSCAN to linkage disequilibrium (LD) matrices, representing pairwise r2 values between all genotyped SNPs. These clustering algorithms have never before been applied to genotype data as part of the workflow of associative studies. In total, 883,908 SNPs and insertion/deletion polymorphisms from people of European ancestry (4929 cases and 652 controls) were processed. The subsequent testing for frequencies of haplotypes restored in the clusters of SNPs revealed dozens of genes associated with IS and suggested the complex role that protocadherin molecules play in IS. The developed workflow was validated with the use of a simulated dataset of similar ancestry and the same sample sizes. The results of classic GWASs are also provided and discussed. The considered clustering algorithms can be applied to genotypic data to identify the genomic loci associated with different qualitative traits, using the workflow presented in this research.
Collapse
Affiliation(s)
| | | | | | - Andrey V. Khrunin
- National Research Centre “Kurchatov Institute”, Kurchatov Sq. 2, Moscow 123182, Russia; (G.V.K.); (N.A.S.); (S.A.L.)
| |
Collapse
|
2
|
Rijlaarsdam J, Cosin-Tomas M, Schellhas L, Abrishamcar S, Malmberg A, Neumann A, Felix JF, Sunyer J, Gutzkow KB, Grazuleviciene R, Wright J, Kampouri M, Zar HJ, Stein DJ, Heinonen K, Räikkönen K, Lahti J, Hüls A, Caramaschi D, Alemany S, Cecil CAM. DNA methylation and general psychopathology in childhood: an epigenome-wide meta-analysis from the PACE consortium. Mol Psychiatry 2023; 28:1128-1136. [PMID: 36385171 PMCID: PMC7614743 DOI: 10.1038/s41380-022-01871-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Revised: 10/25/2022] [Accepted: 11/02/2022] [Indexed: 11/17/2022]
Abstract
The general psychopathology factor (GPF) has been proposed as a way to capture variance shared between psychiatric symptoms. Despite a growing body of evidence showing both genetic and environmental influences on GPF, the biological mechanisms underlying these influences remain unclear. In the current study, we conducted epigenome-wide meta-analyses to identify both probe- and region-level associations of DNA methylation (DNAm) with school-age general psychopathology in six cohorts from the Pregnancy And Childhood Epigenetics (PACE) Consortium. DNAm was examined both at birth (cord blood; prospective analysis) and during school-age (peripheral whole blood; cross-sectional analysis) in total samples of N = 2178 and N = 2190, respectively. At school-age, we identified one probe (cg11945228) located in the Bromodomain-containing protein 2 gene (BRD2) that negatively associated with GPF (p = 8.58 × 10-8). We also identified a significant differentially methylated region (DMR) at school-age (p = 1.63 × 10-8), implicating the SHC Adaptor Protein 4 (SHC4) gene and the EP300-interacting inhibitor of differentiation 1 (EID1) gene that have been previously implicated in multiple types of psychiatric disorders in adulthood, including obsessive compulsive disorder, schizophrenia, and major depressive disorder. In contrast, no prospective associations were identified with DNAm at birth. Taken together, results of this study revealed some evidence of an association between DNAm at school-age and GPF. Future research with larger samples is needed to further assess DNAm variation associated with GPF.
Collapse
Affiliation(s)
- Jolien Rijlaarsdam
- Department of Child and Adolescent Psychiatry/ Psychology, Erasmus MC University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Marta Cosin-Tomas
- ISGlobal, Barcelona Institute for Global Health, Barcelona, Spain.
- Universitat Pompeu Fabra, Barcelona, Spain.
- Centro de investigación biomédica en red en epidemiología y salud pública (ciberesp), Madrid, Spain.
| | - Laura Schellhas
- School of Psychological Science, MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK
- Institute for Sex Research, Sexual Medicine and Forensic Psychiatry, University Medical Center Hamburg, Eppendorf, Germany
| | - Sarina Abrishamcar
- Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA, USA
| | - Anni Malmberg
- Department of Psychology & Logopedics, University of Helsinki, Helsinki, Finland
| | | | - Janine F Felix
- The Generation R Study Group, Erasmus MC University Medical Center Rotterdam, Rotterdam, The Netherlands
- Department of Pediatrics, Erasmus MC University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Jordi Sunyer
- ISGlobal, Barcelona Institute for Global Health, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
- Centro de investigación biomédica en red en epidemiología y salud pública (ciberesp), Madrid, Spain
| | - Kristine B Gutzkow
- Division of Climate and Environmental Health, Norwegian Institute of Public Health (NIPH), Oslo, Norway
| | - Regina Grazuleviciene
- Department of Environmental Science, Vytautas Magnus University, 44248, Kaunas, Lithuania
| | - John Wright
- Bradford Institute for Health Research, Bradford Teaching Hospitals NHS Foundation Trust, Bradford, UK
| | - Mariza Kampouri
- Department of Social Medicine, University of Crete, Crete, Greece
| | - Heather J Zar
- Department of Paediatrics and Child Health, Red Cross War Memorial Children's Hospital, University of Cape Town, Cape Town, South Africa
- South African Medical Research Council (SAMRC) Unit on Child and Adolescent Health, University of Cape Town, Cape Town, South Africa
| | - Dan J Stein
- Department of Psychiatry and Mental Health, University of Cape Town, Cape Town, South Africa
- South African Medical Research Council (SAMRC) Unit on Risk and Resilience in Mental Disorders, Neuroscience Institute, University of Cape Town, Cape Town, South Africa
| | - Kati Heinonen
- Department of Psychology & Logopedics, University of Helsinki, Helsinki, Finland
- Psychology/ Welfare Sciences, Faculty of Social Sciences, Tampere University, Tampere, Finland
| | - Katri Räikkönen
- Department of Psychology & Logopedics, University of Helsinki, Helsinki, Finland
| | - Jari Lahti
- Department of Psychology & Logopedics, University of Helsinki, Helsinki, Finland
| | - Anke Hüls
- Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA, USA
- Gangarosa Department of Environmental Health, Rollins School of Public Health, Emory University, Atlanta, GA, USA
| | - Doretta Caramaschi
- Medical Research Council Integrative Epidemiology Unit, Population Health Science, Bristol Medical School, University of Bristol, Bristol, UK
- Department of Psychology, , University of Exeter, Exeter, UK
| | - Silvia Alemany
- ISGlobal, Barcelona Institute for Global Health, Barcelona, Spain
- Psychiatric Genetics Unit, Group of Psychiatry, Mental Health and Addiction, Vall d'Hebron Research Institute (VHIR), Universitat Autònoma de Barcelona, Barcelona, Spain
- Biomedical Network Research Centre on Mental Health (CIBERSAM), Instituto de Salud Carlos III, Barcelona, Spain
| | - Charlotte A M Cecil
- Department of Child and Adolescent Psychiatry/ Psychology, Erasmus MC University Medical Center Rotterdam, Rotterdam, The Netherlands.
- Department of Epidemiology, Erasmus MC University Medical Center Rotterdam, Rotterdam, The Netherlands.
- Molecular Epidemiology, Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands.
| |
Collapse
|
3
|
Gomez-Raya L, Rauw WM. Failing the four-gamete test enables exact phasing: the Corners’ Algorithm. GENETICS SELECTION EVOLUTION 2022; 54:74. [PMCID: PMC9661815 DOI: 10.1186/s12711-022-00763-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Accepted: 10/24/2022] [Indexed: 11/16/2022]
Abstract
Abstract
Background
Failing the four-gamete test for two polymorphic DNA markers is an indication that two or three rather than four haplotypes segregate in the population. The objective of this paper is to show that when just three haplotypes are segregating, all three haplotypes can be fully and unambiguously phase-resolved.
Theory and methods
The Corners’ Algorithm tests the four corners in a 3 × 3 table of two-locus genotypes. If one of the four corners is filled with zeroes, then the missing haplotype is identified and the phases of all three haplotypes can be unambiguously resolved for all individuals. Three applications of this method are proposed when the four-gamete test fails: (1) direct estimation of linkage disequilibrium (LD), (2) haplotype-based genome-wide association studies (GWAS) of three haplotypes (single-marker GWAS tests for two out of three haplotypes only), and (3) haplotyping of chromosomal regions that are comprised of pairs of single nucleotide polymorphisms (SNPs) that consist of just three haplotypes. An example based on 435 sows with performance records for total number of piglets born is used to illustrate the methods.
Results
Of 20,339 SNPs, approximately 50% of the pairs of flanking SNPs failed the four-gamete test. For those, the expectation maximization (EM) algorithm gave the same results. The average of the absolute value of the difference in r2 between flanking SNPs across the genome between the two methods was 0.00082. Single-marker GWAS (using two of three haplotypes) detected significant associations for total number of piglets born on chromosomes 1, 2, 6, 9, 10, 12, 13, 14, 15, and 18. Haplotype-based GWAS using the third haplotype resolved with the Corners’ Algorithm detected additional significant associations for total number of piglets born on chromosomes 2, 5, 10, 13, 14, 15, and 18. Estimated substitution effects ranged from 0.40 to 1.35 piglets. Haplotyping of chromosomal regions that failed the four-gamete test for any pair of SNPs covered 961 Mb out of the 2249 Mb by the SNP array.
Conclusions
The Corner’s Algorithm allows to fully phase haplotypes when the four-gamete test fails. Longer haplotypes in chromosomal regions in which the four-gamete test fails for any pair of SNPs can be used as a multi-allelic marker with increased polymorphism information content.
Collapse
|
4
|
A machine learning-based SNP-set analysis approach for identifying disease-associated susceptibility loci. Sci Rep 2022; 12:15817. [PMID: 36138111 PMCID: PMC9499949 DOI: 10.1038/s41598-022-19708-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 09/02/2022] [Indexed: 11/17/2022] Open
Abstract
Identifying disease-associated susceptibility loci is one of the most pressing and crucial challenges in modeling complex diseases. Existing approaches to biomarker discovery are subject to several limitations including underpowered detection, neglect for variant interactions, and restrictive dependence on prior biological knowledge. Addressing these challenges necessitates more ingenious ways of approaching the “missing heritability” problem. This study aims to discover disease-associated susceptibility loci by augmenting previous genome-wide association study (GWAS) using the integration of random forest and cluster analysis. The proposed integrated framework is applied to a hepatitis B virus surface antigen (HBsAg) seroclearance GWAS data. Multiple cluster analyses were performed on (1) single nucleotide polymorphisms (SNPs) considered significant by GWAS and (2) SNPs with the highest feature importance scores obtained using random forest. The resulting SNP-sets from the cluster analyses were subsequently tested for trait-association. Three susceptibility loci possibly associated with HBsAg seroclearance were identified: (1) SNP rs2399971, (2) gene LINC00578, and (3) locus 11p15. SNP rs2399971 is a biomarker reported in the literature to be significantly associated with HBsAg seroclearance in patients who had received antiviral treatment. The latter two loci are linked with diseases influenced by the presence of hepatitis B virus infection. These findings demonstrate the potential of the proposed integrated framework in identifying disease-associated susceptibility loci. With further validation, results herein could aid in better understanding complex disease etiologies and provide inputs for a more advanced disease risk assessment for patients.
Collapse
|
5
|
McCartney DL, Hillary RF, Conole ELS, Banos DT, Gadd DA, Walker RM, Nangle C, Flaig R, Campbell A, Murray AD, Maniega SM, Valdés-Hernández MDC, Harris MA, Bastin ME, Wardlaw JM, Harris SE, Porteous DJ, Tucker-Drob EM, McIntosh AM, Evans KL, Deary IJ, Cox SR, Robinson MR, Marioni RE. Blood-based epigenome-wide analyses of cognitive abilities. Genome Biol 2022; 23:26. [PMID: 35039062 PMCID: PMC8762878 DOI: 10.1186/s13059-021-02596-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Accepted: 12/29/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Blood-based markers of cognitive functioning might provide an accessible way to track neurodegeneration years prior to clinical manifestation of cognitive impairment and dementia. RESULTS Using blood-based epigenome-wide analyses of general cognitive function, we show that individual differences in DNA methylation (DNAm) explain 35.0% of the variance in general cognitive function (g). A DNAm predictor explains ~4% of the variance, independently of a polygenic score, in two external cohorts. It also associates with circulating levels of neurology- and inflammation-related proteins, global brain imaging metrics, and regional cortical volumes. CONCLUSIONS As sample sizes increase, the ability to assess cognitive function from DNAm data may be informative in settings where cognitive testing is unreliable or unavailable.
Collapse
Affiliation(s)
- Daniel L. McCartney
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XU UK
| | - Robert F. Hillary
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XU UK
| | - Eleanor L. S. Conole
- Lothian Birth Cohorts, Department of Psychology, University of Edinburgh, Edinburgh, EH8 9JZ UK
- Centre for Clinical Brain Sciences, UK Dementia Research Institute at the University of Edinburgh, Chancellor’s Building, 49 Little France Crescent, Edinburgh BioQuarter, Edinburgh, EH16 4SB UK
| | - Daniel Trejo Banos
- Department of Quantitative Biomedicine, University of Zurich, Zurich, Switzerland
- Biomedical Informatics, University Hospital of Zurich, Zurich, Switzerland
| | - Danni A. Gadd
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XU UK
| | - Rosie M. Walker
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XU UK
- Centre for Clinical Brain Sciences, UK Dementia Research Institute at the University of Edinburgh, Chancellor’s Building, 49 Little France Crescent, Edinburgh BioQuarter, Edinburgh, EH16 4SB UK
| | - Cliff Nangle
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XU UK
| | - Robin Flaig
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XU UK
| | - Archie Campbell
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XU UK
| | - Alison D. Murray
- Aberdeen Biomedical Imaging Centre, University of Aberdeen, Aberdeen, Scotland, UK
| | - Susana Muñoz Maniega
- Lothian Birth Cohorts, Department of Psychology, University of Edinburgh, Edinburgh, EH8 9JZ UK
- Centre for Clinical Brain Sciences, UK Dementia Research Institute at the University of Edinburgh, Chancellor’s Building, 49 Little France Crescent, Edinburgh BioQuarter, Edinburgh, EH16 4SB UK
| | - María del C. Valdés-Hernández
- Lothian Birth Cohorts, Department of Psychology, University of Edinburgh, Edinburgh, EH8 9JZ UK
- Centre for Clinical Brain Sciences, UK Dementia Research Institute at the University of Edinburgh, Chancellor’s Building, 49 Little France Crescent, Edinburgh BioQuarter, Edinburgh, EH16 4SB UK
| | - Mathew A. Harris
- Lothian Birth Cohorts, Department of Psychology, University of Edinburgh, Edinburgh, EH8 9JZ UK
| | - Mark E. Bastin
- Lothian Birth Cohorts, Department of Psychology, University of Edinburgh, Edinburgh, EH8 9JZ UK
- Centre for Clinical Brain Sciences, UK Dementia Research Institute at the University of Edinburgh, Chancellor’s Building, 49 Little France Crescent, Edinburgh BioQuarter, Edinburgh, EH16 4SB UK
| | - Joanna M. Wardlaw
- Lothian Birth Cohorts, Department of Psychology, University of Edinburgh, Edinburgh, EH8 9JZ UK
- Centre for Clinical Brain Sciences, UK Dementia Research Institute at the University of Edinburgh, Chancellor’s Building, 49 Little France Crescent, Edinburgh BioQuarter, Edinburgh, EH16 4SB UK
| | - Sarah E. Harris
- Lothian Birth Cohorts, Department of Psychology, University of Edinburgh, Edinburgh, EH8 9JZ UK
| | - David J. Porteous
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XU UK
| | - Elliot M. Tucker-Drob
- Department of Psychology, University of Texas, Austin, TX USA
- Population Research Center and Center on Aging and Population Sciences, University of Texas, Austin, TX USA
| | - Andrew M. McIntosh
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XU UK
- Division of Psychiatry, Centre for Clinical Brain Sciences, University of Edinburgh, Edinburgh, UK
| | - Kathryn L. Evans
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XU UK
| | - Ian J. Deary
- Lothian Birth Cohorts, Department of Psychology, University of Edinburgh, Edinburgh, EH8 9JZ UK
| | - Simon R. Cox
- Lothian Birth Cohorts, Department of Psychology, University of Edinburgh, Edinburgh, EH8 9JZ UK
| | | | - Riccardo E. Marioni
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XU UK
| |
Collapse
|
6
|
Kasyanov E, Rakitko A, Rukavishnikov G, Golimbet V, Shmukler A, Iliinsky V, Neznanov N, Kibitov A, Mazo G. Contemporary GWAS studies of depression: the critical role of phenotyping. Zh Nevrol Psikhiatr Im S S Korsakova 2022; 122:50-61. [DOI: 10.17116/jnevro202212201150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
7
|
Matallana-Ramirez LP, Whetten RW, Sanchez GM, Payn KG. Breeding for Climate Change Resilience: A Case Study of Loblolly Pine ( Pinus taeda L.) in North America. FRONTIERS IN PLANT SCIENCE 2021; 12:606908. [PMID: 33995428 PMCID: PMC8119900 DOI: 10.3389/fpls.2021.606908] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Accepted: 04/08/2021] [Indexed: 05/25/2023]
Abstract
Earth's atmosphere is warming and the effects of climate change are becoming evident. A key observation is that both the average levels and the variability of temperature and precipitation are changing. Information and data from new technologies are developing in parallel to provide multidisciplinary opportunities to address and overcome the consequences of these changes in forest ecosystems. Changes in temperature and water availability impose multidimensional environmental constraints that trigger changes from the molecular to the forest stand level. These can represent a threat for the normal development of the tree from early seedling recruitment to adulthood both through direct mortality, and by increasing susceptibility to pathogens, insect attack, and fire damage. This review summarizes the strengths and shortcomings of previous work in the areas of genetic variation related to cold and drought stress in forest species with particular emphasis on loblolly pine (Pinus taeda L.), the most-planted tree species in North America. We describe and discuss the implementation of management and breeding strategies to increase resilience and adaptation, and discuss how new technologies in the areas of engineering and genomics are shaping the future of phenotype-genotype studies. Lessons learned from the study of species important in intensively-managed forest ecosystems may also prove to be of value in helping less-intensively managed forest ecosystems adapt to climate change, thereby increasing the sustainability and resilience of forestlands for the future.
Collapse
Affiliation(s)
- Lilian P. Matallana-Ramirez
- Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, Raleigh, NC, United States
| | - Ross W. Whetten
- Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, Raleigh, NC, United States
| | - Georgina M. Sanchez
- Center for Geospatial Analytics, North Carolina State University, Raleigh, Raleigh, NC, United States
| | - Kitt G. Payn
- Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, Raleigh, NC, United States
| |
Collapse
|
8
|
Genome-wide haplotype association study in imaging genetics using whole-brain sulcal openings of 16,304 UK Biobank subjects. Eur J Hum Genet 2021; 29:1424-1437. [PMID: 33664500 PMCID: PMC8440755 DOI: 10.1038/s41431-021-00827-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2020] [Revised: 12/18/2020] [Accepted: 02/04/2021] [Indexed: 11/29/2022] Open
Abstract
Neuroimaging-genetics cohorts gather two types of data: brain imaging and genetic data. They allow the discovery of associations between genetic variants and brain imaging features. They are invaluable resources to study the influence of genetics and environment in the brain features variance observed in normal and pathological populations. This study presents a genome-wide haplotype analysis for 123 brain sulcus opening value (a measure of sulcal width) across the whole brain that include 16,304 subjects from UK Biobank. Using genetic maps, we defined 119,548 blocks of low recombination rate distributed along the 22 autosomal chromosomes and analyzed 1,051,316 haplotypes. To test associations between haplotypes and complex traits, we designed three statistical approaches. Two of them use a model that includes all the haplotypes for a single block, while the last approach considers each haplotype independently. All the statistics produced were assessed as rigorously as possible. Thanks to the rich imaging dataset at hand, we used resampling techniques to assess False Positive Rate for each statistical approach in a genome-wide and brain-wide context. The results on real data show that genome-wide haplotype analyses are more sensitive than single-SNP approach and account for local complex Linkage Disequilibrium (LD) structure, which makes genome-wide haplotype analysis an interesting and statistically sound alternative to the single-SNP counterpart.
Collapse
|
9
|
Qu L, Shen MM, Dou TC, Ma M, Lu J, Wang XG, Guo J, Hu YP, Li YF, Wang KH. Genome-wide association studies for mottled eggs in chickens using a high-density single-nucleotide polymorphism array. Animal 2020; 15:100051. [PMID: 33516007 DOI: 10.1016/j.animal.2020.100051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2020] [Revised: 08/12/2020] [Accepted: 08/13/2020] [Indexed: 10/22/2022] Open
Abstract
Mottled eggs in layer chickens are gaining increasing attention because of the economic impact on the egg industry caused by the reduced sale value of commodity eggs. However, the genetic architecture underlying mottled eggs is not well understood. The genetic architecture underlying the mottled egg trait was investigated using genome-wide association studies (GWAS) by high-density arrays, using a total of 407 pink eggs and 799 blue eggs from an F2 resource population generated by crossing Dongxiang Blue-shelled and White Leghorn chickens. The mottled egg score in blue eggs was found to be higher than that in pink eggs. The single-nucleotide polymorphism heritability of mottled egg at laying day and storage for 7 days was 0.18 and 0.20, respectively. Bivariate GWAS provided 29 significant loci, mainly located on GGA2, GGA3, GGA8, GGA10, GGA15, GGA17, and GGA23, affecting mottled egg on laying day. Candidate genes RIMS2, SLC25A32, RIMBP2, VPS13B, and RGS3 were obtained for mottled eggshell by bivariate GWAS and gene annotation. Our findings provide new insights into the genetic architecture of mottled egg in hens, and demonstrate that a genomic selection method would be profitable for breeding out the mottled egg trait.
Collapse
Affiliation(s)
- L Qu
- Jiangsu Institute of Poultry Science, Chinese Academy of Agricultural Sciences, 225125 Yangzhou, Jiangsu, China
| | - M M Shen
- Jiangsu Institute of Poultry Science, Chinese Academy of Agricultural Sciences, 225125 Yangzhou, Jiangsu, China; College of Biotechnology, Jiangsu University of Science and Technology, 212003 Zhenjiang, Jiangsu, China
| | - T C Dou
- Jiangsu Institute of Poultry Science, Chinese Academy of Agricultural Sciences, 225125 Yangzhou, Jiangsu, China
| | - M Ma
- Jiangsu Institute of Poultry Science, Chinese Academy of Agricultural Sciences, 225125 Yangzhou, Jiangsu, China
| | - J Lu
- Jiangsu Institute of Poultry Science, Chinese Academy of Agricultural Sciences, 225125 Yangzhou, Jiangsu, China
| | - X G Wang
- Jiangsu Institute of Poultry Science, Chinese Academy of Agricultural Sciences, 225125 Yangzhou, Jiangsu, China
| | - J Guo
- Jiangsu Institute of Poultry Science, Chinese Academy of Agricultural Sciences, 225125 Yangzhou, Jiangsu, China
| | - Y P Hu
- Jiangsu Institute of Poultry Science, Chinese Academy of Agricultural Sciences, 225125 Yangzhou, Jiangsu, China
| | - Y F Li
- Jiangsu Institute of Poultry Science, Chinese Academy of Agricultural Sciences, 225125 Yangzhou, Jiangsu, China
| | - K H Wang
- Jiangsu Institute of Poultry Science, Chinese Academy of Agricultural Sciences, 225125 Yangzhou, Jiangsu, China.
| |
Collapse
|
10
|
Zhao J, Hua S, Wang W, Fan W, Tang W, Zhang Y, Zhang C. Identification of TNFA influencing MDD risk and clinical features in Han Chinese. Cytokine 2020; 129:155030. [DOI: 10.1016/j.cyto.2020.155030] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Revised: 01/22/2020] [Accepted: 02/05/2020] [Indexed: 01/07/2023]
|
11
|
Zhang H, Shen LY, Xu ZC, Kramer LM, Yu JQ, Zhang XY, Na W, Yang LL, Cao ZP, Luan P, Reecy JM, Li H. Haplotype-based genome-wide association studies for carcass and growth traits in chicken. Poult Sci 2020; 99:2349-2361. [PMID: 32359570 PMCID: PMC7597553 DOI: 10.1016/j.psj.2020.01.009] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2019] [Revised: 01/20/2020] [Accepted: 01/20/2020] [Indexed: 12/15/2022] Open
Abstract
There have been several genome-wide association study (GWAS) reported for carcass, growth, and meat traits in chickens. Most of these studies have been based on single SNPs GWAS. In contrast, haplotype-based GWAS reports have been limited. In the present study, 2 Northeast Agricultural University broiler lines divergently selected for abdominal fat content (NEAUHLF) and genotyped with the chicken 60K SNP chip were used to perform a haplotype-based GWAS. The lean and fat chicken lines were selected for abdominal fat content for 11 yr. Abdominal fat weight was significantly different between the 2 lines; however, there was no difference for body weight between the lean and fat lines. A total of 132 haplotype windows were significantly associated with abdominal fat weight. These significantly associated haplotype windows were primarily located on chromosomes 2, 4, 8, 10, and 26. Seven candidate genes, including SHH, LMBR1, FGF7, IL16, PLIN1, IGF1R, and SLC16A1, were located within these associated regions. These genes may play important roles in the control of abdominal fat content. Two regions on chromosomes 3 and 10 were significantly associated with testis weight. These 2 regions were previously detected by the single SNP GWAS using this same resource population. TCF21 on chromosome 3 was identified as a potentially important candidate gene for testis growth and development based on gene expression analysis and the reported function of this gene. TCF12, which was previously detected in our SNP by SNP interaction analysis, was located in a region on chromosome 10 that was significantly associated with testis weight. Six candidate genes, including TNFRSF1B, PLOD1, NPPC, MTHFR, EPHB2, and SLC35A3, on chromosome 21 may play important roles in bone development based on the known function of these genes. In addition, several regions were significantly associated with other carcass and growth traits, but no candidate genes were identified. The results of the present study may be helpful in understanding the genetic mechanisms of carcass and growth traits in chickens.
Collapse
Affiliation(s)
- Hui Zhang
- Key Laboratory of Chicken Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Genetics, Breeding and Reproduction, Education Department of Heilongjiang Province, College of Animal Science and Technology, Northeast Agricultural University, Harbin 150030, PR China
| | - Lin-Yong Shen
- Key Laboratory of Chicken Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Genetics, Breeding and Reproduction, Education Department of Heilongjiang Province, College of Animal Science and Technology, Northeast Agricultural University, Harbin 150030, PR China
| | - Zi-Chun Xu
- Key Laboratory of Chicken Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Genetics, Breeding and Reproduction, Education Department of Heilongjiang Province, College of Animal Science and Technology, Northeast Agricultural University, Harbin 150030, PR China
| | - Luke M Kramer
- Department of Animal Science, Iowa State University, Ames, IA 50011, USA
| | - Jia-Qiang Yu
- Key Laboratory of Chicken Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Genetics, Breeding and Reproduction, Education Department of Heilongjiang Province, College of Animal Science and Technology, Northeast Agricultural University, Harbin 150030, PR China
| | - Xin-Yang Zhang
- Key Laboratory of Chicken Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Genetics, Breeding and Reproduction, Education Department of Heilongjiang Province, College of Animal Science and Technology, Northeast Agricultural University, Harbin 150030, PR China
| | - Wei Na
- Key Laboratory of Chicken Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Genetics, Breeding and Reproduction, Education Department of Heilongjiang Province, College of Animal Science and Technology, Northeast Agricultural University, Harbin 150030, PR China
| | - Li-Li Yang
- Key Laboratory of Chicken Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Genetics, Breeding and Reproduction, Education Department of Heilongjiang Province, College of Animal Science and Technology, Northeast Agricultural University, Harbin 150030, PR China
| | - Zhi-Ping Cao
- Key Laboratory of Chicken Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Genetics, Breeding and Reproduction, Education Department of Heilongjiang Province, College of Animal Science and Technology, Northeast Agricultural University, Harbin 150030, PR China
| | - Peng Luan
- Key Laboratory of Chicken Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Genetics, Breeding and Reproduction, Education Department of Heilongjiang Province, College of Animal Science and Technology, Northeast Agricultural University, Harbin 150030, PR China
| | - James M Reecy
- Department of Animal Science, Iowa State University, Ames, IA 50011, USA.
| | - Hui Li
- Key Laboratory of Chicken Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Genetics, Breeding and Reproduction, Education Department of Heilongjiang Province, College of Animal Science and Technology, Northeast Agricultural University, Harbin 150030, PR China.
| |
Collapse
|
12
|
Maul S, Giegling I, Fabbri C, Corponi F, Serretti A, Rujescu D. Genetics of resilience: Implications from genome-wide association studies and candidate genes of the stress response system in posttraumatic stress disorder and depression. Am J Med Genet B Neuropsychiatr Genet 2020; 183:77-94. [PMID: 31583809 DOI: 10.1002/ajmg.b.32763] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/09/2019] [Revised: 07/22/2019] [Accepted: 09/03/2019] [Indexed: 12/28/2022]
Abstract
Resilience is the ability to cope with critical situations through the use of personal and socially mediated resources. Since a lack of resilience increases the risk of developing stress-related psychiatric disorders such as posttraumatic stress disorder (PTSD) and major depressive disorder (MDD), a better understanding of the biological background is of great value to provide better prevention and treatment options. Resilience is undeniably influenced by genetic factors, but very little is known about the exact underlying mechanisms. A recently published genome-wide association study (GWAS) on resilience has identified three new susceptibility loci, DCLK2, KLHL36, and SLC15A5. Further interesting results can be found in association analyses of gene variants of the stress response system, which is closely related to resilience, and PTSD and MDD. Several promising genes, such as the COMT (catechol-O-methyltransferase) gene, the serotonin transporter gene (SLC6A4), and neuropeptide Y (NPY) suggest gene × environment interaction between genetic variants, childhood adversity, and the occurrence of PTSD and MDD, indicating an impact of these genes on resilience. GWAS on PTSD and MDD provide another approach to identifying new disease-associated loci and, although the functional significance for disease development for most of these risk genes is still unknown, they are potential candidates due to the overlap of stress-related psychiatric disorders and resilience. In the future, it will be important for genetic studies to focus more on resilience than on pathological phenotypes, to develop reasonable concepts for measuring resilience, and to establish international cooperations to generate sufficiently large samples.
Collapse
Affiliation(s)
- Stephan Maul
- Department of Psychiatry, Psychotherapy, and Psychosomatics, Martin-Luther-University Halle-Wittenberg, Halle, Germany
| | - Ina Giegling
- Department of Psychiatry, Psychotherapy, and Psychosomatics, Martin-Luther-University Halle-Wittenberg, Halle, Germany
| | - Chiara Fabbri
- Department of Biomedical and Neuromotor Sciences, University of Bologna, Bologna, Italy
| | - Filippo Corponi
- Department of Biomedical and Neuromotor Sciences, University of Bologna, Bologna, Italy
| | - Alessandro Serretti
- Department of Biomedical and Neuromotor Sciences, University of Bologna, Bologna, Italy
| | - Dan Rujescu
- Department of Psychiatry, Psychotherapy, and Psychosomatics, Martin-Luther-University Halle-Wittenberg, Halle, Germany
| |
Collapse
|
13
|
Al Bkhetan Z, Zobel J, Kowalczyk A, Verspoor K, Goudey B. Exploring effective approaches for haplotype block phasing. BMC Bioinformatics 2019; 20:540. [PMID: 31666002 PMCID: PMC6822470 DOI: 10.1186/s12859-019-3095-8] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2019] [Accepted: 09/10/2019] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND Knowledge of phase, the specific allele sequence on each copy of homologous chromosomes, is increasingly recognized as critical for detecting certain classes of disease-associated mutations. One approach for detecting such mutations is through phased haplotype association analysis. While the accuracy of methods for phasing genotype data has been widely explored, there has been little attention given to phasing accuracy at haplotype block scale. Understanding the combined impact of the accuracy of phasing tool and the method used to determine haplotype blocks on the error rate within the determined blocks is essential to conduct accurate haplotype analyses. RESULTS We present a systematic study exploring the relationship between seven widely used phasing methods and two common methods for determining haplotype blocks. The evaluation focuses on the number of haplotype blocks that are incorrectly phased. Insights from these results are used to develop a haplotype estimator based on a consensus of three tools. The consensus estimator achieved the most accurate phasing in all applied tests. Individually, EAGLE2, BEAGLE and SHAPEIT2 alternate in being the best performing tool in different scenarios. Determining haplotype blocks based on linkage disequilibrium leads to more correctly phased blocks compared to a sliding window approach. We find that there is little difference between phasing sections of a genome (e.g. a gene) compared to phasing entire chromosomes. Finally, we show that the location of phasing error varies when the tools are applied to the same data several times, a finding that could be important for downstream analyses. CONCLUSIONS The choice of phasing and block determination algorithms and their interaction impacts the accuracy of phased haplotype blocks. This work provides guidance and evidence for the different design choices needed for analyses using haplotype blocks. The study highlights a number of issues that may have limited the replicability of previous haplotype analysis.
Collapse
Affiliation(s)
- Ziad Al Bkhetan
- School of Computing & Information Systems, University of Melbourne, Parkville, 3010, Australia
| | - Justin Zobel
- School of Computing & Information Systems, University of Melbourne, Parkville, 3010, Australia
| | - Adam Kowalczyk
- School of Computing & Information Systems, University of Melbourne, Parkville, 3010, Australia.,Centre for Neural Engineering, University of Melbourne, Carlton, 3053, Australia.,Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, 00-662, Poland.,Centre for Epidemiology and Biostatistics, The University of Melbourne, Parkville, 3010, Australia
| | - Karin Verspoor
- School of Computing & Information Systems, University of Melbourne, Parkville, 3010, Australia.
| | - Benjamin Goudey
- Centre for Epidemiology and Biostatistics, The University of Melbourne, Parkville, 3010, Australia.,IBM Australia - Research, Southgate, 3006, Australia
| |
Collapse
|
14
|
Welsh P, Preiss D, Hayward C, Shah ASV, McAllister D, Briggs A, Boachie C, McConnachie A, Padmanabhan S, Welsh C, Woodward M, Campbell A, Porteous D, Mills NL, Sattar N. Cardiac Troponin T and Troponin I in the General Population. Circulation 2019; 139:2754-2764. [PMID: 31014085 PMCID: PMC6571179 DOI: 10.1161/circulationaha.118.038529] [Citation(s) in RCA: 187] [Impact Index Per Article: 37.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
BACKGROUND There is great interest in widening the use of high-sensitivity cardiac troponins for population cardiovascular disease (CVD) and heart failure screening. However, it is not clear whether cardiac troponin T (cTnT) and troponin I (cTnI) are equivalent measures of risk in this setting. We aimed to compare and contrast (1) the association of cTnT and cTnI with CVD and non-CVD outcomes, and (2) their determinants in a genome-wide association study. METHODS High-sensitivity cTnT and cTnI were measured in serum from 19 501 individuals in Generation Scotland Scottish Family Health Study. Median follow-up was 7.8 years (quartile 1 to quartile 3, 7.1-9.2). Associations of each troponin with a composite CVD outcome (1177 events), CVD death (n=266), non-CVD death (n=374), and heart failure (n=216) were determined by using Cox models. A genome-wide association study was conducted using a standard approach developed for the cohort. RESULTS Both cTnI and cTnT were strongly associated with CVD risk in unadjusted models. After adjusting for classical risk factors, the hazard ratio for a 1 SD increase in log transformed troponin was 1.24 (95% CI, 1.17-1.32) and 1.11 (1.04-1.19) for cTnI and cTnT, respectively; ratio of hazard ratios 1.12 (1.04-1.21). cTnI, but not cTnT, was associated with myocardial infarction and coronary heart disease. Both cTnI and cTnT had strong associations with CVD death and heart failure. By contrast, cTnT, but not cTnI, was associated with non-CVD death; ratio of hazard ratios 0.77 (0.67-0.88). We identified 5 loci (53 individual single-nucleotide polymorphisms) that had genome-wide significant associations with cTnI, and a different set of 4 loci (4 single-nucleotide polymorphisms) for cTnT. CONCLUSIONS The upstream genetic causes of low-grade elevations in cTnI and cTnT appear distinct, and their associations with outcomes also differ. Elevations in cTnI are more strongly associated with some CVD outcomes, whereas cTnT is more strongly associated with the risk of non-CVD death. These findings help inform the selection of an optimal troponin assay for future clinical care and research in this setting.
Collapse
Affiliation(s)
- Paul Welsh
- Institute of Cardiovascular and Medical Sciences (P.W., S.P., C.W., N.S.), University of Glasgow, United Kingdom
| | - David Preiss
- MRC Population Health Research Unit, Clinical Trial Service Unit and Epidemiological Studies Unit (D. Preiss), University of Oxford, United Kingdom
| | - Caroline Hayward
- MRC Human Genetics Unit, MRC Institute of Genetics and Molecular Medicine (C.H.), University of Edinburgh, United Kingdom
| | - Anoop S V Shah
- BHF Centre for Cardiovascular Science (A.S.V.S., N.L.M.), University of Edinburgh, United Kingdom
| | - David McAllister
- Institute of Cardiovascular and Medical Sciences (P.W., S.P., C.W., N.S.), University of Glasgow, United Kingdom
| | - Andrew Briggs
- Institute of Health and Wellbeing (A.B.), University of Glasgow, United Kingdom
| | - Charles Boachie
- Robertson Centre for Biostatistics (C.B., A.M.), University of Glasgow, United Kingdom
| | - Alex McConnachie
- Robertson Centre for Biostatistics (C.B., A.M.), University of Glasgow, United Kingdom
| | - Sandosh Padmanabhan
- Institute of Cardiovascular and Medical Sciences (P.W., S.P., C.W., N.S.), University of Glasgow, United Kingdom
| | - Claire Welsh
- Institute of Cardiovascular and Medical Sciences (P.W., S.P., C.W., N.S.), University of Glasgow, United Kingdom
| | - Mark Woodward
- The George Institute for Global Health (M.W.), University of Oxford, United Kingdom.,The George Institute for Global Health, University of New South Wales, Sydney, Australia (M.W.).,Department of Epidemiology, Johns Hopkins University, Baltimore, MD (M.W.)
| | - Archie Campbell
- Centre for Genomic and Experimental Medicine, MRC Institute of Genetics and Molecular Medicine (A.C., D. Porteous), University of Edinburgh, United Kingdom.,Usher Institute for Population Health Sciences and Informatics (A.C.), University of Edinburgh, United Kingdom
| | - David Porteous
- Centre for Genomic and Experimental Medicine, MRC Institute of Genetics and Molecular Medicine (A.C., D. Porteous), University of Edinburgh, United Kingdom
| | - Nicholas L Mills
- BHF Centre for Cardiovascular Science (A.S.V.S., N.L.M.), University of Edinburgh, United Kingdom
| | - Naveed Sattar
- Institute of Cardiovascular and Medical Sciences (P.W., S.P., C.W., N.S.), University of Glasgow, United Kingdom
| |
Collapse
|
15
|
Corponi F, Bonassi S, Vieta E, Albani D, Frustaci A, Ducci G, Landi S, Boccia S, Serretti A, Fabbri C. Genetic basis of psychopathological dimensions shared between schizophrenia and bipolar disorder. Prog Neuropsychopharmacol Biol Psychiatry 2019; 89:23-29. [PMID: 30149091 DOI: 10.1016/j.pnpbp.2018.08.023] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/05/2018] [Revised: 08/22/2018] [Accepted: 08/23/2018] [Indexed: 12/25/2022]
Abstract
Shared genetic vulnerability between schizophrenia (SCZ) and bipolar disorder (BP) was demonstrated, but the genetic underpinnings of specific symptom domains are unclear. This study investigated which genes and gene sets may modulate specific psychopathological domains and if genome-wide significant loci previously associated with SCZ or BP may play a role. Genome-wide data were available in patients with SCZ (n = 226) or BP (n = 228). Phenotypes under investigation were depressive and positive symptoms severity, suicidal ideation, onset age and substance use disorder comorbidity. Genome-wide analyses were performed at gene and gene set level, while 148 genome-wide significant loci previously associated with SCZ and/or BP were investigated. Each sample was analyzed separately then a meta-analysis was performed. SH3GL2 and CLVS1 genes were associated with suicidal ideation in SCZ (p = 5.62e-08 and 0.01, respectively), the former also in the meta-analysis (p = .01). SHC4 gene was associated with depressive symptoms severity in BP (p = .003). A gene set involved in cellular differentiation (GO:0048661) was associated with substance disorder comorbidity in the meta-analysis (p = .03). Individual loci previously associated with SCZ or BP did not modulate the phenotypes of interest. This study provided confirmatory and new findings. SH3GL2 (endophilin A1) showed a role in suicidal ideation that may be due to its relevance to the glutamate system. SHC4 regulates BDNF-induced MAPK activation and was previously associated with depression. CLVS1 is involved in lysosome maturation and was for the first time associated with a psychiatric trait. GO:0048661 may mediate the risk of substance disorder through an effect on neurodevelopment/neuroplasticity.
Collapse
Affiliation(s)
- Filippo Corponi
- Department of Biomedical and NeuroMotor Sciences, University of Bologna, Italy
| | - Stefano Bonassi
- Unit of Clinical and Molecular Epidemiology, IRCCS San Raffaele Pisana, Rome, Italy; Department of Human Sciences and Quality of Life Promotion, San Raffaele University, Rome, Italy
| | - Eduard Vieta
- Bipolar Disorders Unit, Institute of Neuroscience, Hospital Clínic, University of Barcelona, IDIBAPS, CIBERSAM, Barcelona, Catalonia, Spain
| | - Diego Albani
- Laboratory of Biology of Neurodegenerative Disorders, Neuroscience Department, IRCCS Istituto di Ricerche Farmacologiche "Mario Negri", Milan, Italy
| | - Alessandra Frustaci
- Barnet, Enfield and Haringey Mental Health NHS Trust, St.Ann's Hospital, St.Ann's Road, N15 3 TH London, UK
| | | | - Stefano Landi
- Dipartimento di Biologia, Universita' di Pisa, Pisa, Italy
| | - Stefania Boccia
- Section of Hygiene, Institute of Public Health, Universita' Cattolica del Sacro Cuore, Fondazione Policlinico "Agostino Gemelli" IRCCS, Rome, Italy
| | - Alessandro Serretti
- Department of Biomedical and NeuroMotor Sciences, University of Bologna, Italy.
| | - Chiara Fabbri
- Department of Biomedical and NeuroMotor Sciences, University of Bologna, Italy
| |
Collapse
|
16
|
Stanaway IB, Hall TO, Rosenthal EA, Palmer M, Naranbhai V, Knevel R, Namjou-Khales B, Carroll RJ, Kiryluk K, Gordon AS, Linder J, Howell KM, Mapes BM, Lin FTJ, Joo YY, Hayes MG, Gharavi AG, Pendergrass SA, Ritchie MD, de Andrade M, Croteau-Chonka DC, Raychaudhuri S, Weiss ST, Lebo M, Amr SS, Carrell D, Larson EB, Chute CG, Rasmussen-Torvik LJ, Roy-Puckelwartz MJ, Sleiman P, Hakonarson H, Li R, Karlson EW, Peterson JF, Kullo IJ, Chisholm R, Denny JC, Jarvik GP, Crosslin DR. The eMERGE genotype set of 83,717 subjects imputed to ~40 million variants genome wide and association with the herpes zoster medical record phenotype. Genet Epidemiol 2018; 43:63-81. [PMID: 30298529 PMCID: PMC6375696 DOI: 10.1002/gepi.22167] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2018] [Revised: 08/10/2018] [Accepted: 08/28/2018] [Indexed: 12/30/2022]
Abstract
The Electronic Medical Records and Genomics (eMERGE) network is a network of medical centers with electronic medical records linked to existing biorepository samples for genomic discovery and genomic medicine research. The network sought to unify the genetic results from 78 Illumina and Affymetrix genotype array batches from 12 contributing medical centers for joint association analysis of 83,717 human participants. In this report, we describe the imputation of eMERGE results and methods to create the unified imputed merged set of genome‐wide variant genotype data. We imputed the data using the Michigan Imputation Server, which provides a missing single‐nucleotide variant genotype imputation service using the minimac3 imputation algorithm with the Haplotype Reference Consortium genotype reference set. We describe the quality control and filtering steps used in the generation of this data set and suggest generalizable quality thresholds for imputation and phenotype association studies. To test the merged imputed genotype set, we replicated a previously reported chromosome 6 HLA‐B herpes zoster (shingles) association and discovered a novel zoster‐associated loci in an epigenetic binding site near the terminus of chromosome 3 (3p29).
Collapse
Affiliation(s)
- Ian B Stanaway
- Department of Biomedical Informatics Medical Education, School of Medicine, University of Washington, Seattle, Washington
| | - Taryn O Hall
- Department of Biomedical Informatics Medical Education, School of Medicine, University of Washington, Seattle, Washington
| | - Elisabeth A Rosenthal
- Division of Medical Genetics, School of Medicine, University of Washington, Seattle, Washington
| | - Melody Palmer
- Division of Medical Genetics, School of Medicine, University of Washington, Seattle, Washington
| | - Vivek Naranbhai
- Department of Biomedical Informatics Medical Education, School of Medicine, University of Washington, Seattle, Washington.,Harvard Medical School, Harvard University, Cambridge, Massachusetts
| | - Rachel Knevel
- Harvard Medical School, Harvard University, Cambridge, Massachusetts
| | - Bahram Namjou-Khales
- Department of Pediatrics, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio
| | - Robert J Carroll
- Departments of Biomedical Informatics and Medicine, Vanderbilt University, Nashville, Tennessee
| | - Krzysztof Kiryluk
- Department of Medicine, Columbia University, New York City, New York
| | - Adam S Gordon
- Division of Medical Genetics, School of Medicine, University of Washington, Seattle, Washington
| | - Jodell Linder
- Vanderbilt Institute for Clinical and Translational Research, School of Medicine, Vanderbilt University, Nashville, Tennessee
| | - Kayla Marie Howell
- Vanderbilt Institute for Clinical and Translational Research, School of Medicine, Vanderbilt University, Nashville, Tennessee
| | - Brandy M Mapes
- Vanderbilt Institute for Clinical and Translational Research, School of Medicine, Vanderbilt University, Nashville, Tennessee
| | - Frederick T J Lin
- Feinberg School of Medicine, Northwestern University, Chicago, Illinois
| | | | - M Geoffrey Hayes
- Feinberg School of Medicine, Northwestern University, Chicago, Illinois
| | - Ali G Gharavi
- Department of Medicine, Columbia University, New York City, New York
| | | | - Marylyn D Ritchie
- Department of Genetics, University of Pennsylvania, Philadelphia, Pennsylvania
| | | | | | - Soumya Raychaudhuri
- Harvard Medical School, Harvard University, Cambridge, Massachusetts.,Program in Medical and Population Genetics, Broad Institute of Massachusetts Technical Institute and Harvard University, Cambridge, Massachusetts
| | - Scott T Weiss
- Harvard Medical School, Harvard University, Cambridge, Massachusetts
| | - Matt Lebo
- Harvard Medical School, Harvard University, Cambridge, Massachusetts
| | - Sami S Amr
- Harvard Medical School, Harvard University, Cambridge, Massachusetts
| | - David Carrell
- Kaiser Permanente Washington Health Research Institute (Formerly Group Health Cooperative-Seattle), Kaiser Permanente, Seattle, Washington
| | - Eric B Larson
- Kaiser Permanente Washington Health Research Institute (Formerly Group Health Cooperative-Seattle), Kaiser Permanente, Seattle, Washington
| | - Christopher G Chute
- Schools of Medicine, Public Health, and Nursing, Johns Hopkins University, Baltimore, Maryland
| | | | | | - Patrick Sleiman
- Children's Hospital of Philadelphia, Philadelphia, Pennsylvania
| | | | - Rongling Li
- National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland
| | - Elizabeth W Karlson
- Department of Genetics, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Josh F Peterson
- Departments of Biomedical Informatics and Medicine, Vanderbilt University, Nashville, Tennessee
| | | | - Rex Chisholm
- Feinberg School of Medicine, Northwestern University, Chicago, Illinois
| | - Joshua Charles Denny
- Departments of Biomedical Informatics and Medicine, Vanderbilt University, Nashville, Tennessee
| | - Gail P Jarvik
- Division of Medical Genetics, School of Medicine, University of Washington, Seattle, Washington
| | -
- National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland
| | - David R Crosslin
- Department of Biomedical Informatics Medical Education, School of Medicine, University of Washington, Seattle, Washington
| |
Collapse
|
17
|
Kerr SM, Campbell A, Marten J, Vitart V, McIntosh AM, Porteous DJ, Hayward C. Electronic health record and genome-wide genetic data in Generation Scotland participants. Wellcome Open Res 2017; 2:85. [PMID: 29062915 PMCID: PMC5645708 DOI: 10.12688/wellcomeopenres.12600.1] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/17/2017] [Indexed: 12/26/2022] Open
Abstract
This article provides the first detailed demonstration of the research value of the Electronic Health Record (EHR) linked to research data in Generation Scotland Scottish Family Health Study (GS:SFHS) participants, together with how to access this data. The structured, coded variables in the routine biochemistry, prescribing and morbidity records, in particular, represent highly valuable phenotypic data for a genomics research resource. Access to a wealth of other specialized datasets, including cancer, mental health and maternity inpatient information, is also possible through the same straightforward and transparent application process. The EHR linked dataset is a key component of GS:SFHS, a biobank conceived in 1999 for the purpose of studying the genetics of health areas of current and projected public health importance. Over 24,000 adults were recruited from 2006 to 2011, with broad and enduring written informed consent for biomedical research. Consent was obtained from 23,603 participants for GS:SFHS study data to be linked to their Scottish National Health Service (NHS) records, using their Community Health Index number. This identifying number is used for NHS Scotland procedures (registrations, attendances, samples, prescribing and investigations) and allows healthcare records for individuals to be linked across time and location. Here, we describe the NHS EHR dataset on the sub-cohort of 20,032 GS:SFHS participants with consent and mechanism for record linkage plus extensive genetic data. Together with existing study phenotypes, including family history and environmental exposures, such as smoking, the EHR is a rich resource of real world data that can be used in research to characterise the health trajectory of participants, available at low cost and a high degree of timeliness, matched to DNA, urine and serum samples and genome-wide genetic information.
Collapse
Affiliation(s)
- Shona M Kerr
- MRC Human Genetics Unit, University of Edinburgh, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK
| | - Archie Campbell
- Generation Scotland, Centre for Genomic and Experimental Medicine, University of Edinburgh, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK
| | - Jonathan Marten
- MRC Human Genetics Unit, University of Edinburgh, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK
| | - Veronique Vitart
- MRC Human Genetics Unit, University of Edinburgh, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK
| | - Andrew M McIntosh
- Centre for Cognitive Ageing and Cognitive Epidemiology, Department of Psychology, University of Edinburgh, Edinburgh, EH8 9JZ, UK.,Division of Psychiatry, University of Edinburgh, Royal Edinburgh Hospital, Edinburgh, EH10 5HF, UK
| | - David J Porteous
- Generation Scotland, Centre for Genomic and Experimental Medicine, University of Edinburgh, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK.,Medical Genetics Section, Centre for Genomic and Experimental Medicine, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK
| | - Caroline Hayward
- MRC Human Genetics Unit, University of Edinburgh, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK
| |
Collapse
|