1
|
Al-Chalabi A, Andrews J, Farhan S. Recent advances in the genetics of familial and sporadic ALS. INTERNATIONAL REVIEW OF NEUROBIOLOGY 2024; 176:49-74. [PMID: 38802182 DOI: 10.1016/bs.irn.2024.04.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
ALS shows complex genetic inheritance patterns. In about 5% to 10% of cases, there is a family history of ALS or a related condition such as frontotemporal dementia in a first or second degree relative, and for about 80% of such people a pathogenic gene variant can be identified. Such variants are also seen in people with no family history because of factor influencing the expression of genes, such as age. Genetic susceptibility factors also contribute to risk, and the heritability of ALS is between 40% and 60%. The genetic variants influencing ALS risk include single base changes, repeat expansions, copy number variants, and others. Here we review what is known of the genetic landscape and architecture of ALS.
Collapse
Affiliation(s)
- Ammar Al-Chalabi
- Department of Basic and Clinical Neuroscience, King's College London, London, United Kingdom.
| | - Jinsy Andrews
- Department of Neurology, Columbia University, New York, NY, United States
| | - Sali Farhan
- Department of Neurology and Neurosurgery, Montreal Neurological Institute-Hospital, Montreal, QC, Canada; Department of Human Genetics, Montreal Neurological Institute-Hospital, Montreal, QC, Canada
| |
Collapse
|
2
|
Jana S, Sutton M, Mollayeva T, Chan V, Colantonio A, Escobar MD. Application of multiple testing procedures for identifying relevant comorbidities, from a large set, in traumatic brain injury for research applications utilizing big health-administrative data. Front Big Data 2022; 5:793606. [PMID: 36247970 PMCID: PMC9563390 DOI: 10.3389/fdata.2022.793606] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Accepted: 08/25/2022] [Indexed: 11/13/2022] Open
Abstract
Background Multiple testing procedures (MTP) are gaining increasing popularity in various fields of biostatistics, especially in statistical genetics. However, in injury surveillance research utilizing the growing amount and complexity of health-administrative data encoded in the International Statistical Classification of Diseases and Related Health Problems 10th Revision (ICD-10), few studies involve MTP and discuss their applications and challenges. Objective We aimed to apply MTP in the population-wide context of comorbidity preceding traumatic brain injury (TBI), one of the most disabling injuries, to find a subset of comorbidity that can be targeted in primary injury prevention. Methods In total, 2,600 ICD-10 codes were used to assess the associations between TBI and comorbidity, with 235,003 TBI patients, on a matched data set of patients without TBI. McNemar tests were conducted on each 2,600 ICD-10 code, and appropriate multiple testing adjustments were applied using the Benjamini-Yekutieli procedure. To study the magnitude and direction of associations, odds ratios with 95% confidence intervals were constructed. Results Benjamini-Yekutieli procedure captured 684 ICD-10 codes, out of 2,600, as codes positively associated with a TBI event, reducing the effective number of codes for subsequent analysis and comprehension. Conclusion Our results illustrate the utility of MTP for data mining and dimension reduction in TBI research utilizing big health-administrative data to support injury surveillance research and generate ideas for injury prevention.
Collapse
Affiliation(s)
- Sayantee Jana
- Department of Mathematics, Indian Institute of Technology, Hyderabad, India
- *Correspondence: Sayantee Jana
| | | | - Tatyana Mollayeva
- Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
- KITE Research Institute Toronto Rehabilitation Institute, University Health Network, Toronto, ON, Canada
- Rehabilitation Sciences Institute, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
- Global Brain Health Institute, Institute of Neuroscience, Trinity College Dublin, Dublin, Ireland
- Acquired Brain Injury Research Lab, University of Toronto, Toronto, ON, Canada
| | - Vincy Chan
- KITE Research Institute Toronto Rehabilitation Institute, University Health Network, Toronto, ON, Canada
- Rehabilitation Sciences Institute, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
- Acquired Brain Injury Research Lab, University of Toronto, Toronto, ON, Canada
- Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, ON, Canada
- Faculty of Health Sciences, Ontario Tech University, Oshawa, ON, Canada
| | - Angela Colantonio
- Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
- KITE Research Institute Toronto Rehabilitation Institute, University Health Network, Toronto, ON, Canada
- Rehabilitation Sciences Institute, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
- Acquired Brain Injury Research Lab, University of Toronto, Toronto, ON, Canada
- ICES (fomerly Institute for Clinical Evaluative Sciences), Toronto, ON, Canada
| | | |
Collapse
|
3
|
Gupta PK. GWAS for genetics of complex quantitative traits: Genome to pangenome and SNPs to SVs and k-mers. Bioessays 2021; 43:e2100109. [PMID: 34486143 DOI: 10.1002/bies.202100109] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Revised: 08/21/2021] [Accepted: 08/23/2021] [Indexed: 12/22/2022]
Abstract
The development of improved methods for genome-wide association studies (GWAS) for genetics of quantitative traits has been an active area of research during the last 25 years. This activity initially started with the use of mixed linear model (MLM), which was variously modified. During the last decade, however, with the availability of high throughput next generation sequencing (NGS) technology, development and use of pangenomes and novel markers including structural variations (SVs) and k-mers for GWAS has taken over as a new thrust area of research. Pangenomes and SVs are now available in humans, livestock, and a number of plant species, so that these resources along with k-mers are being used in GWAS for exploring additional genetic variation that was hitherto not available for analysis. These developments have resulted in significant improvement in GWAS methodology for detection of marker-trait associations (MTAs) that are relevant to human healthcare and crop improvement.
Collapse
Affiliation(s)
- Pushpendra K Gupta
- Department of Genetics and Plant Breeding, Ch. Charan Singh University Meerut, Meerut, Uttar Pradesh, India
| |
Collapse
|
4
|
Gupta PK. Quantitative genetics: pan-genomes, SVs, and k-mers for GWAS. Trends Genet 2021; 37:868-871. [PMID: 34183185 DOI: 10.1016/j.tig.2021.05.006] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Revised: 05/20/2021] [Accepted: 05/25/2021] [Indexed: 01/30/2023]
Abstract
For identification of marker-trait associations (MTAs) for complex traits in animals and plants, thousands of genome-wide association studies (GWAS) were conducted during the past two decades. This involved regular improvement in methodology. Initially, a reference genome and SNPs were used; more recently pan-genomes and the markers structural variations (SVs)/k-mers are also being used.
Collapse
Affiliation(s)
- Pushpendra K Gupta
- Molecular Biology Laboratory, Department of Genetics and Plant Breeding, CCS University Meerut, Meerut, India.
| |
Collapse
|
5
|
Roberts R, Chang CC. A Journey through Genetic Architecture and Predisposition of Coronary Artery Disease. Curr Genomics 2020; 21:382-398. [PMID: 33093801 PMCID: PMC7536803 DOI: 10.2174/1389202921999200630145241] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2020] [Revised: 05/18/2020] [Accepted: 05/26/2020] [Indexed: 01/14/2023] Open
Abstract
Introduction To halt the spread of coronary artery disease (CAD), the number one killer in the world, requires primary prevention. Fifty percent of all Americans are expected to experience a cardiac event; the challenge is identifying those at risk. 40 to 60% of predisposition to CAD is genetic. The first genetic risk variant, 9p21, was discovered in 2007. Genome-Wide Association Studies has since discovered hundreds of genetic risk variants. The genetic burden for CAD can be expressed as a single number, Genetic Risk Score (GRS). Assessment of GRS to risk stratify for CAD was superior to conventional risk factors in several large clinical trials assessing statin therapy, and more recently in a population of nearly 500,000 (UK Biobank). Studies were performed based on prospective genetic risk stratification for CAD. These studies showed that a favorable lifestyle was associated with a 46% reduction in cardiac events and programmed exercise, a 50% reduction in cardiac events. Genetic risk score is superior to conventional risk factors, and is markedly attenuated by lifestyle changes and drug therapy. Genetic risk can be determined at birth or any time thereafter. Conclusion Utilizing the GRS to risk stratify young, asymptomatic individuals could provide a paradigm shift in the primary prevention of CAD and significantly halt its spread.
Collapse
Affiliation(s)
- Robert Roberts
- 1Cardiovascular Genomics & Genetics, University of Arizona, College of Medicine, Phoenix, AZ, USA; 2Cardiovascular Genomics & Genetics, Phoenix, AZ, USA
| | - Chih Chao Chang
- 1Cardiovascular Genomics & Genetics, University of Arizona, College of Medicine, Phoenix, AZ, USA; 2Cardiovascular Genomics & Genetics, Phoenix, AZ, USA
| |
Collapse
|
6
|
Abegaz F, Chaichoompu K, Génin E, Fardo DW, König IR, Mahachie John JM, Van Steen K. Principals about principal components in statistical genetics. Brief Bioinform 2020; 20:2200-2216. [PMID: 30219892 DOI: 10.1093/bib/bby081] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2018] [Revised: 07/21/2018] [Accepted: 08/12/2018] [Indexed: 12/13/2022] Open
Abstract
Principal components (PCs) are widely used in statistics and refer to a relatively small number of uncorrelated variables derived from an initial pool of variables, while explaining as much of the total variance as possible. Also in statistical genetics, principal component analysis (PCA) is a popular technique. To achieve optimal results, a thorough understanding about the different implementations of PCA is required and their impact on study results, compared to alternative approaches. In this review, we focus on the possibilities, limitations and role of PCs in ancestry prediction, genome-wide association studies, rare variants analyses, imputation strategies, meta-analysis and epistasis detection. We also describe several variations of classic PCA that deserve increased attention in statistical genetics applications.
Collapse
|
7
|
Abstract
PURPOSE OF REVIEW The goal of the review is to provide a comprehensive overview of the current understanding of the mechanisms underlying variation in human stature. RECENT FINDINGS Human height is an anthropometric trait that varies considerably within human populations as well as across the globe. Historically, much research focus was placed on understanding the biology of growth plate chondrocytes and how modifications to core chondrocyte proliferation and differentiation pathways potentially shaped height attainment in normal as well as pathological contexts. Recently, much progress has been made to improve our understanding regarding the mechanisms underlying the normal and pathological range of height variation within as well as between human populations, and today, it is understood to reflect complex interactions among a myriad of genetic, environmental, and evolutionary factors. Indeed, recent improvements in genetics (e.g., GWAS) and breakthroughs in functional genomics (e.g., whole exome sequencing, DNA methylation analysis, ATAC-sequencing, and CRISPR) have shed light on previously unknown pathways/mechanisms governing pathological and common height variation. Additionally, the use of an evolutionary perspective has also revealed important mechanisms that have shaped height variation across the planet. This review provides an overview of the current knowledge of the biological mechanisms underlying height variation by highlighting new research findings on skeletal growth control with an emphasis on previously unknown pathways/mechanisms influencing pathological and common height variation. In this context, this review also discusses how evolutionary forces likely shaped the genomic architecture of height across the globe.
Collapse
Affiliation(s)
| | - Terence D Capellini
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
8
|
Chen Y, Adrianto I, Ianuzzi MC, Garman L, Montgomery CG, Rybicki BA, Levin AM, Li J. Extended methods for gene-environment-wide interaction scans in studies of admixed individuals with varying degrees of relationships. Genet Epidemiol 2019; 43:414-426. [PMID: 30793815 DOI: 10.1002/gepi.22196] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2018] [Revised: 12/26/2018] [Accepted: 01/24/2019] [Indexed: 11/08/2022]
Abstract
The etiology of many complex diseases involves both environmental exposures and inherited genetic predisposition as well as interactions between them. Gene-environment-wide interaction studies (GEWIS) provide a means to identify the interactions between genetic variation and environmental exposures that underlie disease risk. However, current GEWIS methods lack the capability to adjust for the potentially complex correlations in studies with varying degrees of relationships (both known and unknown) among individuals in admixed populations. We developed novel generalized estimating equation (GEE) based methods-GEE-adaptive and GEE-joint-to account for phenotypic correlations due to kinship while accounting for covariates, including, measures of genome-wide ancestry. In simulation studies of admixed individuals, both methods controlled family-wise error rates, an advantage over the case-only approach. They demonstrated higher power than traditional case-control methods across a wide range of underlying alternative hypotheses, especially where both marginal and interaction effects were present. We applied the proposed method to conduct a GEWIS of a known sarcoidosis risk factor (insecticide exposure) and risk of sarcoidosis in African Americans and identified two novel loci with suggestive evidence of G × E interaction.
Collapse
Affiliation(s)
- Yalei Chen
- Department of Public Health Sciences, Henry Ford Health System, Detroit, Michigan.,Center for Bioinformatics, Henry Ford Health System, Detroit, Michigan
| | - Indra Adrianto
- Department of Public Health Sciences, Henry Ford Health System, Detroit, Michigan.,Center for Bioinformatics, Henry Ford Health System, Detroit, Michigan
| | - Michael C Ianuzzi
- Department of Internal Medicine, Northwell Staten Island University Hospital, Staten Island, New York, New York
| | - Lori Garman
- Arthritis and Clinical Immunology Research Program, Oklahoma Medical Research Foundation, Oklahoma City, Oklahoma
| | - Courtney G Montgomery
- Arthritis and Clinical Immunology Research Program, Oklahoma Medical Research Foundation, Oklahoma City, Oklahoma
| | - Benjamin A Rybicki
- Department of Public Health Sciences, Henry Ford Health System, Detroit, Michigan
| | - Albert M Levin
- Department of Public Health Sciences, Henry Ford Health System, Detroit, Michigan.,Center for Bioinformatics, Henry Ford Health System, Detroit, Michigan
| | - Jia Li
- Department of Public Health Sciences, Henry Ford Health System, Detroit, Michigan.,Center for Bioinformatics, Henry Ford Health System, Detroit, Michigan
| |
Collapse
|
9
|
Physical Confirmation and Comparative Genomics of the Rat Mammary carcinoma susceptibility 3 Quantitative Trait Locus. G3-GENES GENOMES GENETICS 2017; 7:1767-1773. [PMID: 28391240 PMCID: PMC5473756 DOI: 10.1534/g3.117.039388] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Human breast and rat mammary cancer susceptibility are complex phenotypes where complete sets of risk associated loci remain to be identified for both species. We tested multiple congenic rat strains to physically confirm and positionally map rat Mammary carcinoma susceptibility 3 (Mcs3)-a mammary cancer resistance allele previously predicted at Rattus norvegicus chromosome 1 (RNO1). The mammary cancer susceptible Wistar Furth (WF) strain was the recipient, and the mammary cancer resistant Copenhagen (Cop) strain was the RNO1-segment donor for congenics. Inbred WF females averaged 6.3 carcinogen-induced mammary carcinomas per rat. Two WF.Cop congenic strains averaged 2.8 and 3.4 mammary carcinomas per rat, which confirmed Mcs3 as an independently acting allele. Two other WF.Cop congenic strains averaged 6.6 and 8.1 mammary carcinomas per rat, and, thus, did not contain Mcs3 Rat Mcs3 was delimited to 27.8 Mb of RNO1 from rs8149408 to rs105131702 (RNO1:143700228-171517317 of RGSC 6.0/rn6). Human genetic variants with p values for association to breast cancer risk below 10-7 had not been reported for Mcs3 orthologous loci; however, human variants located in Mcs3-orthologous regions with potential association to risk (10-7 < p < 10-3) were listed in some population-based studies. Further, rat Mcs3 contains sequence orthologous to human 11q13/14-a region frequently amplified in female breast cancer. We conclude that Mcs3 is an independently acting mammary carcinoma resistance allele. Human population-based, genome-targeted association studies interrogating Mcs3 orthologous loci may yield novel breast cancer risk associated variants and genes.
Collapse
|
10
|
Chen CCM, Keith JM, Mengersen KL. Accurate phenotyping: Reconciling approaches through Bayesian model averaging. PLoS One 2017; 12:e0176136. [PMID: 28423058 PMCID: PMC5396931 DOI: 10.1371/journal.pone.0176136] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2016] [Accepted: 04/05/2017] [Indexed: 11/18/2022] Open
Abstract
Genetic research into complex diseases is frequently hindered by a lack of clear biomarkers for phenotype ascertainment. Phenotypes for such diseases are often identified on the basis of clinically defined criteria; however such criteria may not be suitable for understanding the genetic composition of the diseases. Various statistical approaches have been proposed for phenotype definition; however our previous studies have shown that differences in phenotypes estimated using different approaches have substantial impact on subsequent analyses. Instead of obtaining results based upon a single model, we propose a new method, using Bayesian model averaging to overcome problems associated with phenotype definition. Although Bayesian model averaging has been used in other fields of research, this is the first study that uses Bayesian model averaging to reconcile phenotypes obtained using multiple models. We illustrate the new method by applying it to simulated genetic and phenotypic data for Kofendred personality disorder-an imaginary disease with several sub-types. Two separate statistical methods were used to identify clusters of individuals with distinct phenotypes: latent class analysis and grade of membership. Bayesian model averaging was then used to combine the two clusterings for the purpose of subsequent linkage analyses. We found that causative genetic loci for the disease produced higher LOD scores using model averaging than under either individual model separately. We attribute this improvement to consolidation of the cores of phenotype clusters identified using each individual method.
Collapse
Affiliation(s)
- Carla Chia-Ming Chen
- Australian Institute of Marine Science, Cape Cleveland QLD, Australia
- ARC Centre of Excellence for Mathematical & Statistical Frontiers, Queensland University of Technology, Brisbane QLD, Australia
| | | | - Kerrie Lee Mengersen
- ARC Centre of Excellence for Mathematical & Statistical Frontiers, Queensland University of Technology, Brisbane QLD, Australia
| |
Collapse
|
11
|
Zhao Y, Li X, Zhu S. rs78378222 polymorphism in the 3′-untranslated region of TP53 contributes to development of age-associated cataracts by modifying microRNA-125b-induced apoptosis of lens epithelial cells. Mol Med Rep 2016; 14:2305-10. [DOI: 10.3892/mmr.2016.5465] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2015] [Accepted: 04/29/2016] [Indexed: 11/05/2022] Open
|
12
|
Hu X, Ren J, Ren X, Huang S, Sabiel SAI, Luo M, Nevo E, Fu C, Peng J, Sun D. Association of Agronomic Traits with SNP Markers in Durum Wheat (Triticum turgidum L. durum (Desf.)). PLoS One 2015; 10:e0130854. [PMID: 26110423 PMCID: PMC4482485 DOI: 10.1371/journal.pone.0130854] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2015] [Accepted: 05/25/2015] [Indexed: 01/11/2023] Open
Abstract
Association mapping is a powerful approach to detect associations between traits of interest and genetic markers based on linkage disequilibrium (LD) in molecular plant breeding. In this study, 150 accessions of worldwide originated durum wheat germplasm (Triticum turgidum spp. durum) were genotyped using 1,366 SNP markers. The extent of LD on each chromosome was evaluated. Association of single nucleotide polymorphisms (SNP) markers with ten agronomic traits measured in four consecutive years was analyzed under a mix linear model (MLM). Two hundred and one significant association pairs were detected in the four years. Several markers were associated with one trait, and also some markers were associated with multiple traits. Some of the associated markers were in agreement with previous quantitative trait loci (QTL) analyses. The function and homology analyses of the corresponding ESTs of some SNP markers could explain many of the associations for plant height, length of main spike, number of spikelets on main spike, grain number per plant, and 1000-grain weight, etc. The SNP associations for the observed traits are generally clustered in specific chromosome regions of the wheat genome, mainly in 2A, 5A, 6A, 7A, 1B, and 6B chromosomes. This study demonstrates that association mapping can complement and enhance previous QTL analyses and provide additional information for marker-assisted selection.
Collapse
Affiliation(s)
- Xin Hu
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan Hubei, 430070, China
| | - Jing Ren
- Shandong Provincial Key Laboratory of Functional Macromolecular Biophysics, Institute of Biophysics, Dezhou University, Dezhou, Shandong, 253023, China
| | - Xifeng Ren
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan Hubei, 430070, China
| | - Sisi Huang
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan Hubei, 430070, China
| | - Salih A. I. Sabiel
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan Hubei, 430070, China
| | - Mingcheng Luo
- Department of Plant Sciences, University of California Davis, Davis, CA, 95616, United States of America
| | - Eviatar Nevo
- Institute of Evolution, University of Haifa, Mount Carmel, Haifa, 31905, Israel
| | - Chunjie Fu
- Science and Technology Center, China National Seed Group Co., Ltd, Wuhan, Hubei, 430206, China
| | - Junhua Peng
- Science and Technology Center, China National Seed Group Co., Ltd, Wuhan, Hubei, 430206, China
| | - Dongfa Sun
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan Hubei, 430070, China
- Hubei Collaborative Innovation Center for Grain Industry, Jingzhou, Hubei, 434025, China
| |
Collapse
|
13
|
Okada Y. From the era of genome analysis to the era of genomic drug discovery: a pioneering example of rheumatoid arthritis. Clin Genet 2014; 86:432-40. [PMID: 25060537 DOI: 10.1111/cge.12465] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2014] [Revised: 07/19/2014] [Accepted: 07/21/2014] [Indexed: 01/18/2023]
Abstract
Although we have obtained comprehensive catalogs of genetic risk loci that are linked to human diseases, little is known regarding how to devise a systematic strategy to integrate genetic study results with diverse biological resources. Such strategies will be crucial for providing novel insights into disease biology and for aiding drug discovery as an ultimate goal. Here we describe the current progress in this field using a pioneering example of large-scale genetic association studies on rheumatoid arthritis (RA), an autoimmune disease characterized by inflammation and destruction of joints. Through functional and bioinformatic annotations of risk single nucleotide polymorphisms (SNPs) and genes from >100 RA risk loci identified by genome-wide association study (GWAS) meta-analysis, we found novel biological insights into RA pathogenicity. Further, by integrating RA genetic findings with the complete catalog of approved drugs for RA and other diseases, we provide empirical data to indicate that human genetic-based approaches may be useful for supporting 'genetics-driven genomic drug discovery' efforts in complex human traits and suggest that further development of integrative approaches should be undertaken.
Collapse
Affiliation(s)
- Y Okada
- Department of Human Genetics and Disease Diversity, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University, Tokyo, Japan; Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| |
Collapse
|
14
|
Gusareva ES, Van Steen K. Practical aspects of genome-wide association interaction analysis. Hum Genet 2014; 133:1343-58. [DOI: 10.1007/s00439-014-1480-y] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2014] [Accepted: 08/18/2014] [Indexed: 12/31/2022]
|
15
|
Yang HC, Lin CW, Chen CW, Chen JJ. Applying genome-wide gene-based expression quantitative trait locus mapping to study population ancestry and pharmacogenetics. BMC Genomics 2014; 15:319. [PMID: 24779372 PMCID: PMC4236814 DOI: 10.1186/1471-2164-15-319] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2013] [Accepted: 04/15/2014] [Indexed: 01/13/2023] Open
Abstract
BACKGROUND Gene-based analysis has become popular in genomic research because of its appealing biological and statistical properties compared with those of a single-locus analysis. However, only a few, if any, studies have discussed a mapping of expression quantitative trait loci (eQTL) in a gene-based framework. Neither study has discussed ancestry-informative eQTL nor investigated their roles in pharmacogenetics by integrating single nucleotide polymorphism (SNP)-based eQTL (s-eQTL) and gene-based eQTL (g-eQTL). RESULTS In this g-eQTL mapping study, the transcript expression levels of genes (transcript-level genes; T-genes) were correlated with the SNPs of genes (sequence-level genes; S-genes) by using a method of gene-based partial least squares (PLS). Ancestry-informative transcripts were identified using a rank-score-based multivariate association test, and ancestry-informative eQTL were identified using Fisher's exact test. Furthermore, key ancestry-predictive eQTL were selected in a flexible discriminant analysis. We analyzed SNPs and gene expression of 210 independent people of African-, Asian- and European-descent. We identified numerous cis- and trans-acting g-eQTL and s-eQTL for each population by using PLS. We observed ancestry information enriched in eQTL. Furthermore, we identified 2 ancestry-informative eQTL associated with adverse drug reactions and/or drug response. Rs1045642, located on MDR1, is an ancestry-informative eQTL (P = 2.13E-13, using Fisher's exact test) associated with adverse drug reactions to amitriptyline and nortriptyline and drug responses to morphine. Rs20455, located in KIF6, is an ancestry-informative eQTL (P = 2.76E-23, using Fisher's exact test) associated with the response to statin drugs (e.g., pravastatin and atorvastatin). The ancestry-informative eQTL of drug biotransformation genes were also observed; cross-population cis-acting expression regulators included SPG7, TAP2, SLC7A7, and CYP4F2. Finally, we also identified key ancestry-predictive eQTL and established classification models with promising training and testing accuracies in separating samples from close populations. CONCLUSIONS In summary, we developed a gene-based PLS procedure and a SAS macro for identifying g-eQTL and s-eQTL. We established data archives of eQTL for global populations. The program and data archives are accessible at http://www.stat.sinica.edu.tw/hsinchou/genetics/eQTL/HapMapII.htm. Finally, the results from our investigations regarding the interrelationship between eQTL, ancestry information, and pharmacodynamics provide rich resources for future eQTL studies and practical applications in population genetics and medical genetics.
Collapse
Affiliation(s)
- Hsin-Chou Yang
- Institute of Statistical Science, Academia Sinica, No 128, Academia Road, Section 2, Nankang, Taipei, Taiwan
- School of Public Health, National Defense Medical Center, Taipei, Taiwan
| | - Chien-Wei Lin
- Institute of Statistical Science, Academia Sinica, No 128, Academia Road, Section 2, Nankang, Taipei, Taiwan
| | - Chia-Wei Chen
- Institute of Statistical Science, Academia Sinica, No 128, Academia Road, Section 2, Nankang, Taipei, Taiwan
| | - James J Chen
- National Center for Toxicological Research, Food and Drug Administration, Little Rock, Arkansas, USA
| |
Collapse
|
16
|
Wang W, Ingles SA, Torres-Mejía G, Stern MC, Stanczyk FZ, Schwartz GG, Nelson DO, Fejerman L, Wolff RK, Slattery ML, John EM. Genetic variants and non-genetic factors predict circulating vitamin D levels in Hispanic and non-Hispanic White women: the Breast Cancer Health Disparities Study. INTERNATIONAL JOURNAL OF MOLECULAR EPIDEMIOLOGY AND GENETICS 2014; 5:31-46. [PMID: 24596595 PMCID: PMC3939005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Received: 11/15/2013] [Accepted: 12/30/2013] [Indexed: 06/03/2023]
Abstract
Genome-wide association studies (GWAS) have identified common polymorphisms in or near GC, CYP2R1, CYP24A1, and NADSYN1/DHCR7 genes to be associated with circulating levels of 25-hydroxyvitamin D [25(OH)D] in European populations. To replicate these GWAS findings, we examined six selected polymorphisms from these regions and their relation with circulating 25(OH)D levels in 1,605 Hispanic women (629 U.S. Hispanics and 976 Mexicans) and 354 non-Hispanic White (NHW) women. We also assessed the potential interactions between these variants and known non-genetic predictors of 25(OH)D levels, including body mass index (BMI), sunlight exposure and vitamin D intake from diet and supplements. The minor alleles of the two GC polymorphisms (rs7041 and rs2282679) were significantly associated with lower 25(OH)D levels in both Hispanic and NHW women. The CYP2R1 polymorphism, rs2060793, also was significantly associated with 25(OH)D levels in both groups. We found no significant associations for the polymorphisms in the CYP24A1. In Hispanic controls, 25(OH)D levels were significantly associated with the rs12785878T and rs1790349G haplotype in the NADSYN1/DHCR7 region. Significant interactions between GC rs2282679 and BMI and between rs12785878 and time spent in outdoor activities were observed. These results provide further support for the contribution of common genetic variants to individual variability in circulating 25(OH)D levels. The observed interactions between SNPs and non-genetic factors warrant confirmation.
Collapse
Affiliation(s)
- Wei Wang
- Cancer Prevention Institute of CaliforniaFremont, CA 94538, USA
- Biomedical Informatics Training Program, Stanford UniversityStanford, CA 94305, USA
| | - Sue Ann Ingles
- Department of Preventive Medicine, Norris Comprehensive Cancer Center, Keck School of Medicine, University of Southern CaliforniaLos Angeles, CA 90089, USA
- Department of Obstetrics and Gynecology, Keck School of Medicine, University of Southern CaliforniaLos Angeles, CA 90033, USA
| | | | - Mariana C Stern
- Department of Preventive Medicine, Norris Comprehensive Cancer Center, Keck School of Medicine, University of Southern CaliforniaLos Angeles, CA 90089, USA
| | - Frank Z Stanczyk
- Department of Obstetrics and Gynecology, Keck School of Medicine, University of Southern CaliforniaLos Angeles, CA 90033, USA
| | - Gary G Schwartz
- Department of Cancer Biology, School of Medicine, Wake Forest UniversityMedical Center Boulevard, Winston-Salem, NC 27157, USA
| | - David O Nelson
- Cancer Prevention Institute of CaliforniaFremont, CA 94538, USA
| | - Laura Fejerman
- Division of General Internal Medicine, Department of Medicine,Institute for Human Genetics and Helen Diller Family Comprehensive Cancer Center, University of CaliforniaSan Francisco, CA 94158, USA
| | - Roger K Wolff
- Department of Medicine, University of UtahSalt Lake City, UT 84132, USA
| | - Martha L Slattery
- Department of Medicine, University of UtahSalt Lake City, UT 84132, USA
| | - Esther M John
- Cancer Prevention Institute of CaliforniaFremont, CA 94538, USA
- Division of Epidemiology, Department of Health Research and Policy, Stanford Cancer Institute, Stanford University School of MedicineStanford, CA 94503, USA
| |
Collapse
|
17
|
Sanders J, Samuelson DJ. Significant overlap between human genome-wide association-study nominated breast cancer risk alleles and rat mammary cancer susceptibility loci. Breast Cancer Res 2014; 16:R14. [PMID: 24467842 PMCID: PMC4054882 DOI: 10.1186/bcr3607] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2013] [Accepted: 01/10/2014] [Indexed: 11/13/2022] Open
Abstract
Introduction Human population-based genome-wide association (GWA) studies identify low penetrance breast cancer risk alleles; however, GWA studies alone do not definitively determine causative genes or mechanisms. Stringent genome- wide statistical significance level requirements, set to avoid false-positive associations, yield many false-negative associations. Laboratory rats (Rattus norvegicus) are useful to study many aspects of breast cancer, including genetic susceptibility. Several rat mammary cancer associated loci have been identified using genetic linkage and congenic strain based-approaches. Here, we sought to determine the amount of overlap between GWA study nominated human breast and rat mammary cancer susceptibility loci. Methods We queried published GWA studies to identify two groups of SNPs, one that reached genome-wide significance and one comprised of SNPs failing a validation step and not reaching genome- wide significance. Human genome locations of these SNPs were compared to known rat mammary carcinoma susceptibility loci to determine if risk alleles existed in both species. Rat genome regions not known to associate with mammary cancer risk were randomly selected as control regions. Results Significantly more human breast cancer risk GWA study nominated SNPs mapped at orthologs of rat mammary cancer loci than to regions not known to contain rat mammary cancer loci. The rat genome was useful to predict associations that had met human genome-wide significance criteria and weaker associations that had not. Conclusions Integration of human and rat comparative genomics may be useful to parse out false-negative associations in GWA studies of breast cancer risk.
Collapse
|
18
|
Larson NB, Jenkins GD, Larson MC, Vierkant RA, Sellers TA, Phelan CM, Schildkraut JM, Sutphen R, Pharoah PPD, Gayther SA, Wentzensen N, Goode EL, Fridley BL. Kernel canonical correlation analysis for assessing gene-gene interactions and application to ovarian cancer. Eur J Hum Genet 2014; 22:126-31. [PMID: 23591404 PMCID: PMC3865403 DOI: 10.1038/ejhg.2013.69] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2012] [Revised: 01/11/2013] [Accepted: 01/16/2013] [Indexed: 01/24/2023] Open
Abstract
Although single-locus approaches have been widely applied to identify disease-associated single-nucleotide polymorphisms (SNPs), complex diseases are thought to be the product of multiple interactions between loci. This has led to the recent development of statistical methods for detecting statistical interactions between two loci. Canonical correlation analysis (CCA) has previously been proposed to detect gene-gene coassociation. However, this approach is limited to detecting linear relations and can only be applied when the number of observations exceeds the number of SNPs in a gene. This limitation is particularly important for next-generation sequencing, which could yield a large number of novel variants on a limited number of subjects. To overcome these limitations, we propose an approach to detect gene-gene interactions on the basis of a kernelized version of CCA (KCCA). Our simulation studies showed that KCCA controls the Type-I error, and is more powerful than leading gene-based approaches under a disease model with negligible marginal effects. To demonstrate the utility of our approach, we also applied KCCA to assess interactions between 200 genes in the NF-κB pathway in relation to ovarian cancer risk in 3869 cases and 3276 controls. We identified 13 significant gene pairs relevant to ovarian cancer risk (local false discovery rate <0.05). Finally, we discuss the advantages of KCCA in gene-gene interaction analysis and its future role in genetic association studies.
Collapse
Affiliation(s)
- Nicholas B Larson
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Gregory D Jenkins
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Melissa C Larson
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Robert A Vierkant
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | | | | | | | - Rebecca Sutphen
- Department of Pediatrics, Universty of South Florida College of Medicine, Tampa, FL, USA
| | | | - Simon A Gayther
- Department of Preventative Medicine, University of Southern California, Los Angeles, CA, USA
| | - Nicolas Wentzensen
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Ovarian Cancer Association Consortium
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
- Cancer Epidemiology, Moffitt Cancer Center, Tampa, FL, USA
- Duke Comprehensive Cancer Center, Duke University, Durham, NC, USA
- Department of Pediatrics, Universty of South Florida College of Medicine, Tampa, FL, USA
- Department of Oncology, University of Cambridge, Cambridge, UK
- Department of Preventative Medicine, University of Southern California, Los Angeles, CA, USA
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
- Department of Biostatistics, University of Kansas Medical Center, Kansas City, KS, USA
| | - Ellen L Goode
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Brooke L Fridley
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
- Department of Biostatistics, University of Kansas Medical Center, Kansas City, KS, USA
| |
Collapse
|
19
|
Becker N. Aktives Monitoring kleinräumiger Krebshäufungen. Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz 2014; 57:41-6. [DOI: 10.1007/s00103-013-1875-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
20
|
Liu B, Shen X, Pan W. Semi-supervised spectral clustering with application to detect population stratification. Front Genet 2013; 4:215. [PMID: 24298278 PMCID: PMC3829479 DOI: 10.3389/fgene.2013.00215] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2013] [Accepted: 10/04/2013] [Indexed: 11/22/2022] Open
Abstract
In genetic association studies, unaccounted population stratification can cause spurious associations in a discovery process of identifying disease-associated genetic markers. In such a situation, prior information is often available for some subjects' population identities. To leverage the additional information, we propose a semi-supervised clustering approach for detecting population stratification. This approach maintains the advantages of spectral clustering, while is integrated with the additional identity information, leading to sharper clustering performance. To demonstrate utility of our approach, we analyze a whole-genome sequencing dataset from the 1000 Genomes Project, consisting of the genotypes of 607 individuals sampled from three continental groups involving 10 subpopulations. This is compared against a semi-supervised spectral clustering method, in addition to a spectral clustering method, with the known subpopulation information by the Rand index and an adjusted Rand (ARand) index. The numerical results suggest that the proposed method outperforms its competitors in detecting population stratification.
Collapse
Affiliation(s)
- Binghui Liu
- Division of Biostatistics, School of Public Health, University of Minnesota Minneapolis, MN, USA ; School of Statistics, University of Minnesota Minneapolis, MN, USA
| | | | | |
Collapse
|
21
|
Single nucleotide variation in the TP53 3' untranslated region in diffuse large B-cell lymphoma treated with rituximab-CHOP: a report from the International DLBCL Rituximab-CHOP Consortium Program. Blood 2013; 121:4529-40. [PMID: 23515929 DOI: 10.1182/blood-2012-12-471722] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
We identified multiple single nucleotide variants (SNVs) in the TP53 3' untranslated region (3'UTR) in tumor specimens from 244 patients with diffuse large B-cell lymphoma (DLBCL). Patients carrying a wild-type TP53 coding sequence (CDS) and 1 or more 3'UTR SNVs had a better 5-year survival rate than patients carrying a wild-type CDS and the reference 3'UTR, yet there is no statistically significance difference in overall survival (OS). In contrast, 3'UTR variation predicted poorer OS for patients with a mutant TP53 CDS. We then sequenced TP53 3'UTR in 247 additional DLBCL patients as a validation set. Altogether, we identified 187 novel SNVs; 36 occurred at least twice. Most of the newly identified 3'UTR SNVs were located at sites that are complementary to seed sequences of microRNAs (miRNAs) that are predicted or experimentally known to target TP53. Three SNVs disrupt the seed match between miR-125b and the TP53 3'UTR, thereby impeding suppression of p53 by this miRNA. In addition, a germline SNV (rs78378222) located in the TP53 polyadenylation signal resulted in downregulation of both p53 messenger RNA and protein levels and reduction of cellular apoptosis. This study is the first to demonstrate the prognostic value of the TP53 3'UTR in cancer.
Collapse
|
22
|
Aljasir B, Ioannidis JPA, Yurkiewich A, Moher D, Higgins JPT, Arora P, Little J. Assessment of systematic effects of methodological characteristics on candidate genetic associations. Hum Genet 2013; 132:167-78. [PMID: 23095857 DOI: 10.1007/s00439-012-1237-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2012] [Accepted: 10/08/2012] [Indexed: 12/30/2022]
Abstract
Candidate genetic association studies have been found to have a low replication rate in the past. Here, we aimed to assess whether aspects of reported methodological characteristics in genetic association studies may be related to the magnitude of effects observed. An observational, literature-based investigation of 511 case-control studies of genetic association studies indexed in 2007, was undertaken. Meta-regression analyses were used to assess the relationship between 23 reported methodological characteristics and the magnitude of genetic associations. The 511 studies had been conducted in 52 countries and were published in 220 journals (median impact factor 5.1). The multivariate meta-regression model of methodological characteristics plus disease category accounted for 17.2 % of the between-study variance in the magnitude of the reported genetic associations. Our findings are consistent with the view that better conducted and better reported genetic association research may lead to less inflated results.
Collapse
Affiliation(s)
- Badr Aljasir
- Department of Epidemiology and Community Medicine, University of Ottawa, Ottawa, ON, Canada
| | | | | | | | | | | | | |
Collapse
|
23
|
Ikegawa S. A short history of the genome-wide association study: where we were and where we are going. Genomics Inform 2012; 10:220-5. [PMID: 23346033 PMCID: PMC3543921 DOI: 10.5808/gi.2012.10.4.220] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2012] [Revised: 10/31/2012] [Accepted: 11/01/2012] [Indexed: 01/03/2023] Open
Abstract
Recent rapid advances in genetic research are ushering us into the genome sequence era, where an individual's genome information is utilized for clinical practice. The most spectacular results of the human genome study have been provided by genome-wide association studies (GWASs). This is a review of the history of GWASs as related to my work. Further efforts are necessary to make full use of its potential power to medicine.
Collapse
Affiliation(s)
- Shiro Ikegawa
- Laboratory of Bone and Joint Diseases, Center for Genomic Medicine, RIKEN, Tokyo 108-8639, Japan
| |
Collapse
|
24
|
Zhao Y, Chen F, Zhai R, Lin X, Wang Z, Su L, Christiani DC. Correction for population stratification in random forest analysis. Int J Epidemiol 2012; 41:1798-806. [PMID: 23148107 DOI: 10.1093/ije/dys183] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
BACKGROUND Population structure (PS), including population stratification and admixture, is a significant confounder in genome-wide association studies (GWAS), as it may produce spurious associations. Random forest (RF) has been increasingly applied in GWAS data analysis because of its advantage in analysing high dimensional genetic data. RF creates importance measures for single nucleotide polymorphisms (SNPs), which are helpful for feature selections. However, if PS is not appropriately corrected, RF tends to give high importance to disease-unrelated SNPs with different frequencies of allele or genotype among subpopulations, leading to inaccurate results. METHODS In this study, the authors propose to correct for the confounding effect of PS by including the information of PS in RF analysis. The correction procedure starts by extracting the information of PS using EIGENSTRAT or multi-dimensional scaling clustering procedure from a large number of structure inference SNPs. Phenotype and genotypes adjusted by the information of PS are then used as the outcome and predictors in RF analysis. RESULTS Extensive simulations indicate that the importance measure of the causal SNP is increased following the PS correction. By analysing a real dataset, the proposed correction removes the spurious association between the lactase gene and height. CONCLUSION The authors propose a simple method to correct for PS in RF analysis on GWAS data. Further studies in real GWAS datasets are required to validate the robustness of the proposed approach.
Collapse
Affiliation(s)
- Yang Zhao
- Environmental and Occupational Medicine and Epidemiology Program, Department of Environmental Health, Harvard School of Public Health, Harvard University, Boston, MA, USA
| | | | | | | | | | | | | |
Collapse
|
25
|
Höglund JK, Guldbrandtsen B, Lund MS, Sahana G. Analyzes of genome-wide association follow-up study for calving traits in dairy cattle. BMC Genet 2012; 13:71. [PMID: 22888914 PMCID: PMC3465222 DOI: 10.1186/1471-2156-13-71] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2012] [Accepted: 08/02/2012] [Indexed: 12/01/2022] Open
Abstract
BACKGROUND There is often a pronounced disagreement between results obtained from different genome-wide association studies in cattle. There are multiple reasons for this disagreement. Particularly the presence of false positives leads to a need to validate detected QTL before they are optimally incorporated or weighted in selection decisions or further studied for causal gene. In dairy cattle progeny testing scheme new data is routinely accumulated which can be used to validate previously discovered associations. However, the data is not an independent sample and the sample size may not be sufficient to have enough power to validate previous discoveries. Here we compared two strategies to validate previously detected QTL when new data is added from the same study population. We compare analyzing a combined dataset (COMB) including all data presently available to only analyzing a validation dataset (VAL) i.e. a new dataset not previously analyzed as an independent replication. Secondly, we confirm SNP detected in the Reference population (REF) (i.e. previously analyzed dataset consists of older bulls) in the VAL dataset. RESULTS Clearly the results from the combined (COMB) dataset which had nearly twice the sample size of other two subsets allowed the detection of far more significant associations than the two smaller subsets. The number of significant SNPs in REF (older bulls) was about four times higher compare to VAL (younger bulls) though both had similar sample sizes, 2,219 and 2,039 respectively. A total of 424 SNP-trait combinations on 22 chromosomes showed genome-wide significant association involving 284 unique SNPs in the COMB dataset. In the REF data set 101 associations (73 unique SNPs) and in the VAL 24 associations (18 unique SNPs) were found genome-wide significant. Sixty-eight percent of the SNPs in the REF dataset could be confirmed in the VAL dataset. Out of 469 unique SNPs showing chromosome-wide significant association with calving traits in the REF dataset 321 could be confirmed in the VAL dataset at P < 0.05. CONCLUSIONS The follow-up study for GWAS in cattle will depend on the aim of the study. If the aim is to discover novel QTL, analyses of the COMB dataset is recommended, while in case of identification of the causal mutation underlying a QTL, confirmation of the discovered SNPs are necessary to avoid following a false positive.
Collapse
Affiliation(s)
- Johanna K Höglund
- Department of Molecular Biology and Genetics, Aarhus University, P.O. Box 50, Tjele, DK-8830, Denmark
- VikingGenetics, Ebeltoftvej 16, Assentoft, Randers, SØ, DK-8960, Denmark
- Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, P.O. Box 7070, Uppsala, 750 07, Sweden
| | - Bernt Guldbrandtsen
- Department of Molecular Biology and Genetics, Aarhus University, P.O. Box 50, Tjele, DK-8830, Denmark
| | - Mogens S Lund
- Department of Molecular Biology and Genetics, Aarhus University, P.O. Box 50, Tjele, DK-8830, Denmark
| | - Goutam Sahana
- Department of Molecular Biology and Genetics, Aarhus University, P.O. Box 50, Tjele, DK-8830, Denmark
| |
Collapse
|
26
|
Abstract
The power of a statistical test is the probability that it will yield a statistically significant result given that the null hypothesis is false. In other words, it represents the chance that the study will be successful in detecting a true effect and is dependent on a number of factors, including the magnitude of the effect, the sample size and study design, and the specified false-positive rate. Power calculations are primarily performed during the planning stages of a study, most typically in determining the sample size required. Consideration of statistical power can also sometimes shed light on the results of completed studies, particularly in the interpretation of negative results. In this article, we review the fundamentals of statistical power, discuss how power is calculated (using a genetic case/control study as an example), and consider the most pertinent factors that influence power in genetic studies. Finally, we focus on power in the context of modern whole-genome association studies, in which issues of coverage, multiple testing, and staged designs are paramount.
Collapse
|
27
|
Todd S, Fazil Baksh M, Whitehead J. Sequential methods for pharmacogenetic studies. Comput Stat Data Anal 2012. [DOI: 10.1016/j.csda.2011.02.019] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
28
|
Abstract
Hematological traits are essential biomedical indicators that are widely used in clinical practice. The elucidation of the etiology that determines an individual's hematological traits would have a substantial impact. Hematological traits are known to be heritable, and it has been suggested that genetic factors contribute significantly to the inter-individual variance of these traits. Here, we review our current knowledge regarding the genetic architecture of hematological traits in humans, most of which has been obtained through recent developments in genome-wide association studies (GWAS). In addition to current knowledge, which is based on the hematological traits of the three major blood-cell lineages (white blood cells; WBC, red blood cells; RBC, and platelets; PLT), we propose future approaches that would be useful as a next step in the post-GWAS era.
Collapse
|
29
|
Molecular genetic studies of gene identification for sarcopenia. Hum Genet 2011; 131:1-31. [PMID: 21706341 DOI: 10.1007/s00439-011-1040-7] [Citation(s) in RCA: 66] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2011] [Accepted: 06/12/2011] [Indexed: 02/07/2023]
Abstract
Sarcopenia, which is characterized by a progressive decrease of skeletal muscle mass and function with aging, is closely related to several common diseases (such as cardiovascular and airway diseases) and functional impairment/disability. Strong genetic determination has been reported for muscle mass and muscle strength, two most commonly recognized and studied risk phenotypes for sarcopenia, with heritability ranging from 30 to 85% for muscle strength and 45-90% for muscle mass. Sarcopenia has been the subject of increasing genetic research over the past decade. This review is designed to comprehensively summarize the most important and representative molecular genetic studies designed to identify genetic factors associated with sarcopenia. We have methodically reviewed whole-genome linkage studies in humans, quantitative trait loci mapping in animal models, candidate gene association studies, newly reported genome-wide association studies, DNA microarrays and microRNA studies of sarcopenia or related skeletal muscle phenotypes. The major results of each study are tabulated for easy comparison and reference. The findings of representative studies are discussed with respect to their influence on our present understanding of the genetics of sarcopenia. This is a comprehensive review of molecular genetic studies of gene identification for sarcopenia, and an overarching theme for this review is that the currently accumulating results are tentative and occasionally inconsistent and should be interpreted with caution pending further investigation. Consequently, this overview should enhance recognition of the need to validate/replicate the genetic variants underlying sarcopenia in large human cohorts and animal. We believe that further progress in understanding the genetic etiology of sarcopenia will provide valuable insights into important fundamental biological mechanisms underlying muscle physiology that will ultimately lead to improved ability to recognize individuals at risk for developing sarcopenia and our ability to treat this debilitating condition.
Collapse
|
30
|
Physical confirmation and mapping of overlapping rat mammary carcinoma susceptibility QTLs, Mcs2 and Mcs6. PLoS One 2011; 6:e19891. [PMID: 21625632 PMCID: PMC3097214 DOI: 10.1371/journal.pone.0019891] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2011] [Accepted: 04/14/2011] [Indexed: 12/03/2022] Open
Abstract
Only a portion of the estimated heritability of breast cancer susceptibility has been explained by individual loci. Comparative genetic approaches that first use an experimental organism to map susceptibility QTLs are unbiased methods to identify human orthologs to target in human population-based genetic association studies. Here, overlapping rat mammary carcinoma susceptibility (Mcs) predicted QTLs, Mcs6 and Mcs2, were physically confirmed and mapped to identify the human orthologous region. To physically confirm Mcs6 and Mcs2, congenic lines were established using the Wistar-Furth (WF) rat strain, which is susceptible to developing mammary carcinomas, as the recipient (genetic background) and either Wistar-Kyoto (WKy, Mcs6) or Copenhagen (COP, Mcs2), which are resistant, as donor strains. By comparing Mcs phenotypes of WF.WKy congenic lines with distinct segments of WKy chromosome 7 we physically confirmed and mapped Mcs6 to ∼33 Mb between markers D7Rat171 and gUwm64-3. The predicted Mcs2 QTL was also physically confirmed using segments of COP chromosome 7 introgressed into a susceptible WF background. The Mcs6 and Mcs2 overlapping genomic regions contain multiple annotated genes, but none have a clear or well established link to breast cancer susceptibility. Igf1 and Socs2 are two of multiple potential candidate genes in Mcs6. The human genomic region orthologous to rat Mcs6 is on chromosome 12 from base positions 71,270,266 to 105,502,699. This region has not shown a genome-wide significant association to breast cancer risk in pun studies of breast cancer susceptibility.
Collapse
|
31
|
Notaridou M, Quaye L, Dafou D, Jones C, Song H, Høgdall E, Kjaer SK, Christensen L, Høgdall C, Blaakaer J, McGuire V, Wu AH, Van Den Berg DJ, Pike MC, Gentry-Maharaj A, Wozniak E, Sher T, Jacobs IJ, Tyrer J, Schildkraut JM, Moorman PG, Iversen ES, Jakubowska A, Mędrek K, Lubiński J, Ness RB, Moysich KB, Lurie G, Wilkens LR, Carney ME, Wang-Gohrke S, Doherty JA, Rossing MA, Beckmann MW, Thiel FC, Ekici AB, Chen X, Beesley J, Gronwald J, Fasching PA, Chang-Claude J, Goodman MT, Chenevix-Trench G, Berchuck A, Pearce CL, Whittemore AS, Menon U, Pharoah PD, Gayther SA, Ramus SJ. Common alleles in candidate susceptibility genes associated with risk and development of epithelial ovarian cancer. Int J Cancer 2011; 128:2063-74. [PMID: 20635389 PMCID: PMC3098608 DOI: 10.1002/ijc.25554] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2010] [Revised: 05/26/2010] [Accepted: 06/24/2010] [Indexed: 12/26/2022]
Abstract
Common germline genetic variation in the population is associated with susceptibility to epithelial ovarian cancer. Microcell-mediated chromosome transfer and expression microarray analysis identified nine genes associated with functional suppression of tumorogenicity in ovarian cancer cell lines; AIFM2, AKTIP, AXIN2, CASP5, FILIP1L, RBBP8, RGC32, RUVBL1 and STAG3. Sixty-three tagging single nucleotide polymorphisms (tSNPs) in these genes were genotyped in 1,799 invasive ovarian cancer cases and 3,045 controls to look for associations with disease risk. Two SNPs in RUVBL1, rs13063604 and rs7650365, were associated with increased risk of serous ovarian cancer [HetOR = 1.42 (1.15-1.74) and the HomOR = 1.63 (1.10-1.42), p-trend = 0.0002] and [HetOR = 0.97 (0.80-1.17), HomOR = 0.74 (0.58-0.93), p-trend = 0.009], respectively. We genotyped rs13063604 and rs7650365 in an additional 4,590 cases and 6,031 controls from ten sites from the United States, Europe and Australia; however, neither SNP was significant in Stage 2. We also evaluated the potential role of tSNPs in these nine genes in ovarian cancer development by testing for allele-specific loss of heterozygosity (LOH) in 286 primary ovarian tumours. We found frequent LOH for tSNPs in AXIN2, AKTIP and RGC32 (64, 46 and 34%, respectively) and one SNP, rs1637001, in STAG3 showed significant allele-specific LOH with loss of the common allele in 94% of informative tumours (p = 0.015). Array comparative genomic hybridisation indicated that this nonrandom allelic imbalance was due to amplification of the rare allele. In conclusion, we show evidence for the involvement of a common allele of STAG3 in the development of epithelial ovarian cancer.
Collapse
Affiliation(s)
- Maria Notaridou
- Gynaecological Oncology Unit, UCL EGA Institute for Women’s Health, University College London, United Kingdom
| | - Lydia Quaye
- Gynaecological Oncology Unit, UCL EGA Institute for Women’s Health, University College London, United Kingdom
| | - Dimitra Dafou
- Department of Medical and Molecular Genetics, King’s College London School of Medicine, Guy’s Hospital, London, United Kingdom
| | - Chris Jones
- Gynaecological Oncology Unit, UCL EGA Institute for Women’s Health, University College London, United Kingdom
| | - Honglin Song
- CR-UK Department of Oncology, University of Cambridge, Strangeways Research Laboratory, Cambridge, United Kingdom
| | - Estrid Høgdall
- Department of Viruses, Hormones and Cancer, Institute of Cancer Epidemiology, Danish Cancer Society, Copenhagen, Denmark
| | - Susanne K. Kjaer
- Department of Viruses, Hormones and Cancer, Institute of Cancer Epidemiology, Danish Cancer Society, Copenhagen, Denmark
| | - Lise Christensen
- Department of Pathology, Bispebjerg Hospital, University of Copenhagen, Copenhagen, Denmark
| | - Claus Høgdall
- The Gynaecologic Clinic, The Juliane Marie Centre, Rigshospitalet, University of Copenhagen, Denmark
| | - Jan Blaakaer
- Department of Gynaecology and Obstetrics, Aarhus University Hospital, Skejby, Aarhus, Denmark
| | - Valerie McGuire
- Department of Health Research and Policy, Stanford University School of Medicine, Stanford, CA
| | - Anna H. Wu
- University of Southern California, Keck School of Medicine, Department of Preventive Medicine, Los Angeles, CA
| | - David J. Van Den Berg
- University of Southern California, Keck School of Medicine, Department of Preventive Medicine, Los Angeles, CA
| | - Malcolm C. Pike
- University of Southern California, Keck School of Medicine, Department of Preventive Medicine, Los Angeles, CA
| | - Aleksandra Gentry-Maharaj
- Gynaecological Oncology Unit, UCL EGA Institute for Women’s Health, University College London, United Kingdom
| | - Eva Wozniak
- Gynaecological Oncology Unit, UCL EGA Institute for Women’s Health, University College London, United Kingdom
| | - Tanya Sher
- Gynaecological Oncology Unit, UCL EGA Institute for Women’s Health, University College London, United Kingdom
| | - Ian J. Jacobs
- Gynaecological Oncology Unit, UCL EGA Institute for Women’s Health, University College London, United Kingdom
| | - Jonathan Tyrer
- CR-UK Department of Oncology, University of Cambridge, Strangeways Research Laboratory, Cambridge, United Kingdom
| | | | - Patricia G. Moorman
- Department of Community and Family Medicine, Duke University Medical Center, Durham, NC
| | - Edwin S. Iversen
- Department of Statistical Science, Duke University, Medical Center, Durham, NC
| | - Anna Jakubowska
- Department of Genetics and Pathology, International Hereditary Cancer Center, Pomeranian Medical University, Szczecin, Poland
| | - Krzysztof Mędrek
- Department of Genetics and Pathology, International Hereditary Cancer Center, Pomeranian Medical University, Szczecin, Poland
| | - Jan Lubiński
- Department of Genetics and Pathology, International Hereditary Cancer Center, Pomeranian Medical University, Szczecin, Poland
| | | | - Kirsten B. Moysich
- Department of Cancer Prevention and Control, Roswell Park Cancer Institute, Buffalo, NY
| | - Galina Lurie
- Cancer Research Center of Hawaii, University of Hawaii, Honolulu, HI
| | - Lynne R. Wilkens
- Cancer Research Center of Hawaii, University of Hawaii, Honolulu, HI
| | - Michael E. Carney
- Cancer Research Center of Hawaii, University of Hawaii, Honolulu, HI
| | - Shan Wang-Gohrke
- Department of Obstetrics and Gynecology, University of Ulm, Ulm, Germany
| | - Jennifer A. Doherty
- Program in Epidemiology, Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington
| | - Mary Anne Rossing
- Program in Epidemiology, Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington
| | - Matthias W. Beckmann
- Department of Gynecology and Obstetrics, University Hospital Erlangen, Erlangen, Germany
| | - Falk C. Thiel
- Department of Gynecology and Obstetrics, University Hospital Erlangen, Erlangen, Germany
| | - Arif B. Ekici
- Institute of Human Genetics, Friedrich-Alexander-University Erlangen-Nuremberg, Erlangen, Germany
| | - Xiaoqing Chen
- Genetics and Population Health, The Queensland Institute of Medical Research, Post Office Royal Brisbane Hospital, Australia
| | - Jonathan Beesley
- Genetics and Population Health, The Queensland Institute of Medical Research, Post Office Royal Brisbane Hospital, Australia
| | | | - Jacek Gronwald
- Department of Genetics and Pathology, International Hereditary Cancer Center, Pomeranian Medical University, Szczecin, Poland
| | - Peter A. Fasching
- Department of Gynecology and Obstetrics, University Hospital Erlangen, Erlangen, Germany
- Division of Hematology and Oncology, University of California at Los Angeles, David Geffen School of Medicine, Los Angeles, CA
| | - Jenny Chang-Claude
- Division of Cancer Epidemiology, German Cancer Research Center, Heidelberg, Germany
| | - Marc T. Goodman
- Cancer Research Center of Hawaii, University of Hawaii, Honolulu, HI
| | - Georgia Chenevix-Trench
- Genetics and Population Health, The Queensland Institute of Medical Research, Post Office Royal Brisbane Hospital, Australia
| | - Andrew Berchuck
- Department of Obstetrics and Gynecology/Division of Gynecologic Oncology, Duke University Medical Center, Durham, NC, 27710
| | - C. Leigh Pearce
- University of Southern California, Keck School of Medicine, Department of Preventive Medicine, Los Angeles, CA
| | - Alice S. Whittemore
- Department of Health Research and Policy, Stanford University School of Medicine, Stanford, CA
| | - Usha Menon
- Gynaecological Oncology Unit, UCL EGA Institute for Women’s Health, University College London, United Kingdom
| | - Paul D.P. Pharoah
- CR-UK Department of Oncology, University of Cambridge, Strangeways Research Laboratory, Cambridge, United Kingdom
| | - Simon A. Gayther
- Gynaecological Oncology Unit, UCL EGA Institute for Women’s Health, University College London, United Kingdom
| | - Susan J. Ramus
- Gynaecological Oncology Unit, UCL EGA Institute for Women’s Health, University College London, United Kingdom
| | | |
Collapse
|
32
|
Factors affecting the effective number of tests in genetic association studies: a comparative study of three PCA-based methods. J Hum Genet 2011; 56:428-35. [PMID: 21451529 DOI: 10.1038/jhg.2011.34] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The number of tested marker becomes numerous in genetic association studies (GAS) and one major challenge is to derive the multiple testing threshold. Some approaches calculating an effective number (M(eff)) of tests in GAS were developed and have been shown to be promising. As yet, there have been no comparisons of their robustness to influencing factors. We evaluated the performance of three principal component analysis (PCA)-based M(eff) estimation formulas (M(eff-C) in Cheverud (2001), M(eff-L) in Li and Ji (2005), and M(eff-G) in Galwey (2009)). Four influencing factors including LD measurements, marker density, population samples and the total number of tested markers were considered. We validated them by the Bonferroni's method and the permutation test with 10 000 random shuffles based on three real data sets. For each factor, M(eff-C) yielded conservative threshold except with D' coefficient, and M(eff-G) would be too liberal compared with the permutation test. Our results indicated that M(eff-L) based on r(2) coefficient achieve close approximation of the permutation threshold. As for a large number of markers, we recommended to use M(eff-L) with r(2) coefficient according to fixed-length separation, as well as fixed-number separation, to obtain accurate estimate of the multiple testing threshold and to save more computational time.
Collapse
|
33
|
Jiang L, Liu J, Sun D, Ma P, Ding X, Yu Y, Zhang Q. Genome wide association studies for milk production traits in Chinese Holstein population. PLoS One 2010; 5:e13661. [PMID: 21048968 PMCID: PMC2965099 DOI: 10.1371/journal.pone.0013661] [Citation(s) in RCA: 176] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2010] [Accepted: 10/04/2010] [Indexed: 11/21/2022] Open
Abstract
Genome-wide association studies (GWAS) based on high throughput SNP genotyping technologies open a broad avenue for exploring genes associated with milk production traits in dairy cattle. Motivated by pinpointing novel quantitative trait nucleotide (QTN) across Bos Taurus genome, the present study is to perform GWAS to identify genes affecting milk production traits using current state-of-the-art SNP genotyping technology, i.e., the Illumina BovineSNP50 BeadChip. In the analyses, the five most commonly evaluated milk production traits are involved, including milk yield (MY), milk fat yield (FY), milk protein yield (PY), milk fat percentage (FP) and milk protein percentage (PP). Estimated breeding values (EBVs) of 2,093 daughters from 14 paternal half-sib families are considered as phenotypes within the framework of a daughter design. Association tests between each trait and the 54K SNPs are achieved via two different analysis approaches, a paternal transmission disequilibrium test (TDT)-based approach (L1-TDT) and a mixed model based regression analysis (MMRA). In total, 105 SNPs were detected to be significantly associated genome-wise with one or multiple milk production traits. Of the 105 SNPs, 38 were commonly detected by both methods, while four and 63 were solely detected by L1-TDT and MMRA, respectively. The majority (86 out of 105) of the significant SNPs is located within the reported QTL regions and some are within or close to the reported candidate genes. In particular, two SNPs, ARS-BFGL-NGS-4939 and BFGL-NGS-118998, are located close to the DGAT1 gene (160bp apart) and within the GHR gene, respectively. Our findings herein not only provide confirmatory evidences for previously findings, but also explore a suite of novel SNPs associated with milk production traits, and thus form a solid basis for eventually unraveling the causal mutations for milk production traits in dairy cattle.
Collapse
Affiliation(s)
- Li Jiang
- Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, People's Republic of China
| | - Jianfeng Liu
- Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, People's Republic of China
| | - Dongxiao Sun
- Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, People's Republic of China
| | - Peipei Ma
- Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, People's Republic of China
| | - Xiangdong Ding
- Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, People's Republic of China
| | - Ying Yu
- Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, People's Republic of China
| | - Qin Zhang
- Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, People's Republic of China
- * E-mail:
| |
Collapse
|
34
|
Abstract
Endometriosis is a common cause of morbidity in women with an unknown etiology. Studies have demonstrated the familial nature of endometriosis and suggest that inheritance occurs in a polygenic/multifactorial fashion. Studies have attempted to define the gene or genes responsible for endometriosis through association or linkage studies with candidate genes or DNA mapping technology. A number of genomics studies have demonstrated significant alterations in gene expression in endometriosis. A more thorough understanding of the genetics and genomics of endometriosis will facilitate understanding the basic biology of the disease and open new inroads to diagnosis and treatment of this enigmatic condition.
Collapse
|
35
|
Lascorz J, Försti A, Chen B, Buch S, Steinke V, Rahner N, Holinski-Feder E, Morak M, Schackert HK, Görgens H, Schulmann K, Goecke T, Kloor M, Engel C, Büttner R, Kunkel N, Weires M, Hoffmeister M, Pardini B, Naccarati A, Vodickova L, Novotny J, Schreiber S, Krawczak M, Bröring CD, Völzke H, Schafmayer C, Vodicka P, Chang-Claude J, Brenner H, Burwinkel B, Propping P, Hampe J, Hemminki K. Genome-wide association study for colorectal cancer identifies risk polymorphisms in German familial cases and implicates MAPK signalling pathways in disease susceptibility. Carcinogenesis 2010; 31:1612-9. [PMID: 20610541 DOI: 10.1093/carcin/bgq146] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Genetic susceptibility accounts for approximately 35% of all colorectal cancer (CRC). Ten common low-risk variants contributing to CRC risk have been identified through genome-wide association studies (GWASs). In our GWAS, 610 664 genotyped single-nucleotide polymorphisms (SNPs) passed the quality control filtering in 371 German familial CRC patients and 1263 controls, and replication studies were conducted in four additional case-control sets (4915 cases and 5607 controls). Known risk loci at 8q24.21 and 11q23 were confirmed, and a previously unreported association, rs12701937, located between the genes GLI3 (GLI family zinc finger 3) and INHBA (inhibin, beta A) [P = 1.1 x 10(-3), odds ratio (OR) 1.14, 95% confidence interval (CI) 1.05-1.23, dominant model in the combined cohort], was identified. The association was stronger in familial cases compared with unselected cases (P = 2.0 x 10(-4), OR 1.36, 95% CI 1.16-1.60, dominant model). Two other unreported SNPs, rs6038071, 40 kb upstream of CSNK2A1 (casein kinase 2, alpha 1 polypeptide) and an intronic marker in MYO3A (myosin IIIA), rs11014993, associated with CRC only in the familial CRC cases (P = 2.5 x 10(-3), recessive model, and P = 2.7 x 10(-4), dominant model). Three software tools successfully pointed to the overrepresentation of genes related to the mitogen-activated protein kinase (MAPK) signalling pathways among the 1340 most strongly associated markers from the GWAS (allelic P value < 10(-3)). The risk of CRC increased significantly with an increasing number of risk alleles in seven genes involved in MAPK signalling events (P(trend) = 2.2 x 10(-16), OR(per allele) = 1.34, 95% CI 1.11-1.61).
Collapse
Affiliation(s)
- Jesús Lascorz
- Division of Molecular Genetic Epidemiology, German Cancer Research Center (DKFZ), Heidelberg 69120, Germany.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Abstract
Genome-wide association studies (GWAS) provide an important avenue for undertaking an agnostic evaluation of the association between common genetic variants and risk of disease. Recent advances in our understanding of human genetic variation and the technology to measure such variation have made GWAS feasible. Over the past few years a multitude of GWAS have identified and replicated many associated variants. These findings are enriching our knowledge about the genetic basis of disease and leading some to advocate using GWA study results for genetic testing. For many of the GWA study results, however, the underlying mechanisms remain unclear and the findings explain only a limited amount of heritability. These issues may be clarified by more detailed investigations, including analyses of less common variants, sequence-level data, and environmental exposures. Such studies should help clarify the potential value of genetic testing to the public's health.
Collapse
Affiliation(s)
- John S Witte
- Institute for Human Genetics, Departments of Epidemiology and Biostatistics and Urology, University of California, San Francisco, San Francisco, California 94158-9001, USA.
| |
Collapse
|
37
|
Gene signature is associated with early stage rectal cancer recurrence. J Am Coll Surg 2010; 211:187-95. [PMID: 20670856 DOI: 10.1016/j.jamcollsurg.2010.03.035] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2010] [Revised: 03/23/2010] [Accepted: 03/24/2010] [Indexed: 01/22/2023]
Abstract
BACKGROUND Despite expected excellent outcomes of surgical resection for early stage rectal cancers, 20% of stage I and II rectal cancers recur. Identifying biologic factors that predict the subset prone to recur could allow more directed therapy. This study identifies a tumor gene expression profile that accurately predicts disease recurrence. STUDY DESIGN Stage I/II rectal cancer patients treated by surgery alone at a single institution were included and classified as having recurrent or nonrecurrent cancer. Tumor mRNA was isolated from frozen tissue and evaluated for total genome gene expression by microarray analysis. Background-corrected and normalized microarray data were analyzed using BAMarray software. Selected genes were further analyzed using unsupervised clustering and nearest-centroid classification. A balanced K-fold scoring-pair algorithm using 1,000 independent replications was used for gene signature development. RESULTS Sixty-nine patients with disease-free survival and 31 patients with recurrent disease were included at a median follow-up of 105 months (interquartile range 114 months) and 32 months (interquartile range 25 months), respectively. Demographics and tumor characteristics between groups were similar. Fifty-two genes from 43,148 probes were differentially expressed, and a 36-gene signature was found to be statistically associated with recurrence using a scoring-pair algorithm. Accuracy to identify recurrence as measured by area under the receiver operating characteristic curve was 0.803. CONCLUSIONS Differential gene expression within rectal cancers is associated with recurrence of early stage disease. A 36-gene signature correlates with an increased risk of more or less aggressive tumor behavior. This information obtainable at biopsy may assist in determining treatment decisions.
Collapse
|
38
|
Abstract
A two-stage design is cost-effective for genome-wide association studies (GWAS) testing hundreds of thousands of single nucleotide polymorphisms (SNPs). In this design, each SNP is genotyped in stage 1 using a fraction of case-control samples. Top-ranked SNPs are selected and genotyped in stage 2 using additional samples. A joint analysis, combining statistics from both stages, is applied in the second stage. Follow-up studies can be regarded as a two-stage design. Once some potential SNPs are identified, independent samples are further genotyped and analyzed separately or jointly with previous data to confirm the findings. When the underlying genetic model is known, an asymptotically optimal trend test (TT) can be used at each analysis. In practice, however, genetic models for SNPs with true associations are usually unknown. In this case, the existing methods for analysis of the two-stage design and follow-up studies are not robust across different genetic models. We propose a simple robust procedure with genetic model selection to the two-stage GWAS. Our results show that, if the optimal TT has about 80% power when the genetic model is known, then the existing methods for analysis of the two-stage design have minimum powers about 20% across the four common genetic models (when the true model is unknown), while our robust procedure has minimum powers about 70% across the same genetic models. The results can be also applied to follow-up and replication studies with a joint analysis.
Collapse
Affiliation(s)
- Minjung Kwak
- Office of Biostatistics Research, National Heart, Lung and Blood Institute, 6701 Rockledge Drive, MSC 7913, Bethesda, Maryland 20892-7913, USA
| | | | | |
Collapse
|
39
|
Luo S, Mukherjee B, Chen J, Chatterjee N. Shrinkage estimation for robust and efficient screening of single-SNP association from case-control genome-wide association studies. Genet Epidemiol 2010; 33:740-50. [PMID: 19434716 DOI: 10.1002/gepi.20428] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Population-based case-control design has become one of the most popular approaches for conducting genome-wide association scans for rare diseases like cancer. In this article, we propose a novel method for improving the power of the widely used single-single-nucleotide polymorphism (SNP) two-degrees-of-freedom (2 d.f.) association test for case-control studies by exploiting the common assumption of Hardy-Weinberg Equilibrium (HWE) for the underlying population. A key feature of the method is that it can relax the assumed model constraints via a completely data-adaptive shrinkage estimation approach so that the number of false-positive results due to the departure of HWE is controlled. The method is computationally simple and is easily scalable to association tests involving hundreds of thousands or millions of genetic markers. Simulation studies as well as an application involving data from a real genome-wide association study illustrate that the proposed method is very robust for large-scale association studies and can improve the power for detecting susceptibility SNPs with recessive effects, when compared to existing methods. Implications of the general estimation strategy beyond the simple 2 d.f. association test are discussed.
Collapse
Affiliation(s)
- Sheng Luo
- University of Texas Health Science Center, Houston, USA
| | | | | | | |
Collapse
|
40
|
The Informative Extremes: Using Both Nearest and Farthest Individuals Can Improve Relief Algorithms in the Domain of Human Genetics. EVOLUTIONARY COMPUTATION, MACHINE LEARNING AND DATA MINING IN BIOINFORMATICS 2010. [DOI: 10.1007/978-3-642-12211-8_16] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
|
41
|
Gu X, Frankowski RF, Rosner GL, Relling M, Peng B, Amos CI. A modified forward multiple regression in high-density genome-wide association studies for complex traits. Genet Epidemiol 2009; 33:518-25. [PMID: 19365845 DOI: 10.1002/gepi.20404] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Genome-wide association studies (GWAS) have been widely used to identify genetic effects on complex diseases or traits. Most currently used methods are based on separate single-nucleotide polymorphism (SNP) analyses. Because this approach requires correction for multiple testing to avoid excessive false-positive results, it suffers from reduced power to detect weak genetic effects under limited sample size. To increase the power to detect multiple weak genetic factors and reduce false-positive results caused by multiple tests and dependence among test statistics, a modified forward multiple regression (MFMR) approach is proposed. Simulation studies show that MFMR has higher power than the Bonferroni and false discovery rate procedures for detecting moderate and weak genetic effects, and MFMR retains an acceptable-false positive rate even if causal SNPs are correlated with many SNPs due to population stratification or other unknown reasons.
Collapse
Affiliation(s)
- Xiangjun Gu
- Department of Epidemiology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas 77030, USA
| | | | | | | | | | | |
Collapse
|
42
|
Gupta V, Khadgawat R, Sachdeva MP. Significance of genome-wide association studies in molecular anthropology. Genet Test Mol Biomarkers 2009; 13:711-5. [PMID: 19810820 DOI: 10.1089/gtmb.2009.0072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The successful advent of a genome-wide approach in association studies raises the hopes of human geneticists for solving a genetic maze of complex traits especially the disorders. This approach, which is replete with the application of cutting-edge technology and supported by big science projects (like Human Genome Project; and even more importantly the International HapMap Project) and various important databases (SNP database, CNV database, etc.), has had unprecedented success in rapidly uncovering many of the genetic determinants of complex disorders. The magnitude of this approach in the genetics of classical anthropological variables like height, skin color, eye color, and other genome diversity projects has certainly expanded the horizons of molecular anthropology. Therefore, in this article we have proposed a genome-wide association approach in molecular anthropological studies by providing lessons from the exemplary study of the Wellcome Trust Case Control Consortium. We have also highlighted the importance and uniqueness of Indian population groups in facilitating the design and finding optimum solutions for other genome-wide association-related challenges.
Collapse
Affiliation(s)
- Vipin Gupta
- South Asia Network for Chronic Disease, Public Health Foundation of India, New Delhi, India.
| | | | | |
Collapse
|
43
|
Fridley BL. Bayesian variable and model selection methods for genetic association studies. Genet Epidemiol 2009; 33:27-37. [PMID: 18618760 DOI: 10.1002/gepi.20353] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Variable selection is growing in importance with the advent of high throughput genotyping methods requiring analysis of hundreds to thousands of single nucleotide polymorphisms (SNPs) and the increased interest in using these genetic studies to better understand common, complex diseases. Up to now, the standard approach has been to analyze the genotypes for each SNP individually to look for an association with a disease. Alternatively, combinations of SNPs or haplotypes are analyzed for association. Another added complication in studying complex diseases or phenotypes is that genetic risk for the disease is often due to multiple SNPs in various locations on the chromosome with small individual effects that may have a collectively large effect on the phenotype. Hence, multi-locus SNP models, as opposed to single SNP models, may better capture the true underlying genotypic-phenotypic relationship. Thus, innovative methods for determining which SNPs to include in the model are needed. The goal of this article is to describe several methods currently available for variable and model selection using Bayesian approaches and to illustrate their application for genetic association studies using both real and simulated candidate gene data for a complex disease. In particular, Bayesian model averaging (BMA), stochastic search variable selection (SSVS), and Bayesian variable selection (BVS) using a reversible jump Markov chain Monte Carlo (MCMC) for candidate gene association studies are illustrated using a study of age-related macular degeneration (AMD) and simulated data.
Collapse
Affiliation(s)
- Brooke L Fridley
- Division of Biostatistics, Department of Health Sciences Research, Mayo Clinic College of Medicine, Rochester, Minnesota 55905, USA.
| |
Collapse
|
44
|
Association between common germline genetic variation in 94 candidate genes or regions and risks of invasive epithelial ovarian cancer. PLoS One 2009; 4:e5983. [PMID: 19543528 PMCID: PMC2695002 DOI: 10.1371/journal.pone.0005983] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2009] [Accepted: 05/08/2009] [Indexed: 12/16/2022] Open
Abstract
Background Recent studies have identified several single nucleotide polymorphisms (SNPs) in the population that are associated with variations in the risks of many different diseases including cancers such as breast, prostate and colorectal. For ovarian cancer, the known highly penetrant susceptibility genes (BRCA1 and BRCA2) are probably responsible for only 40% of the excess familial ovarian cancer risks, suggesting that other susceptibility genes of lower penetrance exist. Methods We have taken a candidate approach to identifying moderate risk susceptibility alleles for ovarian cancer. To date, we have genotyped 340 SNPs from 94 candidate genes or regions, in up to 1,491 invasive epithelial ovarian cancer cases and 3,145 unaffected controls from three different population based studies from the UK, Denmark and USA. Results After adjusting for population stratification by genomic control, 18 SNPs (5.3%) were significant at the 5% level, and 5 SNPs (1.5%) were significant at the 1% level. The most significant association was for the SNP rs2107425, located on chromosome 11p15.5, which has previously been identified as a susceptibility allele for breast cancer from a genome wide association study (P-trend = 0.0012). When SNPs/genes were stratified into 7 different pathways or groups of validation SNPs, the breast cancer associated SNPs were the only group of SNPs that were significantly associated with ovarian cancer risk (P-heterogeneity = 0.0003; P-trend = 0.0028; adjusted (for population stratification) P-trend = 0.006). We did not find statistically significant associations when the combined data for all SNPs were analysed using an admixture maximum likelihood (AML) experiment-wise test for association (P-heterogeneity = 0.051; P-trend = 0.068). Conclusion These data suggest that a proportion of the SNPs we evaluated were associated with ovarian cancer risk, but that the effect sizes were too small to detect associations with individual SNPs.
Collapse
|
45
|
Abstract
High-dimensional and highly correlated data leading to non- or weakly identified effects are commonplace. Maximum likelihood will typically fail in such situations and a variety of shrinkage methods have been proposed. Standard techniques, such as ridge regression or the lasso, shrink estimates toward zero, with some approaches allowing coefficients to be selected out of the model by achieving a value of zero. When substantive information is available, estimates can be shrunk to nonnull values; however, such information may not be available. We propose a Bayesian semiparametric approach that allows shrinkage to multiple locations. Coefficients are given a mixture of heavy-tailed double exponential priors, with location and scale parameters assigned Dirichlet process hyperpriors to allow groups of coefficients to be shrunk toward the same, possibly nonzero, mean. Our approach favors sparse, but flexible, structure by shrinking toward a small number of random locations. The methods are illustrated using a study of genetic polymorphisms and Parkinson's disease.
Collapse
Affiliation(s)
- Richard F Maclehose
- Division of Biostatistics, University of Minnesota, Minneapolis, Minnesota 55455, USA.
| | | |
Collapse
|
46
|
Tsoi LC, Boehnke M, Klein RL, Zheng WJ. Evaluation of genome-wide association study results through development of ontology fingerprints. ACTA ACUST UNITED AC 2009; 25:1314-20. [PMID: 19349285 DOI: 10.1093/bioinformatics/btp158] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
MOTIVATION Genome-wide association (GWA) studies may identify multiple variants that are associated with a disease or trait. To narrow down candidates for further validation, quantitatively assessing how identified genes relate to a phenotype of interest is important. RESULTS We describe an approach to characterize genes or biological concepts (phenotypes, pathways, diseases, etc.) by ontology fingerprint--the set of Gene Ontology (GO) terms that are overrepresented among the PubMed abstracts discussing the gene or biological concept together with the enrichment p-value of these terms generated from a hypergeometric enrichment test. We then quantify the relevance of genes to the trait from a GWA study by calculating similarity scores between their ontology fingerprints using enrichment p-values. We validate this approach by correctly identifying corresponding genes for biological pathways with a 90% average area under the ROC curve (AUC). We applied this approach to rank genes identified through a GWA study that are associated with the lipid concentrations in plasma as well as to prioritize genes within linkage disequilibrium (LD) block. We found that the genes with highest scores were: ABCA1, lipoprotein lipase (LPL) and cholesterol ester transfer protein, plasma for high-density lipoprotein; low-density lipoprotein receptor, APOE and APOB for low-density lipoprotein; and LPL, APOA1 and APOB for triglyceride. In addition, we identified genes relevant to lipid metabolism from the literature even in cases where such knowledge was not reflected in current annotation of these genes. These results demonstrate that ontology fingerprints can be used effectively to prioritize genes from GWA studies for experimental validation.
Collapse
Affiliation(s)
- Lam C Tsoi
- Bioinformatics Graduate Program, Department of Biostatistics, Bioinformatics and Epidemiology, Medical University of South Carolina, Charleston, SC, USA
| | | | | | | |
Collapse
|
47
|
Quaye L, Song H, Ramus SJ, Gentry-Maharaj A, Høgdall E, DiCioccio RA, McGuire V, Wu AH, Van Den Berg DJ, Pike MC, Wozniak E, Doherty JA, Rossing MA, Ness RB, Moysich KB, Høgdall C, Blaakaer J, Easton DF, Ponder BAJ, Jacobs IJ, Menon U, Whittemore AS, Krüger-Kjaer S, Pearce CL, Pharoah PDP, Gayther SA. Tagging single-nucleotide polymorphisms in candidate oncogenes and susceptibility to ovarian cancer. Br J Cancer 2009; 100:993-1001. [PMID: 19240718 PMCID: PMC2661781 DOI: 10.1038/sj.bjc.6604947] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2008] [Revised: 01/19/2009] [Accepted: 01/26/2009] [Indexed: 01/02/2023] Open
Abstract
Low-moderate risk alleles that are relatively common in the population may explain a significant proportion of the excess familial risk of ovarian cancer (OC) not attributed to highly penetrant genes. In this study, we evaluated the risks of OC associated with common germline variants in five oncogenes (BRAF, ERBB2, KRAS, NMI and PIK3CA) known to be involved in OC development. Thirty-four tagging SNPs in these genes were genotyped in approximately 1800 invasive OC cases and 3000 controls from population-based studies in Denmark, the United Kingdom and the United States. We found no evidence of disease association for SNPs in BRAF, KRAS, ERBB2 and PIK3CA when OC was considered as a single disease phenotype; but after stratification by histological subtype, we found borderline evidence of association for SNPs in KRAS and BRAF with mucinous OC and in ERBB2 and PIK3CA with endometrioid OC. For NMI, we identified a SNP (rs11683487) that was associated with a decreased risk of OC (unadjusted P(dominant)=0.004). We then genotyped rs11683487 in another 1097 cases and 1792 controls from an additional three case-control studies from the United States. The combined odds ratio was 0.89 (95% confidence interval (CI): 0.80-0.99) and remained statistically significant (P(dominant)=0.032). We also identified two haplotypes in ERBB2 associated with an increased OC risk (P(global)=0.034) and a haplotype in BRAF that had a protective effect (P(global)=0.005). In conclusion, these data provide borderline evidence of association for common allelic variation in the NMI with risk of epithelial OC.
Collapse
Affiliation(s)
- L Quaye
- Gynaecological Oncology Department, UCL EGA Institute for Women's Health, University College London, London, UK
| | - H Song
- Strangeways Research Laboratory, CR-UK Department of Oncology, University of Cambridge, Cambridge, UK
| | - S J Ramus
- Gynaecological Oncology Department, UCL EGA Institute for Women's Health, University College London, London, UK
| | - A Gentry-Maharaj
- Gynaecological Oncology Department, UCL EGA Institute for Women's Health, University College London, London, UK
| | - E Høgdall
- Department of Virus, Hormones and Cancer, Institute of Cancer Epidemiology, Danish Cancer Society, Copenhagen, Denmark
| | - R A DiCioccio
- Department of Cancer Genetics, Roswell Park Cancer Institute, Buffalo, NY, USA
| | - V McGuire
- Department of Health Research and Policy, Stanford University School of Medicine, Stanford, CA, USA
| | - A H Wu
- Department of Preventive Medicine, University of Southern California, Keck School of Medicine, Los Angeles, CA, USA
| | - D J Van Den Berg
- Department of Preventive Medicine, University of Southern California, Keck School of Medicine, Los Angeles, CA, USA
| | - M C Pike
- Department of Preventive Medicine, University of Southern California, Keck School of Medicine, Los Angeles, CA, USA
| | - E Wozniak
- Gynaecological Oncology Department, UCL EGA Institute for Women's Health, University College London, London, UK
| | - J A Doherty
- JD Program in Epidemiology, Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - M A Rossing
- JD Program in Epidemiology, Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - R B Ness
- Department of Epidemiology and University of Pittsburgh Cancer Institute, Pittsburgh, PA, USA
| | - K B Moysich
- Department of Cancer Genetics, Roswell Park Cancer Institute, Buffalo, NY, USA
| | - C Høgdall
- Gynaecology Clinic, The Juliane Marie Centre, Rigshospitalet, University of Copenhagen, Denmark, CR-UK
| | - J Blaakaer
- Department of Gynaecology and Obstetrics, Aarhus University Hospital, Skejby, Aarhus, Denmark
| | - The Ovarian Cancer Association Consortium
- Gynaecological Oncology Department, UCL EGA Institute for Women's Health, University College London, London, UK
- Strangeways Research Laboratory, CR-UK Department of Oncology, University of Cambridge, Cambridge, UK
- Department of Virus, Hormones and Cancer, Institute of Cancer Epidemiology, Danish Cancer Society, Copenhagen, Denmark
- Department of Cancer Genetics, Roswell Park Cancer Institute, Buffalo, NY, USA
- Department of Health Research and Policy, Stanford University School of Medicine, Stanford, CA, USA
- Department of Preventive Medicine, University of Southern California, Keck School of Medicine, Los Angeles, CA, USA
- JD Program in Epidemiology, Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Department of Epidemiology and University of Pittsburgh Cancer Institute, Pittsburgh, PA, USA
- Gynaecology Clinic, The Juliane Marie Centre, Rigshospitalet, University of Copenhagen, Denmark, CR-UK
- Department of Gynaecology and Obstetrics, Aarhus University Hospital, Skejby, Aarhus, Denmark
- Department of Oncology, Strangeways Research Laboratory, Genetic Epidemiology Unit, University of Cambridge, Cambridge, UK
| | - D F Easton
- Department of Oncology, Strangeways Research Laboratory, Genetic Epidemiology Unit, University of Cambridge, Cambridge, UK
| | - B A J Ponder
- Strangeways Research Laboratory, CR-UK Department of Oncology, University of Cambridge, Cambridge, UK
| | - I J Jacobs
- Gynaecological Oncology Department, UCL EGA Institute for Women's Health, University College London, London, UK
| | - U Menon
- Gynaecological Oncology Department, UCL EGA Institute for Women's Health, University College London, London, UK
| | - A S Whittemore
- Department of Preventive Medicine, University of Southern California, Keck School of Medicine, Los Angeles, CA, USA
| | - S Krüger-Kjaer
- Department of Virus, Hormones and Cancer, Institute of Cancer Epidemiology, Danish Cancer Society, Copenhagen, Denmark
- Gynaecology Clinic, The Juliane Marie Centre, Rigshospitalet, University of Copenhagen, Denmark, CR-UK
| | - C L Pearce
- Department of Preventive Medicine, University of Southern California, Keck School of Medicine, Los Angeles, CA, USA
| | - P D P Pharoah
- Strangeways Research Laboratory, CR-UK Department of Oncology, University of Cambridge, Cambridge, UK
| | - S A Gayther
- Gynaecological Oncology Department, UCL EGA Institute for Women's Health, University College London, London, UK
| |
Collapse
|
48
|
Quaye L, Dafou D, Ramus SJ, Song H, Gentry-Maharaj A, Maharaj AG, Notaridou M, Hogdall E, Kjaer SK, Christensen L, Hogdall C, Easton DF, Jacobs I, Menon U, Pharoah PDP, Gayther SA. Functional complementation studies identify candidate genes and common genetic variants associated with ovarian cancer survival. Hum Mol Genet 2009; 18:1869-78. [PMID: 19270026 DOI: 10.1093/hmg/ddp107] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Common germline genetic variation and/or somatic alterations in tumours may be associated with survival in women diagnosed with ovarian cancer. The successful identification of genetic associations relies on a suitable strategy for identifying and testing candidate genes. We used microcell-mediated chromosome transfer approach and expression microarray analysis to identify genes that were associated with neoplastic suppression in ovarian cancer cell lines. Sixty-five tagging single nucleotide polymorphisms (tSNPs) in nine candidate genes were genotyped in approximately 1700 invasive ovarian cancer cases to look for associations with survival. For two of these genes, loss of heterozygosity (LOH) analysis of tSNPs in 314 ovarian tumours was used to identify associations between somatic gene deletions and survival. We identified significant associations with survival for a tSNP in caspase 5 (CASP5) [hazard ratio (HR) = 1.13 (95% CI: 1.00-1.27), P = 0.042] and two tSNPs in the retinoblastoma binding protein (RBBP8) gene [HR = 0.85 (95% CI: 0.75-0.95), P = 0.007 and HR = 0.83 (95% CI: 0.71-0.95), P = 0.009]. After adjusting for multiple prognostic factors in a multivariate Cox regression analysis, both associations in RBBP8 remained significant (P = 0.028 and 0.036). We then genotyped 314 ovarian tumours for several tSNPs in CASP5 and RBBP8 to identify gene deletions by LOH. For RBBP8, 35% of tumours in 101 informative cases showed somatic allelic deletion; LOH of RBBP8 was associated with a significantly worse prognosis [HR = 2.19 (95% CI: 1.36-3.54), P = 0.001]. In summary, a novel in vitro functional approach in ovarian cancer cells has identified RBBP8 as a gene for which both germline genetic variation and somatic alterations in tumours are associated with survival in ovarian cancer patients.
Collapse
Affiliation(s)
- Lydia Quaye
- Gynaecological Oncology Unit, UCL EGA Institute for Women's Health, University College London, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
49
|
Abstract
Twin and family studies have demonstrated that most cognitive traits are moderately to highly heritable. Neurodevelopmental disorders such as dyslexia, autism, and specific language impairment (SLI) also show strong genetic influence. Nevertheless, it has proved difficult for researchers to identify genes that would explain substantial amounts of variance in cognitive traits or disorders. Although this observation may seem paradoxical, it fits with a multifactorial model of how complex human traits are influenced by numerous genes that interact with one another, and with the environment, to produce a specific phenotype. Such a model can also explain why genetic influences on cognition have not vanished in the course of human evolution. Recent linkage and association studies of SLI and dyslexia are reviewed to illustrate these points. The role of nonheritable genetic mutations (sporadic copy number variants) in causing autism is also discussed. Finally, research on phenotypic correlates of allelic variation in the genes ASPM and microcephalin is considered; initial interest in these as genes for brain size or intelligence has been dampened by a failure to find phenotypic differences in people with different versions of these genes. There is a current vogue for investigators to include measures of allelic variants in studies of cognition and cognitive disorders. It is important to be aware that the effect sizes associated with these variants are typically small and hard to detect without extremely large sample sizes.
Collapse
Affiliation(s)
- D V M Bishop
- Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom.
| |
Collapse
|
50
|
Barnett GC, West CML, Dunning AM, Elliott RM, Coles CE, Pharoah PDP, Burnet NG. Normal tissue reactions to radiotherapy: towards tailoring treatment dose by genotype. Nat Rev Cancer 2009; 9:134-42. [PMID: 19148183 PMCID: PMC2670578 DOI: 10.1038/nrc2587] [Citation(s) in RCA: 508] [Impact Index Per Article: 33.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
A key challenge in radiotherapy is to maximize radiation doses to cancer cells while minimizing damage to surrounding healthy tissue. As severe toxicity in a minority of patients limits the doses that can be safely given to the majority, there is interest in developing a test to measure an individual's radiosensitivity before treatment. Variation in sensitivity to radiation is an inherited genetic trait and recent progress in genotyping raises the possibility of genome-wide studies to characterize genetic profiles that predict patient response to radiotherapy.
Collapse
Affiliation(s)
- Gillian C Barnett
- Department of Oncology, University of Cambridge, Oncology Centre, Addenbrooke's Hospital, Hills Road, Cambridge CB2 0QQ, UK.
| | | | | | | | | | | | | |
Collapse
|