1
|
Hanson HA, Leiser CL, Madsen MJ, Gardner J, Knight S, Cessna M, Sweeney C, Doherty JA, Smith KR, Bernard PS, Camp NJ. Family Study Designs Informed by Tumor Heterogeneity and Multi-Cancer Pleiotropies: The Power of the Utah Population Database. Cancer Epidemiol Biomarkers Prev 2020; 29:807-815. [PMID: 32098891 PMCID: PMC7168701 DOI: 10.1158/1055-9965.epi-19-0912] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Revised: 01/15/2020] [Accepted: 02/18/2020] [Indexed: 02/01/2023] Open
Abstract
BACKGROUND Previously, family-based designs and high-risk pedigrees have illustrated value for the discovery of high- and intermediate-risk germline breast cancer susceptibility genes. However, genetic heterogeneity is a major obstacle hindering progress. New strategies and analytic approaches will be necessary to make further advances. One opportunity with the potential to address heterogeneity via improved characterization of disease is the growing availability of multisource databases. Specific to advances involving family-based designs are resources that include family structure, such as the Utah Population Database (UPDB). To illustrate the broad utility and potential power of multisource databases, we describe two different novel family-based approaches to reduce heterogeneity in the UPDB. METHODS Our first approach focuses on using pedigree-informed breast tumor phenotypes in gene mapping. Our second approach focuses on the identification of families with similar pleiotropies. We use a novel network-inspired clustering technique to explore multi-cancer signatures for high-risk breast cancer families. RESULTS Our first approach identifies a genome-wide significant breast cancer locus at 2q13 [P = 1.6 × 10-8, logarithm of the odds (LOD) equivalent 6.64]. In the region, IL1A and IL1B are of particular interest, key cytokine genes involved in inflammation. Our second approach identifies five multi-cancer risk patterns. These clusters include expected coaggregations (such as breast cancer with prostate cancer, ovarian cancer, and melanoma), and also identify novel patterns, including coaggregation with uterine, thyroid, and bladder cancers. CONCLUSIONS Our results suggest pedigree-informed tumor phenotypes can map genes for breast cancer, and that various different cancer pleiotropies exist for high-risk breast cancer pedigrees. IMPACT Both methods illustrate the potential for decreasing etiologic heterogeneity that large, population-based multisource databases can provide.See all articles in this CEBP Focus section, "Modernizing Population Science."
Collapse
Affiliation(s)
- Heidi A Hanson
- Huntsman Cancer Institute, University of Utah, Salt Lake City, Utah.
- Utah Population Database, University of Utah, Salt Lake City, Utah
- Department of Surgery, University of Utah, Salt Lake City, Utah
| | - Claire L Leiser
- Huntsman Cancer Institute, University of Utah, Salt Lake City, Utah
- Department of Epidemiology, University of Washington, Seattle, Washington
| | - Michael J Madsen
- Huntsman Cancer Institute, University of Utah, Salt Lake City, Utah
| | - John Gardner
- Huntsman Cancer Institute, University of Utah, Salt Lake City, Utah
| | | | - Melissa Cessna
- Intermountain Biorepository, Intermountain Healthcare, Salt Lake City, Utah
- Department of Pathology, Intermountain Medical Center, Intermountain Healthcare, Salt Lake City, Utah
| | - Carol Sweeney
- Huntsman Cancer Institute, University of Utah, Salt Lake City, Utah
- Utah Cancer Registry, University of Utah, Salt Lake City, Utah
- Department of Internal Medicine, University of Utah School of Medicine, Salt Lake City, Utah
| | - Jennifer A Doherty
- Huntsman Cancer Institute, University of Utah, Salt Lake City, Utah
- Utah Cancer Registry, University of Utah, Salt Lake City, Utah
- Department of Population Sciences, University of Utah School of Medicine, Salt Lake City, Utah
| | - Ken R Smith
- Huntsman Cancer Institute, University of Utah, Salt Lake City, Utah
- Utah Population Database, University of Utah, Salt Lake City, Utah
- Department of Family and Consumer Studies, University of Utah, Salt Lake City, Utah
| | - Philip S Bernard
- Huntsman Cancer Institute, University of Utah, Salt Lake City, Utah
- Department of Pathology, University of Utah, Salt Lake City, Utah
| | - Nicola J Camp
- Huntsman Cancer Institute, University of Utah, Salt Lake City, Utah
- Utah Population Database, University of Utah, Salt Lake City, Utah
- Department of Internal Medicine, University of Utah School of Medicine, Salt Lake City, Utah
| |
Collapse
|
2
|
Madsen MJ, Knight S, Sweeney C, Factor R, Salama M, Stijleman IJ, Rajamanickam V, Welm BE, Arunachalam S, Jones B, Rachamadugu R, Rowe K, Cessna MH, Thomas A, Kushi LH, Caan BJ, Bernard PS, Camp NJ. Reparameterization of PAM50 Expression Identifies Novel Breast Tumor Dimensions and Leads to Discovery of a Genome-Wide Significant Breast Cancer Locus at 12q15. Cancer Epidemiol Biomarkers Prev 2018; 27:644-652. [PMID: 29650789 DOI: 10.1158/1055-9965.epi-17-0887] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Revised: 11/30/2017] [Accepted: 04/02/2018] [Indexed: 11/16/2022] Open
Abstract
Background: Breast tumor subtyping has failed to provide impact in susceptibility genetics. The PAM50 assay categorizes breast tumors into: Luminal A, Luminal B, HER2-enriched and Basal-like. However, tumors are often more complex than simple categorization can describe. The identification of heritable tumor characteristics has potential to decrease heterogeneity and increase power for gene finding.Methods: We used 911 sporadic breast tumors with PAM50 expression data to derive tumor dimensions using principal components (PC). Dimensions in 238 tumors from high-risk pedigrees were compared with the sporadic tumors. Proof-of-concept gene mapping, informed by tumor dimension, was performed using Shared Genomic Segment (SGS) analysis.Results: Five dimensions (PC1-5) explained the majority of the PAM50 expression variance: three captured intrinsic subtype, two were novel (PC3, PC5). All five replicated in 745 TCGA tumors. Both novel dimensions were significantly enriched in the high-risk pedigrees (intrinsic subtypes were not). SGS gene-mapping in a pedigree identified a 0.5 Mb genome-wide significant region at 12q15 This region segregated through 32 meioses to 8 breast cancer cases with extreme PC3 tumors (P = 2.6 × 10-8).Conclusions: PC analysis of PAM50 gene expression revealed multiple independent, quantitative measures of tumor diversity. These tumor dimensions show evidence for heritability and potential as powerful traits for gene mapping.Impact: Our study suggests a new approach to describe tumor expression diversity, provides new avenues for germline studies, and proposes a new breast cancer locus. Similar reparameterization of expression patterns may inform other studies attempting to model the effects of tumor heterogeneity. Cancer Epidemiol Biomarkers Prev; 27(6); 644-52. ©2018 AACR.
Collapse
Affiliation(s)
- Michael J Madsen
- Huntsman Cancer Institute, University of Utah, Salt Lake City, Utah
| | - Stacey Knight
- School of Medicine, University of Utah, Salt Lake City, Utah.,Intermountain Healthcare, Salt Lake City, Utah
| | - Carol Sweeney
- School of Medicine, University of Utah, Salt Lake City, Utah
| | - Rachel Factor
- School of Medicine, University of Utah, Salt Lake City, Utah
| | - Mohamed Salama
- School of Medicine, University of Utah, Salt Lake City, Utah
| | - Inge J Stijleman
- Huntsman Cancer Institute, University of Utah, Salt Lake City, Utah
| | | | - Bryan E Welm
- Huntsman Cancer Institute, University of Utah, Salt Lake City, Utah.,School of Medicine, University of Utah, Salt Lake City, Utah
| | - Sasi Arunachalam
- Huntsman Cancer Institute, University of Utah, Salt Lake City, Utah
| | - Brandt Jones
- Huntsman Cancer Institute, University of Utah, Salt Lake City, Utah
| | | | - Kerry Rowe
- Intermountain Healthcare, Salt Lake City, Utah
| | | | - Alun Thomas
- School of Medicine, University of Utah, Salt Lake City, Utah
| | | | - Bette J Caan
- Division of Research, Kaiser Permanente, Oakland, California
| | - Philip S Bernard
- Huntsman Cancer Institute, University of Utah, Salt Lake City, Utah.,School of Medicine, University of Utah, Salt Lake City, Utah
| | - Nicola J Camp
- Huntsman Cancer Institute, University of Utah, Salt Lake City, Utah. .,School of Medicine, University of Utah, Salt Lake City, Utah
| |
Collapse
|
3
|
Waller RG, Darlington TM, Wei X, Madsen MJ, Thomas A, Curtin K, Coon H, Rajamanickam V, Musinsky J, Jayabalan D, Atanackovic D, Rajkumar SV, Kumar S, Slager S, Middha M, Galia P, Demangel D, Salama M, Joseph V, McKay J, Offit K, Klein RJ, Lipkin SM, Dumontet C, Vachon CM, Camp NJ. Novel pedigree analysis implicates DNA repair and chromatin remodeling in multiple myeloma risk. PLoS Genet 2018; 14:e1007111. [PMID: 29389935 PMCID: PMC5794067 DOI: 10.1371/journal.pgen.1007111] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2017] [Accepted: 11/10/2017] [Indexed: 01/10/2023] Open
Abstract
The high-risk pedigree (HRP) design is an established strategy to discover rare, highly-penetrant, Mendelian-like causal variants. Its success, however, in complex traits has been modest, largely due to challenges of genetic heterogeneity and complex inheritance models. We describe a HRP strategy that addresses intra-familial heterogeneity, and identifies inherited segments important for mapping regulatory risk. We apply this new Shared Genomic Segment (SGS) method in 11 extended, Utah, multiple myeloma (MM) HRPs, and subsequent exome sequencing in SGS regions of interest in 1063 MM / MGUS (monoclonal gammopathy of undetermined significance-a precursor to MM) cases and 964 controls from a jointly-called collaborative resource, including cases from the initial 11 HRPs. One genome-wide significant 1.8 Mb shared segment was found at 6q16. Exome sequencing in this region revealed predicted deleterious variants in USP45 (p.Gln691* and p.Gln621Glu), a gene known to influence DNA repair through endonuclease regulation. Additionally, a 1.2 Mb segment at 1p36.11 is inherited in two Utah HRPs, with coding variants identified in ARID1A (p.Ser90Gly and p.Met890Val), a key gene in the SWI/SNF chromatin remodeling complex. Our results provide compelling statistical and genetic evidence for segregating risk variants for MM. In addition, we demonstrate a novel strategy to use large HRPs for risk-variant discovery more generally in complex traits.
Collapse
Affiliation(s)
- Rosalie G. Waller
- University of Utah School of Medicine, Salt Lake City, Utah, United States of America
| | - Todd M. Darlington
- University of Utah School of Medicine, Salt Lake City, Utah, United States of America
| | - Xiaomu Wei
- Weill Cornell Medical College, New York, New York, United States of America
| | - Michael J. Madsen
- University of Utah School of Medicine, Salt Lake City, Utah, United States of America
| | - Alun Thomas
- University of Utah School of Medicine, Salt Lake City, Utah, United States of America
| | - Karen Curtin
- University of Utah School of Medicine, Salt Lake City, Utah, United States of America
| | - Hilary Coon
- University of Utah School of Medicine, Salt Lake City, Utah, United States of America
| | | | - Justin Musinsky
- Memorial Sloan Kettering Cancer Center, New York, New York, United States of America
| | - David Jayabalan
- Weill Cornell Medical College, New York, New York, United States of America
| | - Djordje Atanackovic
- University of Utah School of Medicine, Salt Lake City, Utah, United States of America
| | | | - Shaji Kumar
- Mayo Clinic, Rochester, Minnesota, United States of America
| | - Susan Slager
- Mayo Clinic, Rochester, Minnesota, United States of America
| | - Mridu Middha
- Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| | | | | | - Mohamed Salama
- University of Utah School of Medicine, Salt Lake City, Utah, United States of America
| | - Vijai Joseph
- Memorial Sloan Kettering Cancer Center, New York, New York, United States of America
| | - James McKay
- International Agency for Research on Cancer, Lyon, France
| | - Kenneth Offit
- Memorial Sloan Kettering Cancer Center, New York, New York, United States of America
| | - Robert J. Klein
- Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| | - Steven M. Lipkin
- Weill Cornell Medical College, New York, New York, United States of America
| | | | | | - Nicola J. Camp
- University of Utah School of Medicine, Salt Lake City, Utah, United States of America
| |
Collapse
|
4
|
Edwards D. Modelling and visualizing fine-scale linkage disequilibrium structure. BMC Bioinformatics 2013; 14:179. [PMID: 23742095 PMCID: PMC3683336 DOI: 10.1186/1471-2105-14-179] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2013] [Accepted: 05/29/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Detailed study of genetic variation at the population level in humans and other species is now possible due to the availability of large sets of single nucleotide polymorphism data. Alleles at two or more loci are said to be in linkage disequilibrium (LD) when they are correlated or statistically dependent. Current efforts to understand the genetic basis of complex phenotypes are based on the existence of such associations, making study of the extent and distribution of linkage disequilibrium central to this endeavour. The objective of this paper is to develop methods to study fine-scale patterns of allelic association using probabilistic graphical models. RESULTS An efficient, linear-time forward-backward algorithm is developed to estimate chromosome-wide LD models by optimizing a penalized likelihood criterion, and a convenient way to display these models is described. To illustrate the methods they are applied to data obtained by genotyping 8341 pigs. It is found that roughly 20% of the porcine genome exhibits complex LD patterns, forming islands of relatively high genetic diversity. CONCLUSIONS The proposed algorithm is efficient and makes it feasible to estimate and visualize chromosome-wide LD models on a routine basis.
Collapse
Affiliation(s)
- David Edwards
- Department of Molecular Biology and Genetics, Centre for Quantitative Genetics and Genomics, Blichers Allé 20, Tjele 8830, Denmark.
| |
Collapse
|
5
|
Huang Y, Thomas A, Vieland VJ. Employing MCMC under the PPL framework to analyze sequence data in large pedigrees. Front Genet 2013; 4:59. [PMID: 23626600 PMCID: PMC3630390 DOI: 10.3389/fgene.2013.00059] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2013] [Accepted: 04/03/2013] [Indexed: 11/18/2022] Open
Abstract
The increased feasibility of whole-genome (or whole-exome) sequencing has led to renewed interest in using family data to find disease mutations. For clinical phenotypes that lend themselves to study in large families, this approach can be particularly effective, because it may be possible to obtain strong evidence of a causal mutation segregating in a single pedigree even under conditions of extreme locus and/or allelic heterogeneity at the population level. In this paper, we extend our capacity to carry out positional mapping in large pedigrees, using a combination of linkage analysis and within-pedigree linkage trait-variant disequilibrium analysis to fine map down to the level of individual sequence variants. To do this, we develop a novel hybrid approach to the linkage portion, combining the non-stochastic approach to integration over the trait model implemented in the software package Kelvin, with Markov chain Monte Carlo-based approximation of the marker likelihood using blocked Gibbs sampling as implemented in the McSample program in the JPSGCS package. We illustrate both the positional mapping template, as well as the efficacy of the hybrid algorithm, in application to a single large pedigree with phenotypes simulated under a two-locus trait model.
Collapse
Affiliation(s)
- Yungui Huang
- Battelle Center for Mathematical Medicine, The Research Institute at Nationwide Children's Hospital Columbus, OH, USA
| | | | | |
Collapse
|
6
|
Abel HJ, Thomas A. Case-control association testing by graphical modeling for the Genetic Analysis Workshop 17 mini-exome sequence data. BMC Proc 2011; 5 Suppl 9:S62. [PMID: 22373360 PMCID: PMC3287901 DOI: 10.1186/1753-6561-5-s9-s62] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
We generalize recent work on graphical models for linkage disequilibrium to estimate the conditional independence structure between all variables for individuals in the Genetic Analysis Workshop 17 unrelated individuals data set. Using a stepwise approach for computational efficiency and an extension of our previously described methods, we estimate a model that describes the relationships between the disease trait, all quantitative variables, all covariates, ethnic origin, and the loci most strongly associated with these variables. We performed our analysis for the first 50 replicate data sets. We found that our approach was able to describe the relationships between the outcomes and covariates and that it could correctly detect associations of disease with several loci and with a reasonable false-positive detection rate.
Collapse
Affiliation(s)
- Haley J Abel
- Division of Genetic Epidemiology, University of Utah, 391 Chipeta Way, Salt Lake City, UT 84105, USA.
| | | |
Collapse
|