1
|
Teixeira VH, Pipinikas CP, Pennycuick A, Lee-Six H, Chandrasekharan D, Beane J, Morris TJ, Karpathakis A, Feber A, Breeze CE, Ntolios P, Hynds RE, Falzon M, Capitanio A, Carroll B, Durrenberger PF, Hardavella G, Brown JM, Lynch AG, Farmery H, Paul DS, Chambers RC, McGranahan N, Navani N, Thakrar RM, Swanton C, Beck S, George PJ, Spira A, Campbell PJ, Thirlwell C, Janes SM. Deciphering the genomic, epigenomic, and transcriptomic landscapes of pre-invasive lung cancer lesions. Nat Med 2019; 25:517-525. [PMID: 30664780 PMCID: PMC7614970 DOI: 10.1038/s41591-018-0323-0] [Citation(s) in RCA: 151] [Impact Index Per Article: 30.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Accepted: 12/05/2018] [Indexed: 01/10/2023]
Abstract
The molecular alterations that occur in cells before cancer is manifest are largely uncharted. Lung carcinoma in situ (CIS) lesions are the pre-invasive precursor to squamous cell carcinoma. Although microscopically identical, their future is in equipoise, with half progressing to invasive cancer and half regressing or remaining static. The cellular basis of this clinical observation is unknown. Here, we profile the genomic, transcriptomic, and epigenomic landscape of CIS in a unique patient cohort with longitudinally monitored pre-invasive disease. Predictive modeling identifies which lesions will progress with remarkable accuracy. We identify progression-specific methylation changes on a background of widespread heterogeneity, alongside a strong chromosomal instability signature. We observed mutations and copy number changes characteristic of cancer and chart their emergence, offering a window into early carcinogenesis. We anticipate that this new understanding of cancer precursor biology will improve early detection, reduce overtreatment, and foster preventative therapies targeting early clonal events in lung cancer.
Collapse
Affiliation(s)
- Vitor H Teixeira
- Lungs for Living Research Centre, UCL Respiratory, University College London, London, UK
| | - Christodoulos P Pipinikas
- Lungs for Living Research Centre, UCL Respiratory, University College London, London, UK
- Research Department of Cancer Biology and Medical Genomics Laboratory, UCL Cancer Institute, University College London, London, UK
| | - Adam Pennycuick
- Lungs for Living Research Centre, UCL Respiratory, University College London, London, UK
| | - Henry Lee-Six
- The Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, UK
| | - Deepak Chandrasekharan
- Lungs for Living Research Centre, UCL Respiratory, University College London, London, UK
| | - Jennifer Beane
- Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Tiffany J Morris
- Research Department of Cancer Biology and Medical Genomics Laboratory, UCL Cancer Institute, University College London, London, UK
| | - Anna Karpathakis
- Research Department of Cancer Biology and Medical Genomics Laboratory, UCL Cancer Institute, University College London, London, UK
| | - Andrew Feber
- Research Department of Cancer Biology and Medical Genomics Laboratory, UCL Cancer Institute, University College London, London, UK
| | - Charles E Breeze
- Research Department of Cancer Biology and Medical Genomics Laboratory, UCL Cancer Institute, University College London, London, UK
| | - Paschalis Ntolios
- Lungs for Living Research Centre, UCL Respiratory, University College London, London, UK
| | - Robert E Hynds
- Lungs for Living Research Centre, UCL Respiratory, University College London, London, UK
- CRUK Lung Cancer Centre of Excellence, UCL Cancer Institute, London, UK
- Cancer Evolution and Genome Instability Laboratory, The Francis Crick Institute, London, UK
| | - Mary Falzon
- Department of Pathology, University College London Hospitals NHS Trust, London, UK
| | - Arrigo Capitanio
- Department of Pathology, University College London Hospitals NHS Trust, London, UK
| | - Bernadette Carroll
- Department of Thoracic Medicine, University College London Hospital, London, UK
| | - Pascal F Durrenberger
- Center for Inflammation and Tissue Repair, UCL Respiratory, University College London, London, UK
| | - Georgia Hardavella
- Department of Thoracic Medicine, University College London Hospital, London, UK
| | - James M Brown
- Lungs for Living Research Centre, UCL Respiratory, University College London, London, UK
| | - Andy G Lynch
- Computational Biology and Statistics Laboratory, Cancer Research UK Cambridge Institute, Cambridge, UK
- School of Medicine/School of Mathematics and Statistics, University of St Andrews, St Andrews, UK
| | - Henry Farmery
- Computational Biology and Statistics Laboratory, Cancer Research UK Cambridge Institute, Cambridge, UK
| | - Dirk S Paul
- Research Department of Cancer Biology and Medical Genomics Laboratory, UCL Cancer Institute, University College London, London, UK
| | - Rachel C Chambers
- Center for Inflammation and Tissue Repair, UCL Respiratory, University College London, London, UK
| | | | - Neal Navani
- Lungs for Living Research Centre, UCL Respiratory, University College London, London, UK
- Department of Thoracic Medicine, University College London Hospital, London, UK
| | - Ricky M Thakrar
- Lungs for Living Research Centre, UCL Respiratory, University College London, London, UK
- Department of Thoracic Medicine, University College London Hospital, London, UK
| | - Charles Swanton
- CRUK Lung Cancer Centre of Excellence, UCL Cancer Institute, London, UK
- Cancer Evolution and Genome Instability Laboratory, The Francis Crick Institute, London, UK
| | - Stephan Beck
- Research Department of Cancer Biology and Medical Genomics Laboratory, UCL Cancer Institute, University College London, London, UK
| | | | - Avrum Spira
- Department of Medicine, Boston University School of Medicine, Boston, MA, USA
- Johnson and Johnson Innovation, Cambridge, MA, USA
| | - Peter J Campbell
- The Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, UK
| | - Christina Thirlwell
- Research Department of Cancer Biology and Medical Genomics Laboratory, UCL Cancer Institute, University College London, London, UK
| | - Sam M Janes
- Lungs for Living Research Centre, UCL Respiratory, University College London, London, UK.
- Department of Thoracic Medicine, University College London Hospital, London, UK.
| |
Collapse
|
3
|
Deshpande A, Weiss LA. Recurrent reciprocal copy number variants: Roles and rules in neurodevelopmental disorders. Dev Neurobiol 2018; 78:519-530. [PMID: 29575775 DOI: 10.1002/dneu.22587] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2017] [Revised: 02/08/2018] [Accepted: 03/13/2018] [Indexed: 12/14/2022]
Abstract
Deletions and duplications, called reciprocal CNVs when they occur at the same locus, are implicated in neurodevelopmental phenotypes ranging from morphological to behavioral. In this article, we propose three models of how differences in gene expression in deletion and duplication genotypes may result in deleterious phenotypes. To explore these models, we use examples of the similarities and differences in clinical phenotypes of five reciprocal CNVs known to cause neurodevelopmental disorders: 1q21.1, 7q11.23, 15q13.3, 16p11.2, and 22q11.2. These models and examples may inform some insights into better understanding of gene-phenotype relationships. © 2018 Wiley Periodicals, Inc. Develop Neurobiol 78: 519-530, 2018.
Collapse
Affiliation(s)
- Aditi Deshpande
- Department of Psychiatry, University of California, San Francisco, San Francisco, California, 94143.,Institute for Human Genetics, University of California, San Francisco, San Francisco, California, 94143.,Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, California, 94143
| | - Lauren A Weiss
- Department of Psychiatry, University of California, San Francisco, San Francisco, California, 94143.,Institute for Human Genetics, University of California, San Francisco, San Francisco, California, 94143.,Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, California, 94143
| |
Collapse
|
4
|
Zang Y, Zhao Q, Zhang Q, Li Y, Zhang S, Ma S. Inferring gene regulatory relationships with a high-dimensional robust approach. Genet Epidemiol 2017; 41:437-454. [PMID: 28464328 DOI: 10.1002/gepi.22047] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2016] [Revised: 02/12/2017] [Accepted: 02/17/2017] [Indexed: 11/11/2022]
Abstract
Gene expression (GE) levels have important biological and clinical implications. They are regulated by copy number alterations (CNAs). Modeling the regulatory relationships between GEs and CNAs facilitates understanding disease biology and can also have values in translational medicine. The expression level of a gene can be regulated by its cis-acting as well as trans-acting CNAs, and the set of trans-acting CNAs is usually not known, which poses a high-dimensional selection and estimation problem. Most of the existing studies share a common limitation in that they cannot accommodate long-tailed distributions or contamination of GE data. In this study, we develop a high-dimensional robust regression approach to infer the regulatory relationships between GEs and CNAs. A high-dimensional regression model is used to accommodate the effects of both cis-acting and trans-acting CNAs. A density power divergence loss function is used to accommodate long-tailed GE distributions and contamination. Penalization is adopted for regularized estimation and selection of relevant CNAs. The proposed approach is effectively realized using a coordinate descent algorithm. Simulation shows that it has competitive performance compared to the nonrobust benchmark and the robust LAD (least absolute deviation) approach. We analyze TCGA (The Cancer Genome Atlas) data on cutaneous melanoma and study GE-CNA regulations in the RAP (regulation of apoptosis) pathway, which further demonstrates the satisfactory performance of the proposed approach.
Collapse
Affiliation(s)
- Yangguang Zang
- School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, China.,Department of Biostatistics, Yale University, New Haven, Connecticut, United States of America
| | - Qing Zhao
- Merck Research Lab, Rahway, New Jersey, United States of America
| | - Qingzhao Zhang
- School of Economics and Wang Yanan Institute for Studies in Economics, Xiamen University, Fujian Sheng, China
| | - Yang Li
- School of Statistics, Remin University of China, Beijing, China
| | - Sanguo Zhang
- School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Shuangge Ma
- Department of Biostatistics, Yale University, New Haven, Connecticut, United States of America.,School of Economics and Wang Yanan Institute for Studies in Economics, Xiamen University, Fujian Sheng, China
| |
Collapse
|
5
|
Li BJ, Li HL, Meng Z, Zhang Y, Lin H, Yue GH, Xia JH. Copy Number Variations in Tilapia Genomes. MARINE BIOTECHNOLOGY (NEW YORK, N.Y.) 2017; 19:11-21. [PMID: 28168542 DOI: 10.1007/s10126-017-9733-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2016] [Accepted: 12/23/2016] [Indexed: 06/06/2023]
Abstract
Discovering the nature and pattern of genome variation is fundamental in understanding phenotypic diversity among populations. Although several millions of single nucleotide polymorphisms (SNPs) have been discovered in tilapia, the genome-wide characterization of larger structural variants, such as copy number variation (CNV) regions has not been carried out yet. We conducted a genome-wide scan for CNVs in 47 individuals from three tilapia populations. Based on 254 Gb of high-quality paired-end sequencing reads, we identified 4642 distinct high-confidence CNVs. These CNVs account for 1.9% (12.411 Mb) of the used Nile tilapia reference genome. A total of 1100 predicted CNVs were found overlapping with exon regions of protein genes. Further association analysis based on linear model regression found 85 CNVs ranging between 300 and 27,000 base pairs significantly associated to population types (R 2 > 0.9 and P > 0.001). Our study sheds first insights on genome-wide CNVs in tilapia. These CNVs among and within tilapia populations may have functional effects on phenotypes and specific adaptation to particular environments.
Collapse
Affiliation(s)
- Bi Jun Li
- State Key Laboratory of Biocontrol, Institute of Aquatic Economic Animals and Guangdong Provincial Key Laboratory for Aquatic Economic Animals, College of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, People's Republic of China
| | - Hong Lian Li
- State Key Laboratory of Biocontrol, Institute of Aquatic Economic Animals and Guangdong Provincial Key Laboratory for Aquatic Economic Animals, College of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, People's Republic of China
| | - Zining Meng
- State Key Laboratory of Biocontrol, Institute of Aquatic Economic Animals and Guangdong Provincial Key Laboratory for Aquatic Economic Animals, College of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, People's Republic of China
| | - Yong Zhang
- State Key Laboratory of Biocontrol, Institute of Aquatic Economic Animals and Guangdong Provincial Key Laboratory for Aquatic Economic Animals, College of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, People's Republic of China
| | - Haoran Lin
- State Key Laboratory of Biocontrol, Institute of Aquatic Economic Animals and Guangdong Provincial Key Laboratory for Aquatic Economic Animals, College of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, People's Republic of China
| | - Gen Hua Yue
- Molecular Population Genetics and Breeding Group, Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore, 117604, Singapore.
- Department of Biological Sciences, National University of Singapore, Singapore, 117543, Singapore.
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore, 637551, Singapore.
| | - Jun Hong Xia
- State Key Laboratory of Biocontrol, Institute of Aquatic Economic Animals and Guangdong Provincial Key Laboratory for Aquatic Economic Animals, College of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, People's Republic of China.
| |
Collapse
|
6
|
Menezes RX, Mohammadi L, Goeman JJ, Boer JM. Analysing multiple types of molecular profiles simultaneously: connecting the needles in the haystack. BMC Bioinformatics 2016; 17:77. [PMID: 26860128 PMCID: PMC4746904 DOI: 10.1186/s12859-016-0926-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2015] [Accepted: 01/29/2016] [Indexed: 11/23/2022] Open
Abstract
Background It has been shown that a random-effects framework can be used to test the association between a gene’s expression level and the number of DNA copies of a set of genes. This gene-set modelling framework was later applied to find associations between mRNA expression and microRNA expression, by defining the gene sets using target prediction information. Methods and results Here, we extend the model introduced by Menezes et al. 2009 to consider the effect of not just copy number, but also of other molecular profiles such as methylation changes and loss-of-heterozigosity (LOH), on gene expression levels. We will consider again sets of measurements, to improve robustness of results and increase the power to find associations. Our approach can be used genome-wide to find associations and yields a test to help separate true associations from noise. We apply our method to colon and to breast cancer samples, for which genome-wide copy number, methylation and gene expression profiles are available. Our findings include interesting gene expression-regulating mechanisms, which may involve only one of copy number or methylation, or both for the same samples. We even are able to find effects due to different molecular mechanisms in different samples. Conclusions Our method can equally well be applied to cases where other types of molecular (high-dimensional) data are collected, such as LOH, SNP genotype and microRNA expression data. Computationally efficient, it represents a flexible and powerful tool to study associations between high-dimensional datasets. The method is freely available via the SIM BioConductor package. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-0926-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Renée X Menezes
- Department of Epidemiology and Biostatistics, VU University Medical Center, De Boelelaan 1089a, HV Amsterdam, 1081, The Netherlands.
| | | | - Jelle J Goeman
- Biostatistics, Department for Health Evidence, Radboud University Medical Center, Nijmegen, The Netherlands. .,Medical Statistics and Bioinformatics, Leiden University Medical Center, Nijmegen, The Netherlands.
| | - Judith M Boer
- Department of Pediatric Oncology and Hematology, Erasmus MC-Sophia Children's Hospital, Rotterdam, The Netherlands. .,Netherlands Bioinformatics Centre, Nijmegen, The Netherlands.
| |
Collapse
|