1
|
Denis M, Varghese RS, Barefoot ME, Tadesse MG, Ressom HW. A Bayesian two-step integrative procedure incorporating prior knowledge for the identification of miRNA-mRNAs involved in hepatocellular carcinoma. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2022; 2022:81-86. [PMID: 36085997 PMCID: PMC9473151 DOI: 10.1109/embc48229.2022.9871330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Recent studies have confirmed the role of miRNA regulation of gene expression in oncogenesis for various cancers. In parallel, prior knowledge about relationships between miRNA and mRNA have been accumulated from biological experiments or statistical analyses. Improved identification of disease-associated miRNA-mRNA pairs may be achieved by incorporating prior knowledge into integrative genomic analyses. In this study we focus on 39 patients with hepatocellular carcinoma (HCC) and 25 patients with liver cirrhosis and use a flexible Bayesian two-step integrative method. We found 66 significant miRNA-mRNA pairs, several of which contain molecules that have previously been identified as potential biomarkers. These results demonstrate the utility of the proposed approach in providing a better understanding of relationships between different biological levels, thereby giving insights into the biological mechanisms underlying the diseases, while providing a better selection of biomarkers that may serve as diagnostic, prognostic, or therapeutic biomarker candidates.
Collapse
|
2
|
Liu Y, Baggerly KA, Orouji E, Manyam G, Chen H, Lam M, Davis JS, Lee MS, Broom BM, Menter DG, Rai K, Kopetz S, Morris JS. Methylation-eQTL Analysis in Cancer Research. Bioinformatics 2021; 37:4014-4022. [PMID: 34117863 PMCID: PMC9188481 DOI: 10.1093/bioinformatics/btab443] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2020] [Revised: 03/15/2021] [Accepted: 06/11/2021] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION DNA methylation is a key epigenetic factor regulating gene expression. While promoter methylation has been well studied, recent publications have revealed that functionally important methylation also occurs in intergenic and distal regions, and varies across genes and tissue types. Given the growing importance of inter-platform integrative genomic analyses, there is an urgent need to develop methods to discover and characterize gene-level relationships between methylation and expression. RESULTS We introduce a novel sequential penalized regression approach to identify methylation-expression quantitative trait loci (methyl-eQTLs), a term that we have coined to represent, for each gene and tissue type, a sparse set of CpG loci best explaining gene expression and accompanying weights indicating direction and strength of association. Using TCGA and MD Anderson colorectal cohorts to build and validate our models, we demonstrate our strategy better explains expression variability than current commonly used gene-level methylation summaries. The methyl-eQTLs identified by our approach can be used to construct gene-level methylation summaries that are maximally correlated with gene expression for use in integrative models, and produce a tissue-specific summary of which genes appear to be strongly regulated by methylation. Our results introduce an important resource to the biomedical community for integrative genomics analyses involving DNA methylation. AVAILABILITY AND IMPLEMENTATION We produce an R Shiny app (https://rstudio-prd-c1.pmacs.upenn.edu/methyl-eQTL/) that interactively presents methyl-eQTL results for colorectal, breast, and pancreatic cancer. The source R code for this work is provided in the supplement. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yusha Liu
- Department of Human Genetics, The University of Chicago, Chicago, IL 60637, USA
| | - Keith A Baggerly
- Department of Bioinformatics and Computational Biology, M.D. Anderson Cancer Center, Houston, TX 77030, USA
| | - Elias Orouji
- Department of Genomic Medicine, M.D. Anderson Cancer Center, Houston, TX 77030, USA
| | - Ganiraju Manyam
- Department of Bioinformatics and Computational Biology, M.D. Anderson Cancer Center, Houston, TX 77030, USA
| | - Huiqin Chen
- Department of Bioinformatics and Computational Biology, M.D. Anderson Cancer Center, Houston, TX 77030, USA
| | - Michael Lam
- Department of Gastrointestinal Medical Oncology, M.D. Anderson Cancer Center, Houston, TX 77030, USA
| | - Jennifer S Davis
- Department of Epidemiology, M.D. Anderson Cancer Center, Houston, TX 77030, USA
| | - Michael S Lee
- Department of Medicine, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Bradley M Broom
- Department of Bioinformatics and Computational Biology, M.D. Anderson Cancer Center, Houston, TX 77030, USA
| | - David G Menter
- Department of Gastrointestinal Medical Oncology, M.D. Anderson Cancer Center, Houston, TX 77030, USA
| | - Kunal Rai
- Department of Genomic Medicine, M.D. Anderson Cancer Center, Houston, TX 77030, USA
| | - Scott Kopetz
- Department of Gastrointestinal Medical Oncology, M.D. Anderson Cancer Center, Houston, TX 77030, USA
| | - Jeffrey S Morris
- Department of Biostatistics, Epidemiology and Informatics, The University of Pennsylvania, Philadelphia, PA 19104-6021, USA
| |
Collapse
|
3
|
Liu L, Zeng P, Yang S, Yuan Z. Leveraging methylation to identify the potential causal genes associated with survival in lung adenocarcinoma and lung squamous cell carcinoma. Oncol Lett 2020; 20:193-200. [PMID: 32537022 PMCID: PMC7291670 DOI: 10.3892/ol.2020.11564] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2019] [Accepted: 03/21/2020] [Indexed: 12/24/2022] Open
Abstract
Understanding the different genetic landscape between lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC) is important for understanding the underlying molecular mechanism, which may facilitate the development of effective and precise treatments. Although previous studies have identified a number of differentially expressed genes (DEGs) responsible for lung cancer, it is unknown which of these genes are causal. The present study integrated DNA methylation, RNA sequencing, clinical characteristics and survival outcomes of patients with LUAD and LUSC from The Cancer Genome Atlas. DEGs were first identified using edgeR by comparing tumor and normal tissue, and differentially methylated probes (DMPs) were assessed using ChAMP. Candidate genes for further time-to-event instrumental variable analysis were selected as the intersecting genes between DEGs and the genes including DMP CpG sites within the transcription start site (TSS1500), with DMPs in TSS1500 region being the instrumental variables. Extensive sensitivity analyses were conducted to assess the robustness of the results. The present study identified 906 DEGs for LUAD, among which 538 also had DMPs in the TSS1500 region. In addition, 1,543 DEGs were identified for LUSC, among which 1,053 also had DMPs in the TSS1500 region. Time-to-event instrumental variable analysis detected eight potential causal genes for LUAD survival, including aryl hydrocarbon receptor nuclear translocator like 2, semaphorin 3G, serum deprivation-response protein, chloride intracellular channel protein 5, LIM zinc finger domain containing 2, epithelial membrane protein 2, carbonic anhydrase 7 and LOC116437. The results also identified that phosphatidylinositol-3,4,5-trisphosphate-dependent Rac exchange factor 2 may be a potential causal gene for LUSC. Therefore, the results of the present study suggested that there was molecular heterogeneity between these two lung cancer subtypes. Such analysis framework can be extended to other cancer genomics research.
Collapse
Affiliation(s)
- Lu Liu
- Department of Biostatistics, School of Public Health, Shandong University, Jinan, Shandong 250012, P.R. China.,Institute for Medical Dataology, Shandong University, Jinan, Shandong 250012, P.R. China
| | - Ping Zeng
- Department of Epidemiology and Biostatistics, Xuzhou Medical University, Xuzhou, Jiangsu 221004, P.R. China
| | - Sheng Yang
- Department of Biostatistics, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu 211166, P.R. China
| | - Zhongshang Yuan
- Department of Biostatistics, School of Public Health, Shandong University, Jinan, Shandong 250012, P.R. China.,Institute for Medical Dataology, Shandong University, Jinan, Shandong 250012, P.R. China
| |
Collapse
|
4
|
Varghese RS, Zhou Y, Barefoot M, Chen Y, Di Poto C, Balla AK, Oliver E, Sherif ZA, Kumar D, Kroemer AH, Tadesse MG, Ressom HW. Identification of miRNA-mRNA associations in hepatocellular carcinoma using hierarchical integrative model. BMC Med Genomics 2020; 13:56. [PMID: 32228601 PMCID: PMC7106691 DOI: 10.1186/s12920-020-0706-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2019] [Accepted: 03/19/2020] [Indexed: 02/07/2023] Open
Abstract
Background The established role miRNA-mRNA regulation of gene expression has in oncogenesis highlights the importance of integrating miRNA with downstream mRNA targets. These findings call for investigations aimed at identifying disease-associated miRNA-mRNA pairs. Hierarchical integrative models (HIM) offer the opportunity to uncover the relationships between disease and the levels of different molecules measured in multiple omic studies. Methods The HIM model we formulated for analysis of mRNA-seq and miRNA-seq data can be specified with two levels: (1) a mechanistic submodel relating mRNAs to miRNAs, and (2) a clinical submodel relating disease status to mRNA and miRNA, while accounting for the mechanistic relationships in the first level. Results mRNA-seq and miRNA-seq data were acquired by analysis of tumor and normal liver tissues from 30 patients with hepatocellular carcinoma (HCC). We analyzed the data using HIM and identified 157 significant miRNA-mRNA pairs in HCC. The majority of these molecules have already been independently identified as being either diagnostic, prognostic, or therapeutic biomarker candidates for HCC. These pairs appear to be involved in processes contributing to the pathogenesis of HCC involving inflammation, regulation of cell cycle, apoptosis, and metabolism. For further evaluation of our method, we analyzed miRNA-seq and mRNA-seq data from TCGA network. While some of the miRNA-mRNA pairs we identified by analyzing both our and TCGA data are previously reported in the literature and overlap in regulation and function, new pairs have been identified that may contribute to the discovery of novel targets. Conclusion The results strongly support the hypothesis that miRNAs are important regulators of mRNAs in HCC. Furthermore, these results emphasize the biological relevance of studying miRNA-mRNA pairs.
Collapse
Affiliation(s)
- Rency S Varghese
- Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University, Room 175, Building D, 4000 Reservoir Rd NW, Washington, DC, 20057, USA
| | - Yuan Zhou
- Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University, Room 175, Building D, 4000 Reservoir Rd NW, Washington, DC, 20057, USA
| | - Megan Barefoot
- Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University, Room 175, Building D, 4000 Reservoir Rd NW, Washington, DC, 20057, USA
| | - Yifan Chen
- Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University, Room 175, Building D, 4000 Reservoir Rd NW, Washington, DC, 20057, USA
| | - Cristina Di Poto
- Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University, Room 175, Building D, 4000 Reservoir Rd NW, Washington, DC, 20057, USA
| | | | - Everett Oliver
- Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University, Room 175, Building D, 4000 Reservoir Rd NW, Washington, DC, 20057, USA
| | - Zaki A Sherif
- Department of Biochemistry & Molecular Biology, College of Medicine, Howard University, Washington DC, USA
| | - Deepak Kumar
- Department of Pharmaceutical Sciences, North Carolina Central University, Durham, NC, USA
| | | | - Mahlet G Tadesse
- Department of Mathematics and Statistics, Georgetown University, Washington DC, USA
| | - Habtom W Ressom
- Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University, Room 175, Building D, 4000 Reservoir Rd NW, Washington, DC, 20057, USA.
| |
Collapse
|