1
|
Gonzalez-Cárdenas M, Treviño V. The Impact of Mutational Hotspots on Cancer Survival. Cancers (Basel) 2024; 16:1072. [PMID: 38473427 DOI: 10.3390/cancers16051072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2024] [Revised: 02/11/2024] [Accepted: 02/15/2024] [Indexed: 03/14/2024] Open
Abstract
BACKGROUND Cofactors, biomarkers, and the mutational status of genes such as TP53, EGFR, IDH1/2, or PIK3CA have been used for patient stratification. However, many genes exhibit recurrent mutational positions known as hotspots, specifically linked to varying degrees of survival outcomes. Nevertheless, few hotspots have been analyzed (e.g., TP53 and EGFR). Thus, many other genes and hotspots remain unexplored. METHODS We systematically screened over 1400 hotspots across 33 TCGA cancer types. We compared the patients carrying a hotspot against (i) all cases, (ii) gene-mutated cases, (iii) other mutated hotspots, or (iv) specific hotspots. Due to the limited number of samples in hotspots and the inherent group imbalance, besides Cox models and the log-rank test, we employed VALORATE to estimate their association with survival precisely. RESULTS We screened 1469 hotspots in 6451 comparisons, where 314 were associated with survival. Many are discussed and linked to the current literature. Our findings demonstrate associations between known hotspots and survival while also revealing more potential hotspots. To enhance accessibility and promote further investigation, all the Kaplan-Meier curves, the log-rank tests, Cox statistics, and VALORATE-estimated null distributions are accessible on our website. CONCLUSIONS Our analysis revealed both known and putatively novel hotspots associated with survival, which can be used as biomarkers. Our web resource is a valuable tool for cancer research.
Collapse
Affiliation(s)
- Melissa Gonzalez-Cárdenas
- Tecnologico de Monterrey, Escuela de Medicina y Ciencias de la Salud, Ave. Morones Prieto 3000, Monterrey 64710, Nuevo León, Mexico
- Tecnologico de Monterrey, The Institute for Obesity Research, Eugenio Garza Sada Avenue 2501, Monterrey 64849, Nuevo León, Mexico
| | - Víctor Treviño
- Tecnologico de Monterrey, Escuela de Medicina y Ciencias de la Salud, Ave. Morones Prieto 3000, Monterrey 64710, Nuevo León, Mexico
- Tecnologico de Monterrey, The Institute for Obesity Research, Eugenio Garza Sada Avenue 2501, Monterrey 64849, Nuevo León, Mexico
- Tecnologico de Monterrey, oriGen Project, Eugenio Garza Sada Avenue 2501, Monterrey 64849, Nuevo León, Mexico
| |
Collapse
|
2
|
Pandey M, Gromiha MM. MutBLESS: A tool to identify disease-prone sites in cancer using deep learning. Biochim Biophys Acta Mol Basis Dis 2023; 1869:166721. [PMID: 37105446 DOI: 10.1016/j.bbadis.2023.166721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Revised: 04/07/2023] [Accepted: 04/12/2023] [Indexed: 04/29/2023]
Abstract
Understanding the molecular basis and impact of mutations at different stages of cancer are long-standing challenges in cancer biology. Identification of driver mutations from experiments is expensive and time intensive. In the present study, we collected the data for experimentally known driver mutations in 22 different cancer types and classified them into six categories: breast cancer (BRCA), acute myeloid leukaemia (LAML), endometrial carcinoma (EC), stomach cancer (STAD), skin cancer (SKCM), and other cancer types which contains 5747 disease prone and 5514 neutral sites in 516 proteins. The analysis of amino acid distribution along mutant sites revealed that the motifs AAA and LR are preferred in disease-prone sites whereas QPP and QF are dominant in neutral sites. Further, we developed a method using deep neural networks to predict disease-prone sites with amino acid sequence-based features such as physicochemical properties, secondary structure, tri-peptide motifs and conservation scores. We obtained an average AUC of 0.97 in five cancer types BRCA, LAML, EC, STAD and SKCM in a test dataset and 0.72 in all other cancer types together. Our method showed excellent performance for identifying cancer-specific mutations with an average sensitivity, specificity, and accuracy of 96.56 %, 97.39 %, and 97.64 %, respectively. We developed a web server for identifying cancer-prone sites, and it is available at https://web.iitm.ac.in/bioinfo2/MutBLESS/index.html. We suggest that our method can serve as an effective method to identify disease-prone sites and assist to develop therapeutic strategies.
Collapse
Affiliation(s)
- Medha Pandey
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India
| | - M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India.
| |
Collapse
|
3
|
Grant SR, Rosario SR, Patentreger AD, Shary N, Fitzgerald ME, Singh PK, Foster BA, Huss WJ, Wei L, Paragh G. HotSPOT: A Computational Tool to Design Targeted Sequencing Panels to Assess Early Photocarcinogenesis. Cancers (Basel) 2023; 15:cancers15051612. [PMID: 36900402 PMCID: PMC10001346 DOI: 10.3390/cancers15051612] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 02/24/2023] [Accepted: 03/03/2023] [Indexed: 03/08/2023] Open
Abstract
Mutations found in skin are acquired in specific patterns, clustering around mutation-prone genomic locations. The most mutation-prone genomic areas, mutation hotspots, first induce the growth of small cell clones in healthy skin. Mutations accumulate over time, and clones with driver mutations may give rise to skin cancer. Early mutation accumulation is a crucial first step in photocarcinogenesis. Therefore, a sufficient understanding of the process may help predict disease onset and identify avenues for skin cancer prevention. Early epidermal mutation profiles are typically established using high-depth targeted next-generation sequencing. However, there is currently a lack of tools for designing custom panels to capture mutation-enriched genomic regions efficiently. To address this issue, we created a computational algorithm that implements a pseudo-exhaustive approach to identify the best genomic areas to target. We benchmarked the current algorithm in three independent mutation datasets of human epidermal samples. Compared to the sequencing panel designs originally used in these publications, the mutation capture efficacy (number of mutations/base pairs sequenced) of our designed panel improved 9.6-12.1-fold. Mutation burden in the chronically sun-exposed and intermittently sun-exposed normal epidermis was measured within genomic regions identified by hotSPOT based on cutaneous squamous cell carcinoma (cSCC) mutation patterns. We found a significant increase in mutation capture efficacy and mutation burden in cSCC hotspots in chronically sun-exposed vs. intermittently sun-exposed epidermis (p < 0.0001). Our results show that our hotSPOT web application provides a publicly available resource for researchers to design custom panels, enabling efficient detection of somatic mutations in clinically normal tissues and other similar targeted sequencing studies. Moreover, hotSPOT also enables the comparison of mutation burden between normal tissues and cancer.
Collapse
Affiliation(s)
- Sydney R. Grant
- Department of Cell Stress Biology, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14203, USA
- Department of Dermatology, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14203, USA
| | - Spencer R. Rosario
- Department of Biostatistics and Bioinformatics, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14203, USA
| | - Andrew D. Patentreger
- Department of Cell Stress Biology, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14203, USA
- Department of Dermatology, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14203, USA
| | - Nico Shary
- Department of Dermatology, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14203, USA
| | - Megan E. Fitzgerald
- Department of Cell Stress Biology, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14203, USA
- Department of Dermatology, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14203, USA
| | - Prashant K. Singh
- Department of Genetics and Genomics, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14203, USA
| | - Barbara A. Foster
- Department of Pharmacology and Therapeutics, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14203, USA
| | - Wendy J. Huss
- Department of Cell Stress Biology, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14203, USA
- Department of Dermatology, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14203, USA
| | - Lei Wei
- Department of Biostatistics and Bioinformatics, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14203, USA
| | - Gyorgy Paragh
- Department of Cell Stress Biology, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14203, USA
- Department of Dermatology, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14203, USA
- Correspondence:
| |
Collapse
|
4
|
Jain M, Tivtikyan A, Kamalov D, Avdonin S, Rakhmatullin T, Pisarev E, Zvereva M, Samokhodskaya L, Kamalov A. Development of a Sensitive Digital Droplet PCR Screening Assay for the Detection of GPR126 Non-Coding Mutations in Bladder Cancer Urine Liquid Biopsies. Biomedicines 2023; 11:495. [PMID: 36831030 PMCID: PMC9953558 DOI: 10.3390/biomedicines11020495] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 02/06/2023] [Accepted: 02/07/2023] [Indexed: 02/11/2023] Open
Abstract
Recent whole-genome sequencing studies identified two novel recurrent mutations in the enhancer region of GPR126 in urothelial bladder cancer (UBC) tumor samples. This mutational hotspot is the second most common after the TERT promoter in UBC. The aim of the study was to develop a digital droplet PCR screening assay for the simultaneous detection of GPR126 mutations in a single tube. Its performance combined with TERT promoter mutation analysis was evaluated in urine of healthy volunteers (n = 50) and patients with cystitis (n = 22) and UBC (n = 70). The developed assay was validated using DNA constructs carrying the studied variants. None of the mutations were detected in control and cystitis group samples. GPR126 mutations were observed in the urine of 25/70 UBC patients (area under the ROC curve (AUC) of 0.679; mutant allele fraction (MAF) of 21.61 [8.30-44.52] %); TERT mutations-in 40/70 (AUC of 0.786; MAF = 28.29 [19.03-38.08] %); ≥1 mutation-in 47/70 (AUC of 0.836)). The simultaneous presence of GPR126 and TERT mutations was observed in 18/70 cases, with no difference in MAFs for the paired samples (31.96 [14.78-47.49] % vs. 27.13 [17.00-37.62] %, p = 0.349, respectively). The combined analysis of these common non-coding mutations in urine allows the sensitive and non-invasive detection of UBC.
Collapse
Affiliation(s)
- Mark Jain
- Medical Research and Educational Center, Lomonosov Moscow State University, 119992 Moscow, Russia
| | - Alexander Tivtikyan
- Medical Research and Educational Center, Lomonosov Moscow State University, 119992 Moscow, Russia
| | - David Kamalov
- Medical Research and Educational Center, Lomonosov Moscow State University, 119992 Moscow, Russia
| | - Savva Avdonin
- Department of Fundamental Medicine, Lomonosov Moscow State University, 119991 Moscow, Russia
| | - Tagir Rakhmatullin
- Department of Fundamental Medicine, Lomonosov Moscow State University, 119991 Moscow, Russia
| | - Eduard Pisarev
- Department of Bioinformatics and Bioengineering, Lomonosov Moscow State University, 119991 Moscow, Russia
- Department of Chemistry, Lomonosov Moscow State University, 119991 Moscow, Russia
| | - Maria Zvereva
- Department of Chemistry, Lomonosov Moscow State University, 119991 Moscow, Russia
| | - Larisa Samokhodskaya
- Medical Research and Educational Center, Lomonosov Moscow State University, 119992 Moscow, Russia
| | - Armais Kamalov
- Medical Research and Educational Center, Lomonosov Moscow State University, 119992 Moscow, Russia
| |
Collapse
|
5
|
Rubio C, Alfaro M, Mejia-Giraldo A, Fuertes G, Mosquera R, Vargas M. Multivariate analysis in data science for the geospatial distribution of the breast cancer mortality rate in Colombia. Front Oncol 2023; 12:1055655. [PMID: 36686819 PMCID: PMC9853892 DOI: 10.3389/fonc.2022.1055655] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Accepted: 12/13/2022] [Indexed: 01/07/2023] Open
Abstract
This research is framed in the area of biomathematics and contributes to the epidemiological surveillance entities in Colombia to clarify how breast cancer mortality rate (BCM) is spatially distributed in relation to the forest area index (FA) and circulating vehicle index (CV). In this regard, the World Health Organization has highlighted the scarce generation of knowledge that relates mortality from tumor diseases to environmental factors. Quantitative methods based on geospatial data science are used with cross-sectional information from the 2018 census; it's found that the BCM in Colombia is not spatially randomly distributed, but follows cluster aggregation patterns. Under multivariate modeling methods, the research provides sufficient statistical evidence in terms of not rejecting the hypothesis that if a spatial unit has high FA and low CV, then it has significant advantages in terms of lower BCM.
Collapse
Affiliation(s)
- Carlos Rubio
- Facultad de Ingeniería, Universidad de San Buenaventura, Cali, Colombia
| | - Miguel Alfaro
- Industrial Engineering Department, University of Santiago de Chile, Santiago, Chile
| | | | - Guillermo Fuertes
- Industrial Engineering Department, University of Santiago de Chile, Santiago, Chile,Faculty of Engineering, Science and Technology, Universidad Bernardo O’Higgins, Santiago, Chile,*Correspondence: Guillermo Fuertes,
| | - Rodolfo Mosquera
- Escuela de Estudios Industriales y Empresariales, Universidad Industrial de Santander, Bucaramanga, Colombia
| | - Manuel Vargas
- Industrial Engineering Department, University of Santiago de Chile, Santiago, Chile
| |
Collapse
|
6
|
Li B, Jin B, Capra JA, Bush WS. Integration of Protein Structure and Population-Scale DNA Sequence Data for Disease Gene Discovery and Variant Interpretation. Annu Rev Biomed Data Sci 2022; 5:141-161. [PMID: 35508071 DOI: 10.1146/annurev-biodatasci-122220-112147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The experimental and computational techniques for capturing information about protein structures and genetic variation within the human genome have advanced dramatically in the past 20 years, generating extensive new data resources. In this review, we discuss these advances, along with new approaches for determining the impact a genetic variant has on protein function. We focus on the potential of new methods that integrate human genetic variation into protein structures to discover relationships to disease, including the discovery of mutational hotspots in cancer-related proteins, the localization of protein-altering variants within protein regions for common complex diseases, and the assessment of variants of unknown significance for Mendelian traits. We expect that approaches that integrate these data sources will play increasingly important roles in disease gene discovery and variant interpretation. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Collapse
Affiliation(s)
- Bian Li
- Department of Biological Sciences and Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, USA
| | - Bowen Jin
- Graduate Program in Systems Biology and Bioinformatics, Department of Nutrition, School of Medicine, Case Western Reserve University, Cleveland, Ohio, USA
| | - John A Capra
- Bakar Computational Health Sciences Institute and Department of Epidemiology and Biostatistics, University of California, San Francisco, California, USA;
| | - William S Bush
- Cleveland Institute for Computational Biology, Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio, USA;
| |
Collapse
|
7
|
Zhuang H, Fan X, Ji D, Wang Y, Fan J, Li M, Ni D, Lu S, Li X, Chai Z. Elucidation of the conformational dynamics and assembly of Argonaute-RNA complexes by distinct yet coordinated actions of the supplementary microRNA. Comput Struct Biotechnol J 2022; 20:1352-1365. [PMID: 35356544 PMCID: PMC8933676 DOI: 10.1016/j.csbj.2022.03.001] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Revised: 03/04/2022] [Accepted: 03/04/2022] [Indexed: 02/07/2023] Open
Abstract
Argonaute (AGO) proteins, the core of RNA-induced silencing complex, are guided by microRNAs (miRNAs) to recognize target RNA for repression. The miRNA-target RNA recognition forms initially through pairing at the seed region while the additional supplementary pairing can enhance target recognition and compensate for seed mismatch. The extension of miRNA lengths can strengthen the target affinity when pairing both in the seed and supplementary regions. However, the mechanism underlying the effect of the supplementary pairing on the conformational dynamics and the assembly of AGO-RNA complex remains poorly understood. To address this, we performed large-scale molecular dynamics simulations of AGO-RNA complexes with different pairing patterns and miRNA lengths. The results reveal that the additional supplementary pairing can not only strengthen the interaction between miRNA and target RNA, but also induce the increased plasticity of the PAZ domain and enhance the domain connectivity among the PAZ, PIWI, N domains of the AGO protein. The strong community network between these domains tightens the mouth of the supplementary chamber of AGO protein, which prevents the escape of target RNA from the complex and shields it from solvent water attack. Importantly, the inner stronger matching pairs between the miRNA and target RNA can compensate for weaker mismatches at the edge of supplementary region. These findings provide guidance for the design of miRNA mimics and anti-miRNAs for both clinical and experimental use and open the way for further engineering of AGO proteins as a new tool in the field of gene regulation.
Collapse
Affiliation(s)
- Haiming Zhuang
- Department of Pathophysiology, Cell Differentiation and Apoptosis of Chinese Ministry of Education, Shanghai Jiao Tong University, School of Medicine, Shanghai 200025, China
| | - Xiaohua Fan
- Department of Anesthesiology, Changhai Hospital, Naval Medical University, Shanghai 200433, China
| | - Dong Ji
- Department of Anesthesiology, Changhai Hospital, Naval Medical University, Shanghai 200433, China
| | - Yuanhao Wang
- Department of Pathophysiology, Cell Differentiation and Apoptosis of Chinese Ministry of Education, Shanghai Jiao Tong University, School of Medicine, Shanghai 200025, China
| | - Jigang Fan
- Department of Pathophysiology, Cell Differentiation and Apoptosis of Chinese Ministry of Education, Shanghai Jiao Tong University, School of Medicine, Shanghai 200025, China
| | - Mingyu Li
- Department of Pathophysiology, Cell Differentiation and Apoptosis of Chinese Ministry of Education, Shanghai Jiao Tong University, School of Medicine, Shanghai 200025, China
| | - Duan Ni
- Department of Pathophysiology, Cell Differentiation and Apoptosis of Chinese Ministry of Education, Shanghai Jiao Tong University, School of Medicine, Shanghai 200025, China
| | - Shaoyong Lu
- Department of Pathophysiology, Cell Differentiation and Apoptosis of Chinese Ministry of Education, Shanghai Jiao Tong University, School of Medicine, Shanghai 200025, China
| | - Xiaolong Li
- Department of Orthopedics, Changhai Hospital, Naval Medical University, Shanghai 200433, China
| | - Zongtao Chai
- Department of Hepatic Surgery VI, Eastern Hepatobiliary Surgery Hospital, Navy Medical University, Shanghai 200438, China
- Department of Hepatic Surgery, Shanghai Geriatric Center, Shanghai 201104, China
| |
Collapse
|
8
|
In Silico Analysis of Dengue Virus Serotype 2 Mutations Detected at the Intrahost Level in Patients with Different Clinical Outcomes. Microbiol Spectr 2021; 9:e0025621. [PMID: 34468189 PMCID: PMC8557815 DOI: 10.1128/spectrum.00256-21] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Intrahost genetic diversity is thought to facilitate arbovirus adaptation to changing environments and hosts, and it may also be linked to viral pathogenesis. Intending to shed light on the viral determinants for severe dengue pathogenesis, we previously analyzed the DENV-2 intrahost genetic diversity in 68 patients clinically classified as dengue fever (n = 31), dengue with warning signs (n = 19), and severe dengue (n = 18), performing viral whole-genome deep sequencing from clinical samples with an amplicon-free approach. From it, we identified a set of 141 relevant mutations distributed throughout the viral genome that deserved further attention. Therefore, we employed molecular modeling to recreate three-dimensional models of the viral proteins and secondary RNA structures to map the mutations and assess their potential effects. Results showed that, in general lines, disruptive variants were identified primarily among dengue fever cases. In contrast, potential immune-escape variants were associated mainly with warning signs and severe cases, in line with the latter's longer intrahost evolution times. Furthermore, several mutations were located on protein-surface regions, with no associated function. They could represent sites of further investigation, as the interaction of viral and host proteins is critical for both host immunomodulation and virus hijacking of the cellular machinery. The present analysis provides new information about the implications of the intrahost genetic diversity of DENV-2, contributing to the knowledge about the viral factors possibly involved in its pathogenesis within the human host. Strengthening our results with functional studies could allow many of these variants to be considered in the design of therapeutic or prophylactic compounds and the improvement of diagnostic assays. IMPORTANCE Previous evidence showed that intrahost genetic diversity in arboviruses may be linked to viral pathogenesis and that one or a few amino acid replacements within a single protein are enough to modify a biological feature of an RNA virus. To assess dengue virus serotype 2 determinants potentially involved in pathogenesis, we previously analyzed the intrahost genetic diversity of the virus in patients with different clinical outcomes and identified a set of 141 mutations that deserved further study. Thus, through a molecular modeling approach, we showed that disruptive variants were identified primarily among cases with mild dengue fever, while potential immune-escape variants were mainly associated with cases of greater severity. We believe that some of the variants pointed out in this study were attractive enough to be potentially considered in future intelligent designs of therapeutic or prophylactic compounds or the improvement of diagnostic tools. The present analysis provides new information about DENV-2 viral factors possibly involved in its pathogenesis within the human host.
Collapse
|