Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Li J, Drubay D, Michiels S, Gautheret D. Mining the coding and non-coding genome for cancer drivers. Cancer Lett 2015;369:307-15. [PMID: 26433158 DOI: 10.1016/j.canlet.2015.09.015] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2015] [Revised: 09/24/2015] [Accepted: 09/24/2015] [Indexed: 12/20/2022]

For:	Li J, Drubay D, Michiels S, Gautheret D. Mining the coding and non-coding genome for cancer drivers. Cancer Lett 2015;369:307-15. [PMID: 26433158 DOI: 10.1016/j.canlet.2015.09.015] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2015] [Revised: 09/24/2015] [Accepted: 09/24/2015] [Indexed: 12/20/2022]

Number

Cited by Other Article(s)

Huang B, Fan C, Chen K, Rao J, Ou P, Tian C, Yang Y, Cooper DN, Zhao H. VCAT: an integrated variant function annotation tools. Hum Genet 2024:10.1007/s00439-024-02699-6. [PMID: 39192052 DOI: 10.1007/s00439-024-02699-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2024] [Accepted: 08/14/2024] [Indexed: 08/29/2024]

Morova T, Ding Y, Huang CCF, Sar F, Schwarz T, Giambartolomei C, Baca S, Grishin D, Hach F, Gusev A, Freedman M, Pasaniuc B, Lack N. Optimized high-throughput screening of non-coding variants identified from genome-wide association studies. Nucleic Acids Res 2022;51:e18. [PMID: 36546757 PMCID: PMC9943666 DOI: 10.1093/nar/gkac1198] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 11/19/2022] [Accepted: 12/06/2022] [Indexed: 12/24/2022] Open

Affiliation(s)

Tunc Morova Vancouver Prostate Centre, Vancouver, BC V6H 3Z6, Canada
Yi Ding Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90095, USA
Chia-Chi F Huang Vancouver Prostate Centre, Vancouver, BC V6H 3Z6, Canada
Funda Sar Vancouver Prostate Centre, Vancouver, BC V6H 3Z6, Canada
Tommer Schwarz Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90095, USA
Claudia Giambartolomei Central RNA Lab, Istituto Italiano di Tecnologia, Genova 16163, Italy,Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
Sylvan C Baca Department of Medical Oncology, The Center for Functional Cancer Epigenetics, Dana Farber Cancer Institute, Boston, MA 02215, USA
Dennis Grishin Department of Medical Oncology, The Center for Functional Cancer Epigenetics, Dana Farber Cancer Institute, Boston, MA 02215, USA
Faraz Hach Vancouver Prostate Centre, Vancouver, BC V6H 3Z6, Canada,Department of Urologic Science, University of British Columbia, Vancouver, BC V5Z 1M9, Canada
Alexander Gusev Department of Medical Oncology, The Center for Functional Cancer Epigenetics, Dana Farber Cancer Institute, Boston, MA 02215, USA,Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
Matthew L Freedman Department of Medical Oncology, The Center for Functional Cancer Epigenetics, Dana Farber Cancer Institute, Boston, MA 02215, USA,The Center for Cancer Genome Discovery, Dana Farber Cancer Institute, Boston, MA 02215, USA
Bogdan Pasaniuc Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90095, USA,Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA,Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA,Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
Nathan A Lack To whom correspondence should be addressed. Tel: +1 604 875 4411;

Collapse

Lange M, Begolli R, Giakountis A. Non-Coding Variants in Cancer: Mechanistic Insights and Clinical Potential for Personalized Medicine. Noncoding RNA 2021;7:47. [PMID: 34449663 PMCID: PMC8395730 DOI: 10.3390/ncrna7030047] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Revised: 07/26/2021] [Accepted: 08/01/2021] [Indexed: 12/11/2022] Open

Wang Y, Xue H, Pourcel C, Du Y, Gautheret D. 2-kupl: mapping-free variant detection from DNA-seq data of matched samples. BMC Bioinformatics 2021;22:304. [PMID: 34090332 PMCID: PMC8180056 DOI: 10.1186/s12859-021-04185-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Accepted: 05/11/2021] [Indexed: 11/10/2022] Open

Biggs H, Parthasarathy P, Gavryushkina A, Gardner PP. ncVarDB: a manually curated database for pathogenic non-coding variants and benign controls. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020;2020:6013764. [PMID: 33258967 PMCID: PMC7706182 DOI: 10.1093/database/baaa105] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Revised: 10/13/2020] [Accepted: 11/12/2020] [Indexed: 11/22/2022]

Ergoren MC, Cobanogulları H, Temel SG, Mocan G. Functional coding/non-coding variants in EGFR, ROS1 and ALK genes and their role in liquid biopsy as a personalized therapy. Crit Rev Oncol Hematol 2020;156:103113. [PMID: 33038629 DOI: 10.1016/j.critrevonc.2020.103113] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2020] [Revised: 09/17/2020] [Accepted: 09/18/2020] [Indexed: 02/06/2023] Open

Drubay D, Gautheret D, Michiels S. A benchmark study of scoring methods for non-coding mutations. Bioinformatics 2019;34:1635-1641. [PMID: 29340599 DOI: 10.1093/bioinformatics/bty008] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2017] [Accepted: 01/09/2018] [Indexed: 01/06/2023] Open

Agajanian S, Oluyemi O, Verkhivker GM. Integration of Random Forest Classifiers and Deep Convolutional Neural Networks for Classification and Biomolecular Modeling of Cancer Driver Mutations. Front Mol Biosci 2019;6:44. [PMID: 31245384 PMCID: PMC6579812 DOI: 10.3389/fmolb.2019.00044] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2019] [Accepted: 05/23/2019] [Indexed: 12/21/2022] Open

Abstract

Development of machine learning solutions for prediction of functional and clinical significance of cancer driver genes and mutations are paramount in modern biomedical research and have gained a significant momentum in a recent decade. In this work, we integrate different machine learning approaches, including tree based methods, random forest and gradient boosted tree (GBT) classifiers along with deep convolutional neural networks (CNN) for prediction of cancer driver mutations in the genomic datasets. The feasibility of CNN in using raw nucleotide sequences for classification of cancer driver mutations was initially explored by employing label encoding, one hot encoding, and embedding to preprocess the DNA information. These classifiers were benchmarked against their tree-based alternatives in order to evaluate the performance on a relative scale. We then integrated DNA-based scores generated by CNN with various categories of conservational, evolutionary and functional features into a generalized random forest classifier. The results of this study have demonstrated that CNN can learn high level features from genomic information that are complementary to the ensemble-based predictors often employed for classification of cancer mutations. By combining deep learning-generated score with only two main ensemble-based functional features, we can achieve a superior performance of various machine learning classifiers. Our findings have also suggested that synergy of nucleotide-based deep learning scores and integrated metrics derived from protein sequence conservation scores can allow for robust classification of cancer driver mutations with a limited number of highly informative features. Machine learning predictions are leveraged in molecular simulations, protein stability, and network-based analysis of cancer mutations in the protein kinase genes to obtain insights about molecular signatures of driver mutations and enhance the interpretability of cancer-specific classification models.

Collapse

Lowdon RF, Wang T. Epigenomic annotation of noncoding mutations identifies mutated pathways in primary liver cancer. PLoS One 2017;12:e0174032. [PMID: 28333948 PMCID: PMC5363827 DOI: 10.1371/journal.pone.0174032] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2016] [Accepted: 03/02/2017] [Indexed: 11/19/2022] Open

Li H, He Z, Gu Y, Fang L, Lv X. Prioritization of non-coding disease-causing variants and long non-coding RNAs in liver cancer. Oncol Lett 2016;12:3987-3994. [PMID: 27895760 DOI: 10.3892/ol.2016.5135] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2015] [Accepted: 06/16/2016] [Indexed: 01/10/2023] Open

Abstract

There are multiple bioinformatics tools available for the detection of coding driver mutations in cancers. However, the prioritization of pathogenic non-coding variants remains a challenging and demanding task. The present study was performed to discriminate non-coding disease-causing mutations and prioritize potential cancer-implicated long non-coding RNAs (lncRNAs) in liver cancer using a logistic regression model. A logistic regression model was constructed by combining 19,153 disease-associated ClinVar and human gene mutation database pathogenic variants as the response variable and non-coding features as the predictor variable. Genome-wide association study (GWAS) disease or trait-associated variants and recurrent somatic mutations were used to validate the model. Non-coding gene features with the highest fractions of load were characterized and potential cancer-associated lncRNA candidates were prioritized by combining the fraction of high-scoring regions and average score predicted by the logistic regression model. H3K9me3 and conserved regions were the most negatively and positively informative for the model, respectively. The area under the receiver operating characteristic curve of the model was 0.92. The average score of GWAS disease-associated variants was significantly increased compared with neutral single nucleotide polymorphisms (5.8642 vs. 5.4707; P<0.001), the average score of recurrent somatic mutations of liver cancer was significantly increased compared with non-recurrent somatic mutations (5.4101 vs. 5.2768; P=0.0125). The present study found regions in lncRNAs and introns/untranslated regions of protein coding genes where mutations are most likely to be damaging. In total, 847 lncRNAs were filtered out from the background. Characterization of this subset of lncRNAs showed that these lncRNAs are more conservative, less mutated and more highly expressed compared with other control lncRNAs. In addition, 23 of these lncRNAs were differentially expressed between 12 pairs of liver cancer and adjacent normal specimens. The logistic regression model is a useful tool to prioritize non-coding pathogenic variants and lncRNAs, and paves the way for the detection of non-coding driver lncRNAs in liver cancer.

Collapse

Li H, Lv X. Functional annotation of noncoding variants and prioritization of cancer-associated lncRNAs in lung cancer. Oncol Lett 2016;12:222-230. [PMID: 27347129 DOI: 10.3892/ol.2016.4604] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2015] [Accepted: 04/01/2016] [Indexed: 11/05/2022] Open

Abstract

Multiple computational tools have been widely applied to the detection of coding driver mutations in cancer; however, the prioritization of pathogenic non-coding variants remains a difficult and demanding task. The present study was performed to distinguish non-coding disease-causing mutations from neutral ones, and to prioritize potential cancer-associated long non-coding RNAs (lncRNAs) with a logistic regression model in lung cancer. A logistic regression model was constructed, combining 19,153 disease-associated ClinVar and Human Gene Mutation Database pathogenic variants as the response variable and non-coding features as the predictor variable. Validation of the model was conducted with genome-wide association study (GWAS) disease- or trait-associated single nucleotide polymorphisms (SNPs) and recurrent somatic mutations. High scoring regions were characterized with respect to their distribution in various features and gene classes; potential cancer-associated lncRNA candidates were prioritized, combining the fraction of high-scoring regions and average score predicted by the logistic regression model. H3K79me2 was the most negative factor that contributed to the model, while conserved regions were most positively informative to the model. The area under the receiver operating characteristic curve of the model was 0.89. The model assigned a significantly higher score to GWAS SNPs and recurrent somatic mutations compared with neutral SNPs (mean, 5.9012 vs. 5.5238; P<0.001, Mann-Whitney U test) and non-recurrent mutations (mean, 5.4677 vs. 5.2277, P<0.001, Mann-Whitney U test), respectively. It was observed that regions, including splicing sites and untranslated regions, and gene classes, including cancer genes and cancer-associated lncRNAs, had an increased enrichment of high-scoring regions. In total, 2,679 cancer-associated lncRNAs were determined and characterized. A total of 104 of these lncRNAs were differentially expressed between lung cancer and normal specimens. The logistic regression model is a useful and efficient scoring system to prioritize non-coding pathogenic variants and lncRNAs, and may provide the basis for detecting non-coding driver lncRNAs in lung cancer.

Collapse

Wang Q, Zhang J, Liu Y, Zhang W, Zhou J, Duan R, Pu P, Kang C, Han L. A novel cell cycle-associated lncRNA, HOXA11-AS, is transcribed from the 5-prime end of the HOXA transcript and is a biomarker of progression in glioma. Cancer Lett 2016;373:251-9. [PMID: 26828136 DOI: 10.1016/j.canlet.2016.01.039] [Citation(s) in RCA: 137] [Impact Index Per Article: 17.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2015] [Revised: 01/08/2016] [Accepted: 01/24/2016] [Indexed: 01/17/2023]

Affiliation(s)

Qixue Wang Department of Neurosurgery, Tianjin Medical University General Hospital, Tianjin 300052, China; Laboratory of Neuro-Oncology, Tianjin Neurological Institute, Tianjin 300052, China; Key Laboratory of Post-trauma Neuro-Repair and Regeneration in Central Nervous System, Ministry of Education, Tianjin 300052, China; Tianjin Key Laboratory of Injuries, Variations and Regeneration of Nervous System, Tianjin 300052, China; Chinese Glioma Cooperative Group (CGCG), 6 Tiantanxi Li, Beijing 100050, China
Junxia Zhang Chinese Glioma Cooperative Group (CGCG), 6 Tiantanxi Li, Beijing 100050, China; Department of Neurosurgery, The First Affiliated Hospital of Nanjing Medical University, Nanjing 210029, China
Yanwei Liu Chinese Glioma Cooperative Group (CGCG), 6 Tiantanxi Li, Beijing 100050, China; Glioma Center, Department of Neurosurgery, Beijing Tiantan Hospital, Capital Medical University, Beijing 100050, China
Wei Zhang Chinese Glioma Cooperative Group (CGCG), 6 Tiantanxi Li, Beijing 100050, China; Glioma Center, Department of Neurosurgery, Beijing Tiantan Hospital, Capital Medical University, Beijing 100050, China
Junhu Zhou Department of Neurosurgery, Tianjin Medical University General Hospital, Tianjin 300052, China; Laboratory of Neuro-Oncology, Tianjin Neurological Institute, Tianjin 300052, China; Key Laboratory of Post-trauma Neuro-Repair and Regeneration in Central Nervous System, Ministry of Education, Tianjin 300052, China; Tianjin Key Laboratory of Injuries, Variations and Regeneration of Nervous System, Tianjin 300052, China; Chinese Glioma Cooperative Group (CGCG), 6 Tiantanxi Li, Beijing 100050, China
Ran Duan Chinese Glioma Cooperative Group (CGCG), 6 Tiantanxi Li, Beijing 100050, China; Glioma Center, Department of Neurosurgery, Beijing Tiantan Hospital, Capital Medical University, Beijing 100050, China
Peiyu Pu Department of Neurosurgery, Tianjin Medical University General Hospital, Tianjin 300052, China; Laboratory of Neuro-Oncology, Tianjin Neurological Institute, Tianjin 300052, China; Key Laboratory of Post-trauma Neuro-Repair and Regeneration in Central Nervous System, Ministry of Education, Tianjin 300052, China; Tianjin Key Laboratory of Injuries, Variations and Regeneration of Nervous System, Tianjin 300052, China; Chinese Glioma Cooperative Group (CGCG), 6 Tiantanxi Li, Beijing 100050, China
Chunsheng Kang Department of Neurosurgery, Tianjin Medical University General Hospital, Tianjin 300052, China; Laboratory of Neuro-Oncology, Tianjin Neurological Institute, Tianjin 300052, China; Key Laboratory of Post-trauma Neuro-Repair and Regeneration in Central Nervous System, Ministry of Education, Tianjin 300052, China; Tianjin Key Laboratory of Injuries, Variations and Regeneration of Nervous System, Tianjin 300052, China; Chinese Glioma Cooperative Group (CGCG), 6 Tiantanxi Li, Beijing 100050, China
Lei Han Department of Neurosurgery, Tianjin Medical University General Hospital, Tianjin 300052, China; Laboratory of Neuro-Oncology, Tianjin Neurological Institute, Tianjin 300052, China; Key Laboratory of Post-trauma Neuro-Repair and Regeneration in Central Nervous System, Ministry of Education, Tianjin 300052, China; Tianjin Key Laboratory of Injuries, Variations and Regeneration of Nervous System, Tianjin 300052, China; Chinese Glioma Cooperative Group (CGCG), 6 Tiantanxi Li, Beijing 100050, China.

Collapse