Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Badal VD, Kundrotas PJ, Vakser IA. Text Mining for Protein Docking. PLoS Comput Biol 2015;11:e1004630. [PMID: 26650466 PMCID: PMC4674139 DOI: 10.1371/journal.pcbi.1004630] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2015] [Accepted: 10/29/2015] [Indexed: 11/18/2022] Open

For:	Badal VD, Kundrotas PJ, Vakser IA. Text Mining for Protein Docking. PLoS Comput Biol 2015;11:e1004630. [PMID: 26650466 PMCID: PMC4674139 DOI: 10.1371/journal.pcbi.1004630] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2015] [Accepted: 10/29/2015] [Indexed: 11/18/2022] Open

Number

Cited by Other Article(s)

Zhao N, Wu T, Wang W, Zhang L, Gong X. Review and Comparative Analysis of Methods and Advancements in Predicting Protein Complex Structure. Interdiscip Sci 2024:10.1007/s12539-024-00626-x. [PMID: 38955920 DOI: 10.1007/s12539-024-00626-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 02/29/2024] [Accepted: 03/01/2024] [Indexed: 07/04/2024]

Abstract

Protein complexes perform diverse biological functions, and obtaining their three-dimensional structure is critical to understanding and grasping their functions. In many cases, it's not just two proteins interacting to form a dimer; instead, multiple proteins interact to form a multimer. Experimentally resolving protein complex structures can be quite challenging. Recently, there have been efforts and methods that build upon prior predictions of dimer structures to attempt to predict multimer structures. However, in comparison to monomeric protein structure prediction, the accuracy of protein complex structure prediction remains relatively low. This paper provides an overview of recent advancements in efficient computational models for predicting protein complex structures. We introduce protein-protein docking methods in detail and summarize their main ideas, applicable modes, and related information. To enhance prediction accuracy, other critical protein-related information is also integrated, such as predicting interchain residue contact, utilizing experimental data like cryo-EM experiments, and considering protein interactions and non-interactions. In addition, we comprehensively review computational approaches for end-to-end prediction of protein complex structures based on artificial intelligence (AI) technology and describe commonly used datasets and representative evaluation metrics in protein complexes. Finally, we analyze the formidable challenges faced in current protein complex structure prediction tasks, including the structure prediction of heteromeric complex, disordered regions in complex, antibody-antigen complex, and RNA-related complex, as well as the evaluation metrics for complex assessment. We hope that this work will provide comprehensive knowledge of complex structure predictions to contribute to future advanced predictions.

Collapse

Arora S, Chettri S, Percha V, Kumar D, Latwal M. Artifical intelligence: a virtual chemist for natural product drug discovery. J Biomol Struct Dyn 2024;42:3826-3835. [PMID: 37232451 DOI: 10.1080/07391102.2023.2216295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Accepted: 05/12/2023] [Indexed: 05/27/2023]

Robin V, Bodein A, Scott-Boyer MP, Leclercq M, Périn O, Droit A. Overview of methods for characterization and visualization of a protein–protein interaction network in a multi-omics integration context. Front Mol Biosci 2022;9:962799. [PMID: 36158572 PMCID: PMC9494275 DOI: 10.3389/fmolb.2022.962799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Accepted: 08/16/2022] [Indexed: 11/26/2022] Open

Lu D, Pan R, Wu W, Zhang Y, Li S, Xu H, Huang J, Xia J, Wang Q, Luan X, Lv C, Zhang W, Meng G. FL-DTD: an integrated pipeline to predict the drug interacting targets by feedback loop-based network analysis. Brief Bioinform 2022;23:6632928. [PMID: 35794722 DOI: 10.1093/bib/bbac263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 06/01/2022] [Accepted: 06/06/2022] [Indexed: 11/12/2022] Open

Affiliation(s)

Dong Lu Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
Rongrong Pan Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
Wenxuan Wu Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
Yanyan Zhang Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
Shensuo Li Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
Hong Xu Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
Jialan Huang Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
Jianhua Xia Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
Qun Wang Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
Xin Luan Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
Chao Lv Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
Weidong Zhang Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China
Guofeng Meng Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Cailun 1200, 201203, Shanghai, China

Collapse

Saldívar-González FI, Aldas-Bulos VD, Medina-Franco JL, Plisson F. Natural product drug discovery in the artificial intelligence era. Chem Sci 2022;13:1526-1546. [PMID: 35282622 PMCID: PMC8827052 DOI: 10.1039/d1sc04471k] [Citation(s) in RCA: 50] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Accepted: 12/10/2021] [Indexed: 12/19/2022] Open

Badal VD, Kundrotas PJ, Vakser IA. Text mining for modeling of protein complexes enhanced by machine learning. Bioinformatics 2021;37:497-505. [PMID: 32960948 PMCID: PMC8088328 DOI: 10.1093/bioinformatics/btaa823] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Revised: 09/04/2020] [Accepted: 09/08/2020] [Indexed: 11/14/2022] Open

Abstract

MOTIVATION

Procedures for structural modeling of protein-protein complexes (protein docking) produce a number of models which need to be further analyzed and scored. Scoring can be based on independently determined constraints on the structure of the complex, such as knowledge of amino acids essential for the protein interaction. Previously, we showed that text mining of residues in freely available PubMed abstracts of papers on studies of protein-protein interactions may generate such constraints. However, absence of post-processing of the spotted residues reduced usability of the constraints, as a significant number of the residues were not relevant for the binding of the specific proteins.

RESULTS

We explored filtering of the irrelevant residues by two machine learning approaches, Deep Recursive Neural Network (DRNN) and Support Vector Machine (SVM) models with different training/testing schemes. The results showed that the DRNN model is superior to the SVM model when training is performed on the PMC-OA full-text articles and applied to classification (interface or non-interface) of the residues spotted in the PubMed abstracts. When both training and testing is performed on full-text articles or on abstracts, the performance of these models is similar. Thus, in such cases, there is no need to utilize computationally demanding DRNN approach, which is computationally expensive especially at the training stage. The reason is that SVM success is often determined by the similarity in data/text patterns in the training and the testing sets, whereas the sentence structures in the abstracts are, in general, different from those in the full text articles.

AVAILABILITYAND IMPLEMENTATION

The code and the datasets generated in this study are available at https://gitlab.ku.edu/vakser-lab-public/text-mining/-/tree/2020-09-04.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Duan R, Qiu L, Xu X, Ma Z, Merideth BR, Shyu CR, Zou X. Performance of human and server prediction in CAPRI rounds 38-45. Proteins 2020;88:1110-1120. [PMID: 32483825 DOI: 10.1002/prot.25956] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2020] [Revised: 03/26/2020] [Accepted: 05/27/2020] [Indexed: 11/11/2022]

Nicholson DN, Greene CS. Constructing knowledge graphs and their biomedical applications. Comput Struct Biotechnol J 2020;18:1414-1428. [PMID: 32637040 PMCID: PMC7327409 DOI: 10.1016/j.csbj.2020.05.017] [Citation(s) in RCA: 76] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2020] [Revised: 05/22/2020] [Accepted: 05/23/2020] [Indexed: 12/31/2022] Open

de Ávila Berni G, Rabelo-da-Ponte FD, Librenza-Garcia D, V. Boeira M, Kauer-Sant’Anna M, Cavalcante Passos I, Kapczinski F. Potential use of text classification tools as signatures of suicidal behavior: A proof-of-concept study using Virginia Woolf's personal writings. PLoS One 2018;13:e0204820. [PMID: 30356303 PMCID: PMC6200194 DOI: 10.1371/journal.pone.0204820] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2018] [Accepted: 09/15/2018] [Indexed: 01/04/2023] Open

Affiliation(s)

Gabriela de Ávila Berni Bipolar Disorder Program and Laboratory of Molecular Psychiatry, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil Graduation Program in Psychiatry and Department of Psychiatry, Federal University of Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil
Francisco Diego Rabelo-da-Ponte Bipolar Disorder Program and Laboratory of Molecular Psychiatry, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil Graduation Program in Psychiatry and Department of Psychiatry, Federal University of Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil
Diego Librenza-Garcia Bipolar Disorder Program and Laboratory of Molecular Psychiatry, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil Graduation Program in Psychiatry and Department of Psychiatry, Federal University of Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil
Manuela V. Boeira Bipolar Disorder Program and Laboratory of Molecular Psychiatry, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil Graduation Program in Psychiatry and Department of Psychiatry, Federal University of Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil
Márcia Kauer-Sant’Anna Bipolar Disorder Program and Laboratory of Molecular Psychiatry, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil Graduation Program in Psychiatry and Department of Psychiatry, Federal University of Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil
Ives Cavalcante Passos Bipolar Disorder Program and Laboratory of Molecular Psychiatry, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil Graduation Program in Psychiatry and Department of Psychiatry, Federal University of Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil
Flávio Kapczinski Bipolar Disorder Program and Laboratory of Molecular Psychiatry, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil Department of Psychiatry and Behavioural Neurosciences, McMaster University, Hamilton, ON, Canada Department of Psychiatry and Behavioral Neurosciences, St. Joseph Health Hamilton, Hamilton, ON, Canada

Collapse

Mura C, Draizen EJ, Bourne PE. Structural biology meets data science: does anything change? Curr Opin Struct Biol 2018;52:95-102. [PMID: 30267935 DOI: 10.1016/j.sbi.2018.09.003] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2018] [Revised: 08/31/2018] [Accepted: 09/07/2018] [Indexed: 01/22/2023]

Badal VD, Kundrotas PJ, Vakser IA. Natural language processing in text mining for structural modeling of protein complexes. BMC Bioinformatics 2018;19:84. [PMID: 29506465 PMCID: PMC5838950 DOI: 10.1186/s12859-018-2079-4] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2017] [Accepted: 02/20/2018] [Indexed: 12/04/2022] Open

Abstract

Background

Structural modeling of protein-protein interactions produces a large number of putative configurations of the protein complexes. Identification of the near-native models among them is a serious challenge. Publicly available results of biomedical research may provide constraints on the binding mode, which can be essential for the docking. Our text-mining (TM) tool, which extracts binding site residues from the PubMed abstracts, was successfully applied to protein docking (Badal et al., PLoS Comput Biol, 2015; 11: e1004630). Still, many extracted residues were not relevant to the docking.

Results

We present an extension of the TM tool, which utilizes natural language processing (NLP) for analyzing the context of the residue occurrence. The procedure was tested using generic and specialized dictionaries. The results showed that the keyword dictionaries designed for identification of protein interactions are not adequate for the TM prediction of the binding mode. However, our dictionary designed to distinguish keywords relevant to the protein binding sites led to considerable improvement in the TM performance. We investigated the utility of several methods of context analysis, based on dissection of the sentence parse trees. The machine learning-based NLP filtered the pool of the mined residues significantly more efficiently than the rule-based NLP. Constraints generated by NLP were tested in docking of unbound proteins from the DOCKGROUND X-ray benchmark set 4. The output of the global low-resolution docking scan was post-processed, separately, by constraints from the basic TM, constraints re-ranked by NLP, and the reference constraints. The quality of a match was assessed by the interface root-mean-square deviation. The results showed significant improvement of the docking output when using the constraints generated by the advanced TM with NLP.

Conclusions

The basic TM procedure for extracting protein-protein binding site residues from the PubMed abstracts was significantly advanced by the deep parsing (NLP techniques for contextual analysis) in purging of the initial pool of the extracted residues. Benchmarking showed a substantial increase of the docking success rate based on the constraints generated by the advanced TM with NLP.

Electronic supplementary material

The online version of this article (10.1186/s12859-018-2079-4) contains supplementary material, which is available to authorized users.

Collapse

Prediction of protein-protein interactions by label propagation with protein evolutionary and chemical information derived from heterogeneous network. J Theor Biol 2017. [DOI: 10.1016/j.jtbi.2017.06.003] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Anishchenko I, Kundrotas PJ, Vakser IA. Modeling complexes of modeled proteins. Proteins 2017;85:470-478. [PMID: 27701777 PMCID: PMC5313347 DOI: 10.1002/prot.25183] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2016] [Revised: 09/22/2016] [Accepted: 10/02/2016] [Indexed: 12/21/2022]

Vreven T, Pierce BG, Borrman TM, Weng Z. Performance of ZDOCK and IRAD in CAPRI rounds 28-34. Proteins 2016;85:408-416. [PMID: 27718275 DOI: 10.1002/prot.25186] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2016] [Revised: 09/20/2016] [Accepted: 09/29/2016] [Indexed: 11/11/2022]

Anishchenko I, Badal V, Dauzhenka T, Das M, Tuzikov AV, Kundrotas PJ, Vakser IA. Genome-Wide Structural Modeling of Protein-Protein Interactions. BIOINFORMATICS RESEARCH AND APPLICATIONS 2016. [DOI: 10.1007/978-3-319-38782-6_8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]