Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

55
(from Reference Citation Analysis)

Article PDFs (11)

Cited by > 0 (46)

Searched Name

Petras J. Kundrotas

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Singh A, Copeland MM, Kundrotas PJ, Vakser IA. GRAMM Web Server for Protein Docking. Methods Mol Biol 2024;2714:101-112. [PMID: 37676594 DOI: 10.1007/978-1-0716-3441-7_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/08/2023]

Singh A, Copeland MM, Kundrotas PJ, Vakser IA. Gramm: A webserver for free and template-based protein docking. Biophys J 2023;122:47a. [PMID: 36784471 DOI: 10.1016/j.bpj.2022.11.463] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2023] Open

Collins KW, Copeland MM, Kotthoff I, Singh A, Kundrotas PJ, Vakser IA. Dockground resource for protein recognition studies. Protein Sci 2022;31:e4481. [PMID: 36281025 PMCID: PMC9667896 DOI: 10.1002/pro.4481] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 10/19/2022] [Accepted: 10/20/2022] [Indexed: 12/13/2022]

Jenkins NW, Kundrotas PJ, Vakser IA. Size of the protein-protein energy funnel in crowded environment. Front Mol Biosci 2022;9:1031225. [PMID: 36425657 PMCID: PMC9679368 DOI: 10.3389/fmolb.2022.1031225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Accepted: 10/26/2022] [Indexed: 11/09/2022] Open

Kotthoff I, Kundrotas PJ, Vakser IA. Dockground scoring benchmarks for protein docking. Proteins 2022;90:1259-1266. [DOI: 10.1002/prot.26306] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Revised: 12/06/2021] [Accepted: 01/21/2022] [Indexed: 11/05/2022]

Malladi S, Powell HR, David A, Islam SA, Copeland MM, Kundrotas PJ, Sternberg MJ, Vakser IA. GWYRE: A resource for mapping variants onto experimental and modeled structures of human protein complexes. J Mol Biol 2022;434:167608. [PMID: 35662458 PMCID: PMC9188266 DOI: 10.1016/j.jmb.2022.167608] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 03/31/2022] [Accepted: 04/20/2022] [Indexed: 02/08/2023]

Lensink MF, Brysbaert G, Mauri T, Nadzirin N, Velankar S, Chaleil RAG, Clarence T, Bates PA, Kong R, Liu B, Yang G, Liu M, Shi H, Lu X, Chang S, Roy RS, Quadir F, Liu J, Cheng J, Antoniak A, Czaplewski C, Giełdoń A, Kogut M, Lipska AG, Liwo A, Lubecka EA, Maszota-Zieleniak M, Sieradzan AK, Ślusarz R, Wesołowski PA, Zięba K, Del Carpio Muñoz CA, Ichiishi E, Harmalkar A, Gray JJ, Bonvin AMJJ, Ambrosetti F, Vargas Honorato R, Jandova Z, Jiménez-García B, Koukos PI, Van Keulen S, Van Noort CW, Réau M, Roel-Touris J, Kotelnikov S, Padhorny D, Porter KA, Alekseenko A, Ignatov M, Desta I, Ashizawa R, Sun Z, Ghani U, Hashemi N, Vajda S, Kozakov D, Rosell M, Rodríguez-Lumbreras LA, Fernandez-Recio J, Karczynska A, Grudinin S, Yan Y, Li H, Lin P, Huang SY, Christoffer C, Terashi G, Verburgt J, Sarkar D, Aderinwale T, Wang X, Kihara D, Nakamura T, Hanazono Y, Gowthaman R, Guest JD, Yin R, Taherzadeh G, Pierce BG, Barradas-Bautista D, Cao Z, Cavallo L, Oliva R, Sun Y, Zhu S, Shen Y, Park T, Woo H, Yang J, Kwon S, Won J, Seok C, Kiyota Y, Kobayashi S, Harada Y, Takeda-Shitaka M, Kundrotas PJ, Singh A, Vakser IA, Dapkūnas J, Olechnovič K, Venclovas Č, Duan R, Qiu L, Xu X, Zhang S, Zou X, Wodak SJ. Prediction of protein assemblies, the next frontier: The CASP14-CAPRI experiment. Proteins 2021;89:1800-1823. [PMID: 34453465 PMCID: PMC8616814 DOI: 10.1002/prot.26222] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Revised: 07/24/2021] [Accepted: 08/05/2021] [Indexed: 12/19/2022]

Affiliation(s)

Marc F Lensink CNRS UMR8576 UGSF, Institute for Structural and Functional Glycobiology, University of Lille, Lille, France
Guillaume Brysbaert CNRS UMR8576 UGSF, Institute for Structural and Functional Glycobiology, University of Lille, Lille, France
Théo Mauri CNRS UMR8576 UGSF, Institute for Structural and Functional Glycobiology, University of Lille, Lille, France
Nurul Nadzirin Protein Data Bank in Europe (PDBe), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
Sameer Velankar Protein Data Bank in Europe (PDBe), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
Raphael A G Chaleil Biomolecular Modelling Laboratory, The Francis Crick Institute, London, UK
Tereza Clarence Biomolecular Modelling Laboratory, The Francis Crick Institute, London, UK
Paul A Bates Biomolecular Modelling Laboratory, The Francis Crick Institute, London, UK
Ren Kong Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
Bin Liu Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
Guangbo Yang Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
Ming Liu Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
Hang Shi Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
Xufeng Lu Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
Shan Chang Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
Raj S Roy Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, Missouri, USA
Farhan Quadir Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, Missouri, USA
Jian Liu Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, Missouri, USA
Jianlin Cheng Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, Missouri, USA Institute for Data Science and Informatics, University of Missouri, Columbia, Missouri, USA
Anna Antoniak Faculty of Chemistry, University of Gdansk, Gdansk, Poland
Cezary Czaplewski Faculty of Chemistry, University of Gdansk, Gdansk, Poland
Artur Giełdoń Faculty of Chemistry, University of Gdansk, Gdansk, Poland
Mateusz Kogut Faculty of Chemistry, University of Gdansk, Gdansk, Poland
Agnieszka G Lipska Faculty of Chemistry, University of Gdansk, Gdansk, Poland
Adam Liwo Faculty of Chemistry, University of Gdansk, Gdansk, Poland
Emilia A Lubecka Faculty of Electronics, Telecommunications and Informatics, Gdansk University of Technology, Gdansk, Poland
Martyna Maszota-Zieleniak Faculty of Chemistry, University of Gdansk, Gdansk, Poland
Adam K Sieradzan Faculty of Chemistry, University of Gdansk, Gdansk, Poland
Rafał Ślusarz Faculty of Chemistry, University of Gdansk, Gdansk, Poland
Patryk A Wesołowski Faculty of Chemistry, University of Gdansk, Gdansk, Poland Intercollegiate Faculty of Biotechnology, University of Gdansk and Medical University of Gdansk, Gdansk, Poland
Karolina Zięba Faculty of Chemistry, University of Gdansk, Gdansk, Poland
Carlos A Del Carpio Muñoz Graduate School of Medical Sciences, Nagoya City University, Nagoya, Japan
Eiichiro Ichiishi International University of Health and Welfare Hospital (IUHW Hospital), Nasushiobara City, Japan
Ameya Harmalkar Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, USA
Jeffrey J Gray Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, USA
Alexandre M J J Bonvin Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Francesco Ambrosetti Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Rodrigo Vargas Honorato Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Zuzana Jandova Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Brian Jiménez-García Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Panagiotis I Koukos Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Siri Van Keulen Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Charlotte W Van Noort Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Manon Réau Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Jorge Roel-Touris Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Sergei Kotelnikov Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, USA Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York, USA Innopolis University, Russia
Dzmitry Padhorny Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, USA Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York, USA
Kathryn A Porter Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
Andrey Alekseenko Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, USA Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York, USA Institute of Computer-Aided Design of the Russian Academy of Sciences, Moscow, Russia
Mikhail Ignatov Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, USA Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York, USA
Israel Desta Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
Ryota Ashizawa Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, USA Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York, USA
Zhuyezi Sun Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
Usman Ghani Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
Nasser Hashemi Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
Sandor Vajda Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA Department of Chemistry, Boston University, Boston, Massachusetts, USA
Dima Kozakov Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, USA Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York, USA
Mireia Rosell Instituto de Ciencias de la Vid y del Vino (ICVV), CSIC - Universidad de la Rioja - Gobierno de La Rioja, Logrono, Spain Barcelona Supercomputing Center (BSC), Barcelona, Spain
Luis A Rodríguez-Lumbreras Instituto de Ciencias de la Vid y del Vino (ICVV), CSIC - Universidad de la Rioja - Gobierno de La Rioja, Logrono, Spain Barcelona Supercomputing Center (BSC), Barcelona, Spain
Juan Fernandez-Recio Instituto de Ciencias de la Vid y del Vino (ICVV), CSIC - Universidad de la Rioja - Gobierno de La Rioja, Logrono, Spain Barcelona Supercomputing Center (BSC), Barcelona, Spain
Agnieszka Karczynska Université Grenoble Alpes, Inria, CNRS, Grenoble INP, LJK, Grenoble, France
Sergei Grudinin Université Grenoble Alpes, Inria, CNRS, Grenoble INP, LJK, Grenoble, France
Yumeng Yan School of Physics, Huazhong University of Science and Technology, Wuhan, China
Hao Li School of Physics, Huazhong University of Science and Technology, Wuhan, China
Peicong Lin School of Physics, Huazhong University of Science and Technology, Wuhan, China
Sheng-You Huang School of Physics, Huazhong University of Science and Technology, Wuhan, China
Charles Christoffer Department of Computer Science, Purdue University, West Lafayette, Indiana, USA
Genki Terashi Department of Biological Sciences, Purdue University, West Lafayette, Indiana, USA
Jacob Verburgt Department of Biological Sciences, Purdue University, West Lafayette, Indiana, USA
Daipayan Sarkar Department of Biological Sciences, Purdue University, West Lafayette, Indiana, USA
Tunde Aderinwale Department of Computer Science, Purdue University, West Lafayette, Indiana, USA
Xiao Wang Department of Computer Science, Purdue University, West Lafayette, Indiana, USA
Daisuke Kihara Department of Computer Science, Purdue University, West Lafayette, Indiana, USA Department of Biological Sciences, Purdue University, West Lafayette, Indiana, USA
Tsukasa Nakamura Graduate School of Information Sciences, Tohoku University, Sendai, Miyagi, Japan
Yuya Hanazono Institute for Quantum Life Science, National Institutes for Quantum and Radiological Science and Technology, Tokai, Ibaraki, Japan
Ragul Gowthaman University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, Maryland, USA Department of Cell Biology and Molecular Genetics, University of Maryland, Maryland, USA
Johnathan D Guest University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, Maryland, USA Department of Cell Biology and Molecular Genetics, University of Maryland, Maryland, USA
Rui Yin University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, Maryland, USA Department of Cell Biology and Molecular Genetics, University of Maryland, Maryland, USA
Ghazaleh Taherzadeh University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, Maryland, USA Department of Cell Biology and Molecular Genetics, University of Maryland, Maryland, USA
Brian G Pierce University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, Maryland, USA Department of Cell Biology and Molecular Genetics, University of Maryland, Maryland, USA
Didier Barradas-Bautista King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
Zhen Cao King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
Luigi Cavallo King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
Romina Oliva University of Naples "Parthenope", Napoli, Italy
Yuanfei Sun Department of Electrical and Computer Engineering, Texas A&M University, Texas, USA
Shaowen Zhu Department of Electrical and Computer Engineering, Texas A&M University, Texas, USA
Yang Shen Department of Electrical and Computer Engineering, Texas A&M University, Texas, USA
Taeyong Park Department of Chemistry, Seoul National University, Seoul, Republic of Korea
Hyeonuk Woo Department of Chemistry, Seoul National University, Seoul, Republic of Korea
Jinsol Yang Department of Chemistry, Seoul National University, Seoul, Republic of Korea
Sohee Kwon Department of Chemistry, Seoul National University, Seoul, Republic of Korea
Jonghun Won Department of Chemistry, Seoul National University, Seoul, Republic of Korea
Chaok Seok Department of Chemistry, Seoul National University, Seoul, Republic of Korea
Yasuomi Kiyota School of Pharmacy, Kitasato University, Minato-ku, Tokyo, Japan
Shinpei Kobayashi School of Pharmacy, Kitasato University, Minato-ku, Tokyo, Japan
Yoshiki Harada School of Pharmacy, Kitasato University, Minato-ku, Tokyo, Japan
Mayuko Takeda-Shitaka School of Pharmacy, Kitasato University, Minato-ku, Tokyo, Japan
Petras J Kundrotas Computational Biology Program and Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas, USA
Amar Singh Computational Biology Program and Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas, USA
Ilya A Vakser Computational Biology Program and Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas, USA
Justas Dapkūnas Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, Lithuania
Kliment Olechnovič Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, Lithuania
Česlovas Venclovas Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, Lithuania
Rui Duan Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, USA
Liming Qiu Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, USA
Xianjin Xu Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, USA
Shuang Zhang Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, USA
Xiaoqin Zou Institute for Data Science and Informatics, University of Missouri, Columbia, Missouri, USA Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, USA Department of Physics and Astronomy, University of Missouri, Columbia, Missouri, USA Department of Biochemistry, University of Missouri, Columbia, Missouri, USA
Shoshana J Wodak Center for Structural Biology, VIB-VUB, Brussels, Belgium

Collapse

Badal VD, Kundrotas PJ, Vakser IA. Text mining for modeling of protein complexes enhanced by machine learning. Bioinformatics 2021;37:497-505. [PMID: 32960948 PMCID: PMC8088328 DOI: 10.1093/bioinformatics/btaa823] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Revised: 09/04/2020] [Accepted: 09/08/2020] [Indexed: 11/14/2022] Open

Abstract

MOTIVATION

Procedures for structural modeling of protein-protein complexes (protein docking) produce a number of models which need to be further analyzed and scored. Scoring can be based on independently determined constraints on the structure of the complex, such as knowledge of amino acids essential for the protein interaction. Previously, we showed that text mining of residues in freely available PubMed abstracts of papers on studies of protein-protein interactions may generate such constraints. However, absence of post-processing of the spotted residues reduced usability of the constraints, as a significant number of the residues were not relevant for the binding of the specific proteins.

RESULTS

We explored filtering of the irrelevant residues by two machine learning approaches, Deep Recursive Neural Network (DRNN) and Support Vector Machine (SVM) models with different training/testing schemes. The results showed that the DRNN model is superior to the SVM model when training is performed on the PMC-OA full-text articles and applied to classification (interface or non-interface) of the residues spotted in the PubMed abstracts. When both training and testing is performed on full-text articles or on abstracts, the performance of these models is similar. Thus, in such cases, there is no need to utilize computationally demanding DRNN approach, which is computationally expensive especially at the training stage. The reason is that SVM success is often determined by the similarity in data/text patterns in the training and the testing sets, whereas the sentence structures in the abstracts are, in general, different from those in the full text articles.

AVAILABILITYAND IMPLEMENTATION

The code and the datasets generated in this study are available at https://gitlab.ku.edu/vakser-lab-public/text-mining/-/tree/2020-09-04.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Hadarovich A, Chakravarty D, Tuzikov AV, Ben-Tal N, Kundrotas PJ, Vakser IA. Structural motifs in protein cores and at protein-protein interfaces are different. Protein Sci 2020;30:381-390. [PMID: 33166001 DOI: 10.1002/pro.3996] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Revised: 10/30/2020] [Accepted: 10/31/2020] [Indexed: 11/10/2022]

Singh A, Dauzhenka T, Kundrotas PJ, Sternberg MJE, Vakser IA. Application of docking methodologies to modeled proteins. Proteins 2020;88:1180-1188. [PMID: 32170770 DOI: 10.1002/prot.25889] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2019] [Revised: 02/15/2020] [Accepted: 03/07/2020] [Indexed: 12/12/2022]

Chakravarty D, McElfresh GW, Kundrotas PJ, Vakser IA. How to choose templates for modeling of protein complexes: Insights from benchmarking template-based docking. Proteins 2020;88:1070-1081. [PMID: 31994759 DOI: 10.1002/prot.25875] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Revised: 01/07/2020] [Accepted: 01/22/2020] [Indexed: 01/01/2023]

Kotthoff IP, Kundrotas PJ, Vakser IA. Docking Decoys for Modeled Proteins. Biophys J 2020. [DOI: 10.1016/j.bpj.2019.11.1729] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open

Kundrotas PJ, Kotthoff I, Choi SW, Copeland MM, Vakser IA. Dockground Tool for Development and Benchmarking of Protein Docking Procedures. Methods Mol Biol 2020;2165:289-300. [PMID: 32621232 DOI: 10.1007/978-1-0716-0708-4_17] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Lensink MF, Brysbaert G, Nadzirin N, Velankar S, Chaleil RAG, Gerguri T, Bates PA, Laine E, Carbone A, Grudinin S, Kong R, Liu RR, Xu XM, Shi H, Chang S, Eisenstein M, Karczynska A, Czaplewski C, Lubecka E, Lipska A, Krupa P, Mozolewska M, Golon Ł, Samsonov S, Liwo A, Crivelli S, Pagès G, Karasikov M, Kadukova M, Yan Y, Huang SY, Rosell M, Rodríguez-Lumbreras LA, Romero-Durana M, Díaz-Bueno L, Fernandez-Recio J, Christoffer C, Terashi G, Shin WH, Aderinwale T, Subraman SRMV, Kihara D, Kozakov D, Vajda S, Porter K, Padhorny D, Desta I, Beglov D, Ignatov M, Kotelnikov S, Moal IH, Ritchie DW, de Beauchêne IC, Maigret B, Devignes MD, Echartea MER, Barradas-Bautista D, Cao Z, Cavallo L, Oliva R, Cao Y, Shen Y, Baek M, Park T, Woo H, Seok C, Braitbard M, Bitton L, Scheidman-Duhovny D, Dapkūnas J, Olechnovič K, Venclovas Č, Kundrotas PJ, Belkin S, Chakravarty D, Badal VD, Vakser IA, Vreven T, Vangaveti S, Borrman T, Weng Z, Guest JD, Gowthaman R, Pierce BG, Xu X, Duan R, Qiu L, Hou J, Merideth BR, Ma Z, Cheng J, Zou X, Koukos PI, Roel-Touris J, Ambrosetti F, Geng C, Schaarschmidt J, Trellet ME, Melquiond ASJ, Xue L, Jiménez-García B, van Noort CW, Honorato RV, Bonvin AMJJ, Wodak SJ. Blind prediction of homo- and hetero-protein complexes: The CASP13-CAPRI experiment. Proteins 2019;87:1200-1221. [PMID: 31612567 PMCID: PMC7274794 DOI: 10.1002/prot.25838] [Citation(s) in RCA: 79] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2019] [Revised: 09/26/2019] [Accepted: 09/27/2019] [Indexed: 12/28/2022]

Affiliation(s)

Marc F. Lensink University of Lille, CNRS UMR8576 UGSF, Unité de Glycobiologie Structurale et Fonctionnelle, Lille, France
Guillaume Brysbaert University of Lille, CNRS UMR8576 UGSF, Unité de Glycobiologie Structurale et Fonctionnelle, Lille, France
Nurul Nadzirin European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
Sameer Velankar European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
Raphaël A. G. Chaleil Biomolecular Modelling Laboratory, The Francis Crick Institute, London, UK
Tereza Gerguri Biomolecular Modelling Laboratory, The Francis Crick Institute, London, UK
Paul A. Bates Biomolecular Modelling Laboratory, The Francis Crick Institute, London, UK
Elodie Laine CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), Sorbonne Université, Paris, France
Alessandra Carbone CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), Sorbonne Université, Paris, France Institut Universitaire de France (IUF), Paris, France
Sergei Grudinin Université Grenoble Alpes, CNRS, Inria, Grenoble INP, LJK, Grenoble, France
Ren Kong Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
Ran-Ran Liu Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
Xi-Ming Xu Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
Hang Shi Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
Shan Chang Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
Miriam Eisenstein Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
Agnieszka Karczynska Faculty of Chemistry, University of Gdańsk, Gdańsk, Poland
Cezary Czaplewski Faculty of Chemistry, University of Gdańsk, Gdańsk, Poland
Emilia Lubecka Institute of Informatics, Faculty of Mathematics, Physics, and Informatics, University of Gdańsk, Gdańsk, Poland
Agnieszka Lipska Faculty of Chemistry, University of Gdańsk, Gdańsk, Poland
Paweł Krupa Polish Academy of Sciences, Institute of Physics, Warsaw, Poland
Magdalena Mozolewska Polish Academy of Sciences, Institute of Computer Science, Warsaw, Poland
Łukasz Golon Faculty of Chemistry, University of Gdańsk, Gdańsk, Poland
Sergey Samsonov Faculty of Chemistry, University of Gdańsk, Gdańsk, Poland
Adam Liwo Faculty of Chemistry, University of Gdańsk, Gdańsk, Poland School of Computational Sciences, Korea Institute for Advanced Study, Seoul, South Korea
Silvia Crivelli Department of Computer Science, UC Davis, Davis, California
Guillaume Pagès Université Grenoble Alpes, CNRS, Inria, Grenoble INP, LJK, Grenoble, France
Mikhail Karasikov Department of Computer Science, ETH, Zurich, Switzerland
Maria Kadukova Université Grenoble Alpes, CNRS, Inria, Grenoble INP, LJK, Grenoble, France Moscow Institute of Physics and Technology, Dolgoprudniy, Russia
Yumeng Yan School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
Sheng-You Huang School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
Mireia Rosell Barcelona Supercomputing Center (BSC), Barcelona, Spain Instituto de Ciencias de la Vid y del Vino (ICVV-CSIC), Logroño, Spain
Luis A. Rodríguez-Lumbreras Barcelona Supercomputing Center (BSC), Barcelona, Spain Instituto de Ciencias de la Vid y del Vino (ICVV-CSIC), Logroño, Spain
Miguel Romero-Durana Barcelona Supercomputing Center (BSC), Barcelona, Spain
Lucía Díaz-Bueno Barcelona Supercomputing Center (BSC), Barcelona, Spain
Juan Fernandez-Recio Barcelona Supercomputing Center (BSC), Barcelona, Spain Instituto de Ciencias de la Vid y del Vino (ICVV-CSIC), Logroño, Spain Instituto de Biología Molecular de Barcelona (IBMB-CSIC), Barcelona, Spain
Charles Christoffer Department of Computer Science, Purdue University, West Lafayette, Indiana
Genki Terashi Department of Biological Sciences, Purdue University, West Lafayette, Indiana
Woong-Hee Shin Department of Biological Sciences, Purdue University, West Lafayette, Indiana
Tunde Aderinwale Department of Computer Science, Purdue University, West Lafayette, Indiana
Sai Raghavendra Maddhuri Venkata Subraman Department of Computer Science, Purdue University, West Lafayette, Indiana
Daisuke Kihara Department of Computer Science, Purdue University, West Lafayette, Indiana
Dima Kozakov Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York
Sandor Vajda Department of Biomedical Engineering, Boston University, Boston, Massachusetts Department of Chemistry, Boston University, Boston, Massachusetts
Kathryn Porter Department of Biomedical Engineering, Boston University, Boston, Massachusetts
Dzmitry Padhorny Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York
Israel Desta Department of Biomedical Engineering, Boston University, Boston, Massachusetts
Dmitri Beglov Department of Biomedical Engineering, Boston University, Boston, Massachusetts
Mikhail Ignatov Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York
Sergey Kotelnikov Moscow Institute of Physics and Technology, Dolgoprudniy, Russia Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York
Iain H. Moal European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
David W. Ritchie University of Lorraine, CNRS, Inria, LORIA, Nancy, France
Isaure Chauvot de Beauchêne University of Lorraine, CNRS, Inria, LORIA, Nancy, France
Bernard Maigret University of Lorraine, CNRS, Inria, LORIA, Nancy, France
Marie-Dominique Devignes University of Lorraine, CNRS, Inria, LORIA, Nancy, France
Maria E. Ruiz Echartea University of Lorraine, CNRS, Inria, LORIA, Nancy, France
Didier Barradas-Bautista Physical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
Zhen Cao Physical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
Luigi Cavallo Physical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
Romina Oliva Department of Sciences and Technologies, University of Naples “Parthenope”, Napoli, Italy
Yue Cao Department of Electrical and Computer Engineering, Texas A&M University, College Station, Texas
Yang Shen Department of Electrical and Computer Engineering, Texas A&M University, College Station, Texas
Minkyung Baek Department of Chemistry, Seoul National University, Seoul, Republic of Korea
Taeyong Park Department of Chemistry, Seoul National University, Seoul, Republic of Korea
Hyeonuk Woo Department of Chemistry, Seoul National University, Seoul, Republic of Korea
Chaok Seok Department of Chemistry, Seoul National University, Seoul, Republic of Korea
Merav Braitbard Department of Biological Chemistry, Institute of Live Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel
Lirane Bitton School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
Dina Scheidman-Duhovny Department of Biological Chemistry, Institute of Live Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
Justas Dapkūnas Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, Lithuania
Kliment Olechnovič Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, Lithuania
Česlovas Venclovas Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, Lithuania
Petras J. Kundrotas Computational Biology Program and Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas
Saveliy Belkin Computational Biology Program and Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas
Devlina Chakravarty Computational Biology Program and Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas
Varsha D. Badal Computational Biology Program and Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas
Ilya A. Vakser Computational Biology Program and Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas
Thom Vreven Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, Massachusetts
Sweta Vangaveti Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, Massachusetts
Tyler Borrman Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, Massachusetts
Zhiping Weng Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, Massachusetts
Johnathan D. Guest University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, Maryland Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, Maryland
Ragul Gowthaman University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, Maryland Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, Maryland
Brian G. Pierce University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, Maryland Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, Maryland
Xianjin Xu Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri
Rui Duan Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri
Liming Qiu Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri
Jie Hou Department of Computer Science, University of Missouri, Columbia, Missouri
Benjamin Ryan Merideth Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri Informatics Institute, University of Missouri, Columbia, Missouri
Zhiwei Ma Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri Department of Physics and Astronomy, University of Missouri, Columbia, Missouri
Jianlin Cheng Department of Computer Science, University of Missouri, Columbia, Missouri Informatics Institute, University of Missouri, Columbia, Missouri
Xiaoqin Zou Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri Informatics Institute, University of Missouri, Columbia, Missouri Department of Physics and Astronomy, University of Missouri, Columbia, Missouri Department of Biochemistry, University of Missouri, Columbia, Missouri
Panagiotis I. Koukos Computational Structural Biology Group, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Jorge Roel-Touris Computational Structural Biology Group, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Francesco Ambrosetti Computational Structural Biology Group, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Cunliang Geng Computational Structural Biology Group, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Jörg Schaarschmidt Computational Structural Biology Group, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Mikael E. Trellet Computational Structural Biology Group, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Adrien S. J. Melquiond Computational Structural Biology Group, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Li Xue Computational Structural Biology Group, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Brian Jiménez-García Computational Structural Biology Group, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Charlotte W. van Noort Computational Structural Biology Group, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Rodrigo V. Honorato Computational Structural Biology Group, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Alexandre M. J. J. Bonvin Computational Structural Biology Group, Department of Chemistry, Faculty of Science, Utrecht University, Utrecht, The Netherlands
Shoshana J. Wodak VIB Structural Biology Research Center, VUB, Brussels, Belgium

Collapse

Hadarovich A, Anishchenko I, Tuzikov AV, Kundrotas PJ, Vakser IA. Gene ontology improves template selection in comparative protein docking. Proteins 2018;87:245-253. [PMID: 30520123 DOI: 10.1002/prot.25645] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2018] [Revised: 10/21/2018] [Accepted: 11/29/2018] [Indexed: 02/06/2023]

Dauzhenka T, Kundrotas PJ, Vakser IA. Computational Feasibility of an Exhaustive Search of Side-Chain Conformations in Protein-Protein Docking. J Comput Chem 2018;39:2012-2021. [PMID: 30226647 DOI: 10.1002/jcc.25381] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Revised: 03/24/2018] [Accepted: 05/26/2018] [Indexed: 11/07/2022]

Anishchenko I, Kundrotas PJ, Vakser IA. Contact Potential for Structure Prediction of Proteins and Protein Complexes from Potts Model. Biophys J 2018;115:809-821. [PMID: 30122295 DOI: 10.1016/j.bpj.2018.07.035] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2018] [Revised: 07/16/2018] [Accepted: 07/31/2018] [Indexed: 12/18/2022] Open

Badal VD, Kundrotas PJ, Vakser IA. Natural language processing in text mining for structural modeling of protein complexes. BMC Bioinformatics 2018;19:84. [PMID: 29506465 PMCID: PMC5838950 DOI: 10.1186/s12859-018-2079-4] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2017] [Accepted: 02/20/2018] [Indexed: 12/04/2022] Open

Abstract

Background

Structural modeling of protein-protein interactions produces a large number of putative configurations of the protein complexes. Identification of the near-native models among them is a serious challenge. Publicly available results of biomedical research may provide constraints on the binding mode, which can be essential for the docking. Our text-mining (TM) tool, which extracts binding site residues from the PubMed abstracts, was successfully applied to protein docking (Badal et al., PLoS Comput Biol, 2015; 11: e1004630). Still, many extracted residues were not relevant to the docking.

Results

We present an extension of the TM tool, which utilizes natural language processing (NLP) for analyzing the context of the residue occurrence. The procedure was tested using generic and specialized dictionaries. The results showed that the keyword dictionaries designed for identification of protein interactions are not adequate for the TM prediction of the binding mode. However, our dictionary designed to distinguish keywords relevant to the protein binding sites led to considerable improvement in the TM performance. We investigated the utility of several methods of context analysis, based on dissection of the sentence parse trees. The machine learning-based NLP filtered the pool of the mined residues significantly more efficiently than the rule-based NLP. Constraints generated by NLP were tested in docking of unbound proteins from the DOCKGROUND X-ray benchmark set 4. The output of the global low-resolution docking scan was post-processed, separately, by constraints from the basic TM, constraints re-ranked by NLP, and the reference constraints. The quality of a match was assessed by the interface root-mean-square deviation. The results showed significant improvement of the docking output when using the constraints generated by the advanced TM with NLP.

Conclusions

The basic TM procedure for extracting protein-protein binding site residues from the PubMed abstracts was significantly advanced by the deep parsing (NLP techniques for contextual analysis) in purging of the initial pool of the extracted residues. Benchmarking showed a substantial increase of the docking success rate based on the constraints generated by the advanced TM with NLP.

Electronic supplementary material

The online version of this article (10.1186/s12859-018-2079-4) contains supplementary material, which is available to authorized users.

Collapse

Kundrotas PJ, Anishchenko I, Badal VD, Das M, Dauzhenka T, Vakser IA. Modeling CAPRI targets 110-120 by template-based and free docking using contact potential and combined scoring function. Proteins 2018;86 Suppl 1:302-310. [PMID: 28905425 PMCID: PMC5820180 DOI: 10.1002/prot.25380] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2017] [Revised: 08/25/2017] [Accepted: 09/10/2017] [Indexed: 01/12/2023]

Dauzhenka T, Anishchenko I, Kundrotas PJ, Vakser IA. Relative Contribution of the Refinement Steps to the Protein-Protein Docking Success Rate. Biophys J 2018. [DOI: 10.1016/j.bpj.2017.11.3147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open

Kundrotas PJ, Anishchenko I, Dauzhenka T, Kotthoff I, Mnevets D, Copeland MM, Vakser IA. Dockground: A comprehensive data resource for modeling of protein complexes. Protein Sci 2017;27:172-181. [PMID: 28891124 DOI: 10.1002/pro.3295] [Citation(s) in RCA: 47] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Revised: 09/06/2017] [Accepted: 09/07/2017] [Indexed: 12/28/2022]

Dauzhenka T, Anishchenko I, Kundrotas PJ, Vakser IA. Refinement of Protein Docking with Atom-Atom Contact Potentials, Backbone Flexibility and Side-Chain Repacking. Biophys J 2017. [DOI: 10.1016/j.bpj.2016.11.328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open

Anishchenko I, Kundrotas PJ, Vakser IA. Structural quality of unrefined models in protein docking. Proteins 2017;85:39-45. [PMID: 27756103 PMCID: PMC5167671 DOI: 10.1002/prot.25188] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2016] [Revised: 09/29/2016] [Accepted: 10/11/2016] [Indexed: 11/11/2022]

Anishchenko I, Kundrotas PJ, Vakser IA. Modeling complexes of modeled proteins. Proteins 2016;85:470-478. [PMID: 27701777 DOI: 10.1002/prot.25183] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2016] [Revised: 09/22/2016] [Accepted: 10/02/2016] [Indexed: 12/21/2022]

Zheng J, Kundrotas PJ, Vakser IA, Liu S. Template-Based Modeling of Protein-RNA Interactions. PLoS Comput Biol 2016;12:e1005120. [PMID: 27662342 PMCID: PMC5035060 DOI: 10.1371/journal.pcbi.1005120] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2016] [Accepted: 08/25/2016] [Indexed: 12/29/2022] Open

Abstract

Protein-RNA complexes formed by specific recognition between RNA and RNA-binding proteins play an important role in biological processes. More than a thousand of such proteins in human are curated and many novel RNA-binding proteins are to be discovered. Due to limitations of experimental approaches, computational techniques are needed for characterization of protein-RNA interactions. Although much progress has been made, adequate methodologies reliably providing atomic resolution structural details are still lacking. Although protein-RNA free docking approaches proved to be useful, in general, the template-based approaches provide higher quality of predictions. Templates are key to building a high quality model. Sequence/structure relationships were studied based on a representative set of binary protein-RNA complexes from PDB. Several approaches were tested for pairwise target/template alignment. The analysis revealed a transition point between random and correct binding modes. The results showed that structural alignment is better than sequence alignment in identifying good templates, suitable for generating protein-RNA complexes close to the native structure, and outperforms free docking, successfully predicting complexes where the free docking fails, including cases of significant conformational change upon binding. A template-based protein-RNA interaction modeling protocol PRIME was developed and benchmarked on a representative set of complexes.

Structures of protein-RNA complexes are important for characterization of biological processes. The number of experimentally determined protein-RNA complexes is limited. Thus modeling of these complexes is important. Reliable structural predictions of proteins and their complexes are provided by comparative modeling, which takes advantage of similar complexes with experimentally determined structures. Thus, in the case of protein-RNA complexes, it is important to determine if similar proteins and RNAs bind in a similar way. We show that, similarly to the earlier published results on protein-protein complexes, such correlation of the protein-RNA binding mode and the monomers similarity indeed exists, and is stronger when the similarity is determined by structure rather than sequence alignment. The data shows clear transition from random to similar binding mode with the increase of the structural similarity of the monomers. On the basis of the results we designed and implemented a predictive tool, which should be useful for the biological community interested in modeling of protein-RNA interactions.

Collapse

Lensink MF, Velankar S, Kryshtafovych A, Huang SY, Schneidman-Duhovny D, Sali A, Segura J, Fernandez-Fuentes N, Viswanath S, Elber R, Grudinin S, Popov P, Neveu E, Lee H, Baek M, Park S, Heo L, Rie Lee G, Seok C, Qin S, Zhou HX, Ritchie DW, Maigret B, Devignes MD, Ghoorah A, Torchala M, Chaleil RAG, Bates PA, Ben-Zeev E, Eisenstein M, Negi SS, Weng Z, Vreven T, Pierce BG, Borrman TM, Yu J, Ochsenbein F, Guerois R, Vangone A, Rodrigues JPGLM, van Zundert G, Nellen M, Xue L, Karaca E, Melquiond ASJ, Visscher K, Kastritis PL, Bonvin AMJJ, Xu X, Qiu L, Yan C, Li J, Ma Z, Cheng J, Zou X, Shen Y, Peterson LX, Kim HR, Roy A, Han X, Esquivel-Rodriguez J, Kihara D, Yu X, Bruce NJ, Fuller JC, Wade RC, Anishchenko I, Kundrotas PJ, Vakser IA, Imai K, Yamada K, Oda T, Nakamura T, Tomii K, Pallara C, Romero-Durana M, Jiménez-García B, Moal IH, Férnandez-Recio J, Joung JY, Kim JY, Joo K, Lee J, Kozakov D, Vajda S, Mottarella S, Hall DR, Beglov D, Mamonov A, Xia B, Bohnuud T, Del Carpio CA, Ichiishi E, Marze N, Kuroda D, Roy Burman SS, Gray JJ, Chermak E, Cavallo L, Oliva R, Tovchigrechko A, Wodak SJ. Prediction of homoprotein and heteroprotein complexes by protein docking and template-based modeling: A CASP-CAPRI experiment. Proteins 2016;84 Suppl 1:323-48. [PMID: 27122118 PMCID: PMC5030136 DOI: 10.1002/prot.25007] [Citation(s) in RCA: 116] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2015] [Revised: 12/30/2015] [Accepted: 02/02/2016] [Indexed: 12/26/2022]

Abstract

We present the results for CAPRI Round 30, the first joint CASP-CAPRI experiment, which brought together experts from the protein structure prediction and protein-protein docking communities. The Round comprised 25 targets from amongst those submitted for the CASP11 prediction experiment of 2014. The targets included mostly homodimers, a few homotetramers, and two heterodimers, and comprised protein chains that could readily be modeled using templates from the Protein Data Bank. On average 24 CAPRI groups and 7 CASP groups submitted docking predictions for each target, and 12 CAPRI groups per target participated in the CAPRI scoring experiment. In total more than 9500 models were assessed against the 3D structures of the corresponding target complexes. Results show that the prediction of homodimer assemblies by homology modeling techniques and docking calculations is quite successful for targets featuring large enough subunit interfaces to represent stable associations. Targets with ambiguous or inaccurate oligomeric state assignments, often featuring crystal contact-sized interfaces, represented a confounding factor. For those, a much poorer prediction performance was achieved, while nonetheless often providing helpful clues on the correct oligomeric state of the protein. The prediction performance was very poor for genuine tetrameric targets, where the inaccuracy of the homology-built subunit models and the smaller pair-wise interfaces severely limited the ability to derive the correct assembly mode. Our analysis also shows that docking procedures tend to perform better than standard homology modeling techniques and that highly accurate models of the protein components are not always required to identify their association modes with acceptable accuracy. Proteins 2016; 84(Suppl 1):323-348. © 2016 Wiley Periodicals, Inc.

Collapse

Affiliation(s)

Marc F Lensink University Lille, CNRS UMR8576 UGSF, Lille, F-59000, France.
Sameer Velankar European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom
Andriy Kryshtafovych Genome Center, University of California, Davis, California, 95616
Shen-You Huang Research Support Computing, University of Missouri Bioinformatics Consortium, and Department of Computer Science, University of Missouri, Columbia, Missouri, 65211
Dina Schneidman-Duhovny Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, 94158 Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, California, 94158
Andrej Sali Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, 94158 Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, California, 94158 California Institute for Quantitative Biosciences (QB3), University of California San Francisco, San Francisco, California, 94158
Joan Segura GN7 of the National Institute for Bioinformatics (INB) and Biocomputing Unit, National Center of Biotechnology (CSIC), Madrid, 28049, Spain
Narcis Fernandez-Fuentes Institute of Biological, Environmental and Rural Sciences (IBERS), Aberystwyth University, Aberystwyth, SY233FG, United Kingdom
Shruthi Viswanath Department of Computer Science, University of Texas at Austin, Austin, Texas, 78712 Institute for Computational Engineering and Sciences, University of Texas at Austin, Austin, Texas, 78712
Ron Elber Institute for Computational Engineering and Sciences, University of Texas at Austin, Austin, Texas, 78712 Department of Chemistry, University of Texas at Austin, Austin, Texas, 78712
Sergei Grudinin LJK, University Grenoble Alpes, CNRS, Grenoble, 38000, France INRIA, Grenoble, 38000, France
Petr Popov LJK, University Grenoble Alpes, CNRS, Grenoble, 38000, France INRIA, Grenoble, 38000, France Moscow Institute of Physics and Technology, Dolgoprudniy, Russia
Emilie Neveu LJK, University Grenoble Alpes, CNRS, Grenoble, 38000, France INRIA, Grenoble, 38000, France
Hasup Lee Department of Chemistry, Seoul National University, Seoul, 151-747, Republic of Korea
Minkyung Baek Department of Chemistry, Seoul National University, Seoul, 151-747, Republic of Korea
Sangwoo Park Department of Chemistry, Seoul National University, Seoul, 151-747, Republic of Korea
Lim Heo Department of Chemistry, Seoul National University, Seoul, 151-747, Republic of Korea
Gyu Rie Lee Department of Chemistry, Seoul National University, Seoul, 151-747, Republic of Korea
Chaok Seok Department of Chemistry, Seoul National University, Seoul, 151-747, Republic of Korea
Sanbo Qin Department of Physics and Institute of Molecular Biophysics, Florida State University, Tallahassee, Florida, 32306, USA
Huan-Xiang Zhou Department of Physics and Institute of Molecular Biophysics, Florida State University, Tallahassee, Florida, 32306, USA
David W Ritchie INRIA Nancy-Grand Est, Villers-lès-Nancy, 54600, France
Bernard Maigret CNRS, LORIA, Campus Scientifique, BP 239, Vandœuvre-lès-Nancy, 54506, France
Marie-Dominique Devignes CNRS, LORIA, Campus Scientifique, BP 239, Vandœuvre-lès-Nancy, 54506, France
Anisah Ghoorah Department of Computer Science and Engineering, University of Mauritius, Reduit, Mauritius
Mieczyslaw Torchala Biomolecular Modelling Laboratory, the Francis Crick Institute, Lincoln's Inn Fields Laboratory, London, WC2A 3LY, United Kingdom
Raphaël A G Chaleil Biomolecular Modelling Laboratory, the Francis Crick Institute, Lincoln's Inn Fields Laboratory, London, WC2A 3LY, United Kingdom
Paul A Bates Biomolecular Modelling Laboratory, the Francis Crick Institute, Lincoln's Inn Fields Laboratory, London, WC2A 3LY, United Kingdom
Efrat Ben-Zeev G-INCPM, Weizmann Institute of Science, Rehovot, 7610001, Israel
Miriam Eisenstein Department of Chemical Research Support, Weizmann Institute of Science, Rehovot, 7610001, Israel
Surendra S Negi Sealy Center for Structural Biology and Molecular Biophysics, University of Texas Medical Branch, 301 University Boulevard, Galveston, Texas, 77555-0857
Zhiping Weng Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, Massachusetts, 01605
Thom Vreven Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, Massachusetts, 01605
Brian G Pierce Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, Massachusetts, 01605
Tyler M Borrman Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, Massachusetts, 01605
Jinchao Yu Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, University Paris-Saclay, CEA-Saclay, Gif-sur-Yvette, 91191, France
Françoise Ochsenbein Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, University Paris-Saclay, CEA-Saclay, Gif-sur-Yvette, 91191, France
Raphaël Guerois Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, University Paris-Saclay, CEA-Saclay, Gif-sur-Yvette, 91191, France
Anna Vangone Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, Utrecht, 3584 CH, The Netherlands
João P G L M Rodrigues Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, Utrecht, 3584 CH, The Netherlands
Gydo van Zundert Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, Utrecht, 3584 CH, The Netherlands
Mehdi Nellen Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, Utrecht, 3584 CH, The Netherlands
Li Xue Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, Utrecht, 3584 CH, The Netherlands
Ezgi Karaca Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, Utrecht, 3584 CH, The Netherlands
Adrien S J Melquiond Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, Utrecht, 3584 CH, The Netherlands
Koen Visscher Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, Utrecht, 3584 CH, The Netherlands
Panagiotis L Kastritis Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, Utrecht, 3584 CH, The Netherlands
Alexandre M J J Bonvin Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, Utrecht, 3584 CH, The Netherlands
Xianjin Xu Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, 65211
Liming Qiu Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, 65211
Chengfei Yan Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, 65211 Department of Physics and Astronomy, University of Missouri, Columbia, Missouri, 65211
Jilong Li Department of Computer Science, University of Missouri, Columbia, Missouri, 65211
Zhiwei Ma Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, 65211 Department of Physics and Astronomy, University of Missouri, Columbia, Missouri, 65211
Jianlin Cheng Department of Computer Science, University of Missouri, Columbia, Missouri, 65211 Informatics Institute, University of Missouri, Columbia, Missouri, 65211
Xiaoqin Zou Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri, 65211 Department of Physics and Astronomy, University of Missouri, Columbia, Missouri, 65211 Informatics Institute, University of Missouri, Columbia, Missouri, 65211 Department of Biochemistry, University of Missouri, Columbia, Missouri, 65211
Yang Shen Toyota Technological Institute at Chicago, 6045 S Kenwood Avenue, Chicago, Illinois, 60637
Lenna X Peterson Department of Biological Sciences, Purdue University, West Lafayette, Indiana, 47907
Hyung-Rae Kim Department of Biological Sciences, Purdue University, West Lafayette, Indiana, 47907
Amit Roy Department of Biological Sciences, Purdue University, West Lafayette, Indiana, 47907 Bioinformatics and Computational Biosciences Branch, Rocky Mountain Laboratories, National Institutes of Health, Hamilton, Montano 59840
Xusi Han Department of Biological Sciences, Purdue University, West Lafayette, Indiana, 47907
Juan Esquivel-Rodriguez Department of Computer Science, Purdue University, West Lafayette, IN, USA, 47907
Daisuke Kihara Department of Biological Sciences, Purdue University, West Lafayette, Indiana, 47907 Department of Computer Science, Purdue University, West Lafayette, IN, USA, 47907
Xiaofeng Yu Molecular and Cellular Modeling Group, Heidelberg Institute for Theoretical Studies (HITS), Heidelberg, Germany
Neil J Bruce Molecular and Cellular Modeling Group, Heidelberg Institute for Theoretical Studies (HITS), Heidelberg, Germany
Jonathan C Fuller Molecular and Cellular Modeling Group, Heidelberg Institute for Theoretical Studies (HITS), Heidelberg, Germany
Rebecca C Wade Molecular and Cellular Modeling Group, Heidelberg Institute for Theoretical Studies (HITS), Heidelberg, Germany Center for Molecular Biology (ZMBH), DKFZ-ZMBH Alliance, Heidelberg University, Heidelberg, Germany Interdisciplinary Center for Scientific Computing (IWR), Heidelberg University, Heidelberg, Germany
Ivan Anishchenko Center for Computational Biology, The University of Kansas, Lawrence, Kansas, 66047
Petras J Kundrotas Center for Computational Biology, The University of Kansas, Lawrence, Kansas, 66047
Ilya A Vakser Center for Computational Biology, The University of Kansas, Lawrence, Kansas, 66047 Department of Molecular Biosciences, The University of Kansas, Lawrence, Kansas, 66047
Kenichiro Imai Computational Biology Research Center (CBRC), National Institute of Advanced Industrial Science and Technology (AIST), Koto-Ku, Japan
Kazunori Yamada Computational Biology Research Center (CBRC), National Institute of Advanced Industrial Science and Technology (AIST), Koto-Ku, Japan
Toshiyuki Oda Computational Biology Research Center (CBRC), National Institute of Advanced Industrial Science and Technology (AIST), Koto-Ku, Japan
Tsukasa Nakamura Graduate School of Frontier Sciences, the University of Tokyo, Kashiwa, Japan
Kentaro Tomii Computational Biology Research Center (CBRC), National Institute of Advanced Industrial Science and Technology (AIST), Koto-Ku, Japan Graduate School of Frontier Sciences, the University of Tokyo, Kashiwa, Japan
Chiara Pallara Joint BSC-CRG-IRB Research Program in Computational Biology, Barcelona Supercomputing Center, C/Jordi Girona 29, Barcelona, 08034, Spain
Miguel Romero-Durana Joint BSC-CRG-IRB Research Program in Computational Biology, Barcelona Supercomputing Center, C/Jordi Girona 29, Barcelona, 08034, Spain
Brian Jiménez-García Joint BSC-CRG-IRB Research Program in Computational Biology, Barcelona Supercomputing Center, C/Jordi Girona 29, Barcelona, 08034, Spain
Iain H Moal Joint BSC-CRG-IRB Research Program in Computational Biology, Barcelona Supercomputing Center, C/Jordi Girona 29, Barcelona, 08034, Spain
Juan Férnandez-Recio Joint BSC-CRG-IRB Research Program in Computational Biology, Barcelona Supercomputing Center, C/Jordi Girona 29, Barcelona, 08034, Spain
Jong Young Joung Center for in-Silico Protein Science, Korea Institute for Advanced Study, Seoul, 130-722, Korea
Jong Yun Kim Center for in-Silico Protein Science, Korea Institute for Advanced Study, Seoul, 130-722, Korea
Keehyoung Joo Center for in-Silico Protein Science, Korea Institute for Advanced Study, Seoul, 130-722, Korea Center for Advanced Computation, Korea Institute for Advanced Study, Seoul, 130-722, Korea
Jooyoung Lee Center for in-Silico Protein Science, Korea Institute for Advanced Study, Seoul, 130-722, Korea School of Computational Science, Korea Institute for Advanced Study, Seoul, 130-722, Korea
Dima Kozakov Department of Biomedical Engineering, Boston University, Boston, Massachusetts
Sandor Vajda Department of Biomedical Engineering, Boston University, Boston, Massachusetts Department of Chemistry, Boston University, Boston, Massachusetts
Scott Mottarella Department of Biomedical Engineering, Boston University, Boston, Massachusetts
David R Hall Department of Biomedical Engineering, Boston University, Boston, Massachusetts
Dmitri Beglov Department of Biomedical Engineering, Boston University, Boston, Massachusetts
Artem Mamonov Department of Biomedical Engineering, Boston University, Boston, Massachusetts
Bing Xia Department of Biomedical Engineering, Boston University, Boston, Massachusetts
Tanggis Bohnuud Department of Biomedical Engineering, Boston University, Boston, Massachusetts
Carlos A Del Carpio Institute of Biological Diversity, International Pacific Institute of Indiana, Bloomington, Indiana, 47401 Drosophila Genetic Resource Center, Kyoto Institute of Technology, Ukyo-Ku, 616-8354, Japan
Eichiro Ichiishi International University of Health and Welfare Hospital (IUHW Hospital), Asushiobara-City, Tochigi Prefecture, 329-2763, Japan
Nicholas Marze Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, 21218
Daisuke Kuroda Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, 21218
Shourya S Roy Burman Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, 21218
Jeffrey J Gray Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, 21218 Program in Molecular Biophysics, Johns Hopkins University, Baltimore, Maryland, 21218
Edrisse Chermak King Abdullah University of Science and Technology, Saudi Arabia
Luigi Cavallo King Abdullah University of Science and Technology, Saudi Arabia
Romina Oliva University of Naples "Parthenope", Napoli, Italy
Andrey Tovchigrechko J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, Maryland, 20850
Shoshana J Wodak Departments of Biochemistry and Molecular Genetics, University of Toronto, Toronto, Ontario, Canada. VIB Structural Biology Research Center, VUB Pleinlaan 2, Brussels, 1050, Belgium.

Collapse

Anishchenko I, Badal V, Dauzhenka T, Das M, Tuzikov AV, Kundrotas PJ, Vakser IA. Genome-Wide Structural Modeling of Protein-Protein Interactions. Bioinformatics Research and Applications 2016. [DOI: 10.1007/978-3-319-38782-6_8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]

Badal VD, Kundrotas PJ, Vakser IA. Text Mining for Protein Docking. PLoS Comput Biol 2015;11:e1004630. [PMID: 26650466 PMCID: PMC4674139 DOI: 10.1371/journal.pcbi.1004630] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2015] [Accepted: 10/29/2015] [Indexed: 11/18/2022] Open

Abstract

The rapidly growing amount of publicly available information from biomedical research is readily accessible on the Internet, providing a powerful resource for predictive biomolecular modeling. The accumulated data on experimentally determined structures transformed structure prediction of proteins and protein complexes. Instead of exploring the enormous search space, predictive tools can simply proceed to the solution based on similarity to the existing, previously determined structures. A similar major paradigm shift is emerging due to the rapidly expanding amount of information, other than experimentally determined structures, which still can be used as constraints in biomolecular structure prediction. Automated text mining has been widely used in recreating protein interaction networks, as well as in detecting small ligand binding sites on protein structures. Combining and expanding these two well-developed areas of research, we applied the text mining to structural modeling of protein-protein complexes (protein docking). Protein docking can be significantly improved when constraints on the docking mode are available. We developed a procedure that retrieves published abstracts on a specific protein-protein interaction and extracts information relevant to docking. The procedure was assessed on protein complexes from Dockground (http://dockground.compbio.ku.edu). The results show that correct information on binding residues can be extracted for about half of the complexes. The amount of irrelevant information was reduced by conceptual analysis of a subset of the retrieved abstracts, based on the bag-of-words (features) approach. Support Vector Machine models were trained and validated on the subset. The remaining abstracts were filtered by the best-performing models, which decreased the irrelevant information for ~ 25% complexes in the dataset. The extracted constraints were incorporated in the docking protocol and tested on the Dockground unbound benchmark set, significantly increasing the docking success rate.

Protein interactions are central for many cellular processes. Physical characterization of these interactions is essential for understanding of life processes and applications in biology and medicine. Because of the inherent limitations of experimental techniques and rapid development of computational power and methodology, computer modeling is a tool of choice in many studies. Publicly available information from biomedical research is readily accessible on the Internet, providing a powerful resource for modeling of proteins and protein complexes. A major paradigm shift in modeling of protein complexes is emerging due to the rapidly expanding amount of such information, which can be used as modeling constraints. Text mining has been widely used in recreating networks of protein interactions, as well as in detecting small molecule binding sites on proteins. Combining and expanding these two well-developed areas of research, we applied the text mining to physical modeling of protein complexes (protein docking). Our procedure retrieves published abstracts on a protein-protein interaction and extracts the relevant information. The results show that correct information on binding can be obtained for about half of protein complexes. The extracted constraints were incorporated in a modeling procedure, significantly improving its performance.

Collapse

Kirys T, Ruvinsky AM, Singla D, Tuzikov AV, Kundrotas PJ, Vakser IA. Simulated unbound structures for benchmarking of protein docking in the DOCKGROUND resource. BMC Bioinformatics 2015;16:243. [PMID: 26227548 PMCID: PMC4521349 DOI: 10.1186/s12859-015-0672-3] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2015] [Accepted: 07/10/2015] [Indexed: 11/10/2022] Open

Anishchenko I, Kundrotas PJ, Tuzikov AV, Vakser IA. Structural templates for comparative protein docking. Proteins 2015;83:1563-70. [PMID: 25488330 DOI: 10.1002/prot.24736] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2014] [Revised: 11/15/2014] [Accepted: 11/26/2014] [Indexed: 11/07/2022]

Anishchenko I, Kundrotas PJ, Tuzikov AV, Vakser IA. Protein models docking benchmark 2. Proteins 2015;83:891-7. [PMID: 25712716 DOI: 10.1002/prot.24784] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2014] [Revised: 01/30/2015] [Accepted: 02/14/2015] [Indexed: 12/28/2022]

Lensink MF, Moal IH, Bates PA, Kastritis PL, Melquiond ASJ, Karaca E, Schmitz C, van Dijk M, Bonvin AMJJ, Eisenstein M, Jiménez-García B, Grosdidier S, Solernou A, Pérez-Cano L, Pallara C, Fernández-Recio J, Xu J, Muthu P, Praneeth Kilambi K, Gray JJ, Grudinin S, Derevyanko G, Mitchell JC, Wieting J, Kanamori E, Tsuchiya Y, Murakami Y, Sarmiento J, Standley DM, Shirota M, Kinoshita K, Nakamura H, Chavent M, Ritchie DW, Park H, Ko J, Lee H, Seok C, Shen Y, Kozakov D, Vajda S, Kundrotas PJ, Vakser IA, Pierce BG, Hwang H, Vreven T, Weng Z, Buch I, Farkash E, Wolfson HJ, Zacharias M, Qin S, Zhou HX, Huang SY, Zou X, Wojdyla JA, Kleanthous C, Wodak SJ. Blind prediction of interfacial water positions in CAPRI. Proteins 2014;82:620-32. [PMID: 24155158 PMCID: PMC4582081 DOI: 10.1002/prot.24439] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2013] [Revised: 09/16/2013] [Accepted: 09/26/2013] [Indexed: 12/30/2022]

Kundrotas PJ, Vakser IA. Global and local structural similarity in protein-protein complexes: implications for template-based docking. Proteins 2013;81:2137-42. [PMID: 23946125 DOI: 10.1002/prot.24392] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2013] [Revised: 07/23/2013] [Accepted: 08/02/2013] [Indexed: 02/02/2023]

Anishchenko I, Kundrotas PJ, Tuzikov AV, Vakser IA. Protein models: the Grand Challenge of protein docking. Proteins 2013;82:278-87. [PMID: 23934791 DOI: 10.1002/prot.24385] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2013] [Revised: 07/16/2013] [Accepted: 07/26/2013] [Indexed: 12/28/2022]

Kundrotas PJ, Vakser IA, Janin J. Structural templates for modeling homodimers. Protein Sci 2013;22:1655-63. [PMID: 23996787 DOI: 10.1002/pro.2361] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2013] [Revised: 08/23/2013] [Accepted: 08/23/2013] [Indexed: 12/17/2022]

Kundrotas PJ, Vakser IA. Protein-protein alternative binding modes do not overlap. Protein Sci 2013;22:1141-5. [PMID: 23775945 DOI: 10.1002/pro.2295] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2013] [Revised: 06/01/2013] [Accepted: 06/03/2013] [Indexed: 11/09/2022]

Kundrotas PJ, Zhu Z, Vakser IA. GWIDD: a comprehensive resource for genome-wide structural modeling of protein-protein interactions. Hum Genomics 2012;6:7. [PMID: 23245398 PMCID: PMC3500202 DOI: 10.1186/1479-7364-6-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2012] [Accepted: 07/11/2012] [Indexed: 11/10/2022] Open

Sinha R, Kundrotas PJ, Vakser IA. Protein docking by the interface structure similarity: how much structure is needed? PLoS One 2012;7:e31349. [PMID: 22348074 PMCID: PMC3278447 DOI: 10.1371/journal.pone.0031349] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2011] [Accepted: 01/08/2012] [Indexed: 11/19/2022] Open

Sinha R, Kundrotas PJ, Vakser IA. Docking by structural similarity at protein-protein interfaces. Proteins 2011;78:3235-41. [PMID: 20715056 DOI: 10.1002/prot.22812] [Citation(s) in RCA: 73] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Kundrotas PJ, Anishchenko I, Tuzikov AV, Vakser I. Docking Benchmark Set of Protein Models. Biophys J 2011. [DOI: 10.1016/j.bpj.2010.12.1947] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022] Open

Kundrotas PJ, Vakser IA. Accuracy of protein-protein binding sites in high-throughput template-based modeling. PLoS Comput Biol 2010;6:e1000727. [PMID: 20369011 PMCID: PMC2848539 DOI: 10.1371/journal.pcbi.1000727] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2009] [Accepted: 03/01/2010] [Indexed: 11/18/2022] Open

Abstract

The accuracy of protein structures, particularly their binding sites, is essential for the success of modeling protein complexes. Computationally inexpensive methodology is required for genome-wide modeling of such structures. For systematic evaluation of potential accuracy in high-throughput modeling of binding sites, a statistical analysis of target-template sequence alignments was performed for a representative set of protein complexes. For most of the complexes, alignments containing all residues of the interface were found. The full interface alignments were obtained even in the case of poor alignments where a relatively small part of the target sequence (as low as 40%) aligned to the template sequence, with a low overall alignment identity (<30%). Although such poor overall alignments might be considered inadequate for modeling of whole proteins, the alignment of the interfaces was strong enough for docking. In the set of homology models built on these alignments, one third of those ranked 1 by a simple sequence identity criteria had RMSD<5 Å, the accuracy suitable for low-resolution template free docking. Such models corresponded to multi-domain target proteins, whereas for single-domain proteins the best models had 5 Å<RMSD<10 Å, the accuracy suitable for less sensitive structure-alignment methods. Overall, ∼50% of complexes with the interfaces modeled by high-throughput techniques had accuracy suitable for meaningful docking experiments. This percentage will grow with the increasing availability of co-crystallized protein-protein complexes.

Protein-protein interactions play a central role in life processes at the molecular level. The structural information on these interactions is essential for our understanding of these processes and our ability to design drugs to cure diseases. Limitations of experimental techniques to determine the structure of protein-protein complexes leave the vast majority of these complexes to be determined by computational modeling. The modeling is also important for revealing the mechanisms of the complex formation. The 3D modeling of protein complexes (protein docking) relies on the structure of the individual proteins for the prediction of their assembly. Thus the structural accuracy of the individual proteins, which often are models themselves, is critical for the docking. For the docking purposes, the accuracy of the binding sites is obviously essential, whereas the accuracy of the non-binding regions is less critical. In our study, we systematically analyze the accuracy of the binding sites in protein models produced by high-throughput techniques suitable for large-scale (e.g., genome-wide) studies. The results indicate that this accuracy is adequate for the low- to medium-resolution docking of a significant part of known protein-protein complexes.

Collapse

Kundrotas PJ, Zhu Z, Vakser IA. GWIDD: Genome-wide protein docking database. Nucleic Acids Res 2009;38:D513-7. [PMID: 19900970 PMCID: PMC2808876 DOI: 10.1093/nar/gkp944] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Kundrotas PJ, Lensink MF, Alexov E. Homology-based modeling of 3D structures of protein–protein complexes using alignments of modified sequence profiles. Int J Biol Macromol 2008;43:198-208. [DOI: 10.1016/j.ijbiomac.2008.05.004] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2008] [Revised: 05/09/2008] [Accepted: 05/12/2008] [Indexed: 11/25/2022]

Kundrotas PJ, Alexov E. PROTCOM: searchable database of protein complexes enhanced with domain-domain structures. Nucleic Acids Res 2006;35:D575-9. [PMID: 17071962 PMCID: PMC1635331 DOI: 10.1093/nar/gkl768] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open

Kundrotas PJ, Alexov E. Predicting 3D structures of transient protein-protein complexes by homology. Biochim Biophys Acta 2006;1764:1498-511. [PMID: 16963323 DOI: 10.1016/j.bbapap.2006.08.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2006] [Revised: 07/27/2006] [Accepted: 08/03/2006] [Indexed: 11/26/2022]

Kundrotas PJ, Alexov E. Electrostatic properties of protein-protein complexes. Biophys J 2006;91:1724-36. [PMID: 16782791 PMCID: PMC1544298 DOI: 10.1529/biophysj.106.086025] [Citation(s) in RCA: 71] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Kundrotas PJ. Statistical Studies of Flexible Nonhomogeneous Polypeptide Chains. Biomacromolecules 2005;6:3010-7. [PMID: 16283721 DOI: 10.1021/bm0503266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Kundrotas PJ, Karshikoff A. Charge sequence coding in statistical modeling of unfolded proteins. Biochim Biophys Acta 2004;1702:1-8. [PMID: 15450845 DOI: 10.1016/j.bbapap.2004.07.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/22/2004] [Revised: 06/18/2004] [Accepted: 07/01/2004] [Indexed: 11/30/2022]

Kundrotas PJ, Karshikoff A. Effects of charge–charge interactions on dimensions of unfolded proteins: A Monte Carlo study. J Chem Phys 2003. [DOI: 10.1063/1.1588996] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Kundrotas PJ, Karshikoff A. Modeling of denatured state for calculation of the electrostatic contribution to protein stability. Protein Sci 2002;11:1681-6. [PMID: 12070320 PMCID: PMC2373658 DOI: 10.1110/ps.4690102] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]