1
|
Geethu S, Vimina ER. Improved 3-D Protein Structure Predictions using Deep ResNet Model. Protein J 2021; 40:669-681. [PMID: 34510309 DOI: 10.1007/s10930-021-10016-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/09/2021] [Indexed: 10/20/2022]
Abstract
Protein Structure Prediction (PSP) is considered to be a complicated problem in computational biology. In spite of, the remarkable progress made by the co-evolution-based method in PSP, it is still a challenging and unresolved problem. Recently, along with co-evolutionary relationships, deep learning approaches have been introduced in PSP that lead to significant progress. In this paper a novel methodology using deep ResNet architecture for predicting inter-residue distance and dihedral angles is proposed, that aims to generate 125 homologous sequences in an average from a set of customized sequence database. These sequences are used to generate input features. As an outcome of neural networks, a pool of structures is generated from which the lowest potential structure is chosen as the final predicted 3-D protein structure. The proposed method is trained using 6521 protein sequences extracted from Protein Data Bank (PDB). For testing 48 protein sequences whose residue length is less than 400 residues are chosen from the 13th Critical Assessment of protein Structure Prediction (CASP 13) dataset are used. The model is compared with Alphafold, Zhang, and RaptorX. The template modeling (TM) score is used to evaluate the accuracy of the estimated structure. The proposed method produces better performances for 52% of the target sequences while that of Alphafold, Zhang, RaptorX were 10%, 22.9%, and 6% respectively. Additionally, for 37.5% target sequences, the proposed method was able to achieve accuracy greater than or equal to 0.80. The TM score obtained for the sequences under consideration were 0.69, 0.67, 0.65, and 0.58 respectively for the proposed method, Alphafold, Zhang, and RaptorX.
Collapse
Affiliation(s)
- S Geethu
- Department of Computer Science and IT, Amrita School of Arts and Sciences, Amrita Vishwa Vidyapeetham, Kochi Campus, Ernakulam, India.
| | - E R Vimina
- Department of Computer Science and IT, Amrita School of Arts and Sciences, Amrita Vishwa Vidyapeetham, Kochi Campus, Ernakulam, India
| |
Collapse
|
2
|
Bhat R, Kaushik R, Singh A, DasGupta D, Jayaraj A, Soni A, Shandilya A, Shekhar V, Shekhar S, Jayaram B. A comprehensive automated computer-aided discovery pipeline from genomes to hit molecules. Chem Eng Sci 2020. [DOI: 10.1016/j.ces.2020.115711] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
|
3
|
Singh SK, Mishra A, Yadav D, Budholiya N, Rathore AS. Understanding the mechanism of copurification of “difficult to remove” host cell proteins in rituximab biosimilar products. Biotechnol Prog 2020; 36:e2936. [DOI: 10.1002/btpr.2936] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2019] [Revised: 10/01/2019] [Accepted: 10/22/2019] [Indexed: 12/21/2022]
Affiliation(s)
- Sumit K. Singh
- Department of Chemical EngineeringIndian Institute of Technology Hauz Khas New Delhi India
| | - Avinash Mishra
- Department of Chemical EngineeringIndian Institute of Technology Hauz Khas New Delhi India
| | - Divyanshi Yadav
- Department of Chemical EngineeringIndian Institute of Technology Hauz Khas New Delhi India
| | - Niharika Budholiya
- Department of Chemical EngineeringIndian Institute of Technology Hauz Khas New Delhi India
| | - Anurag S. Rathore
- Department of Chemical EngineeringIndian Institute of Technology Hauz Khas New Delhi India
| |
Collapse
|
4
|
Borguesan B, e Silva MB, Grisci B, Inostroza-Ponta M, Dorn M. APL: An angle probability list to improve knowledge-based metaheuristics for the three-dimensional protein structure prediction. Comput Biol Chem 2015; 59 Pt A:142-57. [DOI: 10.1016/j.compbiolchem.2015.08.006] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2015] [Revised: 08/05/2015] [Accepted: 08/17/2015] [Indexed: 10/23/2022]
|
5
|
Márquez-Chamorro AE, Asencio-Cortés G, Santiesteban-Toca CE, Aguilar-Ruiz JS. Soft computing methods for the prediction of protein tertiary structures: A survey. Appl Soft Comput 2015. [DOI: 10.1016/j.asoc.2015.06.024] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
|
6
|
Thévenet P, Rey J, Moroy G, Tuffery P. De novo peptide structure prediction: an overview. Methods Mol Biol 2015; 1268:1-13. [PMID: 25555718 DOI: 10.1007/978-1-4939-2285-7_1] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Peptide structure identification is an important contribution to the further characterization of the residues involved in functional interactions. De novo structure peptide prediction has, in the past few years, made significant progresses that make reasonable, for peptides up to 50 amino acids, its use for the fast identification of their structural topologies. Here, we introduce some of the concepts underlying approaches of the field, together with their limits.
Collapse
Affiliation(s)
- Pierre Thévenet
- Molécules Thérapeutiques In Silico, Inserm UMR-S 973, Université Paris Diderot, Sorbonne Paris Cité, 35 rue Helene Brion, 75013, Paris, France
| | | | | | | |
Collapse
|
7
|
Overview of computational vaccinology: vaccine development through information technology. J Appl Genet 2014; 56:381-91. [PMID: 25534541 DOI: 10.1007/s13353-014-0265-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2014] [Revised: 11/17/2014] [Accepted: 12/08/2014] [Indexed: 12/27/2022]
Abstract
Pathogenic organisms, causes of various infectious diseases, possess a rich repository of antigenic proteins that engender an immune response in a host. These types of diseases are usually treated with the use of pharmaceuticals; unfortunately, many of these also have a potential to induce fatal side effects, especially allergic responses in the diseased host. In addition, many pathogens evolve (by selective survival) single or multi-drug resistance (MDR). Therefore, a means to prevent the host from becoming susceptible to the pathogen from the onset, rather than trying to devise pharmacologic protocols to treat an ongoing infection, are increasingly seen as desirable to reduce the incidence of infectious diseases altogether. To this end, cost-effective development and use of "safe" vaccines is key. This paper provides an overview on the new and expanding area of computational vaccinology and a brief background on pathogen antigenicity, identification of pathogen-specific antigens, and screening of candidate antigens using various tools and databases developed in the recent past.
Collapse
|
8
|
Jayaram B, Dhingra P, Mishra A, Kaushik R, Mukherjee G, Singh A, Shekhar S. Bhageerath-H: a homology/ab initio hybrid server for predicting tertiary structures of monomeric soluble proteins. BMC Bioinformatics 2014; 15 Suppl 16:S7. [PMID: 25521245 PMCID: PMC4290660 DOI: 10.1186/1471-2105-15-s16-s7] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND The advent of human genome sequencing project has led to a spurt in the number of protein sequences in the databanks. Success of structure based drug discovery severely hinges on the availability of structures. Despite significant progresses in the area of experimental protein structure determination, the sequence-structure gap is continually widening. Data driven homology based computational methods have proved successful in predicting tertiary structures for sequences sharing medium to high sequence similarities. With dwindling similarities of query sequences, advanced homology/ ab initio hybrid approaches are being explored to solve structure prediction problem. Here we describe Bhageerath-H, a homology/ ab initio hybrid software/server for predicting protein tertiary structures with advancing drug design attempts as one of the goals. RESULTS Bhageerath-H web-server was validated on 75 CASP10 targets which showed TM-scores ≥ 0.5 in 91% of the cases and Cα RMSDs ≤ 5 Å from the native in 58% of the targets, which is well above the CASP10 water mark. Comparison with some leading servers demonstrated the uniqueness of the hybrid methodology in effectively sampling conformational space, scoring best decoys and refining low resolution models to high and medium resolution. CONCLUSION Bhageerath-H methodology is web enabled for the scientific community as a freely accessible web server. The methodology is fielded in the on-going CASP11 experiment.
Collapse
|
9
|
Mishra A, Rana PS, Mittal A, Jayaram B. D2N: Distance to the native. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2014; 1844:1798-807. [DOI: 10.1016/j.bbapap.2014.07.010] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/08/2014] [Revised: 07/03/2014] [Accepted: 07/15/2014] [Indexed: 12/26/2022]
|
10
|
B. Kumar N, Dhurandhar M, Aggarwal B, Anant S, Daniel K, Deng G, Djeu J, Dou J, Hawk E, Jayaram B, Jia L, Joshi R, Kararala M, Karunagaran D, Kucuk O, Kumar L, Malafa M, Samathanam GJ, Sarkar F, Siddiqi M, Singh RP, Srivastava A, White JD. Proceedings of the Indo-U.S. bilateral workshop on accelerating botanicals/biologics agent development research for cancer chemoprevention, treatment, and survival. Cancer Med 2014; 2:108-15. [PMID: 24279005 PMCID: PMC3797562 DOI: 10.1002/cam4.42] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
With the evolving evidence of the promise of botanicals/biologics for cancer chemoprevention and treatment, an Indo-U.S. collaborative Workshop focusing on “Accelerating Botanicals Agent Development Research for Cancer Chemoprevention and Treatment” was conducted at the Moffitt Cancer Center, 29–31 May 2012. Funded by the Indo-U.S. Science and Technology Forum, a joint initiative of Governments of India and the United States of America and the Moffitt Cancer Center, the overall goals of this workshop were to enhance the knowledge (agents, molecular targets, biomarkers, approaches, target populations, regulatory standards, priorities, resources) of a multinational, multidisciplinary team of researcher's to systematically accelerate the design, to conduct a successful clinical trials to evaluate botanicals/biologics for cancer chemoprevention and treatment, and to achieve efficient translation of these discoveries into the standards for clinical practice that will ultimately impact cancer morbidity and mortality. Expert panelists were drawn from a diverse group of stakeholders, representing the leadership from the National Cancer Institute's Office of Cancer Complementary and Alternative Medicine (OCCAM), NCI Experimental Therapeutics (NExT), Food and Drug Administration, national scientific leadership from India, and a distinguished group of population, basic and clinical scientists from the two countries, including leaders in bioinformatics, social sciences, and biostatisticians. At the end of the workshop, we established four Indo-U.S. working research collaborative teams focused on identifying and prioritizing agents targeting four cancers that are of priority to both countries. Presented are some of the key proceedings and future goals discussed in the proceedings of this workshop.
Collapse
Affiliation(s)
| | - Medha Dhurandhar
- Centre for Development of Advanced Computing, Pune UniversityPune, 411007, India
| | - Bharat Aggarwal
- The University of Texas, M.D. Anderson Cancer CenterHouston, Texas, 77054
| | - Shrikant Anant
- The University of Kansas Medical CenterKansas City, Kansas, 66160
| | | | - Gary Deng
- Memorial Sloan-Kettering Cancer CenterNew York, New York, 10021
| | - Julie Djeu
- Moffitt Cancer Center, Tampa, Florida, 33612-9497
| | - Jinhui Dou
- Food and Drug AdministrationSilver Springs, Maryland, 20993
| | - Ernest Hawk
- The University of Texas, M.D. Anderson Cancer CenterHouston, Texas, 77054
| | - B. Jayaram
- India Institute of Technology-DelhiNew Delhi, 110016, India
| | - Libin Jia
- National Cancer Institute, NIHBethesda, Maryland, 20892
| | - Rajendra Joshi
- Bioinformatics Scientific and Engineering Computing, Pune UniversityPune, 411007, India
| | | | - Devarajan Karunagaran
- Department of Biotechnology, India Institute of Technology – MadrasChennai, 600036, India
| | - Omer Kucuk
- Emory Healthcare, The Emory Clinic Winship Cancer InstituteNE Atlanta, Georgia, 30322
| | - Lalit Kumar
- Institute Rotary Cancer Hospital (IRCH), All India Institute of Medical SciencesNew Delhi, 110029, India
| | | | - G. J. Samathanam
- Department and Transfer DivisionDepartment of Science and Technology, Government of IndiaIndia
| | - Fazlul Sarkar
- Barbara Ann Karmanos Cancer InstituteDetroit, Michigan, 48201
| | | | - Rana P. Singh
- School of Life Sciences, Central University of GujaratGujarat, 382030, India
| | - Anil Srivastava
- Open Health Systems Laboratory at Johns Hopkins Montgomery County CampusRockville, Maryland, 20850
| | | |
Collapse
|
11
|
Moreira GMSG, Conceição FR, McBride AJA, Pinto LDS. Structure predictions of two Bauhinia variegata lectins reveal patterns of C-terminal properties in single chain legume lectins. PLoS One 2013; 8:e81338. [PMID: 24260572 PMCID: PMC3834338 DOI: 10.1371/journal.pone.0081338] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2013] [Accepted: 10/15/2013] [Indexed: 11/18/2022] Open
Abstract
Bauhinia variegata lectins (BVL-I and BVL-II) are single chain lectins isolated from the plant Bauhinia variegata. Single chain lectins undergo post-translational processing on its N-terminal and C-terminal regions, which determines their physiological targeting, carbohydrate binding activity and pattern of quaternary association. These two lectins are isoforms, BVL-I being highly glycosylated, and thus far, it has not been possible to determine their structures. The present study used prediction and validation algorithms to elucidate the likely structures of BVL-I and -II. The program Bhageerath-H was chosen from among three different structure prediction programs due to its better overall reliability. In order to predict the C-terminal region cleavage sites, other lectins known to have this modification were analysed and three rules were created: (1) the first amino acid of the excised peptide is small or hydrophobic; (2) the cleavage occurs after an acid, polar, or hydrophobic residue, but not after a basic one; and (3) the cleavage spot is located 5-8 residues after a conserved Leu amino acid. These rules predicted that BVL-I and -II would have fifteen C-terminal residues cleaved, and this was confirmed experimentally by Edman degradation sequencing of BVL-I. Furthermore, the C-terminal analyses predicted that only BVL-II underwent α-helical folding in this region, similar to that seen in SBA and DBL. Conversely, BVL-I and -II contained four conserved regions of a GS-I association, providing evidence of a previously undescribed X4+unusual oligomerisation between the truncated BVL-I and the intact BVL-II. This is the first report on the structural analysis of lectins from Bauhinia spp. and therefore is important for the characterisation C-terminal cleavage and patterns of quaternary association of single chain lectins.
Collapse
Affiliation(s)
- Gustavo M. S. G. Moreira
- Centro de Desenvolvimento Tecnológico, Núcleo de Biotecnologia, Universidade Federal de Pelotas, Pelotas, Rio Grande do Sul, Brazil
| | - Fabricio R. Conceição
- Centro de Desenvolvimento Tecnológico, Núcleo de Biotecnologia, Universidade Federal de Pelotas, Pelotas, Rio Grande do Sul, Brazil
| | - Alan J. A. McBride
- Centro de Desenvolvimento Tecnológico, Núcleo de Biotecnologia, Universidade Federal de Pelotas, Pelotas, Rio Grande do Sul, Brazil
| | - Luciano da S. Pinto
- Centro de Desenvolvimento Tecnológico, Núcleo de Biotecnologia, Universidade Federal de Pelotas, Pelotas, Rio Grande do Sul, Brazil
| |
Collapse
|
12
|
Dhingra P, Jayaram B. A homology/ab initio hybrid algorithm for sampling near-native protein conformations. J Comput Chem 2013; 34:1925-36. [PMID: 23728619 DOI: 10.1002/jcc.23339] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2012] [Revised: 03/09/2013] [Accepted: 04/21/2013] [Indexed: 12/19/2022]
Abstract
One of the major challenges for protein tertiary structure prediction strategies is the quality of conformational sampling algorithms, which can effectively and readily search the protein fold space to generate near-native conformations. In an effort to advance the field by making the best use of available homology as well as fold recognition approaches along with ab initio folding methods, we have developed Bhageerath-H Strgen, a homology/ab initio hybrid algorithm for protein conformational sampling. The methodology is tested on the benchmark CASP9 dataset of 116 targets. In 93% of the cases, a structure with TM-score ≥ 0.5 is generated in the pool of decoys. Further, the performance of Bhageerath-H Strgen was seen to be efficient in comparison with different decoy generation methods. The algorithm is web enabled as Bhageerath-H Strgen web tool which is made freely accessible for protein decoy generation (http://www.scfbio-iitd.res.in/software/Bhageerath-HStrgen1.jsp).
Collapse
Affiliation(s)
- Priyanka Dhingra
- Department of Chemistry, Indian Institute of Technology, Hauz Khas, New Delhi, 110016, India
| | | |
Collapse
|
13
|
Spectrum of MECP2 gene mutations in a cohort of Indian patients with Rett syndrome: Report of two novel mutations. Gene 2013; 515:78-83. [DOI: 10.1016/j.gene.2012.11.024] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2012] [Revised: 10/30/2012] [Accepted: 11/27/2012] [Indexed: 11/23/2022]
|
14
|
Das DK, Mehta B, Menon SR, Raha S, Udani V. Novel mutations in cyclin-dependent kinase-like 5 (CDKL5) gene in Indian cases of Rett syndrome. Neuromolecular Med 2012; 15:218-25. [PMID: 23242510 DOI: 10.1007/s12017-012-8212-z] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2012] [Accepted: 12/03/2012] [Indexed: 10/27/2022]
Abstract
Rett syndrome is a severe neurodevelopmental disorder, almost exclusively affecting females and characterized by a wide spectrum of clinical manifestations. Both the classic and atypical forms of Rett syndrome are primarily due to mutations in the methyl-CpG-binding protein 2 (MECP2) gene. Mutations in the X-linked cyclin-dependent kinase-like 5 (CDKL5) gene have been identified in patients with atypical Rett syndrome, X-linked infantile spasms sharing common features of generally early-onset seizures and mental retardation. CDKL5 is known as serine/threonine protein kinase 9 (STK9) and is mapped to the Xp22 region. It has a conserved serine/threonine kinase domain within its amino terminus and a large C-terminal region. Disease-causing mutations are distributed in both the amino terminal domain and in the large C-terminal domain. We have screened the CDKL5 gene in 44 patients with atypical Rett syndrome who had tested negative for MECP2 gene mutations and have identified 6 sequence variants, out of which three were novel and three known mutations. Two of these novel mutations p.V966I and p.A1011V were missense and p.H589H a silent mutation. Other known mutations identified were p.V999M, p.Q791P and p.T734A. Sequence homology for all the mutations revealed that the two mutations (p.Q791P and p.T734A) were conserved across species. This indicated the importance of these residues in structure and function of the protein. The damaging effects of these mutations were analysed in silico using PolyPhen-2 online software. The PolyPhen-2 scores of p.Q791P and p.T734A were 0.998 and 0.48, revealing that these mutations could be deleterious and might have potential functional effect. All other mutations had a low score suggesting that they might not alter the activity of CDKL5. We have also analysed the position of the mutations in the CDKL5 protein and found that all the mutations were present in the C-terminal domain of the protein. The C-terminal domain is required for cellular localization through protein-protein interaction; any mutations in this domain might alter this function of the protein. This is the first report from India showing the mutation in CDKL5 gene in Indian cases of Rett syndrome. Our study emphasizes the role of CDKL5 mutation screening in cases of atypical Rett syndrome with congenital seizure variant.
Collapse
Affiliation(s)
- Dhanjit Kumar Das
- Genetic Research Centre, National Institute for Research in Reproductive Health (ICMR), Jahangir Merwanji Street, Parel, Mumbai, 400 012, India.
| | | | | | | | | |
Collapse
|