1
|
Nithya C, Kiran M, Nagarajaram HA. Hubs and Bottlenecks in Protein-Protein Interaction Networks. Methods Mol Biol 2024; 2719:227-248. [PMID: 37803121 DOI: 10.1007/978-1-0716-3461-5_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/08/2023]
Abstract
Protein-protein interaction networks (PPINs) represent the physical interactions among proteins in a cell. These interactions are critical in all cellular processes, including signal transduction, metabolic regulation, and gene expression. In PPINs, centrality measures are widely used to identify the most critical nodes. The two most commonly used centrality measures in networks are degree and betweenness centralities. Degree centrality is the number of connections a node has in the network, and betweenness centrality is the measure of the extent to which a node lies on the shortest paths between pairs of other nodes in the network. In PPINs, proteins with high degree and betweenness centrality are referred to as hubs and bottlenecks respectively. Hubs and bottlenecks are topologically and functionally essential proteins that play crucial roles in maintaining the network's structure and function. This article comprehensively reviews essential literature on hubs and bottlenecks, including their properties and functions.
Collapse
Affiliation(s)
- Chandramohan Nithya
- Department of Biotechnology and Bioinformatics, School of Life Sciences, University of Hyderabad, Hyderabad, Telangana, India
| | - Manjari Kiran
- Department of Systems and Computational Biology, School of Life Sciences, University of Hyderabad, Hyderabad, Telangana, India
| | | |
Collapse
|
2
|
Rastogi N, Zarin S, Alam A, Konduru GV, Manjunath P, Mishra A, Kumar S, Nagarajaram HA, Hasnain SE, Ehtesham NZ. Structural and Biophysical properties of therapeutically important proteins Rv1509 and Rv2231A of Mycobacterium tuberculosis. Int J Biol Macromol 2023; 245:125455. [PMID: 37331537 DOI: 10.1016/j.ijbiomac.2023.125455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Revised: 06/12/2023] [Accepted: 06/15/2023] [Indexed: 06/20/2023]
Abstract
Through comparative analyses using BLASTp and BLASTn of the 25 target sequences, our research identified two unique post-transcriptional modifiers, Rv1509 and Rv2231A, which serve as distinctive and characteristic proteins of M.tb - the Signature Proteins. Here, we have characterized these two signature proteins associated with pathophysiology of M.tb which may prove to be therapeutically important targets. Dynamic Light Scattering and Analytical Gel Filtration Chromatography exhibited that Rv1509 exists as a monomer while Rv2231A as a dimer in solution. Secondary structures were determined using Circular Dichroism and further validated through Fourier Transform Infrared spectroscopy. Both the proteins are capable of withstanding a wide range of temperature and pH variations. Fluorescence spectroscopy based binding affinity experiments showed that Rv1509 binds to iron and may promote organism growth by chelating iron. In the case of Rv2231A, a high affinity for its substrate RNA was observed, which is facilitated in presence of Mg2+ suggesting it might have RNAse activity, supporting the prediction through in-silico studies. This is the first study on biophysical characterization of these two therapeutically important proteins, Rv1509 and Rv2231A, providing important insights into their structure -function correlations which are crucial for development of new drugs/ early diagnostics tools targeting these proteins.
Collapse
Affiliation(s)
- Nilisha Rastogi
- Cell Signaling and Inflammation Biology Lab, ICMR-National Institute of Pathology, New Delhi 110029, India
| | - Sheeba Zarin
- Institute of Molecular Medicine, Jamia Hamdard, Hamdard Nagar, New Delhi, India; Department of Life Science, School of Basic Sciences and Research, Sharda University, Knowledge Park III, Greater Noida, Uttar Pradesh 201310, India
| | - Anwar Alam
- Department of Biotechnology, School of Engineering Sciences and Technology, Sharda University, Knowledge Park III, Greater Noida, Uttar Pradesh 201310, India
| | - Guruprasad Varma Konduru
- Laboratory of Computational Biology, Centre for DNA Fingerprinting and Diagnostics, Uppal, Hyderabad, India; Graduate Studies, Manipal Academy of Higher Education, Manipal, Karnataka, India
| | - P Manjunath
- Cell Signaling and Inflammation Biology Lab, ICMR-National Institute of Pathology, New Delhi 110029, India
| | - Abhay Mishra
- Department of Biophysics, All India Institute of Medical Sciences, New Delhi 110029, India
| | - Saroj Kumar
- Department of Biophysics, All India Institute of Medical Sciences, New Delhi 110029, India
| | - Hampapathalu Adimurthy Nagarajaram
- Laboratory of Computational Biology, Department of Systems and Computational Biology, School of Life Sciences, University of Hyderabad, Prof C.R. Rao Road, Hyderabad 500007, India
| | - Seyed Ehtesham Hasnain
- Department of Life Science, School of Basic Sciences and Research, Sharda University, Knowledge Park III, Greater Noida, Uttar Pradesh 201310, India; Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology, New Delhi 110016, India.
| | - Nasreen Zafar Ehtesham
- Department of Life Science, School of Basic Sciences and Research, Sharda University, Knowledge Park III, Greater Noida, Uttar Pradesh 201310, India.
| |
Collapse
|
3
|
Nithya C, Kiran M, Nagarajaram HA. Dissection of hubs and bottlenecks in a protein-protein interaction network. Comput Biol Chem 2023; 102:107802. [PMID: 36603332 DOI: 10.1016/j.compbiolchem.2022.107802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Revised: 11/20/2022] [Accepted: 12/08/2022] [Indexed: 12/23/2022]
Abstract
Analysis of degree centrality in conjunction with betweenness centrality of proteins in a human protein-protein interaction network revealed three categories of centrally important proteins: a) proteins with high degree and betweenness (hub-bottlenecks denoted as MX), b) proteins with high betweenness and low degree (non-hub-bottlenecks/pure bottlenecks denoted as PB) and c) proteins with high degree and low betweenness (hub-non-bottlenecks/pure hubs denoted as PH). When subjected to a detailed statistical analysis of their molecular-level properties, the proteins belonging to each of these categories were found to be associated with distinct canonical molecular properties, i.e., "molecular markers". The MX proteins are a) conformationally versatile, mainly comprising of essential proteins, b) the targets for interactions by the proteins of viral and bacterial pathogens, c) evolutionally constrained, involved in multiple pathways, enriched with disease genes and d) involved in the functions such as protein stabilization, phosphorylation, and mRNA slicing processes. PB proteins are a) enriched with extracellular and cancer-related proteins, b) enriched with the approved drug targets and c) involved in cell-cell signaling processes. Finally, PH are a) structurally versatile, b) enriched with essential proteins primarily involved in housekeeping processes (transcription and replication). The fact that the proteins belonging to these three categories form three distinct sets in terms of their molecular properties reveals the existence of trichotomy among hubs and bottlenecks, and this knowledge is of paramount importance while prioritizing protein targets for further studies such as drug design and disease association studies based on their network centrality values.
Collapse
Affiliation(s)
- Chandramohan Nithya
- Department of Biotechnology and Bioinformatics, School of Life Sciences, University of Hyderabad, Hyderabad, Telangana 500046, India
| | - Manjari Kiran
- Department of Systems and Computational Biology, School of Life Sciences, University of Hyderabad, Hyderabad, Telangana 500046, India
| | | |
Collapse
|
4
|
Konduru GV, Nagarajaram HA. Human TMPRSS2 non-catalytic ectodomain and SARS-CoV-2 S2' subunit interaction mediated SARS-CoV-2 endocytosis: a model proposal with virtual screening for potential drug molecules to inhibit this interaction. J Biomol Struct Dyn 2022:1-12. [PMID: 35912718 DOI: 10.1080/07391102.2022.2105956] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]
Abstract
This study proposes a novel model for integration of SARS-CoV-2 into host cell via endocytosis as a possible alternative to the prevailing direct fusion model. It is known that the SARS-CoV-2 spike protein undergoes proteolytic cleavage at S1-S2 cleavage site and the cleaved S2 domain is primed by the activated serine protease domain (SPD) of humanTMPRSS2 to become S2'. The activated SPD of TMPRSS2 is formed after it is cleaved by autocatalysis from the membrane bound non-catalytic ectodomain (hNECD) comprising of LDLRA CLASS-I repeat and a SRCR domain. It is known that the SRCR domains as well as LDLRA repeat harboring proteins mediate endocytosis of viruses and certain ligands. Based on this, we put forward a hypothesis that the exposed hNECD binds to the S2' as both are at an interaction proximity soon after S2 is processed by the SPD and this interaction may lead to the endocytosis of virus. Based on this hypothesis we have modelled the hNECD structure, followed by docking studies with the known 3D structure of S2'. The interaction interface of hNECD with S2' was further used for virtual screening of FDA-approved drug molecules and Indian medicinal plant-based compounds. We also mapped the known mutations of concern and mutations of interest on interaction interface of S2' and found that none of the known mutations map onto the interaction interface. This indicates that targeting the interaction between the hNECD of TMPRSS2 and S2' may serve as an attractive therapeutic target.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Guruprasad Varma Konduru
- Laboratory of Computational Biology, Centre for DNA Fingerprinting and Diagnostics, Uppal, Hyderabad, India.,Graduate Studies, Manipal Academy of Higher Education, Manipal, Karnataka, India
| | | |
Collapse
|
5
|
Chaudhary AK, Gholse A, Nagarajaram HA, Dalal AB, Gupta N, Dutta AK, Danda S, Gupta R, Sankar HV, Bhavani GS, Girisha KM, Phadke SR, Ranganath P, Bashyam MD. Ectodysplasin pathogenic variants affecting the furin-cleavage site and unusual clinical features define X-linked hypohidrotic ectodermal dysplasia in India. Am J Med Genet A 2021; 188:788-805. [PMID: 34863015 DOI: 10.1002/ajmg.a.62579] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Revised: 09/20/2021] [Accepted: 11/02/2021] [Indexed: 11/09/2022]
Abstract
Hypohidrotic ectodermal dysplasia (HED) is a rare genetic disorder caused by mutational inactivation of a developmental pathway responsible for generation of tissues of ectodermal origin. The X-linked form accounts for the majority of HED cases and is caused by Ectodysplasin (EDA) pathogenic variants. We performed a combined analysis of 29 X-linked hypohidrotic ectodermal dysplasia (XLHED) families (including 12 from our previous studies). In addition to the classical triad of symptoms including loss (or reduction) of ectodermal structures, such as hair, teeth, and sweat glands, we detected additional HED-related clinical features including facial dysmorphism and hyperpigmentation in several patients. Interestingly, global developmental delay was identified as an unusual clinical symptom in many patients. More importantly, we identified 22 causal pathogenic variants that included 15 missense, four small in-dels, and one nonsense, splice site, and large deletion each. Interestingly, we detected 12 unique (India-specific) pathogenic variants. Of the 29 XLHED families analyzed, 11 (38%) harbored pathogenic variant localized to the furin cleavage site. A comparison with HGMD revealed significant differences in the frequency of missense pathogenic variants; involvement of specific exons and/or protein domains and transition/transversion ratios. A significantly higher proportion of missense pathogenic variants (33%) localized to the EDA furin cleavage when compared to HGMD (7%), of which p.R155C, p.R156C, and p.R156H were detected in three families each. Therefore, the first comprehensive analysis of XLHED from India has revealed several unique features including unusual clinical symptoms and high frequency of furin cleavage site pathogenic variants.
Collapse
Affiliation(s)
- Ajay Kumar Chaudhary
- Laboratory of Molecular Oncology, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, India
| | - Aishwarya Gholse
- Laboratory of Computational Biology, Department of Systems and Computational Biology, School of Life Sciences, University of Hyderabad, Hyderabad, India
| | - Hampapathalu Adimurthy Nagarajaram
- Laboratory of Computational Biology, Department of Systems and Computational Biology, School of Life Sciences, University of Hyderabad, Hyderabad, India
| | - Ashwin Bhikaji Dalal
- Diagnostics Division, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, India
| | - Neerja Gupta
- Division of Genetics, Department of Pediatrics, All India Institute of Medical Sciences, New Delhi, India
| | - Atanu Kumar Dutta
- Department of Clinical Genetics, Christian Medical College and Hospital, Vellore, India
| | - Sumita Danda
- Department of Clinical Genetics, Christian Medical College and Hospital, Vellore, India
| | - Rekha Gupta
- Department of Medical Genetics, Mahatma Gandhi Medical College and Hospital, Jaipur, India
| | - Hariharan V Sankar
- Department of Pediatrics, SAT Hospital, Medical College, Trivandrum, India
| | - Gandham SriLakshmi Bhavani
- Department of Medical Genetics, Kasturba Medical College, Manipal Academy of Higher Education, Manipal, India
| | - Katta M Girisha
- Department of Medical Genetics, Kasturba Medical College, Manipal Academy of Higher Education, Manipal, India
| | - Shubha Rao Phadke
- Department of Medical Genetics, Sanjay Gandhi Postgraduate Institute of Medical Sciences, Lucknow, India
| | - Prajnya Ranganath
- Diagnostics Division, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, India.,Department of Medical Genetics, Nizam's Institute of Medical Sciences, Hyderabad, India
| | - Murali Dharan Bashyam
- Laboratory of Molecular Oncology, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, India
| |
Collapse
|
6
|
Trivedi R, Nagarajaram HA. Substitution scoring matrices for proteins - An overview. Protein Sci 2020; 29:2150-2163. [PMID: 32954566 DOI: 10.1002/pro.3954] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Revised: 09/17/2020] [Accepted: 09/18/2020] [Indexed: 01/17/2023]
Abstract
Sequence analysis is the primary and simplest approach to discover structural, functional and evolutionary details of related proteins. All the alignment based approaches of sequence analysis make use of amino acid substitution matrices, and the accuracy of the results largely depends on the type of scoring matrices used to perform alignment tasks. An amino acid substitution matrix is a 20 × 20 matrix in which the individual elements encapsulate the rates at which each of the 20 amino acid residues in proteins are substituted by other amino acid residues over time. In contrast to most globular/ordered proteins whose amino acids composition is considered as standard, there are several classes of proteins (e.g., transmembrane proteins) in which certain types of amino acid (e.g., hydrophobic residues) are enriched. These compositional differences among various classes of proteins are manifested in their underlying residue substitution frequencies. Therefore, each of the compositionally distinct class of proteins or protein segments should be studied using specific scoring matrices that reflect their distinct residue substitution pattern. In this review, we describe the development and application of various substitution scoring matrices peculiar to proteins with standard and biased compositions. Along with most commonly used standard matrices (PAM, BLOSUM, MD and VTML) that act as default parameters in various homologs search and alignment tools, different substitution scoring matrices specific to compositionally distinct class of proteins are discussed in detail.
Collapse
Affiliation(s)
- Rakesh Trivedi
- Laboratory of Computational Biology, Centre for DNA Fingerprinting and Diagnostics, Uppal, Hyderabad, Telangana, India.,Graduate School, Manipal Academy of Higher Education, Manipal, Karnataka, India
| | - Hampapathalu Adimurthy Nagarajaram
- Laboratory of Computational Biology, Department of Systems and Computational Biology, School of Life Sciences, University of Hyderabad, Hyderabad, Telangana, India.,Centre for Modelling, Simulation and Design, University of Hyderabad, Hyderabad, Telangana, India
| |
Collapse
|
7
|
Taware R, Taunk K, Kumar TVS, Pereira JAM, Câmara JS, Nagarajaram HA, Kundu GC, Rapole S. Extracellular volatilomic alterations induced by hypoxia in breast cancer cells. Metabolomics 2020; 16:21. [PMID: 31980945 DOI: 10.1007/s11306-020-1635-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/05/2019] [Accepted: 01/03/2020] [Indexed: 12/24/2022]
Abstract
INTRODUCTION The metabolic shift induced by hypoxia in cancer cells has not been explored at volatilomic level so far. The volatile organic metabolites (VOMs) constitute an important part of the metabolome and their investigation could provide us crucial aspects of hypoxia driven metabolic reconfiguration in cancer cells. OBJECTIVE To identify the altered volatilomic response induced by hypoxia in metastatic/aggressive breast cancer (BC) cells. METHODS BC cells were cultured under normoxic and hypoxic conditions and VOMs were extracted using HS-SPME approach and profiled by standard GC-MS system. Univariate and multivariate statistical approaches (p < 0.05, Log2 FC ≥ 0.58/≤ - 0.58, PC1 > 0.13/< - 0.13) were applied to select the VOMs differentially altered after hypoxic treatment. Metabolic pathway analysis was also carried out in order to identify altered metabolic pathways induced by the hypoxia in the selected BC cells. RESULTS Overall, 20 VOMs were found to be significantly altered (p < 0.05, PC1 > 0.13/< - 0.13) upon hypoxic exposure to BC cells. Further, cell line specific volatilomic alterations were extracted by comparative metabolic analysis of aggressive (MDA-MB-231) vs. non-aggressive (MCF-7) cells incubated under hypoxia and normoxia. In this case, 15 and 12 VOMs each were found to be significantly altered in aggressive cells when exposed to hypoxic and normoxic condition respectively. Out of these, 9 VOMs were found to be uniquely associated with hypoxia, 6 were specific to normoxia and 6 were found common to both the conditions. Formic acid was identified as the most prominent molecule with higher abundance levels in aggressive as compared to non-aggressive cells in both conditions. Furthermore, metabolic pathway analyses revealed that fatty acid biosynthesis and nicotinate and nicotinamide metabolism were significantly altered in aggressive as compared to non-aggressive cells in normoxia and hypoxia respectively. CONCLUSIONS Higher formate overflow was observed in aggressive cells compared to non-aggressive cells incubated under both the conditions, reinforcing its correlation with aggressive and invasive cancer type. Moreover, under hypoxia, aggressive cells preferred to be bioenergetically more efficient whereas, under normoxia, fatty acid biosynthesis was favoured when compared to non-aggressive cells.
Collapse
Affiliation(s)
- Ravindra Taware
- Proteomics Lab, National Centre for Cell Science, Ganeshkhind, Pune, 411007, India
| | - Khushman Taunk
- Proteomics Lab, National Centre for Cell Science, Ganeshkhind, Pune, 411007, India
| | - Totakura V S Kumar
- Laboratory of Tumor Biology, Angiogenesis and Nanomedicine Research, National Centre for Cell Science (NCCS), Ganeshkhind, Pune, 411007, India
| | - Jorge A M Pereira
- CQM - Centro de Química da Madeira, Universidade da Madeira, Campus Universitário da Penteada, 9020105, Funchal, Portugal
| | - José S Câmara
- CQM - Centro de Química da Madeira, Universidade da Madeira, Campus Universitário da Penteada, 9020105, Funchal, Portugal
- Faculdade de Ciências Exatas E da Engenharia, Universidade da Madeira, Campus Universitário da Penteada, 9020-105, Funchal, Portugal
| | - H A Nagarajaram
- Department of Systems and Computational Biology, School of Life Sciences, University of Hyderabad, Hyderabad, 500046, India
| | - Gopal C Kundu
- Laboratory of Tumor Biology, Angiogenesis and Nanomedicine Research, National Centre for Cell Science (NCCS), Ganeshkhind, Pune, 411007, India
- School of Biotechnology, KIIT Deemed to be University, Institute of Eminence, Bhubaneswar, 751024, India
| | - Srikanth Rapole
- Proteomics Lab, National Centre for Cell Science, Ganeshkhind, Pune, 411007, India.
| |
Collapse
|
8
|
Trivedi R, Nagarajaram HA. Amino acid substitution scoring matrices specific to intrinsically disordered regions in proteins. Sci Rep 2019; 9:16380. [PMID: 31704957 PMCID: PMC6841959 DOI: 10.1038/s41598-019-52532-8] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Accepted: 10/15/2019] [Indexed: 01/09/2023] Open
Abstract
An amino acid substitution scoring matrix encapsulates the rates at which various amino acid residues in proteins are substituted by other amino acid residues, over time. Database search methods make use of substitution scoring matrices to identify sequences with homologous relationships. However, widely used substitution scoring matrices, such as BLOSUM series, have been developed using aligned blocks that are mostly devoid of disordered regions in proteins. Hence, these substitution-scoring matrices are mostly inappropriate for homology searches involving proteins enriched with disordered regions as the disordered regions have distinct amino acid compositional bias, and therefore expected to have undergone amino acid substitutions that are distinct from those in the ordered regions. We, therefore, developed a novel series of substitution scoring matrices referred to as EDSSMat by exclusively considering the substitution frequencies of amino acids in the disordered regions of the eukaryotic proteins. The newly developed matrices were tested for their ability to detect homologs of proteins enriched with disordered regions by means of SSEARCH tool. The results unequivocally demonstrate that EDSSMat matrices detect more number of homologs than the widely used BLOSUM, PAM and other standard matrices, indicating their utility value for homology searches of intrinsically disordered proteins.
Collapse
Affiliation(s)
- Rakesh Trivedi
- Laboratory of Computational Biology, Centre for DNA Fingerprinting and Diagnostics, Uppal, Hyderabad, Telangana, 500039, India.,Graduate School, Manipal Academy of Higher Education, Manipal, Karnataka, 576104, India
| | - Hampapathalu Adimurthy Nagarajaram
- Department of Systems and Computational Biology, School of Life Sciences, University of Hyderabad, Hyderabad, Telangana, 500 046, India. .,Centre for Modelling, Simulation and Design, University of Hyderabad, Hyderabad, Telangana, 500 046, India.
| |
Collapse
|
9
|
Uttarilli A, Ranganath P, Jain SJMN, Prasad CK, Sinha A, Verma IC, Phadke SR, Puri RD, Danda S, Muranjan MN, Jevalikar G, Nagarajaram HA, Dalal AB. Novel mutations of the arylsulphatase B (ARSB) gene in Indian patients with mucopolysaccharidosis type VI. Indian J Med Res 2016; 142:414-25. [PMID: 26609033 PMCID: PMC4683826 DOI: 10.4103/0971-5916.169201] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Background & objectives: Mucopolysaccharidosis type VI (MPS VI) is a rare, autosomal recessive lysosomal storage disorder caused by deficient enzymatic activity of N-acetyl galactosamine-4-sulphatase resulting from mutations in the arylsulphatase B (ARSB) gene. The ARSB gene is located on chromosome 5q11-q13 and is composed of eight exons. More than hundred ARSB mutations have been reported so far, but the mutation spectrum of MPS VI in India is still unknown. Hence, the aim of the present study was to identify the mutational spectrum in patients with MPS VI in India and to study the genotype-phenotype association and functional outcomes of these mutations. Methods: Molecular characterization of the ARSB gene by Sanger sequencing was done for 15 patients (aged 15 months to 11 yr) who were enzymatically confirmed to have MPS VI. Age of onset, clinical progression and enzyme activity levels in each patient were studied to look for genotype-phenotype association. Haplotype analysis performed for unrelated patients with the recurring mutation W450C, was suggestive of a founder effect. Sequence and structural analyses of the ARSB protein using standard software were carried out to determine the impact of detected mutations on the function of the ARSB protein. Results: A total of 12 mutations were identified, of which nine were novel mutations namely, p.D53N, p.L98R, p.Y103SfsX9, p.W353X, p.H393R, p.F166fsX18, p.I220fsX5, p.W450L, and p.W450C, and three were known mutations (p.D54N, p.A237D and p.S320R). The nine novel sequence variants were confirmed not to be polymorphic variants by performing sequencing in 50 unaffected individuals from the same ethnic population. Interpretation & conclusions: Nine novel mutations were identified in MPS VI cases from India in the present study. The study also provides some insights into the genotype-phenotype association in MPS VI.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | - Ashwin B Dalal
- Diagnostics Division, Centre for DNA Fingerprinting & Diagnostics, Hyderabad, India
| |
Collapse
|
10
|
Chaudhary AK, Mohapatra R, Nagarajaram HA, Ranganath P, Dalal A, Dutta A, Danda S, Girisha KM, Bashyam MD. The novel EDAR p.L397H missense mutation causes autosomal dominant hypohidrotic ectodermal dysplasia. J Eur Acad Dermatol Venereol 2016; 31:e17-e20. [PMID: 27168349 DOI: 10.1111/jdv.13587] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Affiliation(s)
| | - R Mohapatra
- Laboratory of Computational biology, Hyderabad, India
| | | | - P Ranganath
- Diagnostics Division, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, India.,Nizam's Institute of Medical Sciences, Hyderabad, India
| | - A Dalal
- Diagnostics Division, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, India
| | - A Dutta
- Christian Medical College, Vellore, India
| | - S Danda
- Christian Medical College, Vellore, India
| | - K M Girisha
- Department of Medical Genetics, Kasturba Medical College, Manipal University, Manipal, India
| | - M D Bashyam
- Laboratory of Molecular Oncology, Hyderabad, India
| |
Collapse
|
11
|
Radha Rama Devi A, Ramesh VA, Nagarajaram HA, Satish SPS, Jayanthi U, Lingappa L. Spectrum of mutations in Glutaryl-CoA dehydrogenase gene in glutaric aciduria type I--Study from South India. Brain Dev 2016; 38:54-60. [PMID: 26071121 DOI: 10.1016/j.braindev.2015.05.013] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/25/2015] [Revised: 05/25/2015] [Accepted: 05/26/2015] [Indexed: 11/19/2022]
Abstract
BACKGROUND Glutaric aciduria type I is an autosomal recessive organic acid disorder. The primary defect is the deficiency of Glutaryl-CoA dehydrogenase (EC number 1.3.99.7) enzyme that is involved in the catabolic pathways of the amino acids l-lysine, l-hydroxylysine, and l-tryptophan. It is a treatable neuro-metabolic disorder. Early diagnosis and treatment helps in preventing brain damage. METHODS The Glutaryl-CoA dehydrogenase gene (GCDH) gene was sequenced to identify disease causing mutations by direct sequencing of all the exons in twelve patients who were biochemically confirmed with GA I. RESULTS We identified eleven mutations of which nine are homozygous mutations, one heterozygous and two synonymous mutations. Among the eleven mutations, four mutations p.Q162R, p.P286S, p.W225X in two families and p.V410M are novel. A milder clinical presentation is observed in those families who are either heterozygous or with a benign synonymous SNP. Multiple sequence alignment (MSA) of GCDH with its homologues revealed that the observed novel mutations are not tolerated by protein structure and function. CONCLUSIONS The present study indicates genetic heterogeneity in GCDH gene mutations among South Indian population. Genetic analysis is useful in prenatal diagnosis and prevention. Mutation analysis is a useful tool in the absence of non-availability of enzyme assay in GA I.
Collapse
Affiliation(s)
- A Radha Rama Devi
- Genetic Metabolic Unit, Rainbow Children Hospital, India; Sandor Life Sciences, India.
| | - Vakkalagadda A Ramesh
- Laboratory of Computational Biology, CDFD, Gruhakalpa, Nampally, Hyderabad, India; Graduate Studies, Manipal University, India
| | - H A Nagarajaram
- Laboratory of Computational Biology, CDFD, Gruhakalpa, Nampally, Hyderabad, India
| | | | | | | |
Collapse
|
12
|
Bidchol AM, Dalal A, Trivedi R, Shukla A, Nampoothiri S, Sankar VH, Danda S, Gupta N, Kabra M, Hebbar SA, Bhat RY, Matta D, Ekbote AV, Puri RD, Phadke SR, Gowrishankar K, Aggarwal S, Ranganath P, Sharda S, Kamate M, Datar CA, Bhat K, Kamath N, Shah H, Krishna S, Gopinath PM, Verma IC, Nagarajaram HA, Satyamoorthy K, Girisha KM. Recurrent and novel GLB1 mutations in India. Gene 2015; 567:173-81. [PMID: 25936995 DOI: 10.1016/j.gene.2015.04.078] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2015] [Revised: 04/27/2015] [Accepted: 04/28/2015] [Indexed: 02/08/2023]
Abstract
GM1 gangliosidosis is a lysosomal storage disorder caused by mutations in the GLB1 gene, leading to the deficiency of the enzyme β-d-galactosidase. In this study, we report molecular findings in 50 Asian Indian families with GM1 gangliosidosis. We sequenced all the exons and flanking intronic sequences of GLB1 gene. We identified 33 different mutations (20 novel and 13 previously reported). The novel mutations include 12 missense (p.M1?, p.E129Q, p.G134R, p.L236P, p.G262E, p.L297F, p.Y331C, p.G414V, p.K493N, p.L514P, p.P597L, p.T600I), four splicing (c.246-2A>G, c.397-2A>G, c.552+1G>T, c.956-2A>G), three indels (p.R22Qfs*8, p.L24Cfs*47, p.I489Qfs*4) and one nonsense mutation (p.Q452*). Most common mutations identified in this study were c.75+2InsT (14%) and p.L337P (10%). Known mutations accounted for 67% of allele frequency in our cohort of patients, suggesting that these mutations in GLB1 are recurrent across different populations. Twenty three mutations were localized in the TIM barrel domain, β-domain 1 and β-domain 2. In silico sequence and structure analysis of GLB1 reveal that all the novel mutations affect the function and structure of the protein. We hereby report on the largest series of patients with GM1 gangliosidosis and the first from India.
Collapse
Affiliation(s)
- Abdul Mueed Bidchol
- Department of Medical Genetics, Kasturba Medical College, Manipal University, Manipal, Karnataka, India
| | - Ashwin Dalal
- Diagnostics Division, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, Telangana, India
| | - Rakesh Trivedi
- Laboratory of Computational Biology & Bioinformatics Facility, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, Telangana, India; Graduate Studies, Manipal University, Manipal, Karnataka, India
| | - Anju Shukla
- Department of Medical Genetics, Kasturba Medical College, Manipal University, Manipal, Karnataka, India
| | - Sheela Nampoothiri
- Department of Pediatric Genetics, Amrita Institute of Medical Sciences and Research Center, Kochi, AIMS Ponekkara, Kerala, India
| | - V H Sankar
- Genetic Clinic, Department of Pediatrics, SAT Hospital, Government Medical College, Thiruvananthapuram, Kerala, India
| | - Sumita Danda
- Department of Clinical Genetics, Christian Medical College and Hospital, Vellore, Tamil Nadu, India
| | - Neerja Gupta
- Genetics Unit, Department of Pediatrics, All India Institute of Medical Sciences, New Delhi, India
| | - Madhulika Kabra
- Genetics Unit, Department of Pediatrics, All India Institute of Medical Sciences, New Delhi, India
| | - Shrikiran A Hebbar
- Department of Pediatrics, Kasturba Medical College, Manipal University, Manipal, Karnataka, India
| | - Ramesh Y Bhat
- Department of Pediatrics, Kasturba Medical College, Manipal University, Manipal, Karnataka, India
| | - Divya Matta
- Diagnostics Division, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, Telangana, India
| | - Alka V Ekbote
- Department of Clinical Genetics, Christian Medical College and Hospital, Vellore, Tamil Nadu, India
| | - Ratna Dua Puri
- Center of Medical Genetics, Sir Ganga Ram Hospital, New Delhi, India
| | - Shubha R Phadke
- Department of Medical Genetics, Sanjay Gandhi Postgraduate Institute of Medical Sciences, Lucknow, Uttar Pradesh, India
| | - Kalpana Gowrishankar
- Department of Medical Genetics, Childs Trust Medical Research Foundation, Kanchi Kamakoti Childs Trust Hospital, Chennai, Tamil Nadu, India
| | - Shagun Aggarwal
- Diagnostics Division, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, Telangana, India; Department of Medical Genetics, Nizam's Institute of Medical Sciences, Hyderabad, Telangana, India
| | - Prajnya Ranganath
- Diagnostics Division, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, Telangana, India; Department of Medical Genetics, Nizam's Institute of Medical Sciences, Hyderabad, Telangana, India
| | - Sheetal Sharda
- Post Graduate Institute of Medical Education and Research, Chandigarh, Punjab, India
| | - Mahesh Kamate
- Pediatric Neurology, KLE University's J N Medical College, Belgaum, Karnataka, India
| | - Chaitanya A Datar
- Rare Genetic Disorder Clinic, Sahyadri Hospital, Pune, Maharashtra, India
| | - Kamalakshi Bhat
- Department of Pediatrics, Kasturba Medical College, Mangalore, Manipal University, Karnataka, India
| | - Nutan Kamath
- Department of Pediatrics, Kasturba Medical College, Mangalore, Manipal University, Karnataka, India
| | - Hitesh Shah
- Pediatric Orthopedics Services, Department of Orthopedics, Kasturba Medical College, Manipal, Manipal University, Karnataka, India
| | - Shuba Krishna
- Strand Life Sciences Pvt Ltd, Bengaluru, Karnataka, India
| | - Puthiya Mundyat Gopinath
- Division of Biotechnology, School of Life Sciences, Manipal University, Manipal, Karnataka, India
| | - Ishwar C Verma
- Center of Medical Genetics, Sir Ganga Ram Hospital, New Delhi, India
| | - H A Nagarajaram
- Laboratory of Computational Biology & Bioinformatics Facility, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, Telangana, India
| | - Kapaettu Satyamoorthy
- Division of Biotechnology, School of Life Sciences, Manipal University, Manipal, Karnataka, India
| | - Katta Mohan Girisha
- Department of Medical Genetics, Kasturba Medical College, Manipal University, Manipal, Karnataka, India.
| |
Collapse
|
13
|
Halehalli RR, Nagarajaram HA. Molecular principles of human virus protein-protein interactions. ACTA ACUST UNITED AC 2014; 31:1025-33. [PMID: 25417202 DOI: 10.1093/bioinformatics/btu763] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2014] [Accepted: 11/12/2014] [Indexed: 01/01/2023]
Abstract
MOTIVATION Viruses, from the human protein-protein interaction network perspective, target hubs, bottlenecks and interconnected nodes enriched in certain biological pathways. However, not much is known about the general characteristic features of the human proteins interacting with viral proteins (referred to as hVIPs) as well as the motifs and domains utilized by human-virus protein-protein interactions (referred to as Hu-Vir PPIs). RESULTS Our study has revealed that hVIPs are mostly disordered proteins, whereas viral proteins are mostly ordered proteins. Protein disorder in viral proteins and hVIPs varies from one subcellular location to another. In any given viral-human PPI pair, at least one of the two proteins is structurally disordered suggesting that disorder associated conformational flexibility as one of the characteristic features of virus-host interaction. Further analyses reveal that hVIPs are (i) slowly evolving proteins, (ii) associated with high centrality scores in human-PPI network, (iii) involved in multiple pathways, (iv) enriched in eukaryotic linear motifs (ELMs) associated with protein modification, degradation and regulatory processes, (v) associated with high number of splice variants and (vi) expressed abundantly across multiple tissues. These aforementioned findings suggest that conformational flexibility, spatial diversity, abundance and slow evolution are the characteristic features of the human proteins targeted by viral proteins. Hu-Vir PPIs are mostly mediated via domain-motif interactions (DMIs) where viral proteins employ motifs that mimic host ELMs to bind to domains in human proteins. DMIs are shared among viruses belonging to different families indicating a possible convergent evolution of these motifs to help viruses to adopt common strategies to subvert host cellular pathways. AVAILABILITY AND IMPLEMENTATION Hu-Vir PPI data, DDI and DMI data for human-virus PPI can be downloaded from http://cdfd.org.in/labpages/computational_biology_datasets.html. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Rachita Ramachandra Halehalli
- Laboratory of Computational Biology, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, Telangana, 500001, India and Graduate School, Manipal University, Manipal, 576104, Karnataka, India Laboratory of Computational Biology, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, Telangana, 500001, India and Graduate School, Manipal University, Manipal, 576104, Karnataka, India
| | - Hampapathalu Adimurthy Nagarajaram
- Laboratory of Computational Biology, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, Telangana, 500001, India and Graduate School, Manipal University, Manipal, 576104, Karnataka, India
| |
Collapse
|
14
|
Bidchol AM, Dalal A, Shah H, S S, Nampoothiri S, Kabra M, Gupta N, Danda S, Gowrishankar K, Phadke SR, Kapoor S, Kamate M, Verma IC, Puri RD, Sankar VH, Devi ARR, Patil SJ, Ranganath P, Jain SJMN, Agarwal M, Singh A, Mishra P, Tamhankar PM, Gopinath PM, Nagarajaram HA, Satyamoorthy K, Girisha KM. GALNS mutations in Indian patients with mucopolysaccharidosis IVA. Am J Med Genet A 2014; 164A:2793-801. [PMID: 25252036 DOI: 10.1002/ajmg.a.36735] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2013] [Accepted: 06/27/2014] [Indexed: 01/20/2023]
Abstract
Mucopolysaccharidosis IV A (Morquio syndrome A, MPS IVA) is a lysosomal storage disease caused by the deficiency of N-acetylgalactosamine-6-sulfatase (GALNS). The mutation spectrum in this condition is yet to be determined in Indians. We aimed to analyze the mutations in the GALNS gene in Asian Indians with MPS IVA. All the exons and the adjacent intronic regions of the gene were amplified and sequenced in sixty-eight unrelated Indian families. We identified 136 mutant alleles comprising of 40 different mutations. We report twenty-two novel mutations that comprise of seventeen missense (p.Asn32Thr, p.Leu36Arg, p.Pro52Leu, p.Pro77Ser, p.Cys79Arg, p.His142Pro, p.Tyr191Asp, p.Asn204Thr, p.Gly188Ser, p.Phe216Ser, p.Trp230Cys, p.Ala291Ser, p.Gly317Arg, p.His329Pro, p.Arg386Ser, p.Glu450Gly, p.Cys501Ser), three splice-site variants (c.120+1G>C, c.1003-3C>G, c.1139+1G>A), one nonsense mutation (p.Gln414*) and one frameshift mutation (p.Pro420Leufs*440). Eighteen mutations have been reported earlier. Among these p.Ser287Leu (8.82%), p.Phe216Ser (7.35%), p.Asn32Thr (6.61%) and p.Ala291Ser (5.88%) were the most frequent mutations in Indian patients but were rare in the mutational profiles reported in other populations. These results indicate that the Indian patients may have a distinct mutation spectrum compared to those of other populations. Mutant alleles in exon 1, 7 and 8 accounted for 44.8% of the mutations, and sequencing of these exons initially may be a cost-effective approach in Asian Indian patients. This is the largest study on molecular analysis of patients with MPS IVA reported in the literature, and the first report from India.
Collapse
Affiliation(s)
- Abdul Mueed Bidchol
- Department of Medical Genetics, Kasturba Medical College, Manipal University, Manipal, Karnataka, India
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Sinha A, Nagarajaram HA. Nodes occupying central positions in human tissue specific PPI networks are enriched with many splice variants. Proteomics 2014; 14:2242-8. [PMID: 25092398 DOI: 10.1002/pmic.201400249] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2014] [Revised: 07/04/2014] [Accepted: 08/01/2014] [Indexed: 12/22/2022]
Abstract
The functional repertoire of genes in the eukaryotic organisms is enhanced by the phenomenon of alternative splicing. Hence, a node in a tissue specific protein-protein interaction (TS PPIN) network can be thought of as an ensemble of various spliced protein products of the corresponding gene expressed in that tissue. Here we demonstrate that the nodes that occupy topologically central positions characterized by high degree, betweenness, closeness, and eigenvector centrality values in TS PPINs of Homo sapiens are associated with high number of splice variants. We also show that the high "centrality" of these genes/nodes could in part be explained by the presence of a large number of promiscuous domains.
Collapse
Affiliation(s)
- Anupam Sinha
- Laboratory of Computational Biology, Centre for DNA Fingerprinting & Diagnostics (CDFD), Hyderabad, Telangana, India
| | | |
Collapse
|
16
|
Patil SJ, Rai GK, Bhat V, Ramesh VA, Nagarajaram HA, Matalia J, Phadke SR. Distal arthrogryposis type 5D with a novel ECEL1 gene mutation. Am J Med Genet A 2014; 164A:2857-62. [PMID: 25099528 DOI: 10.1002/ajmg.a.36702] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2013] [Accepted: 06/27/2014] [Indexed: 11/07/2022]
Abstract
Distal arthrogryposis syndromes (DAs) show wide clinical variability and overlapping clinical findings with the other DAs classified by Bamshad et al. [1996]. Most of the DAs are inherited as autosomal dominant disorders. DA type 5D is a subtype of DA type 5 inherited as autosomal recessive disorder, clinically characterized by congenital distal joint contractures, knee extension contractures, congenital hip dislocation, club foot, ptosis and other eye findings, furrowed tongue, and scoliosis. Here, we report on a family with clinical features of DA type 5D with novel mutations in the ECEL1 gene.
Collapse
Affiliation(s)
- Siddaramappa J Patil
- Centre for Molecular and Metabolic Diagnostics & Research, Narayana Hrudayalaya Hospitals, Bangalore, India
| | | | | | | | | | | | | |
Collapse
|
17
|
Abstract
In this study, we have constructed tissue-specific protein-protein interaction networks for 70 human tissues and have identified three types of hubs based on their expression breadths: (a) tissue-specific hubs (TSHs) (proteins that are expressed in ≤ 10 tissues and also form hubs in ≤ 10 tissues), (b) tissue-preferred hubs (TPHs) (proteins expressed in ≥ 60 tissues but are highly connected in ≤ 10 tissues), and (c) housekeeping hubs (HKHs) (proteins that are expressed in ≥ 60 tissues and also form hubs in ≥ 60 tissues). Comparative analyses revealed significant differences between TSHs and HKHs and also revealed that TPHs behave more like HKHs. TSHs are lengthier, more disordered, and also quickly evolving proteins as compared with HKHs. Despite having a similar number of binding surfaces and interacting domains, TSHs are associated with a lower degree of centrality as compared with HKHs, suggesting that TSHs are "unsaturated" with regard to their binding capability and are perhaps evolving with regard to their interactions. TSHs are less abundantly expressed as compared with HKHs and are enriched with PEST motifs, indicating their tight regulation. All of these properties of TSHs and HKHs correlate with their distinct functional roles; TSHs are involved in tissue-specific functional roles, viz., secretors, receptors, and signaling proteins, whereas HKHs are involved in core-cellular functions such as transcription, translation, and so on. Our study, therefore, brings forth a clear and distinct classification of hubs simply based on their expression breadth and further assumes significance in the light of the highly debated dichotomy of date and party hubs, which is based on the coexpression pattern of hubs with their partners.
Collapse
Affiliation(s)
- Manjari Kiran
- Laboratory of Computational Biology, Centre for DNA Fingerprinting and Diagnostics , Bldg.7, Gruhakalpa, Nampally, Hyderabad 500 001, Andhra Pradesh, India
| | | |
Collapse
|
18
|
Sinha A, Nagarajaram HA. Effect of Alternative Splicing on the Degree Centrality of Nodes in Protein–Protein Interaction Networks of Homo sapiens. J Proteome Res 2013; 12:1980-8. [DOI: 10.1021/pr4000812] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Affiliation(s)
- Anupam Sinha
- Laboratory of Computational Biology, Centre for DNA Fingerprinting & Diagnostics (CDFD), Gruhakalpa, Nampally, Hyderabad 500001, Andhra Pradesh, India
| | - Hampapathalu Adimurthy Nagarajaram
- Laboratory of Computational Biology, Centre for DNA Fingerprinting & Diagnostics (CDFD), Gruhakalpa, Nampally, Hyderabad 500001, Andhra Pradesh, India
| |
Collapse
|
19
|
Kumar P, Nagarajaram HA. A study on mutational dynamics of simple sequence repeats in relation to mismatch repair system in prokaryotic genomes. J Mol Evol 2012; 74:127-39. [PMID: 22415400 DOI: 10.1007/s00239-012-9491-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2011] [Accepted: 02/15/2012] [Indexed: 11/30/2022]
Abstract
Mutational bias toward expansion or contraction of simple sequence repeats (SSRs) is referred to as directionality of SSR evolution. In this communication, we report the mutational bias exhibited by mononucleotide SSRs occurring in the non-coding regions of several prokaryotic genomes. Our investigations revealed that the strains or species lacking mismatch repair (MMR) system generally show higher number of polymorphic SSRs than those species/strains having MMR system. An exception to this observation was seen in the mycobacterial genomes that are MMR deficient where only a few SSR tracts were seen with mutations. This low incidence of SSR mutations even in the MMR-deficient background could be attributed to the high fidelity of the DNA polymerases as a consequence of high generation time of the mycobacteria. MMR system-deficient species generally did not show any bias toward mononucleotide SSR expansions or contractions indicating a neutral evolution of SSRs in these species. The MMR-proficient species in which the observed mutations correspond to secondary mutations showed bias toward contraction of polymononucleotide tracts, perhaps, indicating low efficiency of MMR system to repair SSR-induced slippage errors on template strands. This bias toward deletion in the mononucleotide SSR tracts might be a probable reason behind scarcity for long poly A|T and G|C tracts in prokaryotic systems which are mostly MMR proficient. In conclusion, our study clearly demonstrates mutational dynamics of SSRs in relation to the presence/absence of MMR system in the prokaryotic system.
Collapse
Affiliation(s)
- Pankaj Kumar
- Laboratory of Computational Biology, Centre for DNA Fingerprinting and Diagnostics (CDFD), Tuljaguda Complex, 4-1-714, Mozamjahi Rd, Nampally, Hyderabad, 500 001, India
| | | |
Collapse
|
20
|
Mohammad TAS, Nagarajaram HA. SVM-based method for protein structural class prediction using secondary structural content and structural information of amino acids. J Bioinform Comput Biol 2011; 9:489-502. [PMID: 21776605 DOI: 10.1142/s0219720011005422] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2010] [Revised: 09/15/2010] [Accepted: 01/07/2011] [Indexed: 11/18/2022]
Abstract
The knowledge collated from the known protein structures has revealed that the proteins are usually folded into the four structural classes: all-α, all-β, α/β and α + β. A number of methods have been proposed to predict the protein's structural class from its primary structure; however, it has been observed that these methods fail or perform poorly in the cases of distantly related sequences. In this paper, we propose a new method for protein structural class prediction using low homology (twilight-zone) protein sequences dataset. Since protein structural class prediction is a typical classification problem, we have developed a Support Vector Machine (SVM)-based method for protein structural class prediction that uses features derived from the predicted secondary structure and predicted burial information of amino acid residues. The examination of different individual as well as feature combinations revealed that the combination of secondary structural content, secondary structural and solvent accessibility state frequencies of amino acids gave rise to the best leave-one-out cross-validation accuracy of ~81% which is comparable to the best accuracy reported in the literature so far.
Collapse
Affiliation(s)
- Tabrez Anwar Shamim Mohammad
- Laboratory of Computational Biology, Centre for DNA Fingerprinting and Diagnostics (CDFD), Nampally, Hyderabad 500001, India.
| | | |
Collapse
|
21
|
Mohammad TAS, Nagarajaram HA. A hierarchical approach to protein fold prediction. J Integr Bioinform 2011; 8:185. [PMID: 22008449 DOI: 10.2390/biecoll-jib-2011-185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2011] [Revised: 08/18/2011] [Accepted: 10/19/2011] [Indexed: 05/31/2023] Open
Abstract
Fold recognition, assigning novel proteins to known structures, forms an important component of the overall protein structure discovery process. The available methods for protein fold recognition are limited by the low fold-coverage and/or low prediction accuracies. We describe here a new Support Vector Machine (SVM)-based method for protein fold prediction with high prediction accuracy and high fold-coverage. The new method of fold prediction with high fold-coverage was developed by training and testing on a large number of folds in order to make the method suitable for large scale fold predictions. However, presence of large number of folds in the training set made the classification task difficult as a consequence of increased complexity involved in binary classifications of SVMs. In order to overcome this complexity we adopted a hierarchical approach where fold-prediction is made in two steps. At the first step structural class of the query is predicted and at the second step fold is predicted within the predicted structural class. This decreased the complexity of the classification problem and also improved the overall fold prediction accuracy. To the best of our knowledge this is the first taxonomic fold recognition method to cover over 700 protein-folds and gives prediction accuracy of around 70% on a benchmark dataset. Since the new method gives rise to state of the art prediction performance and hence can be very useful for structural characterization of proteins discovered in various genomes.
Collapse
|
22
|
Mudunuri SB, Kumar P, Rao AA, Pallamsetty S, Nagarajaram HA. G-IMEx: A comprehensive software tool for detection of microsatellites from genome sequences. Bioinformation 2010; 5:221-3. [PMID: 21364802 PMCID: PMC3040503 DOI: 10.6026/97320630005221] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2010] [Accepted: 08/25/2010] [Indexed: 01/03/2023] Open
Abstract
Microsatellites are ubiquitous short tandem repeats found in all known genomes and are known to play a very important role in various studies and fields including DNA fingerprinting, paternity studies, evolutionary studies, virulence and adaptation of certain bacteria and viruses etc. Due to the sequencing of several genomes and the availability of enormous amounts of sequence data during the past few years, computational studies of microsatellites are of interest for many researchers. In this context, we developed a software tool called Imperfect Microsatellite Extractor (IMEx), to extract perfect, imperfect and compound microsatellites from genome sequences along with their complete statistics. Recently we developed a user-friendly graphical-interface using JAVA for IMEx to be used as a stand-alone software named G-IMEx. G-IMEx takes a nucleotide sequence as an input and the results are produced in both html and text formats. The Linux version of G-IMEx can be downloaded for free from http://www.cdfd.org.in/imex.
Collapse
Affiliation(s)
- Suresh B Mudunuri
- Department of Computer Science and Engineering, Aditya Engineering College (AEC), Surampalem 533 437, India
| | - Pankaj Kumar
- Laboratory of Computational Biology, Centre for DNA Fingerprinting and Diagnostics (CDFD), Hyderabad 500 001, India
| | - Allam Appa Rao
- Jawaharlal Nehru Technological University (JNTU), Kakinada, 533 003, India
| | - S Pallamsetty
- Department of Computer Science and Systems Engineering, Andhra University College of Engineering (AUCE), Visakhapatnam 530 003, India
| | - H A Nagarajaram
- Laboratory of Computational Biology, Centre for DNA Fingerprinting and Diagnostics (CDFD), Hyderabad 500 001, India
| |
Collapse
|
23
|
Purushotham G, Madhumohan K, Anwaruddin M, Nagarajaram HA, Hariram V, Narasimhan C, Bashyam MD. The MYH7 p.R787H mutation causes hypertrophic cardiomyopathy in two unrelated families. Exp Clin Cardiol 2010; 15:e1-e4. [PMID: 20664766 PMCID: PMC2907879] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2009] [Accepted: 02/12/2010] [Indexed: 05/29/2023]
Abstract
BACKGROUND Familial hypertrophic cardiomyopathy (FHC) is a Mendelian disorder usually caused by mutations in any one of more than 12 genes, most of which encode sarcomere proteins. The disease exhibits extensive genetic heterogeneity, and it is important to identify mutations that result in adverse symptoms and/or lethality in affected individuals. An analysis of disease-causing mutations has been initiated in the Indian population to determine prevalent mutations. METHODS FHC was detected using echocardiography and by analysis of clinical symptoms and family history. The disease-causing mutation was identified using polymerase chain reaction DNA sequencing. RESULTS The p.R787H mutation was identified in the MYH7 gene in two FHC families. Sequence and structure analysis suggested impaired binding of the mutant protein to the myosin essential light chain. CONCLUSIONS Although the mutation results in variable clinical symptoms in the affected individuals, probably owing to the effect of modifier genes and/or environmental factors, it does not appear to be a lethal mutation.
Collapse
Affiliation(s)
| | | | - Mohammad Anwaruddin
- Laboratory of Computational Biology, Centre for DNA Fingerprinting and Diagnostics (CDFD)
| | - HA Nagarajaram
- Laboratory of Computational Biology, Centre for DNA Fingerprinting and Diagnostics (CDFD)
| | | | | | | |
Collapse
|
24
|
Shamim MTA, Anwaruddin M, Nagarajaram HA. Support Vector Machine-based classification of protein folds using the structural properties of amino acid residues and amino acid residue pairs. ACTA ACUST UNITED AC 2007; 23:3320-7. [PMID: 17989092 DOI: 10.1093/bioinformatics/btm527] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
MOTIVATION Fold recognition is a key step in the protein structure discovery process, especially when traditional sequence comparison methods fail to yield convincing structural homologies. Although many methods have been developed for protein fold recognition, their accuracies remain low. This can be attributed to insufficient exploitation of fold discriminatory features. RESULTS We have developed a new method for protein fold recognition using structural information of amino acid residues and amino acid residue pairs. Since protein fold recognition can be treated as a protein fold classification problem, we have developed a Support Vector Machine (SVM) based classifier approach that uses secondary structural state and solvent accessibility state frequencies of amino acids and amino acid pairs as feature vectors. Among the individual properties examined secondary structural state frequencies of amino acids gave an overall accuracy of 65.2% for fold discrimination, which is better than the accuracy by any method reported so far in the literature. Combination of secondary structural state frequencies with solvent accessibility state frequencies of amino acids and amino acid pairs further improved the fold discrimination accuracy to more than 70%, which is approximately 8% higher than the best available method. In this study we have also tested, for the first time, an all-together multi-class method known as Crammer and Singer method for protein fold classification. Our studies reveal that the three multi-class classification methods, namely one versus all, one versus one and Crammer and Singer method, yield similar predictions. AVAILABILITY Dataset and stand-alone program are available upon request.
Collapse
|
25
|
Cheeran A, Babu Suganthan R, Swapna G, Bandey I, Achary MS, Nagarajaram HA, Sen R. Escherichia coli RNA polymerase mutations located near the upstream edge of an RNA:DNA hybrid and the beginning of the RNA-exit channel are defective for transcription antitermination by the N protein from lambdoid phage H-19B. J Mol Biol 2005; 352:28-43. [PMID: 16061258 DOI: 10.1016/j.jmb.2005.06.052] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2005] [Revised: 05/08/2005] [Accepted: 06/20/2005] [Indexed: 12/01/2022]
Abstract
Transcription antitermination is an important mechanism that can control regulation of gene expression. The N protein of lambdoid phages modifies the transcription elongation complex (EC) and helps it to overcome downstream terminators. In this modified EC, the C-terminal domain of N makes specific interactions with RNA polymerase (RNAP). The interacting surface of RNAP for N is unknown. Here, we report five mutations in the beta (G1045D) and beta' (P251S, P254L, R270C and G336S) subunits of RNAP that are specifically defective for antitermination by N protein of the lambdoid phage, H-19B. A mutation in the C-terminal domain of N, L108F, suppresses the defect of beta'-P254L. Purified mutant holoenzymes exhibit less processive antitermination. The amino acid substitutions in the mutant RNAPs cluster very close to the RNA:DNA hybrid at the beginning of the RNA-exit channel of the EC. We suggest that the action of H-19B N is exerted through the region defined by these amino acids. Wild-type N stabilizes the EC at terminator sites and in this modified EC a part of the terminator hairpin may form but appears to be unstable. We propose that the action of N close to the active center alters the RNAP-nucleic acid interactions around the RNA:DNA hybrid, which impairs proper folding of the terminator hairpin or stabilizes the weak RNA:DNA hybrid, or both.
Collapse
Affiliation(s)
- Anoop Cheeran
- Laboratory of Transcription Biology, Center for DNA Fingerprinting and Diagnostics, Nacharam, Hyderabad-500076, India
| | | | | | | | | | | | | |
Collapse
|
26
|
Prasad MD, Muthulakshmi M, Arunkumar KP, Madhu M, Sreenu VB, Pavithra V, Bose B, Nagarajaram HA, Mita K, Shimada T, Nagaraju J. SilkSatDb: a microsatellite database of the silkworm, Bombyx mori. Nucleic Acids Res 2005; 33:D403-6. [PMID: 15608226 PMCID: PMC540053 DOI: 10.1093/nar/gki099] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
The SilkSatDb (silkmoth microsatellite database) (http://www.cdfd.org.in/silksatdb) is a relational database of microsatellites extracted from the available expressed sequence tags and whole genome shotgun sequences of the silkmoth, Bombyx mori. The database has been rendered with a simple and robust web-based search facility, developed using PHP. The SilkSatDb also stores information on primers developed and validated in the laboratory. Users can retrieve information on the microsatellite and the protocols used, along with informative figures and polymorphism status of those microsatellites. In addition, the interface is coupled with Autoprimer, a primer-designing program, using which users can design primers for the loci of interest.
Collapse
Affiliation(s)
- M D Prasad
- Laboratories of Molecular Genetics, Centre for DNA Fingerprinting and Diagnostics, ECIL Road, Nacharam, Hyderabad 500076, India
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Kamal A, Ramesh G, Srinivas O, Ramulu P, Laxman N, Rehana T, Deepak M, Achary MS, Nagarajaram HA. Design, synthesis, and evaluation of mixed imine-amine pyrrolobenzodiazepine dimers with efficient DNA binding affinity and potent cytotoxicity. Bioorg Med Chem 2005; 12:5427-36. [PMID: 15388169 DOI: 10.1016/j.bmc.2004.07.045] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2004] [Revised: 07/21/2004] [Accepted: 07/21/2004] [Indexed: 11/26/2022]
Abstract
Synthesis of mixed imine-amine pyrrolobenzodiazepine (PBD) dimers that are comprised of DC-81 and secondary amine (N10) of DC-81 subunits tethered to their C8 positions through alkanedioxy linkers (comprised of three and five carbons) is described. These noncross-linking unsymmetrical molecules exhibit significant DNA minor groove binding ability and one of them 5b linked through the pentanedioxy chain exhibits efficient DNA binding ability (DeltaTm=11.0 degrees C) when compared to naturally occurring DC-81, 1 (DeltaTm=0.7 degrees C). The imine-amine PBD dimers exhibit promising in vitro antitumor activity in a number of human cancer cell lines.
Collapse
Affiliation(s)
- Ahmed Kamal
- Biotransformation Laboratory, Division of Organic Chemistry, Indian Institute of Chemical Technology, Hyderabad 500 007, India.
| | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Majeed AA, Ahmed N, Rao KR, Ghousunnissa S, Kauser F, Bose B, Nagarajaram HA, Katoch VM, Cousins DV, Sechi LA, Gilman RH, Hasnain SE. AmpliBASE MT: a Mycobacterium tuberculosis diversity knowledgebase. Bioinformatics 2004; 20:989-92. [PMID: 14764550 DOI: 10.1093/bioinformatics/bth051] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
AmpliBASE MT is an online databank of high-resolution DNA fingerprints representing fluorescent amplified fragment length polymorphism (FAFLP) profiles or amplitypes developed for the Mycobacterium tuberculosis complex strains from 48 different countries. AmpliBASE MT is based on a relational database management system that is hyperlinked to visualize genotyping results in the form of DNA fingerprint images for individual strains. A flexible search system based on systematic comparisons of fragment sizes in base pairs allows inter-laboratory comparison of FAFLP profiles. Besides this, the database also displays previously published data on IS6110 profiles, spoligotypes, MIRU-VNTRs and large sequence polymorphisms along with the FAFLP records that will give the overall comparisons. Being the first of its kind, AmpliBASE MT is expected to be a very helpful tool in strengthening the concept of 'geographic genomics' and will be very helpful to molecular epidemiologists and those interested in diagnostic development for tuberculosis.
Collapse
Affiliation(s)
- Ahmed A Majeed
- Centre for DNA Fingerprinting and Diagnostics, Nacharam, Hyderabad 500076, India
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Abstract
The Ramachandran steric map and energy diagrams of the glycyl residue are symmetric. A plot of (phi,psi) angles of glycyl residues in 250 nonhomologous and high-resolution protein structures is also largely symmetric. However, there is a clear aberration in the symmetry. Although there is a cluster of points corresponding to the right-handed alpha-helical region, the "equivalent" cluster is clearly shifted to in and around the (phi,psi) values of (90 degrees, 0 degrees ) instead of being centered at the left-handed alpha-helical region of (60 degrees, 40 degrees ). This lack of symmetry exists even in the (phi,psi) distribution of residues from non-alpha-helical regions in proteins. Here we provide an explanation for this observation. An analysis of glycyl conformations in small peptide structures and in "coil" proteins, which are largely devoid of helical and sheet regions, shows that glycyl residues prefer to adopt conformations around (+/-90 degrees, 0 degrees ) instead of right- and left-handed alpha-helical regions. By using theoretical calculations, such conformations are shown to have highest solvent accessibility in a system of two-linked peptide units with glycyl residue at the central C(alpha) atom. This finding is consistent with the observations from 250 nonhomologous protein structures where glycyl residues with conformations close to (+/-90 degrees, 0 degrees ) are seen to have high solvent accessibility. Analysis of a subset of nonhomologous structures with very high resolution (1.5 A or better) shows that water molecules are indeed present at distances suitable for hydrogen bond interaction with glycyl residues possessing conformations close to (+/-90 degrees, 0 degrees ). It is suggested that water molecules play a key role in determining and stabilizing these conformations of glycyl residues and explain the aberration in the symmetry of glycyl conformations in proteins.
Collapse
Affiliation(s)
- Narayanan Eswar
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
| | | | | | | |
Collapse
|
30
|
Kamal A, Ramesh G, Laxman N, Ramulu P, Srinivas O, Neelima K, Kondapi AK, Sreenu VB, Nagarajaram HA. Design, synthesis, and evaluation of new noncross-linking pyrrolobenzodiazepine dimers with efficient DNA binding ability and potent antitumor activity. J Med Chem 2002; 45:4679-88. [PMID: 12361394 DOI: 10.1021/jm020124h] [Citation(s) in RCA: 74] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
New sequence selective mixed imine-amide pyrrolobenzodiazepine (PBD) dimers have been developed that are comprised of DC-81 and dilactam of DC-81 subunits tethered to their C8 positions through alkanedioxy linkers (comprised of three to five and eight carbons). Thermal denaturation studies show that after 18 h of incubation with calf thymus DNA at a 5:1 DNA/ligand ratio, one of them (5c) increases the DeltaT(m) value by 17.0 degrees C. Therefore, these unsymmetrical molecules exhibit significant DNA minor groove binding affinity and 5c linked through the pentanedioxy chain exhibits efficient DNA binding ability that compares with the cross-linking DSB-120 PBD dimer (DeltaT(m) = 15.4 degrees C). Interestingly, this imine-amide PBD dimer has been linked with a five carbon chain linker unlike DSB-120, which has two DC-81 subunits with a three carbon chain linker, illustrating the effect of the noncross-linking aspect by introducing the noncovalent subunit. The binding affinity of the compounds has been measured by restriction endonuclease digestion assay based on inhibition of the restriction endonuclease BamHI. This study reveals the significance of noncovalent interactions in combination with covalent bonding aspects when two moieties of structural similarities are joined together. This allows the mixed imine-amide PBD dimer with a five carbon chain linker to achieve an isohelical fit within the DNA minor groove taking in to account both the covalent bonding and the noncovalent binding components. This has been supported by molecular modeling studies, which indicate that the PBD dimer with a five carbon chain linker gives rise to maximum stabilization of the complex with DNA at the minor groove as compared to the other PBD dimers with three, four, and eight carbon chain linkers. The energy of interaction in all of the complexes studied is correlated to the DeltaT(m) values. Furthermore, this dimer 5c has significant cytotoxicity in a number of human cancer cell lines.
Collapse
Affiliation(s)
- Ahmed Kamal
- Biotransformation Laboratory, Division of Organic Chemistry, Indian Institute of Chemical Technology, Hyderabad 500 007, India.
| | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Mendes J, Nagarajaram HA, Soares CM, Blundell TL, Carrondo MA. Incorporating knowledge-based biases into an energy-based side-chain modeling method: application to comparative modeling of protein structure. Biopolymers 2001; 59:72-86. [PMID: 11373721 DOI: 10.1002/1097-0282(200108)59:2<72::aid-bip1007>3.0.co;2-s] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The performance of the self-consistent mean field theory (SCMFT) method for side-chain modeling, employing rotamer energies calculated with the flexible rotamer model (FRM), is evaluated in the context of comparative modeling of protein structure. Predictions were carried out on a test set of 56 model backbones of varying accuracy, to allow side-chain prediction accuracy to be analyzed as a function of backbone accuracy. A progressive decrease in the accuracy of prediction was observed as backbone accuracy decreased. However, even for very low backbone accuracy, prediction was substantially higher than random, indicating that the FRM can, in part, compensate for the errors in the modeled tertiary environment. It was also investigated whether the introduction in the FRM-SCMFT method of knowledge-based biases, derived from a backbone-dependent rotamer library, could enhance its performance. A bias derived from the backbone-dependent rotamer conformations alone did not improve prediction accuracy. However, a bias derived from the backbone-dependent rotamer probabilities improved prediction accuracy considerably. This bias was incorporated through two different strategies. In one (the indirect strategy), rotamer probabilities were used to reject unlikely rotamers a priori, thus restricting prediction by FRM-SCMFT to a subset containing only the most probable rotamers in the library. In the other (the direct strategy), rotamer energies were transformed into pseudo-energies that were added to the average potential energies of the respective rotamers, thereby creating hybrid energy-based/knowledge-based average rotamer energies, which were used by the FRM-SCMFT method for prediction. For all degrees of backbone accuracy, an optimal strength of the knowledge-based bias existed for both strategies for which predictions were more accurate than pure energy-based predictions, and also than pure knowledge-based predictions. Hybrid knowledge-based/energy-based methods were obtained from both strategies and compared with the SCWRL method, a hybrid method based on the same backbone-dependent rotamer library. The accuracy of the indirect method was approximately the same as that of the SCWRL method, but that of the direct method was significantly higher.
Collapse
Affiliation(s)
- J Mendes
- Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, Apartado 127, Av. da República, 2781-901, Oeiras, Portugal
| | | | | | | | | |
Collapse
|
32
|
Nagarajaram HA, Reddy BV, Blundell TL. Analysis and prediction of inter-strand packing distances between beta-sheets of globular proteins. Protein Eng 1999; 12:1055-62. [PMID: 10611399 DOI: 10.1093/protein/12.12.1055] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Any two beta-strands belonging to two different beta-sheets in a protein structure are considered to pack interactively if each beta-strand has at least one residue that undergoes a loss of one tenth or more of its solvent contact surface area upon packing. A data set of protein 3-D structures (determined at 2.5 A resolution or better), corresponding to 428 protein chains, contains 1986 non-identical pairs of beta-strands involved in interactive packing. The inter-axial distance between these is significantly correlated to the weighted sum of the volumes of the interacting residues at the packing interface. This correlation can be used to predict the changes in the inter-sheet distances in equivalent beta-sheets in homologous proteins and, therefore, is of value in comparative modelling of proteins.
Collapse
Affiliation(s)
- H A Nagarajaram
- Department of Biochemistry, 80, Tennis Court Road, Old Addenbrooks Site, Cambridge CB2 1GA, UK
| | | | | |
Collapse
|
33
|
Burke DF, Deane CM, Nagarajaram HA, Campillo N, Martin-Martinez M, Mendes J, Molina F, Perry J, Reddy BV, Soares CM, Steward RE, Williams M, Carrondo MA, Blundell TL, Mizuguchi K. An iterative structure-assisted approach to sequence alignment and comparative modeling. Proteins 1999; Suppl 3:55-60. [PMID: 10526352 DOI: 10.1002/(sici)1097-0134(1999)37:3+<55::aid-prot8>3.3.co;2-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Correct alignment of the sequence of a target protein with those of homologues of known three-dimensional structure is a key step in comparative modeling. Usually an iterative approach that takes account of the local and overall structural features is required. We describe such an approach that exploits databases of structural alignments of homologous proteins (HOMSTRAD, http:/(/)www-cryst.bioc.cam.ac.uk/ approximately homstrad) and protein superfamilies (CAMPASS, http:/(/)www-cryst.bioc.cam.ac.uk/ approximately campass), in which structure-based alignments are analyzed and formatted with the program JOY (http:/(/)www-cryst.bioc.cam.ac.uk/ approximately joy) to reveal conserved local structural features. The databases facilitate the recognition of a family or superfamily, they assist in the selection of useful parent structures, they are helpful in alignment of the target sequences with the parent set, and are useful for deriving relationships that can be used in validating models. In the iterative approach, a model is constructed on the basis of the proposed sequence alignment and this is then reexpressed in the JOY format and realigned with the parent set. This is repeated until the model and sequence alignment is optimized. We examine the case for comparison and use of multiple structures of family members, rather than a single parent structure. We use the targets attempted by our group in CASP3 to assess the value of such procedures.
Collapse
Affiliation(s)
- D F Burke
- Department of Biochemistry, University of Cambridge, United Kingdom
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Abstract
An alpha-helix and a beta-strand are said to be interactively packed if at least one residue in each of the secondary structural elements loses 10% of its solvent accessible contact area on association with the other secondary structural element. An analysis of all such 5,975 nonidentical alpha/beta units in protein structures, defined at < or = 2.5 A resolution, shows that the interaxial distance between the alpha-helix and the beta-strand is linearly correlated with the residue-dependent function, log[(V/nda)/n-int], where V is the volume of amino acid residues in the packing interface, nda is the normalized difference in solvent accessible contact area of the residues in packed and unpacked secondary structural elements, and n-int is the number of residues in the packing interface. The beta-sheet unit (beta u), defined as a pair of adjacent parallel or antiparallel hydrogen-bonded beta-strands, packing with an alpha-helix shows a better correlation between the interaxial distance and log(V/nda) for the residues in the packing interface. This packing relationship is shown to be useful in the prediction of interaxial distances in alpha/beta units using the interacting residue information of equivalent alpha/beta units of homologous proteins. It is, therefore, of value in comparative modeling of protein structures.
Collapse
Affiliation(s)
- B V Reddy
- Department of Biochemistry, University of Cambridge, United Kingdom
| | | | | |
Collapse
|
35
|
Sowdhamini R, Burke DF, Deane C, Huang JF, Mizuguchi K, Nagarajaram HA, Overington JP, Srinivasan N, Steward RE, Blundell TL. Protein three-dimensional structural databases: domains, structurally aligned homologues and superfamilies. Acta Crystallogr D Biol Crystallogr 1998; 54:1168-77. [PMID: 10089493 DOI: 10.1107/s0907444998007148] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
This paper reports the availability of a database of protein structural domains (DDBASE), an alignment database of homologous proteins (HOMSTRAD) and a database of structurally aligned superfamilies (CAMPASS) on the World Wide Web (WWW). DDBASE contains information on the organization of structural domains and their boundaries; it includes only one representative domain from each of the homologous families. This database has been derived by identifying the presence of structural domains in proteins on the basis of inter-secondary structural distances using the program DIAL [Sowdhamini & Blundell (1995), Protein Sci. 4, 506-520]. The alignment of proteins in superfamilies has been performed on the basis of the structural features and relationships of individual residues using the program COMPARER [Sali & Blundell (1990), J. Mol. Biol. 212, 403-428]. The alignment databases contain information on the conserved structural features in homologous proteins and those belonging to superfamilies. Available data include the sequence alignments in structure-annotated formats and the provision for viewing superposed structures of proteins using a graphical interface. Such information, which is freely accessible on the WWW, should be of value to crystallographers in the comparison of newly determined protein structures with previously identified protein domains or existing families.
Collapse
Affiliation(s)
- R Sowdhamini
- Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge CB2 1QW, England
| | | | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Sowdhamini R, Burke DF, Huang JF, Mizuguchi K, Nagarajaram HA, Srinivasan N, Steward RE, Blundell TL. CAMPASS: a database of structurally aligned protein superfamilies. Structure 1998; 6:1087-94. [PMID: 9753697 DOI: 10.1016/s0969-2126(98)00110-5] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Affiliation(s)
- R Sowdhamini
- Department of Biochemistry University of Cambridge 80 Tennis Court Road, Cambridge, CB2 1GA, UK
| | | | | | | | | | | | | | | |
Collapse
|
37
|
Gunasekaran K, Nagarajaram HA, Ramakrishnan C, Balaram P. Stereochemical punctuation marks in protein structures: glycine and proline containing helix stop signals. J Mol Biol 1998; 275:917-32. [PMID: 9480777 DOI: 10.1006/jmbi.1997.1505] [Citation(s) in RCA: 110] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
An analysis on the nature of alpha-helix stop signals has been carried out, using a dataset of 1057 helices identified from 250 high resolution (</=2.0 A), non-homologous, protein crystal structures. The backbone dihedral angles (phi, psi) of the terminating residue (T) were found to cluster either in the left-handed helical region (alphaL: phi=20 degrees to 125 degrees and psi=-45 degrees to 90 degrees ; 469 helices (44%)) or in the extended region (E: phi=-180 degrees to -30 degrees and psi=60 degrees to 180 degrees and -180 degrees to -150 degrees ; 459 helices (43%)) of the Ramachandran map. These two broad categories of helix stop signals, alphaL and E-terminated helices, were further examined for sequence preferences. Gly residues were found to have an overwhelming preference to occur as the "alphaL-terminator (T)" resulting in the classical Schellman motif, with a strong preference for hydrophobic residues at position T-4 and T+1. In the case of E-terminated helices His, Asn, Leu and Phe were found to occur with high propensity at position T. Quite remarkably Pro residues, with single exception, were absent at position T, but had the highest propensity at position T+1. Examination of the frequencies of hydrophobic (h) and polar (p) residues at positions flanking Gly/Pro permitted delineation of exclusive patterns and predictive rules for Gly-terminated helices and Pro-terminated helices. The analysis reveals that Pro residues flanked by polar amino acids have a very strong tendency to terminate helices. Examination of a segment ranging from T-4 to T+3 appeared to be necessary to determine whether helix termination or continuation occur at Gly residues. The two types of helix termination (alphaL, E) signals also differed dramatically in their solvent accessibility. Gly and Pro residues at helix termini appeared to be strongly conserved in homologous sequences.
Collapse
|
38
|
Varadarajan R, Nagarajaram HA, Ramakrishnan C. A procedure for the prediction of temperature-sensitive mutants of a globular protein based solely on the amino acid sequence. Proc Natl Acad Sci U S A 1996; 93:13908-13. [PMID: 8943034 PMCID: PMC19465 DOI: 10.1073/pnas.93.24.13908] [Citation(s) in RCA: 41] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
Temperature-sensitive (Ts) mutants of a protein are an extremely powerful tool for studying protein function in vivo and in cell culture. We have devised a method to predict those residues in a protein sequence that, when appropriately mutated, are most likely to give rise to a Ts phenotype. Since substitutions of buried hydrophobic residues often result in significant destabilization of the protein, our method predicts those residues in the sequence that are likely to be buried in the protein structure. We also indicate a set of amino acid substitutions, which should be made to generate a Ts mutant of the protein. This method requires only the protein sequence. No structural information or homologous sequence information is required. This method was applied to a test data set of 30 nonhomologous protein structures from the Protein Data Bank. All of the residues predicted by the method to be > or = 95% buried were, in fact, buried in the protein crystal structure. In contrast, only 50% of all hydrophobic residues in this data set were > or = 95% buried. This method successfully predicts several known Ts and partially active mutants of T4 lysozyme, lambda repressor, gene V protein, and staphylococcal nuclease. This method also correctly predicts residues that form part of the hydrophobic cores of lambda repressor, myoglobin, and cytochrome b562.
Collapse
Affiliation(s)
- R Varadarajan
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India.
| | | | | |
Collapse
|
39
|
Nagarajaram HA, Ramakrishnan C. Stereochemical studies on cyclic peptides: Detailed energy minimization studies on hydrogen bonded all-trans cyclic pentapeptide backbones. J Biosci 1995. [DOI: 10.1007/bf02703300] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
40
|
Nagarajaram HA, Sowdhamini R, Ramakrishnan C, Balaram P. Termination of right handed helices in proteins by residues in left handed helical conformations. FEBS Lett 1993; 321:79-83. [PMID: 8467914 DOI: 10.1016/0014-5793(93)80625-5] [Citation(s) in RCA: 45] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
An analysis of 636 helical segments, ranging in length from 4 to 32 residues, from 123 independent protein crystal structures reveals that helix termination by residues in left handed (alpha 1) helical conformations is a common occurrence. Gly and Asn residues are the most frequent alpha L helix terminators, with the former having a very high propensity to adopt such conformations. The alpha R-alpha R-alpha R-alpha L segment at the C termini of protein helices often possesses a 6--> 1 (pi-type) hydrogen bond between the CO of residue i and the NH of residue i + 5 with residue i + 4 occurring in the alpha L conformation. A stereochemical analysis of 216 examples shows that in 62 cases the 6-->1 hydrogen bond is absent. The present analysis provides a quantitative measure of the propensity of the 20 amino acids to adopt alpha L helix terminating conformations.
Collapse
Affiliation(s)
- H A Nagarajaram
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore
| | | | | | | |
Collapse
|
41
|
Nagarajaram HA, Paul PK, Ramanarayanan K, Soman KV, Ramakrishnan C. Conformational studies on beta-bend containing a cis peptide unit. Int J Pept Protein Res 1992; 40:383-94. [PMID: 1483833 DOI: 10.1111/j.1399-3011.1992.tb00315.x] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Conformational studies have been carried out on the X-cis-Pro tripeptide system (a system of three linked peptide units, in the trans-cis-trans configuration) using energy minimization techniques. For X, residues Gly, L-Ala, D-Ala and L-Pro have been used. The energy minima have been classified into different groups based upon the conformational similarity. There are 15, 20, 18 and 6 minima that are possible for the four cases respectively and these fall into 11 different groups. A study of these minima shows that, (i) some minima contain hydrogen bonds--either 4-->1 or 1-->2 type, (ii) the low energy minima qualify themselves as bend conformations, (iii) cis' and trans' conformations are possible for the prolyl residue as also the C gamma-endo and C gamma-exo puckerings, and (iv) for Pro-cis-Pro, cis' at the first prolyl residue is ruled out, due to the high energy. The available crystal structure data on proteins and peptides, containing cis-Pro segment have been examined with a view to find the minima that occur in solid state. The data from protein show that they fall under two groups. The conformation at X in X-cis-Pro is near extended when it is a non-glycyl residue. In both peptides and proteins there exists a preference for trans' conformation at prolyl residue over cis' when X is a non-glycyl residue. The minima obtained can be useful in modelling studies.
Collapse
Affiliation(s)
- H A Nagarajaram
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore
| | | | | | | | | |
Collapse
|