1
|
Basu S, Bahadur RP. Conservation and coevolution determine evolvability of different classes of disordered residues in human intrinsically disordered proteins. Proteins 2021; 90:632-644. [PMID: 34626492 DOI: 10.1002/prot.26261] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Revised: 10/07/2021] [Accepted: 10/07/2021] [Indexed: 12/19/2022]
Abstract
Structure, function, and evolution are interdependent properties of proteins. Diversity of protein functions arising from structural variations is a potential driving force behind protein evolvability. Intrinsically disordered proteins or regions (IDPs or IDRs) lack well-defined structure under normal physiological conditions, yet, they are highly functional. Increased occurrence of IDPs in eukaryotes compared to prokaryotes indicates strong correlation of protein evolution and disorderedness. IDPs generally have higher evolution rate compared to globular proteins. Structural pliability allows IDPs to accommodate multiple mutations without affecting their functional potential. Nevertheless, how evolutionary signals vary between different classes of disordered residues (DRs) in IDPs is poorly understood. This study addresses variation of evolutionary behavior in terms of residue conservation and intra-protein coevolution among structural and functional classes of DRs in IDPs. Analyses are performed on 579 human IDPs, which are classified based on length of IDRs, interacting partners and functional classes. We find short IDRs are less conserved than long IDRs or full IDPs. Functional classes which require flexibility and specificity to perform their activity comparatively evolve slower than others. Disorder promoting amino acids evolve faster than order promoting amino acids. Pro, Gly, Ile, and Phe have unique coevolving nature which further emphasizes on their roles in IDPs. This study sheds light on evolutionary footprints in different classes of DRs from human IDPs and enhances our understanding of the structural and functional potential of IDPs.
Collapse
Affiliation(s)
- Sushmita Basu
- Computational Structural Biology Lab, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur, India
| | - Ranjit Prasad Bahadur
- Computational Structural Biology Lab, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur, India
| |
Collapse
|
2
|
Lv C, Zhang X, Liu Y, Zhang T, Chen H, Zang J, Zheng B, Zhao G. Redesign of protein nanocages: the way from 0D, 1D, 2D to 3D assembly. Chem Soc Rev 2021; 50:3957-3989. [PMID: 33587075 DOI: 10.1039/d0cs01349h] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
Compartmentalization is a hallmark of living systems. Through compartmentalization, ubiquitous protein nanocages such as viral capsids, ferritin, small heat shock proteins, and DNA-binding proteins from starved cells fulfill a variety of functions, while their shell-like structures hold great promise for various applications in the field of nanomedicine and nanotechnology. However, the number and structure of natural protein nanocages are limited, and these natural protein nanocages may not be suited for a given application, which might impede their further application as nanovehicles, biotemplates or building blocks. To overcome these shortcomings, different strategies have been developed by scientists to construct artificial protein nanocages, and 1D, 2D and 3D protein arrays with protein nanocages as building blocks through genetic and chemical modification to rival the size and functionality of natural protein nanocages. This review outlines the recent advances in the field of the design and construction of artificial protein nanocages and their assemblies with higher order, summarizes the strategies for creating the assembly of protein nanocages from zero-dimension to three dimensions, and introduces their corresponding applications in the preparation of nanomaterials, electrochemistry, and drug delivery. The review will highlight the roles of both the inter-subunit/intermolecular interactions at the key interface and the protein symmetry in constructing and controlling protein nanocage assemblies with different dimensions.
Collapse
Affiliation(s)
- Chenyan Lv
- College of Food Science and Nutritional Engineering, China Agricultural University, Beijing Key Laboratory of Functional Food from Plant Resources, Beijing 100083, China.
| | | | | | | | | | | | | | | |
Collapse
|
3
|
Pilla SP, Bahadur RP. Residue conservation elucidates the evolution of r-proteins in ribosomal assembly and function. Int J Biol Macromol 2019; 140:323-329. [PMID: 31421176 DOI: 10.1016/j.ijbiomac.2019.08.127] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2019] [Revised: 08/14/2019] [Accepted: 08/14/2019] [Indexed: 02/08/2023]
Abstract
Ribosomes are the translational machineries having two unequal subunits, small subunit (SSU) and large subunit (LSU) across all the domains of life. Origin and evolution of ribosome are encoded in its structure, and the core of the ribosome is highly conserved. Here, we have used Shannon entropy to analyze the evolution of ribosomal proteins (r-proteins) across the three domains of life. Moreover, we have analyzed the residue conservation at protein-protein (PP) and protein-RNA (PR) interfaces in SSU and LSU. Furthermore, we have studied the evolution of early, intermediate and late binding r-proteins. We show that the r-proteins of Thermus thermophilus are better conserved during the evolution. Furthermore, we find the late binders are better conserved than the early and the intermediate binders. The residues at the interior of the r-proteins are the most conserved followed by those at the interface and the solvent accessible surface. Additionally, we show that the residues at the PP interfaces are better conserved than those at the PR interfaces. However, between PR and PP interfaces, the multi-interface residues at the former are better conserved than those at the latter ones. Our findings may provide insights into the evolution of r-proteins in ribosomal assembly and function.
Collapse
Affiliation(s)
- Smita P Pilla
- Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur 721302, India
| | - Ranjit Prasad Bahadur
- Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur 721302, India.
| |
Collapse
|
4
|
Das S, Smith K, Sarker S, Peters A, Adriaanse K, Eden P, Ghorashi SA, Forwood JK, Raidal SR. Assessing circovirus gene flow in multiple spill-over events. Virus Genes 2019; 55:802-814. [PMID: 31463770 DOI: 10.1007/s11262-019-01702-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2019] [Accepted: 08/19/2019] [Indexed: 11/29/2022]
Abstract
The establishment of viral pathogens in new host environments following spillover events probably requires adaptive changes within both the new host and pathogen. After many generations, signals for ancient cross-species transmission may become lost and a strictly host-adapted phylogeny may mimic true co-divergence while the virus may retain an inherent ability to jump host species. The mechanistic basis for such processes remains poorly understood. To study the dynamics of virus-host co-divergence and the arbitrary chances of spillover in various reservoir hosts with equal ecological opportunity, we examined structural constraints of capsid protein in extant populations of Beak and feather disease virus (BFDV) during known spillover events. By assessing reservoir-based genotype stratification, we identified co-divergence defying signatures in the evolution BFDV which highlighted primordial processes of cryptic host adaptation and competing forces of host co-divergence and cross-species transmission. We demonstrate that, despite extensive surface plasticity gathered over a longer span of evolution, structural constraints of the capsid protein allow opportunistic host switching in host-adapted populations. This study provides new insights into how small populations of endangered psittacine species may face multidirectional forces of infection from reservoirs with apparently co-diverging genotypes.
Collapse
Affiliation(s)
- Shubhagata Das
- School of Animal and Veterinary Sciences, Charles Sturt University, Boorooma Street, Wagga Wagga, NSW, 2650, Australia
| | - Kate Smith
- School of Biomedical Sciences, Charles Sturt University, Boorooma Street, Wagga Wagga, NSW, 2650, Australia
| | - Subir Sarker
- Department of Physiology, Anatomy and Microbiology, School of Life Sciences, La Trobe University, Melbourne, 3086, Australia
| | - Andrew Peters
- School of Animal and Veterinary Sciences, Charles Sturt University, Boorooma Street, Wagga Wagga, NSW, 2650, Australia
| | - Katherine Adriaanse
- Healesville Sanctuary, Zoos Victoria, Badger Creek Road, Healesville, VIC, 3777, Australia
| | - Paul Eden
- Healesville Sanctuary, Zoos Victoria, Badger Creek Road, Healesville, VIC, 3777, Australia
| | - Seyed A Ghorashi
- School of Animal and Veterinary Sciences, Charles Sturt University, Boorooma Street, Wagga Wagga, NSW, 2650, Australia
| | - Jade K Forwood
- School of Biomedical Sciences, Charles Sturt University, Boorooma Street, Wagga Wagga, NSW, 2650, Australia
| | - Shane R Raidal
- School of Animal and Veterinary Sciences, Charles Sturt University, Boorooma Street, Wagga Wagga, NSW, 2650, Australia.
| |
Collapse
|
5
|
Pilla SP, Thomas A, Bahadur RP. Dissecting macromolecular recognition sites in ribosome: implication to its self-assembly. RNA Biol 2019; 16:1300-1312. [PMID: 31179876 DOI: 10.1080/15476286.2019.1629767] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Interactions between macromolecules play a crucial role in ribosome assembly that follows a highly coordinated process involving RNA folding and binding of ribosomal proteins (r-proteins). Although extensive studies have been carried out to understand macromolecular interactions in ribosomes, most of them are confined to either large or small ribosomal-subunit of few species. A comparative analysis of macromolecular interactions across different domains is still missing. We have analyzed the structural and physicochemical properties of protein-protein (PP), protein-RNA (PR) and RNA-RNA (RR) interfaces in small and large subunits of ribosomes, as well as in between the two subunits. Additionally, we have also developed Random Forest (RF) classifier to catalog the r-proteins. We find significant differences as well as similarities in macromolecular recognition sites between ribosomal assemblies of prokaryotes and eukaryotes. PR interfaces are substantially larger and have more ionic interactions than PP and RR interfaces in both prokaryotes and eukaryotes. PP, PR and RR interfaces in eukaryotes are well packed compared to those in prokaryotes. However, the packing density between the large and the small subunit interfaces in the entire assembly is strikingly low in both prokaryotes and eukaryotes, indicating the periodic association and dissociation of the two subunits during the translation. The structural and physicochemical properties of PR interfaces are used to predict the r-proteins in the assembly pathway into early, intermediate and late binders using RF classifier with an accuracy of 80%. The results provide new insights into the classification of r-proteins in the assembly pathway.
Collapse
Affiliation(s)
- Smita P Pilla
- a Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur , Kharagpur , India
| | - Amal Thomas
- a Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur , Kharagpur , India
| | - Ranjit Prasad Bahadur
- a Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur , Kharagpur , India
| |
Collapse
|
6
|
Pilla SP, R B, Bahadur RP. Dissecting protein‐protein interactions in proteasome assembly: Implication to its self‐assembly. J Mol Recognit 2019; 32:e2784. [DOI: 10.1002/jmr.2784] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2018] [Revised: 03/07/2019] [Accepted: 03/19/2019] [Indexed: 01/18/2023]
Affiliation(s)
- Smita P. Pilla
- Computational Structural Biology Laboratory, Department of BiotechnologyIndian Institute of Technology Kharagpur Kharagpur India
| | - Babu R
- Computational Structural Biology Laboratory, Department of BiotechnologyIndian Institute of Technology Kharagpur Kharagpur India
| | - Ranjit P. Bahadur
- Computational Structural Biology Laboratory, Department of BiotechnologyIndian Institute of Technology Kharagpur Kharagpur India
| |
Collapse
|
7
|
Montiel-García DJ, Mannige RV, Reddy VS, Carrillo-Tripp M. Structure based sequence analysis of viral and cellular protein assemblies. J Struct Biol 2016; 196:299-308. [PMID: 27480508 DOI: 10.1016/j.jsb.2016.07.013] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2016] [Accepted: 07/18/2016] [Indexed: 10/21/2022]
Abstract
It is well accepted that, in general, protein structural similarity is strongly related to the amino acid sequence identity. To analyze in great detail the correlation, distribution and variation levels of conserved residues in the protein structure, we analyzed all available high-resolution structural data of 5245 cellular complex-forming proteins and 293 spherical virus capsid proteins (VCPs). We categorized and compare them in terms of protein structural regions. In all cases, the buried core residues are the most conserved, followed by the residues at the protein-protein interfaces. The solvent-exposed surface shows greater sequence variations. Our results provide evidence that cellular monomers and VCPs could be two extremes in the quaternary structural space, with cellular dimers and oligomers in between. Moreover, based on statistical analysis, we detected a distinct group of icosahedral virus families whose capsid proteins seem to evolve much slower than the rest of the protein complexes analyzed in this work.
Collapse
Affiliation(s)
- Daniel J Montiel-García
- Biomolecular Diversity Laboratory, Centro de Investigación y Estudios Avanzados del Instituto Politécnico Nacional, Mexico
| | - Ranjan V Mannige
- Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, USA
| | - Vijay S Reddy
- Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, USA
| | - Mauricio Carrillo-Tripp
- Biomolecular Diversity Laboratory, Centro de Investigación y Estudios Avanzados del Instituto Politécnico Nacional, Mexico.
| |
Collapse
|
8
|
Barik A, Nithin C, Karampudi NBR, Mukherjee S, Bahadur RP. Probing binding hot spots at protein-RNA recognition sites. Nucleic Acids Res 2015; 44:e9. [PMID: 26365245 PMCID: PMC4737170 DOI: 10.1093/nar/gkv876] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2015] [Accepted: 08/23/2015] [Indexed: 01/30/2023] Open
Abstract
We use evolutionary conservation derived from structure alignment of polypeptide sequences along with structural and physicochemical attributes of protein–RNA interfaces to probe the binding hot spots at protein–RNA recognition sites. We find that the degree of conservation varies across the RNA binding proteins; some evolve rapidly compared to others. Additionally, irrespective of the structural class of the complexes, residues at the RNA binding sites are evolutionary better conserved than those at the solvent exposed surfaces. For recognitions involving duplex RNA, residues interacting with the major groove are better conserved than those interacting with the minor groove. We identify multi-interface residues participating simultaneously in protein–protein and protein–RNA interfaces in complexes where more than one polypeptide is involved in RNA recognition, and show that they are better conserved compared to any other RNA binding residues. We find that the residues at water preservation site are better conserved than those at hydrated or at dehydrated sites. Finally, we develop a Random Forests model using structural and physicochemical attributes for predicting binding hot spots. The model accurately predicts 80% of the instances of experimental ΔΔG values in a particular class, and provides a stepping-stone towards the engineering of protein–RNA recognition sites with desired affinity.
Collapse
Affiliation(s)
- Amita Barik
- Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur-721302, India
| | - Chandran Nithin
- Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur-721302, India
| | | | - Sunandan Mukherjee
- Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur-721302, India
| | - Ranjit Prasad Bahadur
- Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur-721302, India Advanced Technology Development Centre, Indian Institute of Technology Kharagpur, Kharagpur-721302, India
| |
Collapse
|
9
|
Hopkins M, Kailasan S, Cohen A, Roux S, Tucker KP, Shevenell A, Agbandje-McKenna M, Breitbart M. Diversity of environmental single-stranded DNA phages revealed by PCR amplification of the partial major capsid protein. THE ISME JOURNAL 2014; 8:2093-103. [PMID: 24694711 PMCID: PMC4184009 DOI: 10.1038/ismej.2014.43] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/24/2013] [Accepted: 02/24/2014] [Indexed: 11/15/2022]
Abstract
The small single-stranded DNA (ssDNA) bacteriophages of the subfamily Gokushovirinae were traditionally perceived as narrowly targeted, niche-specific viruses infecting obligate parasitic bacteria, such as Chlamydia. The advent of metagenomics revealed gokushoviruses to be widespread in global environmental samples. This study expands knowledge of gokushovirus diversity in the environment by developing a degenerate PCR assay to amplify a portion of the major capsid protein (MCP) gene of gokushoviruses. Over 500 amplicons were sequenced from 10 environmental samples (sediments, sewage, seawater and freshwater), revealing the ubiquity and high diversity of this understudied phage group. Residue-level conservation data generated from multiple alignments was combined with a predicted 3D structure, revealing a tendency for structurally internal residues to be more highly conserved than surface-presenting protein-protein or viral-host interaction domains. Aggregating this data set into a phylogenetic framework, many gokushovirus MCP clades contained samples from multiple environments, although distinct clades dominated the different samples. Antarctic sediment samples contained the most diverse gokushovirus communities, whereas freshwater springs from Florida were the least diverse. Whether the observed diversity is being driven by environmental factors or host-binding interactions remains an open question. The high environmental diversity of this previously overlooked ssDNA viral group necessitates further research elucidating their natural hosts and exploring their ecological roles.
Collapse
Affiliation(s)
- Max Hopkins
- College of Marine Science, University of South Florida, Saint Petersburg, FL, USA
| | - Shweta Kailasan
- Department of Biochemistry and Molecular Biology, University of Florida, Gainesville, FL, USA
| | - Allison Cohen
- College of Marine Science, University of South Florida, Saint Petersburg, FL, USA
| | - Simon Roux
- Laboratoire ‘Microorganismes: Génome et Environnement', Clermont Université, Université Blaise Pascal, Clermont-Ferrand, France
- CNRS, UMR 6023, LMGE, Aubière, France
| | | | - Amelia Shevenell
- College of Marine Science, University of South Florida, Saint Petersburg, FL, USA
| | - Mavis Agbandje-McKenna
- Department of Biochemistry and Molecular Biology, University of Florida, Gainesville, FL, USA
| | - Mya Breitbart
- College of Marine Science, University of South Florida, Saint Petersburg, FL, USA
| |
Collapse
|
10
|
Cheng S, Brooks CL. Viral capsid proteins are segregated in structural fold space. PLoS Comput Biol 2013; 9:e1002905. [PMID: 23408879 PMCID: PMC3567143 DOI: 10.1371/journal.pcbi.1002905] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2012] [Accepted: 12/16/2012] [Indexed: 02/04/2023] Open
Abstract
Viral capsid proteins assemble into large, symmetrical architectures that are not found in complexes formed by their cellular counterparts. Given the prevalence of the signature jelly-roll topology in viral capsid proteins, we are interested in whether these functionally unique capsid proteins are also structurally unique in terms of folds. To explore this question, we applied a structure-alignment based clustering of all protein chains in VIPERdb filtered at 40% sequence identity to identify distinct capsid folds, and compared the cluster medoids with a non-redundant subset of protein domains in the SCOP database, not including the viral capsid entries. This comparison, using Template Modeling (TM)-score, identified 2078 structural "relatives" of capsid proteins from the non-capsid set, covering altogether 210 folds following the definition in SCOP. The statistical significance of the 210 folds shared by two sets of the same sizes, estimated from 10,000 permutation tests, is less than 0.0001, which is an upper bound on the p-value. We thus conclude that viral capsid proteins are segregated in structural fold space. Our result provides novel insight on how structural folds of capsid proteins, as opposed to their surface chemistry, might be constrained during evolution by requirement of the assembled cage-like architecture. Also importantly, our work highlights a guiding principle for virus-based nanoplatform design in a wide range of biomedical applications and materials science.
Collapse
Affiliation(s)
- Shanshan Cheng
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Charles L. Brooks
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Chemistry, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Biophysics, University of Michigan, Ann Arbor, Michigan, United States of America
| |
Collapse
|
11
|
Abstract
AbstractProtein–protein recognition plays an essential role in structure and function. Specific non-covalent interactions stabilize the structure of macromolecular assemblies, exemplified in this review by oligomeric proteins and the capsids of icosahedral viruses. They also allow proteins to form complexes that have a very wide range of stability and lifetimes and are involved in all cellular processes. We present some of the structure-based computational methods that have been developed to characterize the quaternary structure of oligomeric proteins and other molecular assemblies and analyze the properties of the interfaces between the subunits. We compare the size, the chemical and amino acid compositions and the atomic packing of the subunit interfaces of protein–protein complexes, oligomeric proteins, viral capsids and protein–nucleic acid complexes. These biologically significant interfaces are generally close-packed, whereas the non-specific interfaces between molecules in protein crystals are loosely packed, an observation that gives a structural basis to specific recognition. A distinction is made within each interface between a core that contains buried atoms and a solvent accessible rim. The core and the rim differ in their amino acid composition and their conservation in evolution, and the distinction helps correlating the structural data with the results of site-directed mutagenesis and in vitro studies of self-assembly.
Collapse
|