1
|
Dong S, Chen C, Li J, Liu Y, Bayer EA, Lamed R, Mizrahi I, Cui Q, Feng Y. Unique Fn3-like biosensor in σ I/anti-σ I factors for regulatory expression of major cellulosomal scaffoldins in Pseudobacteroides cellulosolvens. Protein Sci 2024; 33:e5193. [PMID: 39470320 PMCID: PMC11520246 DOI: 10.1002/pro.5193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2024] [Revised: 09/26/2024] [Accepted: 10/01/2024] [Indexed: 10/30/2024]
Abstract
Lignocellulolytic clostridia employ multiple pairs of alternative σ/anti-σ (SigI/RsgI) factors to regulate cellulosomal components for substrate-specific degradation of cellulosic biomass. The current model has proposed that RsgIs use a sensor domain to bind specific extracellular lignocellulosic components and activate cognate SigIs to initiate expression of corresponding cellulosomal enzyme genes, while expression of scaffoldins can be initiated by several different SigIs. Pseudobacteroides cellulosolvens contains the most complex known cellulosome system and the highest number of SigI-RsgI regulons yet discovered. However, the function of many RsgI sensor domains and their relationship with the various enzyme types are not fully understood. Here, we report that RsgI4 from P. cellulosolvens employs a C-terminal module that bears distant similarity to the fibronectin type III (Fn3) domain and serves as the sensor domain. Substrate-binding analysis revealed that the Fn3-like domain of RsgI4 represents a novel carbohydrate-binding module (CBM) that binds to a wide range of polysaccharide types. Structure determination further revealed that the Fn3-like domain belongs to the type B group of CBMs with a predicted concave face for substrate binding. Promoter sequence analysis of cellulosomal genes revealed that SigI4 is responsible for cellulosomal regulation of major scaffoldins rather than enzymes, consistent with the broad substrate specificity of the RsgI4 sensor domain. Notably, scaffoldins are invariably required as cellulosome components regardless of the substrate type. These findings suggest that the intricate cellulosome system of P. cellulosolvens comprises a more elaborate regulation mechanism than other bacteria and thus expands the paradigm of cellulosome regulation.
Collapse
Affiliation(s)
- Sheng Dong
- CAS Key Laboratory of Biofuels, Shandong Provincial Key Laboratory of Synthetic Biology, Qingdao Institute of Bioenergy and Bioprocess TechnologyChinese Academy of SciencesQingdaoChina
- Qingdao Engineering Laboratory of Single Cell Oil, Shandong Engineering Laboratory of Single Cell Oil, Qingdao New Energy Shandong Laboratory, Qingdao Institute of Bioenergy and Bioprocess TechnologyChinese Academy of SciencesQingdaoChina
- Shandong Energy InstituteQingdaoChina
- University of Chinese Academy of SciencesBeijingChina
| | - Chao Chen
- CAS Key Laboratory of Biofuels, Shandong Provincial Key Laboratory of Synthetic Biology, Qingdao Institute of Bioenergy and Bioprocess TechnologyChinese Academy of SciencesQingdaoChina
- Qingdao Engineering Laboratory of Single Cell Oil, Shandong Engineering Laboratory of Single Cell Oil, Qingdao New Energy Shandong Laboratory, Qingdao Institute of Bioenergy and Bioprocess TechnologyChinese Academy of SciencesQingdaoChina
- Shandong Energy InstituteQingdaoChina
- University of Chinese Academy of SciencesBeijingChina
| | - Jie Li
- CAS Key Laboratory of Biofuels, Shandong Provincial Key Laboratory of Synthetic Biology, Qingdao Institute of Bioenergy and Bioprocess TechnologyChinese Academy of SciencesQingdaoChina
- Qingdao Engineering Laboratory of Single Cell Oil, Shandong Engineering Laboratory of Single Cell Oil, Qingdao New Energy Shandong Laboratory, Qingdao Institute of Bioenergy and Bioprocess TechnologyChinese Academy of SciencesQingdaoChina
- Shandong Energy InstituteQingdaoChina
- University of Chinese Academy of SciencesBeijingChina
- Present address:
Department of BiochemistryUniversity of Texas Southwestern Medical CenterDallasTexasUSA
| | - Ya‐Jun Liu
- CAS Key Laboratory of Biofuels, Shandong Provincial Key Laboratory of Synthetic Biology, Qingdao Institute of Bioenergy and Bioprocess TechnologyChinese Academy of SciencesQingdaoChina
- Qingdao Engineering Laboratory of Single Cell Oil, Shandong Engineering Laboratory of Single Cell Oil, Qingdao New Energy Shandong Laboratory, Qingdao Institute of Bioenergy and Bioprocess TechnologyChinese Academy of SciencesQingdaoChina
- Shandong Energy InstituteQingdaoChina
- University of Chinese Academy of SciencesBeijingChina
| | - Edward A. Bayer
- Department of Biomolecular SciencesThe Weizmann Institute of ScienceRehovotIsrael
- Department of Life Sciences and the National Institute for Biotechnology in the NegevBen‐Gurion University of the NegevBeershebaIsrael
| | - Raphael Lamed
- Department of Molecular Microbiology and BiotechnologyTel Aviv UniversityTel AvivIsrael
| | - Itzhak Mizrahi
- Department of Life Sciences and the National Institute for Biotechnology in the NegevBen‐Gurion University of the NegevBeershebaIsrael
| | - Qiu Cui
- CAS Key Laboratory of Biofuels, Shandong Provincial Key Laboratory of Synthetic Biology, Qingdao Institute of Bioenergy and Bioprocess TechnologyChinese Academy of SciencesQingdaoChina
- Qingdao Engineering Laboratory of Single Cell Oil, Shandong Engineering Laboratory of Single Cell Oil, Qingdao New Energy Shandong Laboratory, Qingdao Institute of Bioenergy and Bioprocess TechnologyChinese Academy of SciencesQingdaoChina
- Shandong Energy InstituteQingdaoChina
- University of Chinese Academy of SciencesBeijingChina
- State Key Laboratory of Microbial TechnologyShandong UniversityQingdaoShandongChina
| | - Yingang Feng
- CAS Key Laboratory of Biofuels, Shandong Provincial Key Laboratory of Synthetic Biology, Qingdao Institute of Bioenergy and Bioprocess TechnologyChinese Academy of SciencesQingdaoChina
- Qingdao Engineering Laboratory of Single Cell Oil, Shandong Engineering Laboratory of Single Cell Oil, Qingdao New Energy Shandong Laboratory, Qingdao Institute of Bioenergy and Bioprocess TechnologyChinese Academy of SciencesQingdaoChina
- Shandong Energy InstituteQingdaoChina
- University of Chinese Academy of SciencesBeijingChina
| |
Collapse
|
2
|
Kumar Murmu A, Pal A, Debnath M, Chakraborty A, Pal S, Banerjee S, Pal A, Ghosh N, Karmakar U, Samanta R. Role of mucin 2 gene for growth in Anas platyrhynchos: a novel report. Front Vet Sci 2023; 10:1089451. [PMID: 38026626 PMCID: PMC10666069 DOI: 10.3389/fvets.2023.1089451] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 07/17/2023] [Indexed: 12/01/2023] Open
Abstract
Introduction The mucin gene is expressed in the mucous membrane of the inner layer of the internal organs. Intestinalmucin 2 (MUC2), amajor gel-formingmucin, represents a primary barrier component of mucus layers. Materials and methods This is the first report on the role of mucin genes in growth traits in animals. In this study, we randomly studied Bengal ducks (Anas platyrhynchos) reared from day old to 10 weeks of age under an organized farm and studied the growth parameters as well as body weight and average daily body weight gain. Result and discussion We characterized the mucin gene for Bengal ducks and observed glycosylation and EGF1 (EGF-like domain signature) as important domains for growth traits in ducks. We observed a better expression profile for the mucin gene in high-growing ducks in comparison to that of low-growing ducks with real-time PCR. Hence, the mucin gene may be employed as a marker for growth traits.
Collapse
Affiliation(s)
- Anuj Kumar Murmu
- Department of Livestock Production and Management, West Bengal University of Animal and Fishery Sciences, Kolkata, West Bengal, India
| | - Aruna Pal
- Department of Livestock Farm Complex, West Bengal University of Animal and Fishery Sciences, Kolkata, West Bengal, India
| | - Manti Debnath
- Department of Livestock Farm Complex, West Bengal University of Animal and Fishery Sciences, Kolkata, West Bengal, India
| | - Argha Chakraborty
- Department of Livestock Farm Complex, West Bengal University of Animal and Fishery Sciences, Kolkata, West Bengal, India
| | - Subhamoy Pal
- Department of Animal Science, Visva-Bharati University, Santiniketan, West Bengal, India
| | - Samiddha Banerjee
- Department of Livestock Farm Complex, West Bengal University of Animal and Fishery Sciences, Kolkata, West Bengal, India
| | - Abantika Pal
- Department of Computer Science, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal, India
- University of California, San Francisco, San Francisco, CA, United States
| | - Nilotpal Ghosh
- Department of Livestock Production and Management, West Bengal University of Animal and Fishery Sciences, Kolkata, West Bengal, India
| | - Utpal Karmakar
- Department of Animal Resource Development, Government of West Bengal, Kolkata, India
| | - Rajarshi Samanta
- Department of Livestock Production and Management, West Bengal University of Animal and Fishery Sciences, Kolkata, West Bengal, India
| |
Collapse
|
3
|
Pal A, Pal A, Baviskar P. RIGI, TLR7, and TLR3 Genes Were Predicted to Have Immune Response Against Avian Influenza in Indigenous Ducks. Front Mol Biosci 2022; 8:633283. [PMID: 34970593 PMCID: PMC8712727 DOI: 10.3389/fmolb.2021.633283] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Accepted: 09/29/2021] [Indexed: 12/28/2022] Open
Abstract
Avian influenza is a disease with every possibility to evolve as a human-to-human pandemic arising out of frequent mutations and genetic reassortment or recombination of avian influenza (AI) virus. The greatest concern is that till date, no satisfactory medicine or vaccines are available, leading to massive culling of poultry birds, causing huge economic loss and ban on export of chicken products, which emphasizes the need to develop an alternative strategy for control of AI. In the current study, we attempt to explore the molecular mechanism of innate immune potential of ducks against avian influenza. In the present study, we have characterized immune response molecules such as duck TLR3, TLR7, and RIGI that are predicted to have potent antiviral activities against the identified strain of avian influenza through in silico studies (molecular docking) followed by experimental validation with differential mRNA expression analysis. Future exploitation may include immunomodulation with the recombinant protein, and transgenic or gene-edited chicken resistant to bird flu.
Collapse
Affiliation(s)
- Aruna Pal
- West Bengal University of Animal and Fishery Sciences, Kolkata, India
| | - Abantika Pal
- Indian Institute of Technology Kharagpur, Kharagpur, India
| | | |
Collapse
|
4
|
Rawat K, Pal A, Banerjee S, Pal A, Mandal SC, Batabyal S. Ovine CD14- an Immune Response Gene Has a Role Against Gastrointestinal Nematode Haemonchus contortus-A Novel Report. Front Immunol 2021; 12:664877. [PMID: 34335567 PMCID: PMC8324245 DOI: 10.3389/fimmu.2021.664877] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2021] [Accepted: 04/28/2021] [Indexed: 12/12/2022] Open
Abstract
CD14 (also known as the monocyte differentiation antigen) is an important immune response gene known to be primarily responsible for innate immunity against bacterial pathogens, and as a pattern recognition receptor (PRR), binds with LPS (endotoxin), lipoproteins, and lipotechoic acid of bacteria. So far very limited work has been conducted in parasitic immunology. In the current study, we reported the role of CD14 in parasitic immunology in livestock species (sheep) for the first time. Ovine CD14 is characterized as a horse-shoe shaped bent solenoid with a hydrophobic amino-terminal pocket for CD14 along with domains. High mutation frequency was observed, out of total 41 mutations identified, 23 mutations were observed to be thermodynamically unstable and 11 mutations were deleterious in nature, causing major functional alteration of important domains of CD14, an indication of variations in individual susceptibility for sheep against Haemonchus contortus infestations. In silico studies with molecular docking reveal a role of immune response against Haemonchus contortus in sheep, which is later confirmed with experimental evidence through differential mRNA expression analysis for sheep, which revealed better expression of CD14 in Haemonchus contortus infected sheep compared to that of non-infected sheep. We confirmed the above findings with supportive evidence through haematological and biochemical analyses. Phylogenetic analysis was conducted to assess the evolutionary relationship with respect to humans and it was observed that sheep may well be used as model organisms due to better genetic closeness compared to that of mice.
Collapse
Affiliation(s)
- Kavita Rawat
- Department of BioChemistry, West Bengal University of Animal and Fishery Sciences, Kolkata, India
| | - Aruna Pal
- Department of LFC, West Bengal University of Animal and Fishery Sciences, Kolkata, India
| | - Samiddha Banerjee
- Department of Animal Science, Visva Bharati University, Bolpur, India
| | - Abantika Pal
- Department of Computer Science, Indian Institute of Technology, Kharagpur, India
| | - Subhas Chandra Mandal
- Department of Parasitology, West Bengal University of Animal and Fishery Sciences, Kolkata, India
| | - Subhasis Batabyal
- Department of BioChemistry, West Bengal University of Animal and Fishery Sciences, Kolkata, India
| |
Collapse
|
5
|
Sellers RA, Robertson DL, Tassabehji M. Ancestry of the AUTS2 family-A novel group of polycomb-complex proteins involved in human neurological disease. PLoS One 2020; 15:e0232101. [PMID: 33306672 PMCID: PMC7732068 DOI: 10.1371/journal.pone.0232101] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2019] [Accepted: 04/07/2020] [Indexed: 01/10/2023] Open
Abstract
Autism susceptibility candidate 2 (AUTS2) is a neurodevelopmental regulator associated with an autosomal dominant intellectual disability syndrome, AUTS2 syndrome, and is implicated as an important gene in human-specific evolution. AUTS2 exists as part of a tripartite gene family, the AUTS2 family, which includes two relatively undefined proteins, Fibrosin (FBRS) and Fibrosin-like protein 1 (FBRSL1). Evolutionary ancestors of AUTS2 have not been formally identified outside of the Animalia clade. A Drosophila melanogaster protein, Tay bridge, with a role in neurodevelopment, has been shown to display limited similarity to the C-terminal of AUTS2, suggesting that evolutionary ancestors of the AUTS2 family may exist within other Protostome lineages. Here we present an evolutionary analysis of the AUTS2 family, which highlights ancestral homologs of AUTS2 in multiple Protostome species, implicates AUTS2 as the closest human relative to the progenitor of the AUTS2 family, and demonstrates that Tay bridge is a divergent ortholog of the ancestral AUTS2 progenitor gene. We also define regions of high relative sequence identity, with potential functional significance, shared by the extended AUTS2 protein family. Using structural predictions coupled with sequence conservation and human variant data from 15,708 individuals, a putative domain structure for AUTS2 was produced that can be used to aid interpretation of the consequences of nucleotide variation on protein structure and function in human disease. To assess the role of AUTS2 in human-specific evolution, we recalculated allele frequencies at previously identified human derived sites using large population genome data, and show a high prevalence of ancestral alleles, suggesting that AUTS2 may not be a rapidly evolving gene, as previously thought.
Collapse
Affiliation(s)
- Robert A. Sellers
- Evolution & Genomic Sciences, School of Biological Sciences, University of Manchester, Manchester, United Kingdom
| | - David L. Robertson
- MRC-University of Glasgow Centre for Virus Research, Garscube Campus, Glasgow, United Kingdom
| | - May Tassabehji
- Evolution & Genomic Sciences, School of Biological Sciences, University of Manchester, Manchester, United Kingdom
- * E-mail:
| |
Collapse
|
6
|
Shi Q, Chen W, Huang S, Jin F, Dong Y, Wang Y, Xue Z. DNN-Dom: predicting protein domain boundary from sequence alone by deep neural network. Bioinformatics 2020; 35:5128-5136. [PMID: 31197306 DOI: 10.1093/bioinformatics/btz464] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Revised: 05/07/2019] [Accepted: 06/05/2019] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Accurate delineation of protein domain boundary plays an important role for protein engineering and structure prediction. Although machine-learning methods are widely used to predict domain boundary, these approaches often ignore long-range interactions among residues, which have been proven to improve the prediction performance. However, how to simultaneously model the local and global interactions to further improve domain boundary prediction is still a challenging problem. RESULTS This article employs a hybrid deep learning method that combines convolutional neural network and gate recurrent units' models for domain boundary prediction. It not only captures the local and non-local interactions, but also fuses these features for prediction. Additionally, we adopt balanced Random Forest for classification to deal with high imbalance of samples and high dimensions of deep features. Experimental results show that our proposed approach (DNN-Dom) outperforms existing machine-learning-based methods for boundary prediction. We expect that DNN-Dom can be useful for assisting protein structure and function prediction. AVAILABILITY AND IMPLEMENTATION The method is available as DNN-Dom Server at http://isyslab.info/DNN-Dom/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Qiang Shi
- School of Software Engineering and College of Life Science & Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Weiya Chen
- School of Software Engineering and College of Life Science & Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Siqi Huang
- School of Software Engineering and College of Life Science & Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Fanglin Jin
- School of Software Engineering and College of Life Science & Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Yinghao Dong
- School of Software Engineering and College of Life Science & Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Yan Wang
- School of Software Engineering and College of Life Science & Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Zhidong Xue
- School of Software Engineering and College of Life Science & Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| |
Collapse
|
7
|
Pal A, Pal A, Mallick AI, Biswas P, Chatterjee PN. Molecular characterization of Bu-1 and TLR2 gene in Haringhata Black chicken. Genomics 2020; 112:472-483. [PMID: 30902756 DOI: 10.1016/j.ygeno.2019.03.010] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Revised: 02/17/2019] [Accepted: 03/18/2019] [Indexed: 11/21/2022]
Abstract
Haringhata Black is the only registered indigenous poultry genetic resource of West Bengal till date. Molecular characterization of HB revealed that Bu-1 to be highly glycoylated transmembrane protein unlike mammalian Bu-1, whereas TLR2 of HB chicken was observed to be rich in Leucine rich repeat. HB chicken was observed to be genetically close to chicken of Japan, while distant to chicken breed of UK and Chicago. Avian species wise evolution study indicates genetic closeness of HB chicken with turkey. Differential mRNA expression profile for the immune response genes (TLR2, TLR4 and Bu1 gene) were studied for HB chicken with respect to other chicken breed and poultry birds, which reveals that HB chicken were better in terms of B cell mediated immunity and hence better response to vaccination. Hence HB chicken is one of the best poultry genetic resources to be reared under backyard system where biosecurity measures are almost lacking.
Collapse
Affiliation(s)
- Aruna Pal
- West Bengal University of Animal and Fishery Sciences, 37, K.B. Sarani, Kolkata 37, India.
| | - Abantika Pal
- Indian Institute of technology, Kharagpur, West Bengal, India
| | | | - P Biswas
- West Bengal University of Animal and Fishery Sciences, 37, K.B. Sarani, Kolkata 37, India
| | - P N Chatterjee
- West Bengal University of Animal and Fishery Sciences, 37, K.B. Sarani, Kolkata 37, India
| |
Collapse
|
8
|
Bacteriophage φEf11 ORF28 Endolysin, a Multifunctional Lytic Enzyme with Properties Distinct from All Other Identified Enterococcus faecalis Phage Endolysins. Appl Environ Microbiol 2019; 85:AEM.00555-19. [PMID: 30979842 DOI: 10.1128/aem.00555-19] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2019] [Accepted: 04/09/2019] [Indexed: 02/01/2023] Open
Abstract
ϕEf11 is a temperate Siphoviridae bacteriophage that infects strains of Enterococcus faecalis The ϕEf11 genome, encompassing 65 open reading frames (ORFs), is contained within 42,822 bp of DNA. Within this genome, a module of six lysis-related genes was identified. Based upon sequence homology, one of these six genes, ORF28, was predicted to code for an N-acetylmuramoyl-l-alanine amidase endolysin of 46.133 kDa, composed of 421 amino acids. The PCR-amplified ORF28 was cloned and expressed, and the resulting gene product was affinity purified to homogeneity. The purified protein was obtained from a fusion protein that exhibited a molecular mass of 72.5 kDa, consistent with a 46.1-kDa protein combined with a fused 26.5-kDa glutathione S-transferase tag. It produced rapid, profound lysis in E. faecalis populations and was active against 73 of 103 (71%) E. faecalis strains tested. In addition, it caused substantial destruction of E. faecalis biofilms. The lysin was quite stable, retaining its activity for three years in refrigerated storage, was stable over a wide range of pHs, and was unaffected by the presence of a reducing agent; however, it was inhibited by increasing concentrations of Ca2+ Liquid chromatography-mass spectrometry analysis of E. faecalis cell wall digestion products produced by the ORF28 endolysin indicated that the lysin acted as an N-acetylmuramidase, an endo-β-N-acetylglucosaminidase, and an endopeptidase, rather than an N-acetylmuramoyl-l-alanine amidase. The ϕEf11 ORF28 lysin shared 10% to 37% amino acid identity with the lytic enzymes of all other characterized E. faecalis bacteriophages.IMPORTANCE The emergence of multidrug-resistant pathogenic microorganisms has brought increasing attention to the urgent need for the development of alternative antimicrobial strategies. One such alternative to conventional antibiotics employs lytic enzymes (endolysins) that are produced by bacteriophages in the course of lytic infection. During lytic infection by a bacteriophage, these enzymes hydrolyze the cell wall peptidoglycan, resulting in the lysis of the host cell. However, external endolysin application can result in lysis from without. In this study, we have cloned, expressed, purified, and characterized an endolysin produced by a bacteriophage infecting strains of Enterococcus faecalis The lysin is broadly active against most of the tested E. faecalis strains and exhibits multifunctional enzymatic specificities that differ from all other characterized endolysins produced by E. faecalis bacteriophages.
Collapse
|
9
|
Wang Y, Wang J, Li R, Shi Q, Xue Z, Zhang Y. ThreaDomEx: a unified platform for predicting continuous and discontinuous protein domains by multiple-threading and segment assembly. Nucleic Acids Res 2019; 45:W400-W407. [PMID: 28498994 PMCID: PMC5793814 DOI: 10.1093/nar/gkx410] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2017] [Accepted: 04/28/2017] [Indexed: 12/21/2022] Open
Abstract
We develop a hierarchical pipeline, ThreaDomEx, for both continuous domain (CD) and discontinuous domain (DCD) structure predictions. Starting from a query sequence, ThreaDomEx first threads it through the PDB to identify multiple structure templates, where a profile of domain conservation score (DC-score) is derived for domain-segment assignment. To further detect DCDs that consist of separated segments along the sequence, a boundary-clustering algorithm is used to refine the DCD-linker locations. In case that the templates do not contain DCDs, a domain-segment assembly process, guided by symmetry comparison, is applied for further DCD detections. ThreaDomEx was tested a set of 1111 proteins and achieved a normalized domain overlap score of 89.3% compared to experimental data, which is significantly higher than other state-of-the-art methods. It also recalls 26.7% of DCDs with 72.7% precision on the proteins for which threading failed to detect any DCDs. The server provides facilities for users to interactively refine the domain models by adjusting DC-score threshold, deleting and adding domain linkers, and assembling domain segments, which are particularly helpful for the hard targets for which current methods have a low accuracy while human-expert knowledge and experimental insights can be used for refining models. ThreaDomEX server is available at http://zhanglab.ccmb.med.umich.edu/ThreaDomEx.
Collapse
Affiliation(s)
- Yan Wang
- Key Laboratory of Molecular Biophysics of the Ministry of Education, School of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China.,Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Jian Wang
- Key Laboratory of Molecular Biophysics of the Ministry of Education, School of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Ruiming Li
- School of Software, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Qiang Shi
- School of Software, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Zhidong Xue
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA.,School of Software, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA.,Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
10
|
Talà A, Damiano F, Gallo G, Pinatel E, Calcagnile M, Testini M, Fico D, Rizzo D, Sutera A, Renzone G, Scaloni A, De Bellis G, Siculella L, De Benedetto GE, Puglia AM, Peano C, Alifano P. Pirin: A novel redox-sensitive modulator of primary and secondary metabolism in Streptomyces. Metab Eng 2018; 48:254-268. [PMID: 29944936 DOI: 10.1016/j.ymben.2018.06.008] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Revised: 06/21/2018] [Accepted: 06/22/2018] [Indexed: 10/28/2022]
Abstract
Pirins are evolutionarily conserved iron-containing proteins that are found in all kingdoms of life, and have been implicated in diverse molecular processes, mostly associated with cellular stress. In the present study, we started from the evidence that the insertional inactivation of pirin-like gene SAM23877_RS18305 (pirA) by ΦC31 Att/Int system-based vectors in spiramycin-producing strain Streptomyces ambofaciens ATCC 23877 resulted in marked effects on central carbon and energy metabolism gene expression, high sensitivity to oxidative injury and repression of polyketide antibiotic production. By using integrated transcriptomic, proteomic and metabolite profiling, together with genetic complementation, we here show that most of these effects could be traced to the inability of the pirA-defective strain to modulate beta-oxidation pathway, leading to an unbalanced supply of precursor monomers for polyketide biosynthesis. Indeed, in silico protein-protein interaction modeling and in vitro experimental validation allowed us to demonstrate that PirA is a novel redox-sensitive negative modulator of very long-chain acyl-CoA dehydrogenase, which catalyzes the first committed step of the beta-oxidation pathway.
Collapse
Affiliation(s)
- Adelfia Talà
- Department of Biological and Environmental Sciences and Technologies, University of Salento, Lecce, Italy
| | - Fabrizio Damiano
- Department of Biological and Environmental Sciences and Technologies, University of Salento, Lecce, Italy
| | - Giuseppe Gallo
- Department of Biological, Chemical and Pharmaceutical Sciences and Technologies (STEBICEF), University of Palermo, Palermo, Italy; Advanced Technologies Network (ATeN) Center, University of Palermo, Palermo, Italy
| | - Eva Pinatel
- Institute of Biomedical Technologies, National Research Council, Segrate, Italy
| | - Matteo Calcagnile
- Department of Biological and Environmental Sciences and Technologies, University of Salento, Lecce, Italy
| | - Mariangela Testini
- Department of Biological and Environmental Sciences and Technologies, University of Salento, Lecce, Italy
| | - Daniela Fico
- Laboratory of Analytical and Isotopic Mass Spectrometry, Department of Cultural Heritage, University of Salento, Lecce, Italy
| | - Daniela Rizzo
- Laboratory of Analytical and Isotopic Mass Spectrometry, Department of Cultural Heritage, University of Salento, Lecce, Italy
| | - Alberto Sutera
- Department of Biological, Chemical and Pharmaceutical Sciences and Technologies (STEBICEF), University of Palermo, Palermo, Italy; Advanced Technologies Network (ATeN) Center, University of Palermo, Palermo, Italy
| | - Giovanni Renzone
- Proteomics & Mass Spectrometry Laboratory, ISPAAM, National Research Council, Naples, Italy
| | - Andrea Scaloni
- Proteomics & Mass Spectrometry Laboratory, ISPAAM, National Research Council, Naples, Italy
| | - Gianluca De Bellis
- Institute of Biomedical Technologies, National Research Council, Segrate, Italy
| | - Luisa Siculella
- Department of Biological and Environmental Sciences and Technologies, University of Salento, Lecce, Italy
| | - Giuseppe Egidio De Benedetto
- Laboratory of Analytical and Isotopic Mass Spectrometry, Department of Cultural Heritage, University of Salento, Lecce, Italy
| | - Anna Maria Puglia
- Department of Biological, Chemical and Pharmaceutical Sciences and Technologies (STEBICEF), University of Palermo, Palermo, Italy
| | - Clelia Peano
- Institute of Biomedical Technologies, National Research Council, Segrate, Italy; Institute of Genetic and Biomedical Research, UoS Milan, National Research Council, Rozzano, Milan, Italy; Humanitas Clinical and Research Center, Rozzano, Milan, Italy
| | - Pietro Alifano
- Department of Biological and Environmental Sciences and Technologies, University of Salento, Lecce, Italy.
| |
Collapse
|
11
|
Vijayaraghavan J, Kumar V, Krishnan NP, Kaufhold RT, Zeng X, Lin J, van den Akker F. Structural studies and molecular dynamics simulations suggest a processive mechanism of exolytic lytic transglycosylase from Campylobacter jejuni. PLoS One 2018; 13:e0197136. [PMID: 29758058 PMCID: PMC5951611 DOI: 10.1371/journal.pone.0197136] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2018] [Accepted: 04/26/2018] [Indexed: 11/21/2022] Open
Abstract
The bacterial soluble lytic transglycosylase (LT) breaks down the peptidoglycan (PG) layer during processes such as cell division. We present here crystal structures of the soluble LT Cj0843 from Campylobacter jejuni with and without bulgecin A inhibitor in the active site. Cj0843 has a doughnut shape similar but not identical to that of E. coli SLT70. The C-terminal catalytic domain is preceded by an L-domain, a large helical U-domain, a flexible linker, and a small N-terminal NU-domain. The flexible linker allows the NU-domain to reach over and complete the circular shape, using residues conserved in the Epsilonproteobacteria LT family. The inner surface of the Cj0843 doughnut is mostly positively charged including a pocket that has 8 Arg/Lys residues. Molecular dynamics simulations with PG strands revealed a potential functional role for this pocket in anchoring the negatively charged terminal tetrapeptide of the PG during several steps in the reaction including homing and aligning the PG strand for exolytic cleavage, and subsequent ratcheting of the PG strand to enhance processivity in degrading PG strands.
Collapse
Affiliation(s)
- Jagamya Vijayaraghavan
- Department of Biochemistry, Case Western Reserve University, Cleveland, OH, United States of America
| | - Vijay Kumar
- Department of Biochemistry, Case Western Reserve University, Cleveland, OH, United States of America
| | - Nikhil P. Krishnan
- Department of Biochemistry, Case Western Reserve University, Cleveland, OH, United States of America
| | - Ross T. Kaufhold
- Department of Biochemistry, Case Western Reserve University, Cleveland, OH, United States of America
| | - Ximin Zeng
- Institute of agriculture, University of Tennessee, Knoxville, TN, United States of America
| | - Jun Lin
- Institute of agriculture, University of Tennessee, Knoxville, TN, United States of America
| | - Focco van den Akker
- Department of Biochemistry, Case Western Reserve University, Cleveland, OH, United States of America
- * E-mail:
| |
Collapse
|
12
|
Richa T, Ide S, Suzuki R, Ebina T, Kuroda Y. Fast H-DROP: A thirty times accelerated version of H-DROP for interactive SVM-based prediction of helical domain linkers. J Comput Aided Mol Des 2016; 31:237-244. [PMID: 28028736 DOI: 10.1007/s10822-016-9999-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2016] [Accepted: 12/10/2016] [Indexed: 10/20/2022]
Abstract
Efficient and rapid prediction of domain regions from amino acid sequence information alone is often required for swift structural and functional characterization of large multi-domain proteins. Here we introduce Fast H-DROP, a thirty times accelerated version of our previously reported H-DROP (Helical Domain linker pRediction using OPtimal features), which is unique in specifically predicting helical domain linkers (boundaries). Fast H-DROP, analogously to H-DROP, uses optimum features selected from a set of 3000 ones by combining a random forest and a stepwise feature selection protocol. We reduced the computational time from 8.5 min per sequence in H-DROP to 14 s per sequence in Fast H-DROP on an 8 Xeon processor Linux server by using SWISS-PROT instead of Genbank non-redundant (nr) database for generating the PSSMs. The sensitivity and precision of Fast H-DROP assessed by cross-validation were 33.7 and 36.2%, which were merely ~2% lower than that of H-DROP. The reduced computational time of Fast H-DROP, without affecting prediction performances, makes it more interactive and user-friendly. Fast H-DROP and H-DROP are freely available from http://domserv.lab.tuat.ac.jp/ .
Collapse
Affiliation(s)
- Tambi Richa
- Department of Biotechnology and Life Science, Tokyo University of Agriculture and Technology, 12-24-16 Nakamachi, Koganei-shi, Tokyo, 184-8588, Japan
| | - Soichiro Ide
- Department of Biotechnology and Life Science, Tokyo University of Agriculture and Technology, 12-24-16 Nakamachi, Koganei-shi, Tokyo, 184-8588, Japan
| | - Ryosuke Suzuki
- Department of Biotechnology and Life Science, Tokyo University of Agriculture and Technology, 12-24-16 Nakamachi, Koganei-shi, Tokyo, 184-8588, Japan
| | - Teppei Ebina
- Department of Biotechnology and Life Science, Tokyo University of Agriculture and Technology, 12-24-16 Nakamachi, Koganei-shi, Tokyo, 184-8588, Japan.,Department of Physiology, Graduate school of Medicine, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-0033, Japan
| | - Yutaka Kuroda
- Department of Biotechnology and Life Science, Tokyo University of Agriculture and Technology, 12-24-16 Nakamachi, Koganei-shi, Tokyo, 184-8588, Japan.
| |
Collapse
|
13
|
Leis B, Heinze S, Angelov A, Pham VTT, Thürmer A, Jebbar M, Golyshin PN, Streit WR, Daniel R, Liebl W. Functional Screening of Hydrolytic Activities Reveals an Extremely Thermostable Cellulase from a Deep-Sea Archaeon. Front Bioeng Biotechnol 2015; 3:95. [PMID: 26191525 PMCID: PMC4486847 DOI: 10.3389/fbioe.2015.00095] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2015] [Accepted: 06/17/2015] [Indexed: 01/27/2023] Open
Abstract
Extreme habitats serve as a source of enzymes that are active under extreme conditions and are candidates for industrial applications. In this work, six large-insert mixed genomic libraries were screened for hydrolase activities in a broad temperature range (8-70°C). Among a variety of hydrolytic activities, one fosmid clone, derived from a library of pooled isolates of hyperthermophilic archaea from deep sea vents, displayed hydrolytic activity on carboxymethyl cellulose substrate plates at 70°C but not at lower temperatures. Sequence analysis of the fosmid insert revealed a gene encoding a novel glycoside hydrolase family 12 (GHF12) endo-1,4-β-glucanase, termed Cel12E. The enzyme shares 45% sequence identity with a protein from the archaeon Thermococcus sp. AM4 and displays a unique multidomain architecture. Biochemical characterization of Cel12E revealed a remarkably thermostable protein, which appears to be of archaeal origin. The enzyme displayed maximum activity at 92°C and was active on a variety of linear 1,4-β-glucans like carboxymethyl cellulose, β-glucan, lichenan, and phosphoric acid swollen cellulose. The protein is able to bind to various insoluble β-glucans. Product pattern analysis indicated that Cel12E is an endo-cleaving β-glucanase. Cel12E expands the toolbox of hyperthermostable archaeal cellulases with biotechnological potential.
Collapse
Affiliation(s)
- Benedikt Leis
- Department of Microbiology, School of Life Sciences Weihenstephan, Technische Universität München, Freising-Weihenstephan, Germany
| | - Simon Heinze
- Department of Microbiology, School of Life Sciences Weihenstephan, Technische Universität München, Freising-Weihenstephan, Germany
| | - Angel Angelov
- Department of Microbiology, School of Life Sciences Weihenstephan, Technische Universität München, Freising-Weihenstephan, Germany
| | - Vu Thuy Trang Pham
- Department of Microbiology, School of Life Sciences Weihenstephan, Technische Universität München, Freising-Weihenstephan, Germany
| | - Andrea Thürmer
- Göttingen Genomics Laboratory, Department of Genomic and Applied Microbiology, Georg-August University Göttingen, Göttingen, Germany
| | - Mohamed Jebbar
- Laboratoire de Microbiologie des Environnements Extrêmes-UMR 6197 (CNRS-Ifremer-UBO), Institut Universitaire Européen de la Mer, Université de Bretagne Occidentale, Plouzané, France
| | | | - Wolfgang R. Streit
- Fakultät für Mathematik, Informatik und Naturwissenschaften Biologie, Biozentrum Klein Flottbek, Universität Hamburg, Hamburg, Germany
| | - Rolf Daniel
- Göttingen Genomics Laboratory, Department of Genomic and Applied Microbiology, Georg-August University Göttingen, Göttingen, Germany
| | - Wolfgang Liebl
- Department of Microbiology, School of Life Sciences Weihenstephan, Technische Universität München, Freising-Weihenstephan, Germany
| |
Collapse
|
14
|
Wu N, Rathnayaka T, Kuroda Y. Bacterial expression and re-engineering of Gaussia princeps luciferase and its use as a reporter protein. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2015; 1854:1392-9. [PMID: 26025768 DOI: 10.1016/j.bbapap.2015.05.008] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2015] [Revised: 04/24/2015] [Accepted: 05/15/2015] [Indexed: 12/21/2022]
Abstract
Bioluminescence, the generation of visible light in a living organism, is widely observed in nature, and a large variety of bioluminescent proteins have been discovered and characterized. Luciferase is a generic term for bioluminescent enzymes that catalyze the emission of light through the oxidization of a luciferin (also a generic term). Luciferase are not necessarily evolutionary related and do not share sequence or structural similarities. Some luciferases, such as those from fireflies and Renilla, have been thoroughly characterized and are being used in a wide range of applications in bio-imaging. Gaussia luciferase (GLuc) from the marine copepod Gaussia princeps is the smallest known luciferase, and it is attracting much attention as a potential reporter protein. GLuc identification is relatively recent, and its structure and its biophysical properties remain to be fully characterized. Here, we review the bacterial production of natively folded GLuc with special emphasis on its disulfide bond formation and the re-engineering of its bioluminescence properties. We also compare the bioluminescent properties under a strictly controlled in vitro condition of selected GLuc's variants using extensively purified proteins with native disulfide bonds. Furthermore, we discuss and predict the domain structure and location of the catalytic core based on literature and on bioinformatics analysis. Finally, we review some examples of GLuc's emerging use in biomolecular imaging and biochemical assay systems.
Collapse
Affiliation(s)
- Nan Wu
- Department of Biotechnology and Life Science, Graduate School of Engineering, Tokyo University of Agriculture and Technology, 2-24-16 Nakamachi, Koganei-shi, Tokyo 184-8588, Japan
| | - Tharangani Rathnayaka
- Department of Biotechnology and Life Science, Graduate School of Engineering, Tokyo University of Agriculture and Technology, 2-24-16 Nakamachi, Koganei-shi, Tokyo 184-8588, Japan
| | - Yutaka Kuroda
- Department of Biotechnology and Life Science, Graduate School of Engineering, Tokyo University of Agriculture and Technology, 2-24-16 Nakamachi, Koganei-shi, Tokyo 184-8588, Japan.
| |
Collapse
|
15
|
van den Biggelaar M, Madsen JJ, Faber JH, Zuurveld MG, van der Zwaan C, Olsen OH, Stennicke HR, Mertens K, Meijer AB. Factor VIII Interacts with the Endocytic Receptor Low-density Lipoprotein Receptor-related Protein 1 via an Extended Surface Comprising "Hot-Spot" Lysine Residues. J Biol Chem 2015; 290:16463-76. [PMID: 25903134 DOI: 10.1074/jbc.m115.650911] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2015] [Indexed: 11/06/2022] Open
Abstract
Lysine residues are implicated in driving the ligand binding to the LDL receptor family. However, it has remained unclear how specificity is regulated. Using coagulation factor VIII as a model ligand, we now study the contribution of individual lysine residues in the interaction with the largest member of the LDL receptor family, low-density lipoprotein receptor-related protein (LRP1). Using hydrogen-deuterium exchange mass spectrometry (HDX-MS) and SPR interaction analysis on a library of lysine replacement variants as two independent approaches, we demonstrate that the interaction between factor VIII (FVIII) and LRP1 occurs over an extended surface containing multiple lysine residues. None of the individual lysine residues account completely for LRP1 binding, suggesting an additive binding model. Together with structural docking studies, our data suggest that FVIII interacts with LRP1 via an extended surface of multiple lysine residues that starts at the bottom of the C1 domain and winds around the FVIII molecule.
Collapse
Affiliation(s)
- Maartje van den Biggelaar
- From the Department of Plasma Proteins, Sanquin Blood Supply Foundation, 1066 CX Amsterdam, The Netherlands,
| | - Jesper J Madsen
- Global Research, Novo Nordisk A/S, DK-2760 Måløv, Denmark, and
| | - Johan H Faber
- Global Research, Novo Nordisk A/S, DK-2760 Måløv, Denmark, and
| | - Marleen G Zuurveld
- From the Department of Plasma Proteins, Sanquin Blood Supply Foundation, 1066 CX Amsterdam, The Netherlands
| | - Carmen van der Zwaan
- From the Department of Plasma Proteins, Sanquin Blood Supply Foundation, 1066 CX Amsterdam, The Netherlands
| | - Ole H Olsen
- Global Research, Novo Nordisk A/S, DK-2760 Måløv, Denmark, and
| | | | - Koen Mertens
- From the Department of Plasma Proteins, Sanquin Blood Supply Foundation, 1066 CX Amsterdam, The Netherlands, the Department of Pharmaceutics, Utrecht Institute for Pharmaceutical Sciences, Utrecht University, 3508 TC Utrecht, The Netherlands
| | - Alexander B Meijer
- From the Department of Plasma Proteins, Sanquin Blood Supply Foundation, 1066 CX Amsterdam, The Netherlands, the Department of Pharmaceutics, Utrecht Institute for Pharmaceutical Sciences, Utrecht University, 3508 TC Utrecht, The Netherlands
| |
Collapse
|
16
|
Shatnawi M, Zaki N. Inter-domain linker prediction using amino acid compositional index. Comput Biol Chem 2015; 55:23-30. [DOI: 10.1016/j.compbiolchem.2015.01.006] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2014] [Revised: 01/22/2015] [Accepted: 01/22/2015] [Indexed: 10/24/2022]
|
17
|
EC300: a phage-based, bacteriolysin-like protein with enhanced antibacterial activity against Enterococcus faecalis. Appl Microbiol Biotechnol 2015; 99:5137-49. [DOI: 10.1007/s00253-015-6483-7] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2015] [Revised: 02/12/2015] [Accepted: 02/13/2015] [Indexed: 10/23/2022]
|
18
|
Shatnawi M, Zaki N, Yoo PD. Protein inter-domain linker prediction using Random Forest and amino acid physiochemical properties. BMC Bioinformatics 2014; 15 Suppl 16:S8. [PMID: 25521329 PMCID: PMC4290662 DOI: 10.1186/1471-2105-15-s16-s8] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Protein chains are generally long and consist of multiple domains. Domains are distinct structural units of a protein that can evolve and function independently. The accurate prediction of protein domain linkers and boundaries is often regarded as the initial step of protein tertiary structure and function predictions. Such information not only enhances protein-targeted drug development but also reduces the experimental cost of protein analysis by allowing researchers to work on a set of smaller and independent units. In this study, we propose a novel and accurate domain-linker prediction approach based on protein primary structure information only. We utilize a nature-inspired machine-learning model called Random Forest along with a novel domain-linker profile that contains physiochemical and domain-linker information of amino acid sequences. RESULTS The proposed approach was tested on two well-known benchmark protein datasets and achieved 68% sensitivity and 99% precision, which is better than any existing protein domain-linker predictor. Without applying any data balancing technique such as class weighting and data re-sampling, the proposed approach is able to accurately classify inter-domain linkers from highly imbalanced datasets. CONCLUSION Our experimental results prove that the proposed approach is useful for domain-linker identification in highly imbalanced single- and multi-domain proteins.
Collapse
|
19
|
H-DROP: an SVM based helical domain linker predictor trained with features optimized by combining random forest and stepwise selection. J Comput Aided Mol Des 2014; 28:831-9. [PMID: 24965847 DOI: 10.1007/s10822-014-9763-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2014] [Accepted: 06/09/2014] [Indexed: 10/25/2022]
Abstract
Domain linker prediction is attracting much interest as it can help identifying novel domains suitable for high throughput proteomics analysis. Here, we report H-DROP, an SVM-based Helical Domain linker pRediction using OPtimal features. H-DROP is, to the best of our knowledge, the first predictor for specifically and effectively identifying helical linkers. This was made possible first because a large training dataset became available from IS-Dom, and second because we selected a small number of optimal features from a huge number of potential ones. The training helical linker dataset, which included 261 helical linkers, was constructed by detecting helical residues at the boundary regions of two independent structural domains listed in our previously reported IS-Dom dataset. 45 optimal feature candidates were selected from 3,000 features by random forest, which were further reduced to 26 optimal features by stepwise selection. The prediction sensitivity and precision of H-DROP were 35.2 and 38.8%, respectively. These values were over 10.7% higher than those of control methods including our previously developed DROP, which is a coil linker predictor, and PPRODO, which is trained with un-differentiated domain boundary sequences. Overall, these results indicated that helical linkers can be predicted from sequence information alone by using a strictly curated training data set for helical linkers and carefully selected set of optimal features. H-DROP is available at http://domserv.lab.tuat.ac.jp.
Collapse
|
20
|
Bhaskara RM, de Brevern AG, Srinivasan N. Understanding the role of domain–domain linkers in the spatial orientation of domains in multi-domain proteins. J Biomol Struct Dyn 2013; 31:1467-80. [DOI: 10.1080/07391102.2012.743438] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
21
|
Lee K, Chiu I, Santos-Cortez RLP, Basit S, Khan S, Azeem Z, Andrade PB, Kim SS, Ahmad W, Leal SM. Novel OTOA mutations cause autosomal recessive non-syndromic hearing impairment in Pakistani families. Clin Genet 2013; 84:294-6. [PMID: 23173898 PMCID: PMC6220893 DOI: 10.1111/cge.12047] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
22
|
Ebina T, Umezawa Y, Kuroda Y. IS-Dom: a dataset of independent structural domains automatically delineated from protein structures. J Comput Aided Mol Des 2013; 27:419-26. [PMID: 23715893 DOI: 10.1007/s10822-013-9654-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2012] [Accepted: 05/07/2013] [Indexed: 11/25/2022]
Abstract
Protein domains that can fold in isolation are significant targets in diverse area of proteomics research as they are often readily analyzed by high-throughput methods. Here, we report IS-Dom, a dataset of Independent Structural Domains (ISDs) that are most likely to fold in isolation. IS-Dom was constructed by filtering domains from SCOP, CATH, and DomainParser using quantitative structural measures, which were calculated by estimating inter-domain hydrophobic clusters and hydrogen bonds from the full length protein's atomic coordinates. The ISD detection protocol is fully automated, and all of the computed interactions are stored in the server which enables rapid update of IS-Dom. We also prepared a standard IS-Dom using parameters optimized by maximizing the Youden's index. The standard IS-Dom, contained 54,860 ISDs, of which 25.5 % had high sequence identity and termini overlap with a Protein Data Bank (PDB) cataloged sequence and are thus experimentally shown to fold in isolation [coined autonomously folded domain (AFDs)]. Furthermore, our ISD detection protocol missed less than 10 % of the AFDs, which corroborated our protocol's ability to define structural domains that are able to fold independently. IS-Dom is available through the web server ( http://domserv.lab.tuat.ac.jp/IS-Dom.html ), and users can either, download the standard IS-Dom dataset, construct their own IS-Dom by interactively varying the parameters, or assess the structural independence of newly defined putative domains.
Collapse
Affiliation(s)
- Teppei Ebina
- Department of Biotechnology and Life Science, Tokyo University of Agriculture and Technology, 12-24-16 Nakamachi, Koganei-shi, Tokyo 184-8588, Japan.
| | | | | |
Collapse
|
23
|
Böth D, Steiner EM, Stadler D, Lindqvist Y, Schnell R, Schneider G. Structure of LdtMt2, an L,D-transpeptidase from Mycobacterium tuberculosis. ACTA CRYSTALLOGRAPHICA. SECTION D, BIOLOGICAL CRYSTALLOGRAPHY 2013; 69:432-41. [PMID: 23519418 PMCID: PMC3605044 DOI: 10.1107/s0907444912049268] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/09/2012] [Accepted: 11/30/2012] [Indexed: 05/12/2023]
Abstract
The transpeptidase LtdMt2 catalyzes the formation of the (3-3) cross-links characteristic of the peptidoglycan layer in the Mycobacterium tuberculosis cell wall. Bioinformatics analysis suggests that the extramembrane part of the enzyme consists of three domains: two smaller domains (denoted as A and B domains) and a transpeptidase domain (the C domain) at the C-terminus. The crystal structures of two fragments comprising the AB domains and the BC domains have been determined. The structure of the BC module, which was determined to 1.86 Å resolution using Se-SAD phasing, consists of the B domain with an immunoglobulin-related fold and the catalytic domain belonging to the ErfK/YbiS/YbnG fold family. The structure of the AB-domain fragment, which was solved by molecular replacement to 1.45 Å resolution, reveals that despite a lack of overall sequence identity the A domain is structurally very similar to the B domain. Combining the structures of the two fragments provides a view of the complete three-domain extramembrane part of LdtMt2 and shows that the protein extends at least 80-100 Å from the plasma membrane into the peptidoglycan layer and thus defines the maximal distance at which cross-links are formed by this enzyme. The LdtMt-related transpeptidases contain one or two immunoglobulin domains, which suggests that these might serve as extender units to position the catalytic domain at an appropriate distance from the membrane in the peptidoglycan layer.
Collapse
Affiliation(s)
- Dominic Böth
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, S-17 177 Stockholm, Sweden
| | - Eva Maria Steiner
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, S-17 177 Stockholm, Sweden
| | - Daniela Stadler
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, S-17 177 Stockholm, Sweden
| | - Ylva Lindqvist
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, S-17 177 Stockholm, Sweden
| | - Robert Schnell
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, S-17 177 Stockholm, Sweden
| | - Gunter Schneider
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, S-17 177 Stockholm, Sweden
| |
Collapse
|
24
|
Rosseto FR, Puhl AC, Andrade MO, Polikarpov I. Crystallization and preliminary diffraction analysis of the catalytic domain of major extracellular endoglucanase from Xanthomonas campestris pv. campestris. Acta Crystallogr Sect F Struct Biol Cryst Commun 2013; 69:137-40. [PMID: 23385754 PMCID: PMC3564615 DOI: 10.1107/s1744309112051408] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2012] [Accepted: 12/20/2012] [Indexed: 11/10/2022]
Abstract
Cellulases, such as endoglucanases, exoglucanases and β-glucosidases, are important enzymes used in the process of enzymatic hydrolysis of plant biomass. The bacteria Xanthomonas campestris pv. campestris expresses a large number of hydrolases and the major endoglucanase (XccEG), a member of glycoside hydrolase family 5 (GH5), is the most strongly secreted extracellularly. In this work, the native XccEG was purified from the extracellular extract and crystallization assays were performed on its catalytic domain. A complete data set was collected on an in-house X-ray source. The crystal diffracted to 2.7 Å resolution and belonged to space group C2, with unit-cell parameters a = 174.66, b = 141.53, c = 108.00 Å, β = 110.49°. The Matthews coefficient suggests a solvent content of 70.1% and the presence of four protein subunits in the asymmetric unit.
Collapse
Affiliation(s)
- Flávio R. Rosseto
- Instituto de Física de São Carlos, Universidade de São Paulo, Avenida Trabalhador Saocarlense 400, São Carlos-SP 13560-970, Brazil
| | - Ana C. Puhl
- Instituto de Física de São Carlos, Universidade de São Paulo, Avenida Trabalhador Saocarlense 400, São Carlos-SP 13560-970, Brazil
| | - Maxuel O. Andrade
- Instituto de Química, Universidade de São Paulo, Avenida Prof. Lineu Prestes 748, Bloco Química Fina, Sala 10, Cidade Universitária, São Paulo-SP 05508-000, Brazil
| | - Igor Polikarpov
- Instituto de Física de São Carlos, Universidade de São Paulo, Avenida Trabalhador Saocarlense 400, São Carlos-SP 13560-970, Brazil
| |
Collapse
|
25
|
Sadowski MI. Prediction of protein domain boundaries from inverse covariances. Proteins 2013; 81:253-60. [PMID: 22987736 PMCID: PMC3563215 DOI: 10.1002/prot.24181] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2012] [Revised: 08/10/2012] [Accepted: 09/04/2012] [Indexed: 01/04/2023]
Abstract
It has been known even since relatively few structures had been solved that longer protein chains often contain multiple domains, which may fold separately and play the role of reusable functional modules found in many contexts. In many structural biology tasks, in particular structure prediction, it is of great use to be able to identify domains within the structure and analyze these regions separately. However, when using sequence data alone this task has proven exceptionally difficult, with relatively little improvement over the naive method of choosing boundaries based on size distributions of observed domains. The recent significant improvement in contact prediction provides a new source of information for domain prediction. We test several methods for using this information including a kernel smoothing-based approach and methods based on building alpha-carbon models and compare performance with a length-based predictor, a homology search method and four published sequence-based predictors: DOMCUT, DomPRO, DLP-SVM, and SCOOBY-DOmain. We show that the kernel-smoothing method is significantly better than the other ab initio predictors when both single-domain and multidomain targets are considered and is not significantly different to the homology-based method. Considering only multidomain targets the kernel-smoothing method outperforms all of the published methods except DLP-SVM. The kernel smoothing method therefore represents a potentially useful improvement to ab initio domain prediction.
Collapse
Affiliation(s)
- Michael I Sadowski
- MRC National Institute for Medical Research, The Ridgeway, Mill Hill, London, United Kingdom.
| |
Collapse
|
26
|
Proença D, Fernandes S, Leandro C, Silva FA, Santos S, Lopes F, Mato R, Cavaco-Silva P, Pimentel M, São-José C. Phage Endolysins with Broad Antimicrobial Activity AgainstEnterococcus faecalisClinical Strains. Microb Drug Resist 2012; 18:322-32. [DOI: 10.1089/mdr.2012.0024] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Affiliation(s)
| | - Sofia Fernandes
- Technophage, SA, Lisboa, Portugal
- Centro de Patogénese Molecular, Unidade de Retrovírus e Infecções Associadas (CPM-URIA), Faculdade de Farmácia da Universidade de Lisboa, Lisboa, Portugal
| | | | | | - Sofia Santos
- Instituto de Biologia Experimental e Tecnológica (IBET), Oeiras, Portugal
- Instituto de Tecnologia Química e Biológica (ITQB), Universidade Nova de Lisboa, Oeiras, Portugal
| | - Fátima Lopes
- Instituto de Biologia Experimental e Tecnológica (IBET), Oeiras, Portugal
- Instituto de Tecnologia Química e Biológica (ITQB), Universidade Nova de Lisboa, Oeiras, Portugal
| | - Rosario Mato
- Laboratory of Molecular Genetics, Instituto de Tecnologia Química e Biológica (ITQB), Universidade Nova de Lisboa, Oeiras, Portugal
- Dept. de Ciências da Vida da Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, Monte de Caparica, Portugal
| | - Patrícia Cavaco-Silva
- Technophage, SA, Lisboa, Portugal
- Center for Interdisciplinary Research Egas Moniz, Monte de Caparica, Portugal
| | - Madalena Pimentel
- Centro de Patogénese Molecular, Unidade de Retrovírus e Infecções Associadas (CPM-URIA), Faculdade de Farmácia da Universidade de Lisboa, Lisboa, Portugal
| | - Carlos São-José
- Centro de Patogénese Molecular, Unidade de Retrovírus e Infecções Associadas (CPM-URIA), Faculdade de Farmácia da Universidade de Lisboa, Lisboa, Portugal
- Institute of Molecular Medicine, Lisboa, Portugal
| |
Collapse
|
27
|
Fernandes S, Proença D, Cantante C, Silva FA, Leandro C, Lourenço S, Milheiriço C, de Lencastre H, Cavaco-Silva P, Pimentel M, São-José C. Novel Chimerical Endolysins with Broad Antimicrobial Activity Against Methicillin-ResistantStaphylococcus aureus. Microb Drug Resist 2012; 18:333-43. [DOI: 10.1089/mdr.2012.0025] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Affiliation(s)
- Sofia Fernandes
- Technophage, SA, Lisboa, Portugal
- Centro de Patogénese Molecular, Unidade de Retrovírus e Infecções Associadas (CPM-URIA), Faculdade de Farmácia da Universidade de Lisboa, Lisboa, Portugal
| | | | | | | | | | | | - Catarina Milheiriço
- Laboratory of Molecular Genetics, Instituto de Tecnologia Química e Biológica (ITQB), Universidade Nova de Lisboa, Oeiras, Portugal
| | - Hermínia de Lencastre
- Laboratory of Molecular Genetics, Instituto de Tecnologia Química e Biológica (ITQB), Universidade Nova de Lisboa, Oeiras, Portugal
- Laboratory of Microbiology, The Rockefeller University, New York, New York
| | - Patrícia Cavaco-Silva
- Technophage, SA, Lisboa, Portugal
- Center for Interdisciplinary Research Egas Moniz, Monte de Caparica, Portugal
| | - Madalena Pimentel
- Centro de Patogénese Molecular, Unidade de Retrovírus e Infecções Associadas (CPM-URIA), Faculdade de Farmácia da Universidade de Lisboa, Lisboa, Portugal
| | - Carlos São-José
- Centro de Patogénese Molecular, Unidade de Retrovírus e Infecções Associadas (CPM-URIA), Faculdade de Farmácia da Universidade de Lisboa, Lisboa, Portugal
- Institute of Molecular Medicine, Lisboa, Portugal
| |
Collapse
|
28
|
Pal A, Sharma A, Bhattacharya TK, Chatterjee PN, Chakravarty AK. Molecular Characterization and SNP Detection of CD14 Gene of Crossbred Cattle. Mol Biol Int 2011; 2011:507346. [PMID: 22132326 PMCID: PMC3205722 DOI: 10.4061/2011/507346] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2011] [Revised: 07/19/2011] [Accepted: 07/19/2011] [Indexed: 12/02/2022] Open
Abstract
CD14 is an important molecule for innate immunity that can act against a wide range of pathogens. The present paper has characterized CD14 gene of crossbred (CB) cattle (Bos indicus×Bos taurus). Cloning and sequence analysis of CD14 cDNA revealed 1119 nucleotide long open reading frame encoding 373 amino acids protein and 20 amino acids signal peptide. CB cattle CD14 gene exhibited a high percentage of nucleotide identity (59.3–98.1%) with the corresponding mammalian homologs. Cattle and buffalo appear to have diverged from a common ancestor in phylogenetic analysis. 25 SNPs with 17 amino acid changes were newly reported and the site for mutational hot-spot was detected in CB cattle CD14 gene. Non-synonymous substitutions exceeding synonymous substitutions indicate the evolution of this protein through positive selection among domestic animals. Predicted protein structures obtained from deduced amino acid sequence indicated CB cattle CD14 molecule to be a receptor with horse shoe-shaped structure. The sites for LPS binding, LPS signalling, leucine-rich repeats, putative N-linked glycosylation, O-linked glycosylation, glycosyl phosphatidyl inositol anchor, disulphide bridges, alpha helix, beta strand, leucine rich nuclear export signal, leucine zipper and domain linker were predicted. Most of leucine and cysteine residues remain conserved across the species.
Collapse
Affiliation(s)
- Aruna Pal
- Animal Genetics Division, Indian Veterinary Research Institute, Izatnagar, Pin-243122, India
| | | | | | | | | |
Collapse
|
29
|
Hasegawa J, Tokuda E, Tenno T, Tsujita K, Sawai H, Hiroaki H, Takenawa T, Itoh T. SH3YL1 regulates dorsal ruffle formation by a novel phosphoinositide-binding domain. ACTA ACUST UNITED AC 2011; 193:901-16. [PMID: 21624956 PMCID: PMC3105542 DOI: 10.1083/jcb.201012161] [Citation(s) in RCA: 73] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Reversible interactions between cytosolic proteins and membrane lipids such as phosphoinositides play important roles in membrane morphogenesis driven by actin polymerization. In this paper, we identify a novel lipid-binding module, which we call the SYLF domain (after the SH3YL1, Ysc84p/Lsb4p, Lsb3p, and plant FYVE proteins that contain it), that is highly conserved from bacteria to mammals. SH3YL1 (SH3 domain containing Ysc84-like 1) strongly bound to phosphatidylinositol 3,4,5-triphosphate (PI(3,4,5)P(3)) and several D5-phosphorylated phosphoinositides through its SYLF domain and was localized to circular dorsal ruffles induced by platelet-derived growth factor stimulation. Interestingly, SHIP2 (the PI(3,4,5)P(3) 5-phosphatase, src-homology 2-containing inositol-5-phosphatase 2) was identified as a binding partner of SH3YL1, and knockdown of these proteins significantly suppressed dorsal ruffle formation. Phosphatidylinositol 3,4-bisphosphate (PI(3,4)P(2)), which is mainly synthesized from PI(3,4,5)P(3) by the action of SHIP2, was enriched in dorsal ruffles, and PI(3,4)P(2) synthesis strongly correlated with formation of the circular membrane structure. These results provide new insight into the molecular mechanism of dorsal ruffle formation and its regulation by phosphoinositide metabolism.
Collapse
Affiliation(s)
- Junya Hasegawa
- Division of Membrane Biology, Department of Biochemistry and Molecular Biology, Kobe University Graduate School of Medicine, Kobe, Hyogo 650-0017, Japan
| | | | | | | | | | | | | | | |
Collapse
|
30
|
Västermark Å, Almén MS, Simmen MW, Fredriksson R, Schiöth HB. Functional specialization in nucleotide sugar transporters occurred through differentiation of the gene cluster EamA (DUF6) before the radiation of Viridiplantae. BMC Evol Biol 2011; 11:123. [PMID: 21569384 PMCID: PMC3111387 DOI: 10.1186/1471-2148-11-123] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2011] [Accepted: 05/12/2011] [Indexed: 02/04/2023] Open
Abstract
Background The drug/metabolite transporter superfamily comprises a diversity of protein domain families with multiple functions including transport of nucleotide sugars. Drug/metabolite transporter domains are contained in both solute carrier families 30, 35 and 39 proteins as well as in acyl-malonyl condensing enzyme proteins. In this paper, we present an evolutionary analysis of nucleotide sugar transporters in relation to the entire superfamily of drug/metabolite transporters that considers crucial intra-protein duplication events that have shaped the transporters. We use a method that combines the strengths of hidden Markov models and maximum likelihood to find relationships between drug/metabolite transporter families, and branches within families. Results We present evidence that the triose-phosphate transporters, domain unknown function 914, uracil-diphosphate glucose-N-acetylglucosamine, and nucleotide sugar transporter families have evolved from a domain duplication event before the radiation of Viridiplantae in the EamA family (previously called domain unknown function 6). We identify previously unknown branches in the solute carrier 30, 35 and 39 protein families that emerged simultaneously as key physiological developments after the radiation of Viridiplantae, including the "35C/E" branch of EamA, which formed in the lineage of T. adhaerens (Animalia). We identify a second cluster of DMTs, called the domain unknown function 1632 cluster, which has non-cytosolic N- and C-termini, and thus appears to have been formed from a different domain duplication event. We identify a previously uncharacterized motif, G-X(6)-G, which is overrepresented in the fifth transmembrane helix of C-terminal domains. We present evidence that the family called fatty acid elongases are homologous to transporters, not enzymes as had previously been thought. Conclusions The nucleotide sugar transporters families were formed through differentiation of the gene cluster EamA (domain unknown function 6) before Viridiplantae, showing for the first time the significance of EamA.
Collapse
Affiliation(s)
- Åke Västermark
- Department of Neuroscience, Functional Pharmacology, Uppsala University, BMC, Box 593, 751 24, Uppsala, Sweden.
| | | | | | | | | |
Collapse
|
31
|
Ebina T, Toh H, Kuroda Y. DROP: an SVM domain linker predictor trained with optimal features selected by random forest. ACTA ACUST UNITED AC 2010; 27:487-94. [PMID: 21169376 DOI: 10.1093/bioinformatics/btq700] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION Biologically important proteins are often large, multidomain proteins, which are difficult to characterize by high-throughput experimental methods. Efficient domain/boundary predictions are thus increasingly required in diverse area of proteomics research for computationally dissecting proteins into readily analyzable domains. RESULTS We constructed a support vector machine (SVM)-based domain linker predictor, DROP (Domain linker pRediction using OPtimal features), which was trained with 25 optimal features. The optimal combination of features was identified from a set of 3000 features using a random forest algorithm complemented with a stepwise feature selection. DROP demonstrated a prediction sensitivity and precision of 41.3 and 49.4%, respectively. These values were over 19.9% higher than those of control SVM predictors trained with non-optimized features, strongly suggesting the efficiency of our feature selection method. In addition, the mean NDO-Score of DROP for predicting novel domains in seven CASP8 FM multidomain proteins was 0.760, which was higher than any of the 12 published CASP8 DP servers. Overall, these results indicate that the SVM prediction of domain linkers can be improved by identifying optimal features that best distinguish linker from non-linker regions.
Collapse
Affiliation(s)
- Teppei Ebina
- Department of Biotechnology and Life Science, Tokyo University of Agriculture and Technology, Koganei-shi, Tokyo 184-8588, Japan
| | | | | |
Collapse
|
32
|
Motono C, Nakata J, Koike R, Shimizu K, Shirota M, Amemiya T, Tomii K, Nagano N, Sakaya N, Misoo K, Sato M, Kidera A, Hiroaki H, Shirai T, Kinoshita K, Noguchi T, Ota M. SAHG, a comprehensive database of predicted structures of all human proteins. Nucleic Acids Res 2010; 39:D487-93. [PMID: 21051360 PMCID: PMC3013665 DOI: 10.1093/nar/gkq1057] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Most proteins from higher organisms are known to be multi-domain proteins and contain substantial numbers of intrinsically disordered (ID) regions. To analyse such protein sequences, those from human for instance, we developed a special protein-structure-prediction pipeline and accumulated the products in the Structure Atlas of Human Genome (SAHG) database at http://bird.cbrc.jp/sahg. With the pipeline, human proteins were examined by local alignment methods (BLAST, PSI-BLAST and Smith–Waterman profile–profile alignment), global–local alignment methods (FORTE) and prediction tools for ID regions (POODLE-S) and homology modeling (MODELLER). Conformational changes of protein models upon ligand-binding were predicted by simultaneous modeling using templates of apo and holo forms. When there were no suitable templates for holo forms and the apo models were accurate, we prepared holo models using prediction methods for ligand-binding (eF-seek) and conformational change (the elastic network model and the linear response theory). Models are displayed as animated images. As of July 2010, SAHG contains 42 581 protein-domain models in approximately 24 900 unique human protein sequences from the RefSeq database. Annotation of models with functional information and links to other databases such as EzCatDB, InterPro or HPRD are also provided to facilitate understanding the protein structure-function relationships.
Collapse
Affiliation(s)
- Chie Motono
- Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology, Tokyo 135-0064, Japan.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|