1
|
Pan Y, Zhao H, Huang W, Liu S, Qi Y, Huang Y. Metal-Protein Hybrid Materials: Unlocking New Frontiers in Biomedical Applications. Adv Healthc Mater 2025:e2404405. [PMID: 39778029 DOI: 10.1002/adhm.202404405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2024] [Revised: 12/24/2024] [Indexed: 01/11/2025]
Abstract
Metal-protein hybrid materials represent a novel class of functional materials that exhibit exceptional physicochemical properties and tunable structures, rendering them remarkable applications in diverse fields, including materials engineering, biocatalysis, biosensing, and biomedicine. The design and development of multifunctional and biocompatible metal-protein hybrid materials have been the subject of extensive research and a key aspiration for practical applications in clinical settings. This review provides a comprehensive analysis of the design strategies, intrinsic properties, and biomedical applications of these hybrid materials, with a specific emphasis on their potential in cancer therapy, drug and vaccine delivery, antibacterial treatments, and tissue regeneration. Through rational design, stable metal-protein hybrid materials can be synthesized using straightforward methods, enabling them with therapeutic, delivery, immunomodulatory, and other desired functionalities. Finally, the review outlines the existing limitations and challenges associated with metal-protein hybrid materials and evaluates their potential for clinical translation, providing insights into their practical implementation within biomedical applications.
Collapse
Affiliation(s)
- Yong Pan
- Faculty of Chemistry, Northeast Normal University, Changchun, 130024, P.R. China
| | - Han Zhao
- Faculty of Chemistry, Northeast Normal University, Changchun, 130024, P.R. China
| | - Wenyong Huang
- Faculty of Chemistry, Northeast Normal University, Changchun, 130024, P.R. China
| | - Siyang Liu
- Faculty of Chemistry, Northeast Normal University, Changchun, 130024, P.R. China
| | - Yanxin Qi
- Faculty of Chemistry, Northeast Normal University, Changchun, 130024, P.R. China
| | - Yubin Huang
- Faculty of Chemistry, Northeast Normal University, Changchun, 130024, P.R. China
| |
Collapse
|
2
|
Wang W, Shuai Y, Zeng M, Fan W, Li M. DPFunc: accurately predicting protein function via deep learning with domain-guided structure information. Nat Commun 2025; 16:70. [PMID: 39746897 PMCID: PMC11697396 DOI: 10.1038/s41467-024-54816-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Accepted: 11/21/2024] [Indexed: 01/04/2025] Open
Abstract
Computational methods for predicting protein function are of great significance in understanding biological mechanisms and treating complex diseases. However, existing computational approaches of protein function prediction lack interpretability, making it difficult to understand the relations between protein structures and functions. In this study, we propose a deep learning-based solution, named DPFunc, for accurate protein function prediction with domain-guided structure information. DPFunc can detect significant regions in protein structures and accurately predict corresponding functions under the guidance of domain information. It outperforms current state-of-the-art methods and achieves a significant improvement over existing structure-based methods. Detailed analyses demonstrate that the guidance of domain information contributes to DPFunc for protein function prediction, enabling our method to detect key residues or regions in protein structures, which are closely related to their functions. In summary, DPFunc serves as an effective tool for large-scale protein function prediction, which pushes the border of protein understanding in biological systems.
Collapse
Affiliation(s)
- Wenkang Wang
- School of Computer Science and Engineering, Central South University, Changsha, 410083, China
| | - Yunyan Shuai
- School of Computer Science and Engineering, Central South University, Changsha, 410083, China
| | - Min Zeng
- School of Computer Science and Engineering, Central South University, Changsha, 410083, China
| | - Wei Fan
- Nuffield Department of Women's and Reproductive Health, University of Oxford, Oxford, OX39DU, UK
| | - Min Li
- School of Computer Science and Engineering, Central South University, Changsha, 410083, China.
| |
Collapse
|
3
|
Dai J, Zhang Y, Gao T, Lin Y, Tang Y, Jiang Z, Zhu Y, Li L, Ni H. A comparative study of two α-L-rhamnosidases with high sequence identity. Int J Biol Macromol 2024; 277:134174. [PMID: 39084418 DOI: 10.1016/j.ijbiomac.2024.134174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2024] [Revised: 06/20/2024] [Accepted: 07/24/2024] [Indexed: 08/02/2024]
Abstract
The GH78 α-L-rhamnosidase from Aspergillus tubingensis (AT-Rha) was proved to be a new clade of Aspergillus α-L-rhamnosidases in the previous study. A putative α-L-rhamnosidase from A. kawachii IFO 4308 (AK-Rha) has 92 % identity in amino acid sequence with AT-Rha. In this study, AK-Rha was expressed in P. pastoris and characterized. Similar to AT-rRha, the recombinant AK-Rha (AK-rRha) showed a narrow substrate specificity to naringin. Interestingly, the enzyme activity of AK-rRha was 0.816 U/mg toward naringin, significantly lower than 125.142 U/mg of AT-rRha. Their large differences in catalytic efficiency was mainly due to their differences in kcat values between AK-rRha (0.67 s-1) and AT-rRha (4.89 × 104 s-1). The molecular dynamics simulation exhibited that the overall conformation of AK-Rha was rigid and that of AT-Rha was flexible; the Loop Y-L located above the catalytic domain formed different steric hindrances to naringin, and interacted with the flavonoid matrices at different strengths. The polar solvation energy analysis implied that the glycosidic bond was more easily hydrolysed in AT-Rha. The comparative study verified that the main feature of AK-Rha and AT-Rha represented Aspergillus α-L-rhamnosidase was the narrow substrate specificity toward naringin, and provided an insight of the relationships between their catalytic abilities and structures.
Collapse
Affiliation(s)
- Jiayuan Dai
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen 361021, China
| | - Yichun Zhang
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen 361021, China
| | - Ting Gao
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen 361021, China
| | - Yanling Lin
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen 361021, China
| | - Yiling Tang
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen 361021, China
| | - Zedong Jiang
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen 361021, China; Fujian Provincial Key Laboratory of Food Microbiology and Enzyme Engineering, Xiamen 361021, China; Research Center of Food Biotechnology of Xiamen City, Xiamen 361021, China
| | - Yanbing Zhu
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen 361021, China; Fujian Provincial Key Laboratory of Food Microbiology and Enzyme Engineering, Xiamen 361021, China; Research Center of Food Biotechnology of Xiamen City, Xiamen 361021, China
| | - Lijun Li
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen 361021, China; Fujian Provincial Key Laboratory of Food Microbiology and Enzyme Engineering, Xiamen 361021, China; Research Center of Food Biotechnology of Xiamen City, Xiamen 361021, China.
| | - Hui Ni
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen 361021, China; Xiamen Ocean Vocational College, Xiamen 361102, China
| |
Collapse
|
4
|
Yi HB, Lee S, Seo K, Kim H, Kim M, Lee HS. Cellular and Biophysical Applications of Genetic Code Expansion. Chem Rev 2024; 124:7465-7530. [PMID: 38753805 DOI: 10.1021/acs.chemrev.4c00112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/18/2024]
Abstract
Despite their diverse functions, proteins are inherently constructed from a limited set of building blocks. These compositional constraints pose significant challenges to protein research and its practical applications. Strategically manipulating the cellular protein synthesis system to incorporate novel building blocks has emerged as a critical approach for overcoming these constraints in protein research and application. In the past two decades, the field of genetic code expansion (GCE) has achieved significant advancements, enabling the integration of numerous novel functionalities into proteins across a variety of organisms. This technological evolution has paved the way for the extensive application of genetic code expansion across multiple domains, including protein imaging, the introduction of probes for protein research, analysis of protein-protein interactions, spatiotemporal control of protein function, exploration of proteome changes induced by external stimuli, and the synthesis of proteins endowed with novel functions. In this comprehensive Review, we aim to provide an overview of cellular and biophysical applications that have employed GCE technology over the past two decades.
Collapse
Affiliation(s)
- Han Bin Yi
- Department of Chemistry, Sogang University, 35 Baekbeom-ro, Mapo-gu, Seoul 04107, Republic of Korea
| | - Seungeun Lee
- Department of Chemistry, Sogang University, 35 Baekbeom-ro, Mapo-gu, Seoul 04107, Republic of Korea
| | - Kyungdeok Seo
- Department of Chemistry, Sogang University, 35 Baekbeom-ro, Mapo-gu, Seoul 04107, Republic of Korea
| | - Hyeongjo Kim
- Department of Chemistry, Sogang University, 35 Baekbeom-ro, Mapo-gu, Seoul 04107, Republic of Korea
| | - Minah Kim
- Department of Chemistry, Sogang University, 35 Baekbeom-ro, Mapo-gu, Seoul 04107, Republic of Korea
| | - Hyun Soo Lee
- Department of Chemistry, Sogang University, 35 Baekbeom-ro, Mapo-gu, Seoul 04107, Republic of Korea
| |
Collapse
|
5
|
Chen S, Liu J, Gao G, Li M, Cao L, Liu T, Li G, Ma T. An NAD +-dependent group Ⅲ alcohol dehydrogenase involved in long-chain alkane degradation in Acinetobacter venetianus RAG-1. Enzyme Microb Technol 2024; 172:110343. [PMID: 37890395 DOI: 10.1016/j.enzmictec.2023.110343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Revised: 10/06/2023] [Accepted: 10/07/2023] [Indexed: 10/29/2023]
Abstract
Alcohol dehydrogenases (ADHs) are a class of key enzymes responsible for the oxidation of alkyl alcohols in the aerobic alkane metabolic pathway. Currently, the degradation mechanisms of short- and medium-chain alkanes are commonly reported, while those of long-chain alkanes have received less attention. In this work, a putative long-chain ADH was screened from Acinetobacter venetianus RAG-1 via RNA-seq with n-octacosane (C28) as the sole carbon source. Conserved sequence analysis revealed that it is a group III (Fe-containing/activated) ADH, which is widespread in the genus Acinetobacter. The deletion of adhA led to a significant reduction in the degradation of C28. AdhA exhibited optimal oxidative activity at pH 8.0 and 50 °C with NAD+ as coenzyme, while showing better tolerability to chemical reagents. Enzyme activity assay showed that AdhA owed the oxidative activity to a wide range of substrates including alkyl alcohols (C1-C32) and some isomeric alcohols, such as isopropanol, isobutanol, isoamyl alcohol, and propanetriol, and could reduce the alkyl aldehyde (C1-C12). Meanwhile, the binding of AdhA to different alkyl alcohols was mediated by different amino acids. AdhA is an ADH with an extremely broad substrate utilization range and excellent biochemical characteristics. These results provided important insights in the subsequent investigation of long-chain alkane degradation and petroleum pollution bioremediation.
Collapse
Affiliation(s)
- Shuai Chen
- Key Laboratory of Molecular Microbiology and Technology, Ministry of Education, College of Life Sciences, Nankai University, Tianjin, China
| | - Jia Liu
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, China
| | - Ge Gao
- Key Laboratory of Molecular Microbiology and Technology, Ministry of Education, College of Life Sciences, Nankai University, Tianjin, China
| | - Mingchang Li
- Key Laboratory of Molecular Microbiology and Technology, Ministry of Education, College of Life Sciences, Nankai University, Tianjin, China
| | - Lu Cao
- Key Laboratory of Molecular Microbiology and Technology, Ministry of Education, College of Life Sciences, Nankai University, Tianjin, China
| | - Tongtong Liu
- Key Laboratory of Molecular Microbiology and Technology, Ministry of Education, College of Life Sciences, Nankai University, Tianjin, China
| | - Guoqiang Li
- Key Laboratory of Molecular Microbiology and Technology, Ministry of Education, College of Life Sciences, Nankai University, Tianjin, China; Tianjin Engineering Technology Center of Green Manufacturing Biobased Materials, Tianjin, China.
| | - Ting Ma
- Key Laboratory of Molecular Microbiology and Technology, Ministry of Education, College of Life Sciences, Nankai University, Tianjin, China; Tianjin Engineering Technology Center of Green Manufacturing Biobased Materials, Tianjin, China.
| |
Collapse
|
6
|
Xu SY, Zhou L, Xu Y, Hong HY, Dai C, Wang YJ, Zheng YG. Recent advances in structure-based enzyme engineering for functional reconstruction. Biotechnol Bioeng 2023; 120:3427-3445. [PMID: 37638646 DOI: 10.1002/bit.28540] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Revised: 07/27/2023] [Accepted: 08/15/2023] [Indexed: 08/29/2023]
Abstract
Structural information can help engineer enzymes. Usually, specific amino acids in particular regions are targeted for functional reconstruction to enhance the catalytic performance, including activity, stereoselectivity, and thermostability. Appropriate selection of target sites is the key to structure-based design, which requires elucidation of the structure-function relationships. Here, we summarize the mutations of residues in different specific regions, including active center, access tunnels, and flexible loops, on fine-tuning the catalytic performance of enzymes, and discuss the effects of altering the local structural environment on the functions. In addition, we keep up with the recent progress of structure-based approaches for enzyme engineering, aiming to provide some guidance on how to take advantage of the structural information.
Collapse
Affiliation(s)
- Shen-Yuan Xu
- Key Laboratory of Bioorganic Synthesis of Zhejiang Province, College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
- Engineering Research Center of Bioconversion and Biopurification of the Ministry of Education, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
- The National and Local Joint Engineering Research Center for Biomanufacturing of Chiral Chemicals, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
| | - Lei Zhou
- Key Laboratory of Bioorganic Synthesis of Zhejiang Province, College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
- Engineering Research Center of Bioconversion and Biopurification of the Ministry of Education, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
- The National and Local Joint Engineering Research Center for Biomanufacturing of Chiral Chemicals, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
| | - Ying Xu
- Key Laboratory of Bioorganic Synthesis of Zhejiang Province, College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
- Engineering Research Center of Bioconversion and Biopurification of the Ministry of Education, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
- The National and Local Joint Engineering Research Center for Biomanufacturing of Chiral Chemicals, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
| | - Han-Yue Hong
- Key Laboratory of Bioorganic Synthesis of Zhejiang Province, College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
- Engineering Research Center of Bioconversion and Biopurification of the Ministry of Education, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
- The National and Local Joint Engineering Research Center for Biomanufacturing of Chiral Chemicals, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
| | - Chen Dai
- Key Laboratory of Bioorganic Synthesis of Zhejiang Province, College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
- Engineering Research Center of Bioconversion and Biopurification of the Ministry of Education, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
- The National and Local Joint Engineering Research Center for Biomanufacturing of Chiral Chemicals, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
| | - Ya-Jun Wang
- Key Laboratory of Bioorganic Synthesis of Zhejiang Province, College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
- Engineering Research Center of Bioconversion and Biopurification of the Ministry of Education, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
- The National and Local Joint Engineering Research Center for Biomanufacturing of Chiral Chemicals, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
| | - Yu-Guo Zheng
- Key Laboratory of Bioorganic Synthesis of Zhejiang Province, College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
- Engineering Research Center of Bioconversion and Biopurification of the Ministry of Education, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
- The National and Local Joint Engineering Research Center for Biomanufacturing of Chiral Chemicals, Zhejiang University of Technology, Hangzhou, Zhejiang, People's Republic of China
| |
Collapse
|
7
|
Vasileiou D, Karapiperis C, Baltsavia I, Chasapi A, Ahrén D, Janssen PJ, Iliopoulos I, Promponas VJ, Enright AJ, Ouzounis CA. CGG toolkit: Software components for computational genomics. PLoS Comput Biol 2023; 19:e1011498. [PMID: 37934729 PMCID: PMC10629618 DOI: 10.1371/journal.pcbi.1011498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 09/07/2023] [Indexed: 11/09/2023] Open
Abstract
Public-domain availability for bioinformatics software resources is a key requirement that ensures long-term permanence and methodological reproducibility for research and development across the life sciences. These issues are particularly critical for widely used, efficient, and well-proven methods, especially those developed in research settings that often face funding discontinuities. We re-launch a range of established software components for computational genomics, as legacy version 1.0.1, suitable for sequence matching, masking, searching, clustering and visualization for protein family discovery, annotation and functional characterization on a genome scale. These applications are made available online as open source and include MagicMatch, GeneCAST, support scripts for CoGenT-like sequence collections, GeneRAGE and DifFuse, supported by centrally administered bioinformatics infrastructure funding. The toolkit may also be conceived as a flexible genome comparison software pipeline that supports research in this domain. We illustrate basic use by examples and pictorial representations of the registered tools, which are further described with appropriate documentation files in the corresponding GitHub release.
Collapse
Affiliation(s)
- Dimitrios Vasileiou
- Biological Computation & Process Laboratory, Chemical Process & Energy Resources Institute, Centre for Research & Technology Hellas, Thessalonica, Greece
| | - Christos Karapiperis
- Biological Computation & Process Laboratory, Chemical Process & Energy Resources Institute, Centre for Research & Technology Hellas, Thessalonica, Greece
- Biological Computation & Computational Biology Group, AIIA Lab, School of Informatics, Aristotle University of Thessalonica, Thessalonica, Greece
| | - Ismini Baltsavia
- Computational Biology Group, Faculty of Medicine, University of Crete, Heraklion, Greece
| | - Anastasia Chasapi
- Biological Computation & Process Laboratory, Chemical Process & Energy Resources Institute, Centre for Research & Technology Hellas, Thessalonica, Greece
| | - Dag Ahrén
- Department of Biology, Microbial Ecology Group, Lund University, Lund, Sweden
| | - Paul J. Janssen
- Nuclear Medical Applications, Belgian Nuclear Research Centre SCK CEN, Mol, Belgium
| | - Ioannis Iliopoulos
- Computational Biology Group, Faculty of Medicine, University of Crete, Heraklion, Greece
| | - Vasilis J. Promponas
- Bioinformatics Research Laboratory, Department of Biological Sciences, New Campus, University of Cyprus, Nicosia, Cyprus
| | - Anton J. Enright
- Department of Pathology, University of Cambridge, Tennis Court Road, Cambridge, United Kingdom
| | - Christos A. Ouzounis
- Biological Computation & Process Laboratory, Chemical Process & Energy Resources Institute, Centre for Research & Technology Hellas, Thessalonica, Greece
- Biological Computation & Computational Biology Group, AIIA Lab, School of Informatics, Aristotle University of Thessalonica, Thessalonica, Greece
- SysBioBio.info (SBBI), Thessalonica, Greece
| |
Collapse
|
8
|
Jiang T, Hu XF, Guan YF, Chen JJ, Yu HQ. Molecular insights into complexation between protein and silica: Spectroscopic and simulation investigations. WATER RESEARCH 2023; 246:120681. [PMID: 37801982 DOI: 10.1016/j.watres.2023.120681] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Revised: 09/25/2023] [Accepted: 09/28/2023] [Indexed: 10/08/2023]
Abstract
The synergistic effect of protein-silica complexation leads to exacerbated membrane fouling in the membrane desalination process, exceeding the individual impacts of silica scaling or protein fouling. However, the molecular-level dynamics of silica binding to proteins and the resulting structural changes in both proteins and silica remain poorly understood. This study investigates the complexation process between silica and proteins-negatively charged bovine serum albumin (BSA) and positively charged lysozyme (LYZ) at neutral pH-using infrared spectroscopy (IR), in situ attenuated total reflection Fourier transform infrared spectroscopy (ATR-FTIR), and multiple computational simulations. The findings reveal that both protein and silica structures undergo changes during the complexation process, with calcium ions in the solution significantly exacerbating these alterations. In particular, in situ ATR-FTIR combined with two-dimensional correlation spectroscopy analysis shows that BSA experiences more pronounced unfolding, providing additional binding sites for silica adsorption compared to LYZ. The adsorbed proteins promote silica polymerization from lower-polymerized to higher-polymerized species. Furthermore, molecular dynamics simulations demonstrate greater conformational variation in BSA through root-mean-square-deviation analysis and the bridging role of calcium ions via mean square displacement analysis. Molecular docking and density functional theory calculations identify the binding sites and energy of silica on proteins. In summary, this research offers a comprehensive understanding of the protein-silica complexation process, contributing to the knowledge of synergistic behaviors of inorganic scaling and organic fouling on membrane surfaces. The integrated approach used here may also be applicable for exploring other complex complexation processes in various environments.
Collapse
Affiliation(s)
- Ting Jiang
- CAS Key Laboratory of Urban Pollutant Conversion, Department of Environmental Science and Technology, University of Science and Technology of China, Hefei 230026, China
| | - Xiao-Fan Hu
- CAS Key Laboratory of Urban Pollutant Conversion, Department of Environmental Science and Technology, University of Science and Technology of China, Hefei 230026, China
| | - Yan-Fang Guan
- CAS Key Laboratory of Urban Pollutant Conversion, Department of Environmental Science and Technology, University of Science and Technology of China, Hefei 230026, China.
| | - Jie-Jie Chen
- CAS Key Laboratory of Urban Pollutant Conversion, Department of Environmental Science and Technology, University of Science and Technology of China, Hefei 230026, China
| | - Han-Qing Yu
- CAS Key Laboratory of Urban Pollutant Conversion, Department of Environmental Science and Technology, University of Science and Technology of China, Hefei 230026, China.
| |
Collapse
|
9
|
Gu J, Xu Y, Nie Y. Role of distal sites in enzyme engineering. Biotechnol Adv 2023; 63:108094. [PMID: 36621725 DOI: 10.1016/j.biotechadv.2023.108094] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Revised: 11/15/2022] [Accepted: 01/01/2023] [Indexed: 01/06/2023]
Abstract
The limitations associated with natural enzyme catalysis have triggered the rise of the field of protein engineering. Traditional rational design was based on the analysis of protein structural information and catalytic mechanisms to identify key active sites or ligand binding sites to reshape the substrate pocket. The role and significance of functional sites in the active center have been studied extensively. With a deeper understanding of the structure-catalysis relationship map, the entire protein molecule can be filled with residues that play a substantial role in its structure and function. However, the catalytic mechanism underlying distal mutations remains unclear. The aim of this review was to highlight the criticality of the distal site in enzyme engineering based on the following three aspects: What can distal mutations exert on function from mutability landscape? How do distal sites influence enzyme function? How to predict and design distal mutations? This review provides insights into the catalytic mechanism of enzymes from the global interaction network, knowledge from sequence-structure-dynamics-function relationships, and strategies for distal mutation-based protein engineering.
Collapse
Affiliation(s)
- Jie Gu
- Lab of Brewing Microbiology and Applied Enzymology, School of Biotechnology and Key laboratory of Industrial Biotechnology of Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Yan Xu
- Lab of Brewing Microbiology and Applied Enzymology, School of Biotechnology and Key laboratory of Industrial Biotechnology of Ministry of Education, Jiangnan University, Wuxi 214122, China; State Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China
| | - Yao Nie
- Lab of Brewing Microbiology and Applied Enzymology, School of Biotechnology and Key laboratory of Industrial Biotechnology of Ministry of Education, Jiangnan University, Wuxi 214122, China; Suqian Industrial Technology Research Institute of Jiangnan University, Suqian 223814, China.
| |
Collapse
|
10
|
Kouros CE, Makri V, Ouzounis CA, Chasapi A. Disease association and comparative genomics of compositional bias in human proteins. F1000Res 2023; 12:198. [PMID: 37082000 PMCID: PMC10111144 DOI: 10.12688/f1000research.129929.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 02/02/2023] [Indexed: 02/22/2023] Open
Abstract
Background: The evolutionary rate of disordered proteins varies greatly due to the lack of structural constraints. So far, few studies have investigated the presence/absence patterns of intrinsically disordered regions (IDRs) across phylogenies in conjunction with human disease. In this study, we report a genome-wide analysis of compositional bias association with disease in human proteins and their taxonomic distribution. Methods: The human genome protein set provided by the Ensembl database was annotated and analysed with respect to both disease associations and the detection of compositional bias. The Uniprot Reference Proteome dataset, containing 11297 proteomes was used as target dataset for the comparative genomics of a well-defined subset of the Human Genome, including 100 characteristic, compositionally biased proteins, some linked to disease. Results: Cross-evaluation of compositional bias and disease-association in the human genome reveals a significant bias towards low complexity regions in disease-associated genes, with charged, hydrophilic amino acids appearing as over-represented. The phylogenetic profiling of 17 disease-associated, low complexity proteins across 11297 proteomes captures characteristic taxonomic distribution patterns. Conclusions: This is the first time that a combined genome-wide analysis of low complexity, disease-association and taxonomic distribution of human proteins is reported, covering structural, functional, and evolutionary properties. The reported framework can form the basis for large-scale, follow-up projects, encompassing the entire human genome and all known gene-disease associations.
Collapse
Affiliation(s)
- Christos E. Kouros
- BCCB-AIIA, School of Informatics, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Vasiliki Makri
- BCCB-AIIA, School of Informatics, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Christos A. Ouzounis
- BCCB-AIIA, School of Informatics, Aristotle University of Thessaloniki, Thessaloniki, Greece
- BCPL, Chemical Process & Energy Resources Institute, Centre for Research & Technology Hellas (CERTH), Thessaloniki, Greece
| | - Anastasia Chasapi
- BCPL, Chemical Process & Energy Resources Institute, Centre for Research & Technology Hellas (CERTH), Thessaloniki, Greece
| |
Collapse
|
11
|
Kouros CE, Makri V, Ouzounis CA, Chasapi A. Disease association and comparative genomics of compositional bias in human proteins. F1000Res 2023; 12:198. [PMID: 37082000 PMCID: PMC10111144 DOI: 10.12688/f1000research.129929.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 04/12/2023] [Indexed: 04/25/2023] Open
Abstract
Background: The evolutionary rate of disordered protein regions varies greatly due to the lack of structural constraints. So far, few studies have investigated the presence/absence patterns of compositional bias, indicative of disorder, across phylogenies in conjunction with human disease. In this study, we report a genome-wide analysis of compositional bias association with disease in human proteins and their taxonomic distribution. Methods: The human genome protein set provided by the Ensembl database was annotated and analysed with respect to both disease associations and the detection of compositional bias. The Uniprot Reference Proteome dataset, containing 11297 proteomes was used as target dataset for the comparative genomics of a well-defined subset of the Human Genome, including 100 characteristic, compositionally biased proteins, some linked to disease. Results: Cross-evaluation of compositional bias and disease-association in the human genome reveals a significant bias towards biased regions in disease-associated genes, with charged, hydrophilic amino acids appearing as over-represented. The phylogenetic profiling of 17 disease-associated, proteins with compositional bias across 11297 proteomes captures characteristic taxonomic distribution patterns. Conclusions: This is the first time that a combined genome-wide analysis of compositional bias, disease-association and taxonomic distribution of human proteins is reported, covering structural, functional, and evolutionary properties. The reported framework can form the basis for large-scale, follow-up projects, encompassing the entire human genome and all known gene-disease associations.
Collapse
Affiliation(s)
- Christos E. Kouros
- BCCB-AIIA, School of Informatics, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Vasiliki Makri
- BCCB-AIIA, School of Informatics, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Christos A. Ouzounis
- BCCB-AIIA, School of Informatics, Aristotle University of Thessaloniki, Thessaloniki, Greece
- BCPL, Chemical Process & Energy Resources Institute, Centre for Research & Technology Hellas (CERTH), Thessaloniki, Greece
| | - Anastasia Chasapi
- BCPL, Chemical Process & Energy Resources Institute, Centre for Research & Technology Hellas (CERTH), Thessaloniki, Greece
| |
Collapse
|
12
|
Jana P, Samanta K, Ehlers M, Zellermann E, Bäcker S, Stauber RH, Schmuck C, Knauer SK. Impact of Peptide Sequences on Their Structure and Function: Mimicking of Virus-Like Nanoparticles for Nucleic Acid Delivery. Chembiochem 2023; 24:e202200519. [PMID: 36314419 PMCID: PMC10099937 DOI: 10.1002/cbic.202200519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Revised: 10/30/2022] [Indexed: 01/05/2023]
Abstract
We rationally designed a series of amphiphilic hepta-peptides enriched with a chemically conjugated guanidiniocarbonylpyrrole (GCP) unit at the lysine side chain. All peptides are composed of polar (GCP) and non-polar (cyclohexyl alanine) residues but differ in their sequence periodicity, resulting in different secondary as well as supramolecular structures. CD spectra revealed the assembly of β-sheet-, α-helical and random structures for peptides 1, 2 and 3, respectively. Consequently, this enabled the formation of distinct supramolecular assemblies such as fibres, nanorod-like or spherical aggregates. Notably, all three cationic peptides are equipped with the anion-binding GCP unit and thus possess a nucleic acid-binding centre. However, only the helical (2) and the unstructured (3) peptide were able to assemble into small virus-like DNA-polyplexes and effectively deliver DNA into cells. Notably, as both peptides (2 and 3) were also capable of siRNA-delivery, they could be utilized to downregulate expression of the caner-relevant protein Survivin.
Collapse
Affiliation(s)
- Poulami Jana
- Department of Chemistry, Kaliachak College Sultanganj, Malda, 732201-, West Bengal, India
| | - Krishnananda Samanta
- Department of Chemistry, Balurghat College Dakshin Dinajpur, 733101-, West Bengal, India
| | - Martin Ehlers
- Organic Chemistry, University of Duisburg-Essen, 45117, Essen, Germany
| | - Elio Zellermann
- Organic Chemistry, University of Duisburg-Essen, 45117, Essen, Germany
| | - Sandra Bäcker
- Molecular Biology, University of Duisburg-Essen, 45117, Essen, Germany
| | - Roland H Stauber
- Molecular and Cellular Oncology, ENT Department, University Mainz Medical Center, 55131, Mainz, Germany
| | - Carsten Schmuck
- Organic Chemistry, University of Duisburg-Essen, 45117, Essen, Germany
| | - Shirley K Knauer
- Molecular Biology, University of Duisburg-Essen, 45117, Essen, Germany
| |
Collapse
|
13
|
Chou HH, Hsu CT, Hsu CW, Yao KH, Wang HC, Hsieh SY. Novel Algorithm for Improved Protein Classification Using Graph Similarity. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:3135-3143. [PMID: 34748498 DOI: 10.1109/tcbb.2021.3125836] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Considerable sequence data are produced in genome annotation projects that relate to molecular levels, structural similarities, and molecular and biological functions. In structural genomics, the most essential task involves resolving protein structures efficiently with hardware or software, understanding these structures, and assigning their biological functions. Understanding the characteristics and functions of proteins enables the exploration of the molecular mechanisms of life. In this paper, we examine the problems of protein classification. Because they perform similar biological functions, proteins in the same family usually share similar structural characteristics. We employed this premise in designing a classification algorithm. In this algorithm, auxiliary graphs are used to represent proteins, with every amino acid in a protein to a vertex in a graph. Moreover, the links between amino acids correspond to the edges between the vertices. The proposed algorithm classifies proteins according to the similarities in their graphical structures. The proposed algorithm is efficient and accurate in distinguishing proteins from different families and outperformed related algorithms experimentally.
Collapse
|
14
|
Goudey B, Geard N, Verspoor K, Zobel J. Propagation, detection and correction of errors using the sequence database network. Brief Bioinform 2022; 23:6764545. [PMID: 36266246 PMCID: PMC9677457 DOI: 10.1093/bib/bbac416] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Revised: 07/31/2022] [Accepted: 08/28/2022] [Indexed: 12/14/2022] Open
Abstract
Nucleotide and protein sequences stored in public databases are the cornerstone of many bioinformatics analyses. The records containing these sequences are prone to a wide range of errors, including incorrect functional annotation, sequence contamination and taxonomic misclassification. One source of information that can help to detect errors are the strong interdependency between records. Novel sequences in one database draw their annotations from existing records, may generate new records in multiple other locations and will have varying degrees of similarity with existing records across a range of attributes. A network perspective of these relationships between sequence records, within and across databases, offers new opportunities to detect-or even correct-erroneous entries and more broadly to make inferences about record quality. Here, we describe this novel perspective of sequence database records as a rich network, which we call the sequence database network, and illustrate the opportunities this perspective offers for quantification of database quality and detection of spurious entries. We provide an overview of the relevant databases and describe how the interdependencies between sequence records across these databases can be exploited by network analyses. We review the process of sequence annotation and provide a classification of sources of error, highlighting propagation as a major source. We illustrate the value of a network perspective through three case studies that use network analysis to detect errors, and explore the quality and quantity of critical relationships that would inform such network analyses. This systematic description of a network perspective of sequence database records provides a novel direction to combat the proliferation of errors within these critical bioinformatics resources.
Collapse
Affiliation(s)
- Benjamin Goudey
- Corresponding author. Benjamin Goudey, School of Computing and Information Systems, University of Melbourne Parkville, Victoria, 3010,
| | - Nicholas Geard
- School of Computing and Information Systems, University of Melbourne Parkville, Victoria, 3010
| | - Karin Verspoor
- School of Computing Technologies, RMIT University Melbourne, Victoria, 3000
| | - Justin Zobel
- School of Computing and Information Systems, University of Melbourne Parkville, Victoria, 3010
| |
Collapse
|
15
|
Kuang D, Issakova D, Kim J. Learning Proteome Domain Folding Using LSTMs in an Empirical Kernel Space. J Mol Biol 2022; 434:167686. [PMID: 35716781 DOI: 10.1016/j.jmb.2022.167686] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Revised: 06/08/2022] [Accepted: 06/10/2022] [Indexed: 11/30/2022]
Abstract
The recognition of protein structural folds is the starting point for protein function inference and for many structural prediction tools. We previously introduced the idea of using empirical comparisons to create a data-augmented feature space called PESS (Protein Empirical Structure Space)1 as a novel approach for protein structure prediction. Here, we extend the previous approach by generating the PESS feature space over fixed-length subsequences of query peptides, and applying a sequential neural network model, with one long short-term memory cell layer followed by a fully connected layer. Using this approach, we show that only a small group of domains as a training set is needed to achieve near state-of-the-art accuracy on fold recognition. Our method improves on the previous approach by reducing the training set required and improving the model's ability to generalize across species, which will help fold prediction for newly discovered proteins.
Collapse
Affiliation(s)
- Da Kuang
- Department of Computer and Information Science, University of Pennsylvania, Philadelphia, USA.
| | - Dina Issakova
- Department of Biology, University of Pennsylvania, Philadelphia, USA.
| | - Junhyong Kim
- Department of Computer and Information Science, University of Pennsylvania, Philadelphia, USA; Department of Biology, University of Pennsylvania, Philadelphia, USA.
| |
Collapse
|
16
|
Körner P. Hydrothermal Degradation of Amino Acids. CHEMSUSCHEM 2021; 14:4947-4957. [PMID: 34498812 PMCID: PMC9292971 DOI: 10.1002/cssc.202101487] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Revised: 09/09/2021] [Indexed: 05/05/2023]
Abstract
Within the past years, hydrothermal processes have gathered much attention as promising conversion technologies for especially wet biomass. Amino acids are an integral component of biomass, zoo biomass in particular. However, what happens to them during hydrothermal treatment? Reviewing the available literature going back to the mid of the 20th century revealed an astonishing, but still fragmentary view. In fact, two universal degradation reactions could be identified (i. e., deamination and decarboxylation), competing with each other. Thereby, small structural differences may obviously have huge impacts on the fate of individual amino acids. Nevertheless, the amount of available experimental data is relatively scarce in many cases. In this work, the available knowledge about the degradation of 20 proteinogenic amino acids under hydrothermal conditions was presented and discussed critically. The hydrothermal conversion of proteinaceus biomass as well as the Maillard reaction, both extensively reviewed elsewhere, were only touched on.
Collapse
Affiliation(s)
- Paul Körner
- DBFZ - Deutsches Biomasseforschungszentrum gemeinnützige GmbH Biorefineries DepartmentTorgauer Straße 11604347LeipzigGermany
| |
Collapse
|
17
|
Re-entrant swelling and redissolution of polyelectrolytes arises from an increased electrostatic decay length at high salt concentrations. J Colloid Interface Sci 2020; 579:369-378. [DOI: 10.1016/j.jcis.2020.06.072] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2020] [Revised: 06/15/2020] [Accepted: 06/16/2020] [Indexed: 11/24/2022]
|
18
|
Rayamajhi S, Aryal S. Surface functionalization strategies of extracellular vesicles. J Mater Chem B 2020; 8:4552-4569. [PMID: 32377649 DOI: 10.1039/d0tb00744g] [Citation(s) in RCA: 76] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Extracellular vesicles (EVs) are lipid-protein bilayer vesicular constructs secreted to the extracellular spaces by cells. All cells secrete EVs as a regular biological process that appears to be conserved throughout the evolution. Owing to the rich molecular cargo of EVs with specific lipid and protein content and documented role in cellular communication, EVs have been exploited as a versatile agent in the biomedical arena, including as diagnostic, drug delivery, immunomodulatory, and therapeutic agents. With these multifaceted applications in the biomedical field, the functionalization of EVs to add diverse functionality has garnered rapid attention. EVs can be functionalized with an exogenous imaging and targeting moiety that allows for the target specificity and the real-time tracking of EVs for diagnostic and therapeutic applications. Importantly, such added functionalities can be used to explore EVs' biogenesis pathway and their role in cellular communication, which can lead to a better understanding of EVs' cellular mechanisms and processes. In this report, we have reviewed the existing surface functionalization strategies of EVs and broadly classified them into three major approaches: physical, biological, and chemical approaches. The physical approach of EV functionalization includes methods like sonication, extrusion, and freeze-thaw that can change the surface properties of EVs via membrane rearrangements. The biological approach includes genetically and metabolically engineering cells to express protein or cargo molecules of interest in secreted EVs. The chemical approach includes different facile click type chemistries that can be used to covalently conjugate the EV lipid or protein construct with different linker groups for diverse functionality. Different chemistries like thiol-maleimide, EDC/NHS, azide-alkyne cycloaddition, and amidation chemistry have been discussed to functionalize EVs. Finally, a comparative discussion of all approaches has been done focusing on the significance of each approach. The collective knowledge of the major approach of surface functionalization can be used to improve the limitation of one technique by combining it with another. An optimized surface functionalization approach developed accordingly can efficiently add required functionality to EVs while maintaining their natural integrity.
Collapse
Affiliation(s)
- Sagar Rayamajhi
- Department of Chemistry, Nanotechnology Innovation Center of Kansas State (NICKS), Kansas State University, Manhattan, KS 66506, USA.
| | | |
Collapse
|
19
|
Chasapi A, Aivaliotis M, Angelis L, Chanalaris A, Iliopoulos I, Kappas I, Karapiperis C, Kyrpides NC, Pafilis E, Panteris E, Topalis P, Tsiamis G, Vizirianakis IS, Vlassi M, Promponas VJ, Ouzounis CA. Establishment of computational biology in Greece and Cyprus: Past, present, and future. PLoS Comput Biol 2019; 15:e1007532. [PMID: 31856214 PMCID: PMC6922331 DOI: 10.1371/journal.pcbi.1007532] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Affiliation(s)
- Anastasia Chasapi
- Biological Computation & Process Lab, Chemical Process & Energy Resources Institute, Centre for Research & Technology Hellas, Thessalonica, Greece
| | - Michalis Aivaliotis
- School of Medicine, Faculty of Health Sciences, Aristotle University of Thessaloniki, Thessalonica, Greece
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology Hellas, Heraklion, Greece
| | - Lefteris Angelis
- School of Informatics, Aristotle University of Thessaloniki, Thessalonica, Greece
| | - Anastasios Chanalaris
- Botnar Research Centre, NDORMS, Medical Sciences Division, University of Oxford, Oxford, United Kingdom
| | - Ioannis Iliopoulos
- Division of Basic Sciences, School of Medicine, University of Crete, Heraklion, Greece
| | - Ilias Kappas
- School of Biology, Aristotle University of Thessaloniki, Thessalonica, Greece
| | - Christos Karapiperis
- School of Informatics, Aristotle University of Thessaloniki, Thessalonica, Greece
| | - Nikos C. Kyrpides
- Department of Energy, Joint Genome Institute, Walnut Creek, California, United States of America
| | - Evangelos Pafilis
- Institute of Marine Biology Biotechnology and Aquaculture, Hellenic Centre for Marine Research, Heraklion, Greece
| | - Eleftherios Panteris
- First Psychiatric Clinic, Papageorgiou General Hospital, Aristotle University of Thessaloniki, Thessalonica, Greece
| | - Pantelis Topalis
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology Hellas, Heraklion, Greece
| | - George Tsiamis
- Department of Environmental Engineering, School of Engineering, University of Patras, Patras, Greece
| | - Ioannis S. Vizirianakis
- Department of Pharmacology, School of Pharmacy, Aristotle University of Thessaloniki, Thessalonica, Greece
| | - Metaxia Vlassi
- Institute of Biosciences & Applications, National Centre for Scientific Research Demokritos, Athens, Greece
| | - Vasilis J. Promponas
- Bioinformatics Research Laboratory, Department of Biological Sciences, University of Cyprus, Nicosia, Cyprus
- * E-mail: (VJP); (CAO)
| | - Christos A. Ouzounis
- Biological Computation & Process Lab, Chemical Process & Energy Resources Institute, Centre for Research & Technology Hellas, Thessalonica, Greece
- * E-mail: (VJP); (CAO)
| |
Collapse
|
20
|
Hosoyama K, Lazurko C, Muñoz M, McTiernan CD, Alarcon EI. Peptide-Based Functional Biomaterials for Soft-Tissue Repair. Front Bioeng Biotechnol 2019; 7:205. [PMID: 31508416 PMCID: PMC6716508 DOI: 10.3389/fbioe.2019.00205] [Citation(s) in RCA: 73] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2019] [Accepted: 08/09/2019] [Indexed: 11/15/2022] Open
Abstract
Synthetically derived peptide-based biomaterials are in many instances capable of mimicking the structure and function of their full-length endogenous counterparts. Combine this with the fact that short mimetic peptides are easier to produce when compared to full length proteins, show enhanced processability and ease of modification, and have the ability to be prepared under well-defined and controlled conditions; it becomes obvious why there has been a recent push to develop regenerative biomaterials from these molecules. There is increasing evidence that the incorporation of peptides within regenerative scaffolds can result in the generation of structural recognition motifs that can enhance cell attachment or induce cell signaling pathways, improving cell infiltration or promote a variety of other modulatory biochemical responses. By highlighting the current approaches in the design and application of short mimetic peptides, we hope to demonstrate their potential in soft-tissue healing while at the same time drawing attention to the advances made to date and the problems which need to be overcome to advance these materials to the clinic for applications in heart, skin, and cornea repair.
Collapse
Affiliation(s)
- Katsuhiro Hosoyama
- Division of Cardiac Surgery Research, University of Ottawa Heart Institute, Ottawa, ON, Canada
| | - Caitlin Lazurko
- Division of Cardiac Surgery Research, University of Ottawa Heart Institute, Ottawa, ON, Canada.,Biochemistry, Microbiology and Immunology Department, Faculty of Medicine, University of Ottawa, Ottawa, ON, Canada
| | - Marcelo Muñoz
- Division of Cardiac Surgery Research, University of Ottawa Heart Institute, Ottawa, ON, Canada
| | - Christopher D McTiernan
- Division of Cardiac Surgery Research, University of Ottawa Heart Institute, Ottawa, ON, Canada
| | - Emilio I Alarcon
- Division of Cardiac Surgery Research, University of Ottawa Heart Institute, Ottawa, ON, Canada.,Biochemistry, Microbiology and Immunology Department, Faculty of Medicine, University of Ottawa, Ottawa, ON, Canada
| |
Collapse
|
21
|
Azad A, Pavlopoulos GA, Ouzounis CA, Kyrpides NC, Buluç A. HipMCL: a high-performance parallel implementation of the Markov clustering algorithm for large-scale networks. Nucleic Acids Res 2019; 46:e33. [PMID: 29315405 PMCID: PMC5888241 DOI: 10.1093/nar/gkx1313] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2017] [Accepted: 01/02/2018] [Indexed: 11/13/2022] Open
Abstract
Biological networks capture structural or functional properties of relevant entities such as molecules, proteins or genes. Characteristic examples are gene expression networks or protein–protein interaction networks, which hold information about functional affinities or structural similarities. Such networks have been expanding in size due to increasing scale and abundance of biological data. While various clustering algorithms have been proposed to find highly connected regions, Markov Clustering (MCL) has been one of the most successful approaches to cluster sequence similarity or expression networks. Despite its popularity, MCL’s scalability to cluster large datasets still remains a bottleneck due to high running times and memory demands. Here, we present High-performance MCL (HipMCL), a parallel implementation of the original MCL algorithm that can run on distributed-memory computers. We show that HipMCL can efficiently utilize 2000 compute nodes and cluster a network of ∼70 million nodes with ∼68 billion edges in ∼2.4 h. By exploiting distributed-memory environments, HipMCL clusters large-scale networks several orders of magnitude faster than MCL and enables clustering of even bigger networks. HipMCL is based on MPI and OpenMP and is freely available under a modified BSD license.
Collapse
Affiliation(s)
- Ariful Azad
- Computational Research Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720-8150, USA
| | - Georgios A Pavlopoulos
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Christos A Ouzounis
- Biological Computation & Process Laboratory, Chemical Process & Energy Resources Institute, Centre for Research & Technology Hellas, Thessalonica 57001, Greece
| | - Nikos C Kyrpides
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Aydin Buluç
- Computational Research Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720-8150, USA.,Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA 94720, USA
| |
Collapse
|
22
|
Chen C, Wang Q, Huang H, Vinayaka CR, Garavelli JS, Arighi CN, Natale DA, Wu CH. PIRSitePredict for protein functional site prediction using position-specific rules. Database (Oxford) 2019; 2019:baz026. [PMID: 30805646 PMCID: PMC6389862 DOI: 10.1093/database/baz026] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2018] [Revised: 01/24/2019] [Accepted: 02/04/2019] [Indexed: 11/14/2022]
Abstract
Methods focused on predicting 'global' annotations for proteins (such as molecular function, biological process and presence of domains or membership in a family) have reached a relatively mature stage. Methods to provide fine-grained 'local' annotation of functional sites (at the level of individual amino acid) are now coming to the forefront, especially in light of the rapid accumulation of genetic variant data. We have developed a computational method and workflow that predicts functional sites within proteins using position-specific conditional template annotation rules (namely PIR Site Rules or PIRSRs for short). Such rules are curated through review of known protein structural and other experimental data by structural biologists and are used to generate high-quality annotations for the UniProt Knowledgebase (UniProtKB) unreviewed section. To share the PIRSR functional site prediction method with the broader scientific community, we have streamlined our workflow and developed a stand-alone Java software package named PIRSitePredict. We demonstrate the use of PIRSitePredict for functional annotation of de novo assembled genome/transcriptome by annotating uncharacterized proteins from Trinity RNA-seq assembly of embryonic transcriptomes of the following three cartilaginous fishes: Leucoraja erinacea (Little Skate), Scyliorhinus canicula (Small-spotted Catshark) and Callorhinchus milii (Elephant Shark). On average about 1200 lines of annotations were predicted for each species.
Collapse
Affiliation(s)
- Chuming Chen
- Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE, USA
- Department of Computer and Information Sciences, University of Delaware, Newark, DE, USA
| | - Qinghua Wang
- Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE, USA
- Department of Computer and Information Sciences, University of Delaware, Newark, DE, USA
| | - Hongzhan Huang
- Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE, USA
- Department of Computer and Information Sciences, University of Delaware, Newark, DE, USA
| | | | - John S Garavelli
- Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE, USA
| | - Cecilia N Arighi
- Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE, USA
- Department of Computer and Information Sciences, University of Delaware, Newark, DE, USA
| | - Darren A Natale
- Protein Information Resource, Georgetown University Medical Center, Washington, DC, USA
| | - Cathy H Wu
- Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE, USA
- Department of Computer and Information Sciences, University of Delaware, Newark, DE, USA
- Protein Information Resource, Georgetown University Medical Center, Washington, DC, USA
| |
Collapse
|
23
|
Affiliation(s)
- Samin Seddigh
- Department of Plant Protection, College of Agriculture, Varamin-Pishva Branch, Islamic Azad University, Varamin, Iran
| |
Collapse
|
24
|
Li Z, Shao S, Ren X, Sun J, Guo Z, Wang S, Song MM, Chang CEA, Xue M. Construction of a Sequenceable Protein Mimetic Peptide Library with a True 3D Diversifiable Chemical Space. J Am Chem Soc 2018; 140:14552-14556. [PMID: 30362722 DOI: 10.1021/jacs.8b08338] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
We present here a library of protein mimetic bicyclic peptides. These nanosized structures exhibit rigid backbones and spatially diversifiable side chains. They present modular amino acids on all three linkages, providing access to a true 3D diversifiable chemical space. These peptides are synthesized through a Cu-catalyzed click reaction and a Ru-catalyzed ring-closing metathesis reaction. Their bicyclic topology can be reduced to a linear one, using Edman degradation and Pd-catalyzed deallylation reactions. The linearization approaches allow de novo sequencing through mass spectrometry methods. We demonstrate the function of a particular peptide that was identified through a high throughput screening against the E363-R378 epitope on the intrinsically disordered c-Myc oncoprotein. Intracellular delivery of this peptide could interfere with the c-Myc-mediated transcription and inhibit proliferation in a human glioblastoma cell line.
Collapse
Affiliation(s)
- Zhonghan Li
- Department of Chemistry , University of California, Riverside , Riverside , California 92521 , United States
| | - Shiqun Shao
- Department of Chemistry , University of California, Riverside , Riverside , California 92521 , United States
| | - Xiaodong Ren
- Department of Chemistry , University of California, Riverside , Riverside , California 92521 , United States
| | - Jianan Sun
- Department of Chemistry , University of California, Riverside , Riverside , California 92521 , United States
| | - Zhili Guo
- Department of Chemistry , University of California, Riverside , Riverside , California 92521 , United States
| | - Siwen Wang
- Department of Chemistry , University of California, Riverside , Riverside , California 92521 , United States
| | - Michelle M Song
- Martin Luther King High School , Riverside , California 92508 , United States
| | - Chia-En A Chang
- Department of Chemistry , University of California, Riverside , Riverside , California 92521 , United States
| | - Min Xue
- Department of Chemistry , University of California, Riverside , Riverside , California 92521 , United States
| |
Collapse
|
25
|
Silveira MC, Azevedo da Silva R, Faria da Mota F, Catanho M, Jardim R, R Guimarães AC, de Miranda AB. Systematic Identification and Classification of β-Lactamases Based on Sequence Similarity Criteria: β-Lactamase Annotation. Evol Bioinform Online 2018; 14:1176934318797351. [PMID: 30210232 PMCID: PMC6131288 DOI: 10.1177/1176934318797351] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2018] [Accepted: 08/08/2018] [Indexed: 12/11/2022] Open
Abstract
β-lactamases, the enzymes responsible for resistance to β-lactam antibiotics, are
widespread among prokaryotic genera. However, current β-lactamase classification
schemes do not represent their present diversity. Here, we propose a workflow to
identify and classify β-lactamases. Initially, a set of curated sequences was
used as a model for the construction of profiles Hidden Markov Models (HMM),
specific for each β-lactamase class. An extensive, nonredundant set of
β-lactamase sequences was constructed from 7 different resistance proteins
databases to test the methodology. The profiles HMM were improved for their
specificity and sensitivity and then applied to fully assembled genomes. Five
hierarchical classification levels are described, and a new class of
β-lactamases with fused domains is proposed. Our profiles HMM provide a better
annotation of β-lactamases, with classes and subclasses defined by objective
criteria such as sequence similarity. This classification offers a solid base to
the elaboration of studies on the diversity, dispersion, prevalence, and
evolution of the different classes and subclasses of this critical enzymatic
activity.
Collapse
Affiliation(s)
- Melise Chaves Silveira
- Laboratório de Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, Brazil
| | - Rangeline Azevedo da Silva
- Laboratório de Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, Brazil
| | - Fábio Faria da Mota
- Laboratório de Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, Brazil
| | - Marcos Catanho
- Laboratório de Genômica Funcional e Bioinformática, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, Brazil
| | - Rodrigo Jardim
- Laboratório de Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, Brazil
| | - Ana Carolina R Guimarães
- Laboratório de Genômica Funcional e Bioinformática, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, Brazil
| | - Antonio B de Miranda
- Laboratório de Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, Brazil
| |
Collapse
|
26
|
Danchin A, Ouzounis C, Tokuyasu T, Zucker JD. No wisdom in the crowd: genome annotation in the era of big data - current status and future prospects. Microb Biotechnol 2018; 11:588-605. [PMID: 29806194 PMCID: PMC6011933 DOI: 10.1111/1751-7915.13284] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Science and engineering rely on the accumulation and dissemination of knowledge to make discoveries and create new designs. Discovery-driven genome research rests on knowledge passed on via gene annotations. In response to the deluge of sequencing big data, standard annotation practice employs automated procedures that rely on majority rules. We argue this hinders progress through the generation and propagation of errors, leading investigators into blind alleys. More subtly, this inductive process discourages the discovery of novelty, which remains essential in biological research and reflects the nature of biology itself. Annotation systems, rather than being repositories of facts, should be tools that support multiple modes of inference. By combining deduction, induction and abduction, investigators can generate hypotheses when accurate knowledge is extracted from model databases. A key stance is to depart from 'the sequence tells the structure tells the function' fallacy, placing function first. We illustrate our approach with examples of critical or unexpected pathways, using MicroScope to demonstrate how tools can be implemented following the principles we advocate. We end with a challenge to the reader.
Collapse
Affiliation(s)
- Antoine Danchin
- Integromics, Institute of Cardiometabolism and Nutrition, Hôpital de la Pitié-Salpêtrière, 47 Boulevard de l'Hôpital, 75013, Paris, France
- School of Biomedical Sciences, Li KaShing Faculty of Medicine, Hong Kong University, 21 Sassoon Road, Pokfulam, Hong Kong
| | - Christos Ouzounis
- Biological Computation and Process Laboratory, Centre for Research and Technology Hellas, Chemical Process and Energy Resources Institute, Thessalonica, 57001, Greece
| | - Taku Tokuyasu
- Shenzhen Institutes of Advanced Technology, Institute of Synthetic Biology, Shenzhen University Town, 1068 Xueyuan Avenue, Shenzhen, China
| | - Jean-Daniel Zucker
- Integromics, Institute of Cardiometabolism and Nutrition, Hôpital de la Pitié-Salpêtrière, 47 Boulevard de l'Hôpital, 75013, Paris, France
| |
Collapse
|
27
|
Yuan S, Chan HCS, Filipek S, Vogel H. PyMOL and Inkscape Bridge the Data and the Data Visualization. Structure 2018; 24:2041-2042. [PMID: 27926832 DOI: 10.1016/j.str.2016.11.012] [Citation(s) in RCA: 135] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Affiliation(s)
- Shuguang Yuan
- Laboratory of Physical Chemistry of Polymers and Membranes, Ecole Polytechnique Fédérale de Lausanne (EPFL), CH B3 495 (Bâtiment CH) Station 6, Lausanne CH-1015, Switzerland.
| | - H C Stephen Chan
- Faculty of Life Sciences, University of Bradford, Bradford, West Yorkshire BD7 1DP, UK
| | - Slawomir Filipek
- Laboratory of Biomodeling, Faculty of Chemistry & Biological and Chemical Research Centre, University of Warsaw, ul. Pasteura 1, Warsaw 02-093, Poland
| | - Horst Vogel
- Laboratory of Physical Chemistry of Polymers and Membranes, Ecole Polytechnique Fédérale de Lausanne (EPFL), CH B3 495 (Bâtiment CH) Station 6, Lausanne CH-1015, Switzerland
| |
Collapse
|
28
|
Seddigh S. Comprehensive comparison of two protein family of P-ATPases (13A1 and 13A3) in insects. Comput Biol Chem 2017; 68:266-281. [DOI: 10.1016/j.compbiolchem.2017.04.006] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2016] [Revised: 01/06/2017] [Accepted: 04/12/2017] [Indexed: 01/22/2023]
|
29
|
Seddigh S, Darabi M. Functional, structural, and phylogenetic analysis of mitochondrial cytochrome b (cytb) in insects. Mitochondrial DNA A DNA Mapp Seq Anal 2017; 29:236-249. [DOI: 10.1080/24701394.2016.1275596] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Affiliation(s)
- Samin Seddigh
- Department of Plant Protection, College of Agriculture, Varamin-Pishva Branch, Islamic Azad University, Varamin, Iran
| | - Maryam Darabi
- Department of Agronomy and Plant Breeding Sciences, College of Aboureihan, University of Tehran, Tehran, Iran
| |
Collapse
|
30
|
Baiesi M, Orlandini E, Trovato A, Seno F. Linking in domain-swapped protein dimers. Sci Rep 2016; 6:33872. [PMID: 27659606 PMCID: PMC5034241 DOI: 10.1038/srep33872] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2016] [Accepted: 09/05/2016] [Indexed: 11/24/2022] Open
Abstract
The presence of knots has been observed in a small fraction of single-domain proteins and related to their thermodynamic and kinetic properties. The exchanging of identical structural elements, typical of domain-swapped proteins, makes such dimers suitable candidates to validate the possibility that mutual entanglement between chains may play a similar role for protein complexes. We suggest that such entanglement is captured by the linking number. This represents, for two closed curves, the number of times that each curve winds around the other. We show that closing the curves is not necessary, as a novel parameter G', termed Gaussian entanglement, is strongly correlated with the linking number. Based on 110 non redundant domain-swapped dimers, our analysis evidences a high fraction of chains with a significant intertwining, that is with |G'| > 1. We report that Nature promotes configurations with negative mutual entanglement and surprisingly, it seems to suppress intertwining in long protein dimers. Supported by numerical simulations of dimer dissociation, our results provide a novel topology-based classification of protein-swapped dimers together with some preliminary evidence of its impact on their physical and biological properties.
Collapse
Affiliation(s)
- Marco Baiesi
- Department of Physics and Astronomy, University of Padova, Padova, Italy
- INFN-Sezione di Padova, Padova, Italy
| | - Enzo Orlandini
- Department of Physics and Astronomy, University of Padova, Padova, Italy
- INFN-Sezione di Padova, Padova, Italy
| | - Antonio Trovato
- Department of Physics and Astronomy, University of Padova, Padova, Italy
- CNISM-Unità di Padova, Padova, Italy
| | - Flavio Seno
- Department of Physics and Astronomy, University of Padova, Padova, Italy
- CNISM-Unità di Padova, Padova, Italy
| |
Collapse
|
31
|
Kingsley LJ, Lill MA. Substrate tunnels in enzymes: structure-function relationships and computational methodology. Proteins 2015; 83:599-611. [PMID: 25663659 DOI: 10.1002/prot.24772] [Citation(s) in RCA: 112] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2014] [Revised: 01/08/2015] [Accepted: 01/14/2015] [Indexed: 12/14/2022]
Abstract
In enzymes, the active site is the location where incoming substrates are chemically converted to products. In some enzymes, this site is deeply buried within the core of the protein, and, in order to access the active site, substrates must pass through the body of the protein via a tunnel. In many systems, these tunnels act as filters and have been found to influence both substrate specificity and catalytic mechanism. Identifying and understanding how these tunnels exert such control has been of growing interest over the past several years because of implications in fields such as protein engineering and drug design. This growing interest has spurred the development of several computational methods to identify and analyze tunnels and how ligands migrate through these tunnels. The goal of this review is to outline how tunnels influence substrate specificity and catalytic efficiency in enzymes with buried active sites and to provide a brief summary of the computational tools used to identify and evaluate these tunnels.
Collapse
Affiliation(s)
- Laura J Kingsley
- Department of Medicinal Chemistry and Molecular Pharmacology, College of Pharmacy, Purdue University, West Lafayette, Indiana
| | | |
Collapse
|
32
|
Szklarczyk D, Franceschini A, Wyder S, Forslund K, Heller D, Huerta-Cepas J, Simonovic M, Roth A, Santos A, Tsafou KP, Kuhn M, Bork P, Jensen LJ, von Mering C. STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res 2014; 43:D447-52. [PMID: 25352553 PMCID: PMC4383874 DOI: 10.1093/nar/gku1003] [Citation(s) in RCA: 7421] [Impact Index Per Article: 674.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
The many functional partnerships and interactions that occur between proteins are at the core of cellular processing and their systematic characterization helps to provide context in molecular systems biology. However, known and predicted interactions are scattered over multiple resources, and the available data exhibit notable differences in terms of quality and completeness. The STRING database (http://string-db.org) aims to provide a critical assessment and integration of protein–protein interactions, including direct (physical) as well as indirect (functional) associations. The new version 10.0 of STRING covers more than 2000 organisms, which has necessitated novel, scalable algorithms for transferring interaction information between organisms. For this purpose, we have introduced hierarchical and self-consistent orthology annotations for all interacting proteins, grouping the proteins into families at various levels of phylogenetic resolution. Further improvements in version 10.0 include a completely redesigned prediction pipeline for inferring protein–protein associations from co-expression data, an API interface for the R computing environment and improved statistical analysis for enrichment tests in user-provided networks.
Collapse
Affiliation(s)
- Damian Szklarczyk
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, 8057 Zurich, Switzerland
| | - Andrea Franceschini
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, 8057 Zurich, Switzerland
| | - Stefan Wyder
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, 8057 Zurich, Switzerland
| | | | - Davide Heller
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, 8057 Zurich, Switzerland
| | | | - Milan Simonovic
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, 8057 Zurich, Switzerland
| | - Alexander Roth
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, 8057 Zurich, Switzerland
| | - Alberto Santos
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, 2200 Copenhagen N, Denmark
| | - Kalliopi P Tsafou
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, 2200 Copenhagen N, Denmark
| | - Michael Kuhn
- Biotechnology Center, Technische Universität Dresden, 01062 Dresden, Germany Max Planck Institute of Molecular Cell Biology and Genetics, 01062 Dresden, Germany
| | - Peer Bork
- European Molecular Biology Laboratory, 69117 Heidelberg, Germany
| | - Lars J Jensen
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, 2200 Copenhagen N, Denmark
| | - Christian von Mering
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, 8057 Zurich, Switzerland
| |
Collapse
|
33
|
Ben-Tal N, Kolodny R. Representation of the Protein Universe using Classifications, Maps, and Networks. Isr J Chem 2014. [DOI: 10.1002/ijch.201400001] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
34
|
Katsani KR, Irimia M, Karapiperis C, Scouras ZG, Blencowe BJ, Promponas VJ, Ouzounis CA. Functional genomics evidence unearths new moonlighting roles of outer ring coat nucleoporins. Sci Rep 2014; 4:4655. [PMID: 24722254 PMCID: PMC3983603 DOI: 10.1038/srep04655] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2013] [Accepted: 02/21/2014] [Indexed: 01/03/2023] Open
Abstract
There is growing evidence for the involvement of Y-complex nucleoporins (Y-Nups) in cellular processes beyond the inner core of nuclear pores of eukaryotes. To comprehensively assess the range of possible functions of Y-Nups, we delimit their structural and functional properties by high-specificity sequence profiles and tissue-specific expression patterns. Our analysis establishes the presence of Y-Nups across eukaryotes with novel composite domain architectures, supporting new moonlighting functions in DNA repair, RNA processing, signaling and mitotic control. Y-Nups associated with a select subset of the discovered domains are found to be under tight coordinated regulation across diverse human and mouse cell types and tissues, strongly implying that they function in conjunction with the nuclear pore. Collectively, our results unearth an expanded network of Y-Nup interactions, thus supporting the emerging view of the Y-complex as a dynamic protein assembly with diverse functional roles in the cell.
Collapse
Affiliation(s)
- Katerina R Katsani
- Department of Molecular Biology & Genetics, Democritus University of Thrace, GR-68100 Alexandroupolis, Greece
| | - Manuel Irimia
- Donnelly Centre for Cellular & Biomolecular Research, University of Toronto, 160 College Street, Toronto, Ontario M5S 3E1, Canada
| | - Christos Karapiperis
- Department of Genetics, Development & Molecular Biology, School of Biology, Faculty of Sciences, Aristotle University of Thessaloniki, GR-54124 Thessalonica, Greece
| | - Zacharias G Scouras
- Department of Genetics, Development & Molecular Biology, School of Biology, Faculty of Sciences, Aristotle University of Thessaloniki, GR-54124 Thessalonica, Greece
| | - Benjamin J Blencowe
- Donnelly Centre for Cellular & Biomolecular Research, University of Toronto, 160 College Street, Toronto, Ontario M5S 3E1, Canada
| | - Vasilis J Promponas
- Bioinformatics Research Laboratory, Department of Biological Sciences, University of Cyprus, PO Box 20537, CY-1678 Nicosia, Cyprus
| | - Christos A Ouzounis
- 1] Donnelly Centre for Cellular & Biomolecular Research, University of Toronto, 160 College Street, Toronto, Ontario M5S 3E1, Canada [2] Bioinformatics Research Laboratory, Department of Biological Sciences, University of Cyprus, PO Box 20537, CY-1678 Nicosia, Cyprus [3] Institute of Applied Biosciences, Centre for Research & Technology, PO Box 361, GR-57001 Thessalonica, Greece [4]
| |
Collapse
|
35
|
Psomopoulos FE, Mitkas PA, Ouzounis CA. Detection of genomic idiosyncrasies using fuzzy phylogenetic profiles. PLoS One 2013; 8:e52854. [PMID: 23341912 PMCID: PMC3544837 DOI: 10.1371/journal.pone.0052854] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2012] [Accepted: 11/22/2012] [Indexed: 11/18/2022] Open
Abstract
Phylogenetic profiles express the presence or absence of genes and their homologs across a number of reference genomes. They have emerged as an elegant representation framework for comparative genomics and have been used for the genome-wide inference and discovery of functionally linked genes or metabolic pathways. As the number of reference genomes grows, there is an acute need for faster and more accurate methods for phylogenetic profile analysis with increased performance in speed and quality. We propose a novel, efficient method for the detection of genomic idiosyncrasies, i.e. sets of genes found in a specific genome with peculiar phylogenetic properties, such as intra-genome correlations or inter-genome relationships. Our algorithm is a four-step process where genome profiles are first defined as fuzzy vectors, then discretized to binary vectors, followed by a de-noising step, and finally a comparison step to generate intra- and inter-genome distances for each gene profile. The method is validated with a carefully selected benchmark set of five reference genomes, using a range of approaches regarding similarity metrics and pre-processing stages for noise reduction. We demonstrate that the fuzzy profile method consistently identifies the actual phylogenetic relationship and origin of the genes under consideration for the majority of the cases, while the detected outliers are found to be particular genes with peculiar phylogenetic patterns. The proposed method provides a time-efficient and highly scalable approach for phylogenetic stratification, with the detected groups of genes being either similar to their own genome profile or different from it, thus revealing atypical evolutionary histories.
Collapse
Affiliation(s)
- Fotis E. Psomopoulos
- Department of Electrical and Computer Engineering, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Pericles A. Mitkas
- Department of Electrical and Computer Engineering, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Christos A. Ouzounis
- Centre for Bioinformatics, Department of Informatics, School of Natural and Mathematical Sciences, King’s College London, Strand, London, United Kingdom
- * E-mail:
| |
Collapse
|
36
|
Manoharan M, Sankar K, Offmann B, Ramanathan S. Association of Putative Members to Family of Mosquito Odorant Binding Proteins: Scoring Scheme Using Fuzzy Functional Templates and Cys Residue Positions. Bioinform Biol Insights 2013; 7:231-51. [PMID: 23908587 PMCID: PMC3728099 DOI: 10.4137/bbi.s11096] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Proteins may be related to each other very specifically as homologous subfamilies. Proteins can also be related to diverse proteins at the super family level. It has become highly important to characterize the existing sequence databases by their signatures to facilitate the function annotation of newly added sequences. The algorithm described here uses a scheme for the classification of odorant binding proteins on the basis of functional residues and Cys-pairing. The cysteine-based scoring scheme not only helps in unambiguously identifying families like odorant binding proteins (OBPs), but also aids in their classification at the subfamily level with reliable accuracy. The algorithm was also applied to yet another cysteine-rich family, where similar accuracy was observed that ensures the application of the protocol to other families.
Collapse
Affiliation(s)
- Malini Manoharan
- Université de La Reunion, DSIMB, INSERM UMR-S 665, La Reunion, France
- National Centre for Biological Sciences, Tata Institute for Fundamental Research, GKVK campus, Bangalore, INDIA
- Manipal University, Madhav Nagar, Manipal, Karnataka, India
| | - Kannan Sankar
- National Centre for Biological Sciences, Tata Institute for Fundamental Research, GKVK campus, Bangalore, INDIA
- Birla Institute of Technology, Pilani, Rajasthan, India
- Current address: Iowa State University, Ames, IA, USA
| | - Bernard Offmann
- Université de La Reunion, DSIMB, INSERM UMR-S 665, La Reunion, France
- Université de Nantes, UFIP CNRS FRE 3478, Nantes, France
| | - Sowdhamini Ramanathan
- National Centre for Biological Sciences, Tata Institute for Fundamental Research, GKVK campus, Bangalore, INDIA
| |
Collapse
|
37
|
Abstract
The observation of a limited secondary-structural alphabet in native proteins, with significant sequence preferences, has profoundly influenced the fields of protein design and structure prediction (Simons, Kooperberg, Huang, & Baker, 1997; Verschueren et al., 2011). In the era of structural genomics, as the size of the structural dataset continues to grow rapidly, it is becoming possible to extend this analysis to tertiary structural motifs and their sequences. For a hypothetical tertiary motif, the rate of its utilization in natural proteins may be used to assess its designability-the ease with which the motif can be realized with natural amino acids. This requires a structural similarity search methodology, which rather than looking for global topological agreement (more appropriate for categorization of full proteins or domains), identifies detailed geometric matches. In this chapter, we introduce such a method, called MaDCaT, and demonstrate its use by assessing the designability landscapes of two tertiary structural motifs. We also show that such analysis can establish structure/sequence links by providing the sequence constraints necessary to encode designable motifs. As logical extension of their secondary-structure counterparts, tertiary structural preferences will likely prove extremely useful in de novo protein design and structure prediction.
Collapse
Affiliation(s)
- Jian Zhang
- Department of Computer Science, Dartmouth College, Fax: 603-646-1672, 6211 Sudikoff Lab, Room 210, Hanover, NH 03755-3510, USA
| | - Gevorg Grigoryan
- Adjunct Professor of Biology, Dartmouth College, Phone: 603-646-3173, Fax: 603-646-1672, 6211 Sudikoff Lab, Room 113, Hanover, NH 03755-3510, USA
| |
Collapse
|
38
|
Abstract
Background Computational sequence analysis, that is, prediction of local sequence properties, homologs, spatial structure and function from the sequence of a protein, offers an efficient way to obtain needed information about proteins under study. Since reliable prediction is usually based on the consensus of many computer programs, meta-severs have been developed to fit such needs. Most meta-servers focus on one aspect of sequence analysis, while others incorporate more information, such as PredictProtein for local sequence feature predictions, SMART for domain architecture and sequence motif annotation, and GeneSilico for secondary and spatial structure prediction. However, as predictions of local sequence properties, three-dimensional structure and function are usually intertwined, it is beneficial to address them together. Results We developed a MEta-Server for protein Sequence Analysis (MESSA) to facilitate comprehensive protein sequence analysis and gather structural and functional predictions for a protein of interest. For an input sequence, the server exploits a number of select tools to predict local sequence properties, such as secondary structure, structurally disordered regions, coiled coils, signal peptides and transmembrane helices; detect homologous proteins and assign the query to a protein family; identify three-dimensional structure templates and generate structure models; and provide predictive statements about the protein's function, including functional annotations, Gene Ontology terms, enzyme classification and possible functionally associated proteins. We tested MESSA on the proteome of Candidatus Liberibacter asiaticus. Manual curation shows that three-dimensional structure models generated by MESSA covered around 75% of all the residues in this proteome and the function of 80% of all proteins could be predicted. Availability MESSA is free for non-commercial use at http://prodata.swmed.edu/MESSA/
Collapse
|
39
|
Kim BH, Chitturi B, Grishin NV. Self consistency grouping: a stringent clustering method. BMC Bioinformatics 2012; 13 Suppl 13:S3. [PMID: 23320864 PMCID: PMC3426801 DOI: 10.1186/1471-2105-13-s13-s3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background Numerous types of clustering like single linkage and K-means have been widely studied and applied to a variety of scientific problems. However, the existing methods are not readily applicable for the problems that demand high stringency. Methods Our method, self consistency grouping, i.e. SCG, yields clusters whose members are closer in rank to each other than to any member outside the cluster. We do not define a distance metric; we use the best known distance metric and presume that it measures the correct distance. SCG does not impose any restriction on the size or the number of the clusters that it finds. The boundaries of clusters are determined by the inconsistencies in the ranks. In addition to the direct implementation that finds the complete structure of the (sub)clusters we implemented two faster versions. The fastest version is guaranteed to find only the clusters that are not subclusters of any other clusters and the other version yields the same output as the direct implementation but does so more efficiently. Results Our tests have demonstrated that SCG yields very few false positives. This was accomplished by introducing errors in the distance measurement. Clustering of protein domain representatives by structural similarity showed that SCG could recover homologous groups with high precision. Conclusions SCG has potential for finding biological relationships under stringent conditions.
Collapse
|
40
|
Shameer K, Shingate PN, Manjunath SCP, Karthika M, Pugalenthi G, Sowdhamini R. 3DSwap: curated knowledgebase of proteins involved in 3D domain swapping. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2011; 2011:bar042. [PMID: 21959866 PMCID: PMC3294423 DOI: 10.1093/database/bar042] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
Three-dimensional domain swapping is a unique protein structural phenomenon where two or more protein chains in a protein oligomer share a common structural segment between individual chains. This phenomenon is observed in an array of protein structures in oligomeric conformation. Protein structures in swapped conformations perform diverse functional roles and are also associated with deposition diseases in humans. We have performed in-depth literature curation and structural bioinformatics analyses to develop an integrated knowledgebase of proteins involved in 3D domain swapping. The hallmark of 3D domain swapping is the presence of distinct structural segments such as the hinge and swapped regions. We have curated the literature to delineate the boundaries of these regions. In addition, we have defined several new concepts like ‘secondary major interface’ to represent the interface properties arising as a result of 3D domain swapping, and a new quantitative measure for the ‘extent of swapping’ in structures. The catalog of proteins reported in 3DSwap knowledgebase has been generated using an integrated structural bioinformatics workflow of database searches, literature curation, by structure visualization and sequence–structure–function analyses. The current version of the 3DSwap knowledgebase reports 293 protein structures, the analysis of such a compendium of protein structures will further the understanding molecular factors driving 3D domain swapping. Database URL:http://caps.ncbs.res.in/3dswap
Collapse
Affiliation(s)
- Khader Shameer
- National Centre for Biological Sciences, GKVK Campus, Bangalore, Karnataka, India
| | | | | | | | | | | |
Collapse
|
41
|
Gomes MR, Guimarães ACR, de Miranda AB. Specific and nonhomologous isofunctional enzymes of the genetic information processing pathways as potential therapeutical targets for tritryps. Enzyme Res 2011; 2011:543912. [PMID: 21808726 PMCID: PMC3145330 DOI: 10.4061/2011/543912] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2011] [Revised: 03/22/2011] [Accepted: 05/05/2011] [Indexed: 12/03/2022] Open
Abstract
Leishmania major, Trypanosoma brucei, and Trypanosoma cruzi (Tritryps) are unicellular protozoa that cause leishmaniasis, sleeping sickness and Chagas' disease, respectively. Most drugs against them were discovered through the screening of large numbers of compounds against whole parasites. Nonhomologous isofunctional enzymes (NISEs) may present good opportunities for the identification of new putative drug targets because, though sharing the same enzymatic activity, they possess different three-dimensional structures thus allowing the development of molecules against one or other isoform. From public data of the Tritryps' genomes, we reconstructed the Genetic Information Processing Pathways (GIPPs). We then used AnEnPi to look for the presence of these enzymes between Homo sapiens and Tritryps, as well as specific enzymes of the parasites. We identified three candidates (ECs 3.1.11.2 and 6.1.1.-) in these pathways that may be further studied as new therapeutic targets for drug development against these parasites.
Collapse
Affiliation(s)
- Monete Rajão Gomes
- Laboratório de Biologia Computacional e Sistemas, Instituto Oswaldo Cruz/FIOCRUZ, 21045-900 Rio de Janeiro, RJ, Brazil
| | | | | |
Collapse
|
42
|
Shortridge MD, Triplet T, Revesz P, Griep MA, Powers R. Bacterial protein structures reveal phylum dependent divergence. Comput Biol Chem 2011; 35:24-33. [PMID: 21315656 DOI: 10.1016/j.compbiolchem.2010.12.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2010] [Revised: 12/28/2010] [Accepted: 12/29/2010] [Indexed: 01/26/2023]
Abstract
Protein sequence space is vast compared to protein fold space. This raises important questions about how structures adapt to evolutionary changes in protein sequences. A growing trend is to regard protein fold space as a continuum rather than a series of discrete structures. From this perspective, homologous protein structures within the same functional classification should reveal a constant rate of structural drift relative to sequence changes. The clusters of orthologous groups (COG) classification system was used to annotate homologous bacterial protein structures in the Protein Data Bank (PDB). The structures and sequences of proteins within each COG were compared against each other to establish their relatedness. As expected, the analysis demonstrates a sharp structural divergence between the bacterial phyla Firmicutes and Proteobacteria. Additionally, each COG had a distinct sequence/structure relationship, indicating that different evolutionary pressures affect the degree of structural divergence. However, our analysis also shows the relative drift rate between sequence identity and structure divergence remains constant.
Collapse
Affiliation(s)
- Matthew D Shortridge
- Department of Chemistry, University of Nebraska-Lincoln, 68588-0304, United States
| | | | | | | | | |
Collapse
|
43
|
Jain P, Hirst JD. Exploring protein structural dissimilarity to facilitate structure classification. BMC STRUCTURAL BIOLOGY 2009; 9:60. [PMID: 19765314 PMCID: PMC2754988 DOI: 10.1186/1472-6807-9-60] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/27/2009] [Accepted: 09/19/2009] [Indexed: 12/04/2022]
Abstract
BACKGROUND Classification of newly resolved protein structures is important in understanding their architectural, evolutionary and functional relatedness to known protein structures. Among various efforts to improve the database of Structural Classification of Proteins (SCOP), automation has received particular attention. Herein, we predict the deepest SCOP structural level that an unclassified protein shares with classified proteins with an equal number of secondary structure elements (SSEs). RESULTS We compute a coefficient of dissimilarity (Omega) between proteins, based on structural and sequence-based descriptors characterising the respective constituent SSEs. For a set of 1,661 pairs of proteins with sequence identity up to 35%, the performance of Omega in predicting shared Class, Fold and Super-family levels is comparable to that of DaliLite Z score and shows a greater than four-fold increase in the true positive rate (TPR) for proteins sharing the Family level. On a larger set of 600 domains representing 200 families, the performance of Z score improves in predicting a shared Family, but still only achieves about half of the TPR of Omega. The TPR for structures sharing a Super-family is lower than in the first dataset, but Omega performs slightly better than Z score. Overall, the sensitivity of Omega in predicting common Fold level is higher than that of the DaliLite Z score. CONCLUSION Classification to a deeper level in the hierarchy is specific and difficult. So the efficiency of Omega may be attractive to the curators and the end-users of SCOP. We suggest Omega may be a better measure for structure classification than the DaliLite Z score, with the caveat that currently we are restricted to comparing structures with equal number of SSEs.
Collapse
Affiliation(s)
- Pooja Jain
- School of Chemistry, The University of Nottingham, University Park, Nottingham, NG7 2RD, UK
| | - Jonathan D Hirst
- School of Chemistry, The University of Nottingham, University Park, Nottingham, NG7 2RD, UK
| |
Collapse
|
44
|
Hvidsten TR, Lægreid A, Kryshtafovych A, Andersson G, Fidelis K, Komorowski J. A comprehensive analysis of the structure-function relationship in proteins based on local structure similarity. PLoS One 2009; 4:e6266. [PMID: 19603073 PMCID: PMC2705683 DOI: 10.1371/journal.pone.0006266] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2008] [Accepted: 06/10/2009] [Indexed: 12/22/2022] Open
Abstract
Background Sequence similarity to characterized proteins provides testable functional hypotheses for less than 50% of the proteins identified by genome sequencing projects. With structural genomics it is believed that structural similarities may give functional hypotheses for many of the remaining proteins. Methodology/Principal Findings We provide a systematic analysis of the structure-function relationship in proteins using the novel concept of local descriptors of protein structure. A local descriptor is a small substructure of a protein which includes both short- and long-range interactions. We employ a library of commonly reoccurring local descriptors general enough to assemble most existing protein structures. We then model the relationship between these local shapes and Gene Ontology using rule-based learning. Our IF-THEN rule model offers legible, high resolution descriptions that combine local substructures and is able to discriminate functions even for functionally versatile folds such as the frequently occurring TIM barrel and Rossmann fold. By evaluating the predictive performance of the model, we provide a comprehensive quantification of the structure-function relationship based only on local structure similarity. Our findings are, among others, that conserved structure is a stronger prerequisite for enzymatic activity than for binding specificity, and that structure-based predictions complement sequence-based predictions. The model is capable of generating correct hypotheses, as confirmed by a literature study, even when no significant sequence similarity to characterized proteins exists. Conclusions/Significance Our approach offers a new and complete description and quantification of the structure-function relationship in proteins. By demonstrating how our predictions offer higher sensitivity than using global structure, and complement the use of sequence, we show that the presented ideas could advance the development of meta-servers in function prediction.
Collapse
Affiliation(s)
- Torgeir R. Hvidsten
- The Linnaeus Centre for Bioinformatics, Uppsala University and The Swedish University for Agricultural Sciences, Uppsala, Sweden
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
| | - Astrid Lægreid
- Department of Cancer Research and Molecular Medicine, Norwegian University of Science and Technology, St. Olavs Hospital HF, Trondheim, Norway
| | | | - Gunnar Andersson
- The Linnaeus Centre for Bioinformatics, Uppsala University and The Swedish University for Agricultural Sciences, Uppsala, Sweden
- Department of Chemistry, Environment and Feed Hygiene, National Veterinary Institute, Uppsala, Sweden
| | | | - Jan Komorowski
- The Linnaeus Centre for Bioinformatics, Uppsala University and The Swedish University for Agricultural Sciences, Uppsala, Sweden
- Interdisciplinary Centre for Mathematical and Computational Modelling, University of Warsaw, Warszawa, Poland
- * E-mail:
| |
Collapse
|
45
|
Abstract
One of the unique insights provided by the growing number of fully sequenced genomes is the pervasiveness of gene duplication and gene loss. Indeed, several metrics now suggest that rates of gene birth and death per gene are only 10-40% lower than nucleotide substitutions per site, and that per nucleotide, the consequent lineage-specific expansion and contraction of gene families may play at least as large a role in adaptation as changes in orthologous sequences. While gene family evolution is pervasive, it may be especially important in our own evolution since it appears that the "revolving door" of gene duplication and loss has undergone multiple accelerations in the lineage leading to humans. In this paper, we review current understanding of gene family evolution including: methods for inferring copy number change, evidence for adaptive expansion and adaptive contraction of gene families, the origins of new families and deaths of previously established ones, and finally we conclude with a perspective on challenges and promising directions for future research.
Collapse
|
46
|
Cantacessi C, Campbell BE, Visser A, Geldhof P, Nolan MJ, Nisbet AJ, Matthews JB, Loukas A, Hofmann A, Otranto D, Sternberg PW, Gasser RB. A portrait of the "SCP/TAPS" proteins of eukaryotes--developing a framework for fundamental research and biotechnological outcomes. Biotechnol Adv 2009; 27:376-88. [PMID: 19239923 DOI: 10.1016/j.biotechadv.2009.02.005] [Citation(s) in RCA: 120] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2008] [Revised: 02/05/2009] [Accepted: 02/11/2009] [Indexed: 01/17/2023]
Abstract
A wide range of proteins belonging to the SCP/TAPS "family" has been described for various eukaryotic organisms, including plants and animals (vertebrates and invertebrates, such as helminths). Although SCP/TAPS proteins have been proposed to play key roles in a number of fundamental biological processes, such as host-pathogen interactions and defence mechanisms, there is a paucity of information on their genetic relationships, structures and functions, and there is no standardised nomenclature for these proteins. A detailed analysis of the relationships of members of the SCP/TAPS family of proteins, based on key protein signatures, could provide a foundation for investigating these areas. In this article, we review the current state of knowledge of key SCP/TAPS proteins of eukaryotes, with an emphasis on those from parasitic helminths, and undertake a comprehensive, systematic phylogenetic analysis of currently available full-length protein sequence data (considering characteristic protein signatures or motifs) to infer relationships and provide a framework (based on statistical support) for the naming of these proteins. This framework is intended to guide genomic and molecular biological explorations of key SCP/TAPS molecules associated with infectious diseases of plants and animals. In particular, fundamental investigations of these molecules in parasites and the integration of structural and functional data could lead to new and innovative approaches for the control of parasitic diseases, with important biotechnological outcomes.
Collapse
Affiliation(s)
- C Cantacessi
- Department of Veterinary Science, The University of Melbourne, Australia
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
47
|
Juncker AS, Jensen LJ, Pierleoni A, Bernsel A, Tress ML, Bork P, von Heijne G, Valencia A, Ouzounis CA, Casadio R, Brunak S. Sequence-based feature prediction and annotation of proteins. Genome Biol 2009; 10:206. [PMID: 19226438 PMCID: PMC2688272 DOI: 10.1186/gb-2009-10-2-206] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
The combination of prediction tools in complex workflows and pipelines facilitates prediction of protein features from sequence. A recent trend in computational methods for annotation of protein function is that many prediction tools are combined in complex workflows and pipelines to facilitate the analysis of feature combinations, for example, the entire repertoire of kinase-binding motifs in the human proteome.
Collapse
Affiliation(s)
- Agnieszka S Juncker
- Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, DK-2800 Lyngby, Denmark
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Protein Sequence Databases. Bioinformatics 2009. [DOI: 10.1007/978-0-387-92738-1_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open
|
49
|
Jiang Z. Protein Function Predictions Based on the Phylogenetic Profile Method. Crit Rev Biotechnol 2008; 28:233-8. [PMID: 19051102 DOI: 10.1080/07388550802512633] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
50
|
Lee EY, Choi DS, Kim KP, Gho YS. Proteomics in gram-negative bacterial outer membrane vesicles. MASS SPECTROMETRY REVIEWS 2008; 27:535-555. [PMID: 18421767 DOI: 10.1002/mas.20175] [Citation(s) in RCA: 223] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Gram-negative bacteria constitutively secrete outer membrane vesicles (OMVs) into the extracellular milieu. Recent research in this area has revealed that OMVs may act as intercellular communicasomes in polyspecies communities by enhancing bacterial survival and pathogenesis in hosts. However, the mechanisms of vesicle formation and the pathophysiological roles of OMVs have not been clearly defined. While it is obvious that mass spectrometry-based proteomics offers great opportunities for improving our knowledge of bacterial OMVs, limited proteomic data are available for OMVs. The present review aims to give an overview of the previous biochemical, biological, and proteomic studies in the emerging field of bacterial OMVs, and to give future directions for high-throughput and comparative proteomic studies of OMVs that originate from diverse Gram-negative bacteria under various environmental conditions. This article will hopefully stimulate further efforts to construct a comprehensive proteome database of bacterial OMVs that will help us not only to elucidate the biogenesis and functions of OMVs but also to develop diagnostic tools, vaccines, and antibiotics effective against pathogenic bacteria.
Collapse
Affiliation(s)
- Eun-Young Lee
- Department of Life Science and Division of Molecular and Life Sciences, Pohang University of Science and Technology, Pohang, Republic of Korea
| | | | | | | |
Collapse
|