Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Ma J, Peng J, Wang S, Xu J. A conditional neural fields model for protein threading. ACTA ACUST UNITED AC 2013;28:i59-66. [PMID: 22689779 PMCID: PMC3371845 DOI: 10.1093/bioinformatics/bts213] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

For:	Ma J, Peng J, Wang S, Xu J. A conditional neural fields model for protein threading. ACTA ACUST UNITED AC 2013;28:i59-66. [PMID: 22689779 PMCID: PMC3371845 DOI: 10.1093/bioinformatics/bts213] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Number

Cited by Other Article(s)

Baker K, Hughes N, Bhattacharya S. An interactive visualization tool for educational outreach in protein contact map overlap analysis. FRONTIERS IN BIOINFORMATICS 2024;4:1358550. [PMID: 38562910 PMCID: PMC10982686 DOI: 10.3389/fbinf.2024.1358550] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Accepted: 03/04/2024] [Indexed: 04/04/2024] Open

Huang B, Kong L, Wang C, Ju F, Zhang Q, Zhu J, Gong T, Zhang H, Yu C, Zheng WM, Bu D. Protein Structure Prediction: Challenges, Advances, and the Shift of Research Paradigms. GENOMICS, PROTEOMICS & BIOINFORMATICS 2023;21:913-925. [PMID: 37001856 PMCID: PMC10928435 DOI: 10.1016/j.gpb.2022.11.014] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/15/2022] [Revised: 11/23/2022] [Accepted: 11/30/2022] [Indexed: 03/31/2023]

Affiliation(s)

Bin Huang Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; University of Chinese Academy of Sciences, Beijing 100049, China
Lupeng Kong Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; Changping Laboratory, Beijing 102206, China
Chao Wang Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
Fusong Ju Microsoft Research AI4Science, Beijing 100080, China
Qi Zhang Huawei Noah's Ark Lab, Wuhan 430206, China
Jianwei Zhu Microsoft Research AI4Science, Beijing 100080, China
Tiansu Gong Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; University of Chinese Academy of Sciences, Beijing 100049, China
Haicang Zhang Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; University of Chinese Academy of Sciences, Beijing 100049, China; Zhongke Big Data Academy, Zhengzhou 450046, China.
Chungong Yu Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; University of Chinese Academy of Sciences, Beijing 100049, China; Zhongke Big Data Academy, Zhengzhou 450046, China.
Wei-Mou Zheng Institute of Theoretical Physics, Chinese Academy of Sciences, Beijing 100190, China.
Dongbo Bu Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; University of Chinese Academy of Sciences, Beijing 100049, China; Zhongke Big Data Academy, Zhengzhou 450046, China.

Collapse

Jamali Langeroudi A, Sabet MS, Jalali-Javaran M, Zamani K, Lohrasebi T, Malboobi MA. Functional assessment of AtPAP17; encoding a purple acid phosphatase involved in phosphate metabolism in Arabidopsis thaliana. Biotechnol Lett 2023;45:719-739. [PMID: 37074554 DOI: 10.1007/s10529-023-03375-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Revised: 03/05/2023] [Accepted: 04/03/2023] [Indexed: 04/20/2023]

Bhattacharya S, Roche R, Shuvo MH, Moussad B, Bhattacharya D. Contact-Assisted Threading in Low-Homology Protein Modeling. Methods Mol Biol 2023;2627:41-59. [PMID: 36959441 DOI: 10.1007/978-1-0716-2974-1_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/25/2023]

Homology Modeling and Analysis of Vacuolar Aspartyl Protease from a Novel Yeast Expression Host Meyerozyma guilliermondii Strain SO. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2022. [DOI: 10.1007/s13369-022-07153-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]

Lee SJ, Joo K, Sim S, Lee J, Lee IH, Lee J. CRFalign: A Sequence-Structure Alignment of Proteins Based on a Combination of HMM-HMM Comparison and Conditional Random Fields. MOLECULES (BASEL, SWITZERLAND) 2022;27:molecules27123711. [PMID: 35744836 PMCID: PMC9231382 DOI: 10.3390/molecules27123711] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Revised: 06/03/2022] [Accepted: 06/07/2022] [Indexed: 11/16/2022]

Villegas-Morcillo A, Gomez AM, Sanchez V. An analysis of protein language model embeddings for fold prediction. Brief Bioinform 2022;23:6571527. [PMID: 35443054 DOI: 10.1093/bib/bbac142] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 03/21/2022] [Accepted: 03/28/2022] [Indexed: 11/13/2022] Open

Byadi S, Oblak D, Kassmi Y, Sadik K, Hachim ME, Podlipnik Č, Aboulmouhajir A. In silico discovery of novel inhibitors from Northern African natural products database against main protease (Mpro) of SARS-CoV-2. J Biomol Struct Dyn 2022;41:2900-2910. [PMID: 35168469 DOI: 10.1080/07391102.2022.2040594] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Kong L, Ju F, Zheng WM, Zhu J, Sun S, Xu J, Bu D. ProALIGN: Directly Learning Alignments for Protein Structure Prediction via Exploiting Context-Specific Alignment Motifs. J Comput Biol 2022;29:92-105. [PMID: 35073170 PMCID: PMC8892980 DOI: 10.1089/cmb.2021.0430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open

Abstract

Template-based modeling (TBM), including homology modeling and protein threading, is one of the most reliable techniques for protein structure prediction. It predicts protein structure by building an alignment between the query sequence under prediction and the templates with solved structures. However, it is still very challenging to build the optimal sequence-template alignment, especially when only distantly related templates are available. Here we report a novel deep learning approach ProALIGN that can predict much more accurate sequence-template alignment. Like protein sequences consisting of sequence motifs, protein alignments are also composed of frequently occurring alignment motifs with characteristic patterns. Alignment motifs are context-specific as their characteristic patterns are tightly related to sequence contexts of the aligned regions. Inspired by this observation, we represent a protein alignment as a binary matrix (in which 1 denotes an aligned residue pair) and then use a deep convolutional neural network to predict the optimal alignment from the query protein and its template. The trained neural network implicitly but effectively encodes an alignment scoring function, which reduces inaccuracies in the handcrafted scoring functions widely used by the current threading approaches. For a query protein and a template, we apply the neural network to directly infer likelihoods of all possible residue pairs in their entirety, which could effectively consider the correlations among multiple residues. We further construct the alignment with maximum likelihood, and finally build a structure model according to the alignment. Tested on three independent data sets with a total of 6688 protein alignment targets and 80 CASP13 TBM targets, our method achieved much better alignments and 3D structure models than the existing methods, including HHpred, CNFpred, CEthreader, and DeepThreader. These results clearly demonstrate the effectiveness of exploiting the context-specific alignment motifs by deep learning for protein threading.

Collapse

Bhattacharya S, Roche R, Moussad B, Bhattacharya D. DisCovER: distance- and orientation-based covariational threading for weakly homologous proteins. Proteins 2022;90:579-588. [PMID: 34599831 PMCID: PMC8738102 DOI: 10.1002/prot.26254] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2021] [Revised: 09/22/2021] [Accepted: 09/28/2021] [Indexed: 02/03/2023]

Ju F, Zhu J, Zhang Q, Wei G, Sun S, Zheng WM, Bu D. Seq-SetNet: directly exploiting multiple sequence alignment for protein secondary structure prediction. Bioinformatics 2022;38:990-996. [PMID: 34849579 DOI: 10.1093/bioinformatics/btab777] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Revised: 10/22/2021] [Accepted: 11/04/2021] [Indexed: 02/03/2023] Open

Abstract

MOTIVATION

Accurate prediction of protein structure relies heavily on exploiting multiple sequence alignment (MSA) for residue mutations and correlations as this information specifies protein tertiary structure. The widely used prediction approaches usually transform MSA into inter-mediate models, say position-specific scoring matrix or profile hidden Markov model. These inter-mediate models, however, cannot fully represent residue mutations and correlations carried by MSA; hence, an effective way to directly exploit MSAs is highly desirable.

RESULTS

Here, we report a novel sequence set network (called Seq-SetNet) to directly and effectively exploit MSA for protein structure prediction. Seq-SetNet uses an 'encoding and aggregation' strategy that consists of two key elements: (i) an encoding module that takes a component homologue in MSA as input, and encodes residue mutations and correlations into context-specific features for each residue; and (ii) an aggregation module to aggregate the features extracted from all component homologues, which are further transformed into structural properties for residues of the query protein. As Seq-SetNet encodes each homologue protein individually, it could consider both insertions and deletions, as well as long-distance correlations among residues, thus representing more information than the inter-mediate models. Moreover, the encoding module automatically learns effective features and thus avoids manual feature engineering. Using symmetric aggregation functions, Seq-SetNet processes the homologue proteins as a sequence set, making its prediction results invariable to the order of these proteins. On popular benchmark sets, we demonstrated the successful application of Seq-SetNet to predict secondary structure and torsion angles of residues with improved accuracy and efficiency.

AVAILABILITY AND IMPLEMENTATION

The code and datasets are available through https://github.com/fusong-ju/Seq-SetNet.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Tran NH, Xu J, Li M. A tale of solving two computational challenges in protein science: neoantigen prediction and protein structure prediction. Brief Bioinform 2022;23:bbab493. [PMID: 34891158 PMCID: PMC8769896 DOI: 10.1093/bib/bbab493] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Revised: 10/11/2021] [Accepted: 10/26/2021] [Indexed: 12/30/2022] Open

New highly antigenic linear B cell epitope peptides from PvAMA-1 as potential vaccine candidates. PLoS One 2021;16:e0258637. [PMID: 34727117 PMCID: PMC8562794 DOI: 10.1371/journal.pone.0258637] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Accepted: 10/01/2021] [Indexed: 11/19/2022] Open

Villegas-Morcillo A, Gomez AM, Morales-Cordovilla JA, Sanchez V. Protein Fold Recognition From Sequences Using Convolutional and Recurrent Neural Networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021;18:2848-2854. [PMID: 32750896 DOI: 10.1109/tcbb.2020.3012732] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Villegas-Morcillo A, Sanchez V, Gomez AM. FoldHSphere: deep hyperspherical embeddings for protein fold recognition. BMC Bioinformatics 2021;22:490. [PMID: 34641786 PMCID: PMC8507389 DOI: 10.1186/s12859-021-04419-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 09/29/2021] [Indexed: 12/01/2022] Open

Kong L, Ju F, Zhang H, Sun S, Bu D. FALCON2: a web server for high-quality prediction of protein tertiary structures. BMC Bioinformatics 2021;22:439. [PMID: 34525939 PMCID: PMC8444573 DOI: 10.1186/s12859-021-04353-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Accepted: 09/01/2021] [Indexed: 11/21/2022] Open

Abstract

BACKGROUND

Accurate prediction of protein tertiary structures is highly desired as the knowledge of protein structures provides invaluable insights into protein functions. We have designed two approaches to protein structure prediction, including a template-based modeling approach (called ProALIGN) and an ab initio prediction approach (called ProFOLD). Briefly speaking, ProALIGN aligns a target protein with templates through exploiting the patterns of context-specific alignment motifs and then builds the final structure with reference to the homologous templates. In contrast, ProFOLD uses an end-to-end neural network to estimate inter-residue distances of target proteins and builds structures that satisfy these distance constraints. These two approaches emphasize different characteristics of target proteins: ProALIGN exploits structure information of homologous templates of target proteins while ProFOLD exploits the co-evolutionary information carried by homologous protein sequences. Recent progress has shown that the combination of template-based modeling and ab initio approaches is promising.

RESULTS

In the study, we present FALCON2, a web server that integrates ProALIGN and ProFOLD to provide high-quality protein structure prediction service. For a target protein, FALCON2 executes ProALIGN and ProFOLD simultaneously to predict possible structures and selects the most likely one as the final prediction result. We evaluated FALCON2 on widely-used benchmarks, including 104 CASP13 (the 13th Critical Assessment of protein Structure Prediction) targets and 91 CASP14 targets. In-depth examination suggests that when high-quality templates are available, ProALIGN is superior to ProFOLD and in other cases, ProFOLD shows better performance. By integrating these two approaches with different emphasis, FALCON2 server outperforms the two individual approaches and also achieves state-of-the-art performance compared with existing approaches.

CONCLUSIONS

By integrating template-based modeling and ab initio approaches, FALCON2 provides an easy-to-use and high-quality protein structure prediction service for the community and we expect it to enable insights into a deep understanding of protein functions.

Collapse

Shen T, Wu J, Lan H, Zheng L, Pei J, Wang S, Liu W, Huang J. When homologous sequences meet structural decoys: Accurate contact prediction by tFold in CASP14-(tFold for CASP14 contact prediction). Proteins 2021;89:1901-1910. [PMID: 34473376 DOI: 10.1002/prot.26232] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Revised: 08/16/2021] [Accepted: 08/20/2021] [Indexed: 12/29/2022]

Arsiccio A, Beavis J, Raut S, Coxon CH. FVIII inhibitors display FV-neutralizing activity in the prothrombin time assay. J Thromb Haemost 2021;19:1907-1913. [PMID: 33914406 PMCID: PMC8360109 DOI: 10.1111/jth.15355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Accepted: 04/16/2021] [Indexed: 11/28/2022]

Bhattacharya S, Roche R, Shuvo MH, Bhattacharya D. Recent Advances in Protein Homology Detection Propelled by Inter-Residue Interaction Map Threading. Front Mol Biosci 2021;8:643752. [PMID: 34046429 PMCID: PMC8148041 DOI: 10.3389/fmolb.2021.643752] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Accepted: 04/21/2021] [Indexed: 11/13/2022] Open

Wu F, Xu J. Deep template-based protein structure prediction. PLoS Comput Biol 2021;17:e1008954. [PMID: 33939695 PMCID: PMC8118551 DOI: 10.1371/journal.pcbi.1008954] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Revised: 05/13/2021] [Accepted: 04/11/2021] [Indexed: 11/19/2022] Open

The Protective A673T Mutation of Amyloid Precursor Protein (APP) in Alzheimer's Disease. Mol Neurobiol 2021;58:4038-4050. [PMID: 33914267 DOI: 10.1007/s12035-021-02385-y] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Accepted: 04/05/2021] [Indexed: 10/21/2022]

Peñaloza HF, Olonisakin TF, Bain WG, Qu Y, van der Geest R, Zupetic J, Hulver M, Xiong Z, Newstead MW, Zou C, Alder JK, Ybe JA, Standiford TJ, Lee JS. Thrombospondin-1 Restricts Interleukin-36γ-Mediated Neutrophilic Inflammation during Pseudomonas aeruginosa Pulmonary Infection. mBio 2021;12:e03336-20. [PMID: 33824208 PMCID: PMC8092289 DOI: 10.1128/mbio.03336-20] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Accepted: 02/25/2021] [Indexed: 01/05/2023] Open

Abstract

Interleukin-36γ (IL-36γ), a member of the IL-1 cytokine superfamily, amplifies lung inflammation and impairs host defense during acute pulmonary Pseudomonas aeruginosa infection. To be fully active, IL-36γ is cleaved at its N-terminal region by proteases such as neutrophil elastase (NE) and cathepsin S (CatS). However, it remains unclear whether limiting extracellular proteolysis restrains the inflammatory cascade triggered by IL-36γ during P. aeruginosa infection. Thrombospondin-1 (TSP-1) is a matricellular protein with inhibitory activity against NE and the pathogen-secreted Pseudomonas elastase LasB-both proteases implicated in amplifying inflammation. We hypothesized that TSP-1 tempers the inflammatory response during lung P. aeruginosa infection by inhibiting the proteolytic environment required for IL-36γ activation. Compared to wild-type (WT) mice, TSP-1-deficient (Thbs1-/-) mice exhibited a hyperinflammatory response in the lungs during P. aeruginosa infection, with increased cytokine production and an unrestrained extracellular proteolytic environment characterized by higher free NE and LasB, but not CatS activity. LasB cleaved IL-36γ proximally to M19 at a cleavage site distinct from those generated by NE and CatS, which cleave IL-36γ proximally to Y16 and S18, respectively. N-terminal truncation experiments in silico predicted that the M19 and the S18 isoforms bind the IL-36R complex almost identically. IL-36γ neutralization ameliorated the hyperinflammatory response and improved lung immunity in Thbs1-/- mice during P. aeruginosa infection. Moreover, administration of cleaved IL-36γ induced cytokine production and neutrophil recruitment and activation that was accentuated in Thbs1-/- mice lungs. Collectively, our data show that TSP-1 regulates lung neutrophilic inflammation and facilitates host defense by restraining the extracellular proteolytic environment required for IL-36γ activation.IMPORTANCEPseudomonas aeruginosa pulmonary infection can lead to exaggerated neutrophilic inflammation and tissue destruction, yet host factors that regulate the neutrophilic response are not fully known. IL-36γ is a proinflammatory cytokine that dramatically increases in bioactivity following N-terminal processing by proteases. Here, we demonstrate that thrombospondin-1, a host matricellular protein, limits N-terminal processing of IL-36γ by neutrophil elastase and the Pseudomonas aeruginosa-secreted protease LasB. Thrombospondin-1-deficient mice (Thbs1-/-) exhibit a hyperinflammatory response following infection. Whereas IL-36γ neutralization reduces inflammatory cytokine production, limits neutrophil activation, and improves host defense in Thbs1-/- mice, cleaved IL-36γ administration amplifies neutrophilic inflammation in Thbs1-/- mice. Our findings indicate that thrombospondin-1 guards against feed-forward neutrophilic inflammation mediated by IL-36γ in the lung by restraining the extracellular proteolytic environment.

Collapse

Affiliation(s)

Hernán F Peñaloza Acute Lung Injury Center of Excellence, Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
Tolani F Olonisakin Acute Lung Injury Center of Excellence, Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
William G Bain Acute Lung Injury Center of Excellence, Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
Yanyan Qu Acute Lung Injury Center of Excellence, Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
Rick van der Geest Acute Lung Injury Center of Excellence, Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
Jill Zupetic Acute Lung Injury Center of Excellence, Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
Mei Hulver Acute Lung Injury Center of Excellence, Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
Zeyu Xiong Acute Lung Injury Center of Excellence, Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
Michael W Newstead Pulmonary and Critical Care Medicine, Department of Medicine, University of Michigan, Ann Arbor, Michigan, USA
Chunbin Zou Acute Lung Injury Center of Excellence, Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
Jonathan K Alder Acute Lung Injury Center of Excellence, Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
Joel A Ybe Department of Environmental and Occupational Health, School of Public Health, Indiana University, Bloomington, Indiana, USA
Theodore J Standiford Pulmonary and Critical Care Medicine, Department of Medicine, University of Michigan, Ann Arbor, Michigan, USA
Janet S Lee Acute Lung Injury Center of Excellence, Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA

Collapse

Zhang H, Shen Y. Template-based prediction of protein structure with deep learning. BMC Genomics 2020;21:878. [PMID: 33372607 PMCID: PMC7771081 DOI: 10.1186/s12864-020-07249-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Accepted: 11/18/2020] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Accurate prediction of protein structure is fundamentally important to understand biological function of proteins. Template-based modeling, including protein threading and homology modeling, is a popular method for protein tertiary structure prediction. However, accurate template-query alignment and template selection are still very challenging, especially for the proteins with only distant homologs available.

RESULTS

We propose a new template-based modelling method called ThreaderAI to improve protein tertiary structure prediction. ThreaderAI formulates the task of aligning query sequence with template as the classical pixel classification problem in computer vision and naturally applies deep residual neural network in prediction. ThreaderAI first employs deep learning to predict residue-residue aligning probability matrix by integrating sequence profile, predicted sequential structural features, and predicted residue-residue contacts, and then builds template-query alignment by applying a dynamic programming algorithm on the probability matrix. We evaluated our methods both in generating accurate template-query alignment and protein threading. Experimental results show that ThreaderAI outperforms currently popular template-based modelling methods HHpred, CNFpred, and the latest contact-assisted method CEthreader, especially on the proteins that do not have close homologs with known structures. In particular, in terms of alignment accuracy measured with TM-score, ThreaderAI outperforms HHpred, CNFpred, and CEthreader by 56, 13, and 11%, respectively, on template-query pairs at the similarity of fold level from SCOPe data. And on CASP13's TBM-hard data, ThreaderAI outperforms HHpred, CNFpred, and CEthreader by 16, 9 and 8% in terms of TM-score, respectively.

CONCLUSIONS

These results demonstrate that with the help of deep learning, ThreaderAI can significantly improve the accuracy of template-based structure prediction, especially for distant-homology proteins.

Collapse

Mirzaei S, Razmara J, Lotfi S. GADP-align: A genetic algorithm and dynamic programming-based method for structural alignment of proteins. BIOIMPACTS 2020;11:271-279. [PMID: 34631489 PMCID: PMC8494253 DOI: 10.34172/bi.2021.37] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/03/2020] [Revised: 06/10/2020] [Accepted: 06/16/2020] [Indexed: 11/16/2022]

Xu J, Wang S. Analysis of distance-based protein structure prediction by deep learning in CASP13. Proteins 2019;87:1069-1081. [PMID: 31471916 DOI: 10.1002/prot.25810] [Citation(s) in RCA: 92] [Impact Index Per Article: 18.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Revised: 07/24/2019] [Accepted: 08/27/2019] [Indexed: 12/30/2022]

Zhu J, Wang S, Bu D, Xu J. Protein threading using residue co-variation and deep learning. Bioinformatics 2019;34:i263-i273. [PMID: 29949980 PMCID: PMC6022550 DOI: 10.1093/bioinformatics/bty278] [Citation(s) in RCA: 57] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open

Holt MC, Ho CS, Morano MI, Barrett SD, Stein AJ. Improved homology modeling of the human & rat EP₄ prostanoid receptors. BMC Mol Cell Biol 2019;20:37. [PMID: 31455205 PMCID: PMC6712885 DOI: 10.1186/s12860-019-0212-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2019] [Accepted: 07/11/2019] [Indexed: 12/02/2022] Open

Abstract

Background

The EP₄ prostanoid receptor is one of four GPCRs that mediate the diverse actions of prostaglandin E₂ (PGE₂). Novel selective EP₄ receptor agonists would assist to further elucidate receptor sub-type function and promote development of therapeutics for bone healing, heart failure, and other receptor associated conditions. The rat EP₄ (rEP₄) receptor has been used as a surrogate for the human EP₄ (hEP₄) receptor in multiple SAR studies. To better understand the validity of this traditional approach, homology models were generated by threading for both receptors using the RaptorX server. These models were fit to an implicit membrane using the PPM server and OPM database with refinement of intra and extracellular loops by Prime (Schrödinger). To understand the interaction between the receptors and known agonists, induced-fit docking experiments were performed using Glide and Prime (Schrödinger), with both endogenous agonists and receptor sub-type selective, small-molecule agonists. The docking scores and observed interactions were compared with radioligand displacement experiments and receptor (rat & human) activation assays monitoring cAMP.

Results

Rank-ordering of in silico compound docking scores aligned well with in vitro activity assay EC₅₀ and radioligand binding K_i. We observed variations between rat and human EP₄ binding pockets that have implications in future small-molecule receptor-modulator design and SAR, specifically a S103G mutation within the rEP4 receptor. Additionally, these models helped identify key interactions between the EP₄ receptor and ligands including PGE₂ and several known sub-type selective agonists while serving as a marked improvement over the previously reported models.

Conclusions

This work has generated a set of novel homology models of the rEP₄ and hEP₄ receptors. The homology models provide an improvement upon the previously reported model, largely due to improved solvation. The hEP₄ docking scores correlates best with the cAMP activation data, where both data sets rank order Rivenprost>CAY10684 > PGE₁ ≈ PGE₂ > 11-deoxy-PGE₁ ≈ 11-dexoy-PGE₂ > 8-aza-11-deoxy-PGE₁. This rank-ordering matches closely with the rEP₄ receptor as well. Species-specific differences were noted for the weak agonists Sulprostone and Misoprostol, which appear to dock more readily within human receptor versus rat receptor.

Electronic supplementary material

The online version of this article (10.1186/s12860-019-0212-5) contains supplementary material, which is available to authorized users.

Collapse

Distance-based protein folding powered by deep learning. Proc Natl Acad Sci U S A 2019;116:16856-16865. [PMID: 31399549 DOI: 10.1073/pnas.1821309116] [Citation(s) in RCA: 234] [Impact Index Per Article: 46.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Durr-e-Shahwar S, Atia-tul-Wahab, Choudhary MI, Jabeen A. Cloning, purification, structural, and functional characterization of methicillin-resistant Staphylococcus aureus (MRSA252) RsbV protein. Int J Biol Macromol 2019;134:962-966. [DOI: 10.1016/j.ijbiomac.2019.05.034] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2019] [Revised: 05/04/2019] [Accepted: 05/05/2019] [Indexed: 02/04/2023]

Vizcaíno-Castillo A, Osorio-Méndez JF, Rubio-Ortiz M, Manning-Cela RG, Hernández R, Cevallos AM. Trypanosoma cruzi actins: Expression analysis of actin 2. Biochem Biophys Res Commun 2019;513:347-353. [DOI: 10.1016/j.bbrc.2019.04.007] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2019] [Accepted: 04/01/2019] [Indexed: 10/27/2022]

Bhattacharya S, Bhattacharya D. Does inclusion of residue-residue contact information boost protein threading? Proteins 2019;87:596-606. [PMID: 30882932 DOI: 10.1002/prot.25684] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2018] [Revised: 02/20/2019] [Accepted: 03/13/2019] [Indexed: 12/26/2022]

Palomo-Ligas L, Gutiérrez-Gutiérrez F, Ochoa-Maganda VY, Cortés-Zárate R, Charles-Niño CL, Castillo-Romero A. Identification of a novel potassium channel (GiK) as a potential drug target in Giardia lamblia: Computational descriptions of binding sites. PeerJ 2019;7:e6430. [PMID: 30834181 PMCID: PMC6397635 DOI: 10.7717/peerj.6430] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2018] [Accepted: 01/10/2019] [Indexed: 12/12/2022] Open

Petegrosso R, Li Z, Srour MA, Saad Y, Zhang W, Kuang R. Scalable remote homology detection and fold recognition in massive protein networks. Proteins 2019;87:478-491. [PMID: 30714638 DOI: 10.1002/prot.25669] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2018] [Revised: 12/19/2018] [Accepted: 01/31/2019] [Indexed: 11/10/2022]

Identification of the novel role of butyrate as AhR ligand in human intestinal epithelial cells. Sci Rep 2019;9:643. [PMID: 30679727 PMCID: PMC6345974 DOI: 10.1038/s41598-018-37019-2] [Citation(s) in RCA: 105] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2018] [Accepted: 11/28/2018] [Indexed: 12/18/2022] Open

Riber L, Koch BM, Kruse LR, Germain E, Løbner-Olesen A. HipA-Mediated Phosphorylation of SeqA Does not Affect Replication Initiation in Escherichia coli. Front Microbiol 2018;9:2637. [PMID: 30450091 PMCID: PMC6225831 DOI: 10.3389/fmicb.2018.02637] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2018] [Accepted: 10/16/2018] [Indexed: 11/20/2022] Open

Skotnicová P, Sobotka R, Shepherd M, Hájek J, Hrouzek P, Tichý M. The cyanobacterial protoporphyrinogen oxidase HemJ is a new b-type heme protein functionally coupled with coproporphyrinogen III oxidase. J Biol Chem 2018;293:12394-12404. [PMID: 29925590 DOI: 10.1074/jbc.ra118.003441] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2018] [Revised: 06/14/2018] [Indexed: 12/27/2022] Open

Morales-Cordovilla JA, Sanchez V, Ratajczak M. Protein alignment based on higher order conditional random fields for template-based modeling. PLoS One 2018;13:e0197912. [PMID: 29856860 PMCID: PMC5983487 DOI: 10.1371/journal.pone.0197912] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2017] [Accepted: 05/10/2018] [Indexed: 11/19/2022] Open

Abdel Azim A, Rittmann SKMR, Fino D, Bochmann G. The physiological effect of heavy metals and volatile fatty acids on Methanococcus maripaludis S2. BIOTECHNOLOGY FOR BIOFUELS 2018;11:301. [PMID: 30410576 PMCID: PMC6214177 DOI: 10.1186/s13068-018-1302-x] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/12/2018] [Accepted: 10/25/2018] [Indexed: 05/16/2023]

Abstract

BACKGROUND

Methanogenic archaea are of importance to the global C-cycle and to biological methane (CH₄) production through anaerobic digestion and pure culture. Here, the individual and combined effects of copper (Cu), zinc (Zn), acetate, and propionate on the metabolism of the autotrophic, hydrogenotrophic methanogen Methanococcus maripaludis S2 were investigated. Cu, Zn, acetate, and propionate may interfere directly and indirectly with the acetyl-CoA synthesis and biological CH₄ production. Thus, these compounds can compromise or improve the performance of M. maripaludis, an organism which can be applied as biocatalyst in the carbon dioxide (CO₂)-based biological CH₄ production (CO₂-BMP) process or of methanogenic organisms applied in anaerobic digestion.

RESULTS

Here, we show that Cu concentration of 1.9 µmol L^-1 reduced growth of M. maripaludis, whereas 4.4 and 6.3 µmol L^-1 of Cu even further retarded biomass production. However, 1.0 mmol L^-1 of Zn enhanced growth, but at Zn concentrations > 2.4 mmol L^-1 no growth could be observed. When both, Cu and Zn, were supplemented to the medium, growth and CH₄ production could even be observed at the highest tested concentration of Cu (6.3 µmol L^-1). Hence, it seems that the addition of 1 mmol L^-1 of Zn enhanced the ability of M. maripaludis to counteract the toxic effect of Cu. The physiological effect to rising concentrations of acetate (12.2, 60.9, 121.9 mmol L^-1) and/or propionate (10.3, 52.0, 104.1 mmol L^-1) was also investigated. When instead of acetate 10.3 mmol L^-1 propionate was provided in the growth medium, M. maripaludis could grow without reduction of the specific growth rate (µ) or the specific CH₄ productivity (qCH₄). A combination of inorganic and/or organic compounds resulted in an increase of µ and qCH₄ for Zn/Cu and Zn/acetate beyond the values that were observed if only the individual concentrations of Zn, Cu, acetate were used.

CONCLUSIONS

Our study sheds light on the physiological effect of VFAs and heavy metals on M. maripaludis. Differently from µ and qCH₄, MER was not influenced by the presence of these compounds. This indicated that each of these compounds directly interacted with the C-fixation machinery of M. maripaludis. Until now, the uptake of VFAs other than acetate was not considered to enhance growth and CH₄ production of methanogens. The finding of propionate uptake by M. maripaludis is important for the interpretation of VFA cycling in anaerobic microenvironments. Due to the importance of methanogens in natural and artificial anaerobic environments, our results help to enhance the understanding the physiological and biotechnological importance with respect to anaerobic digestion, anaerobic wastewater treatment, and CO₂-BMP. Finally, we propose a possible mechanism for acetate uptake into M. maripaludis supported by in silico analyses.

Collapse

Zhu J, Zhang H, Li SC, Wang C, Kong L, Sun S, Zheng WM, Bu D. Improving protein fold recognition by extracting fold-specific features from predicted residue–residue contacts. Bioinformatics 2017;33:3749-3757. [DOI: 10.1093/bioinformatics/btx514] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2017] [Accepted: 08/09/2017] [Indexed: 01/05/2023] Open

Middleton SA, Illuminati J, Kim J. Complete fold annotation of the human proteome using a novel structural feature space. Sci Rep 2017;7:46321. [PMID: 28406174 PMCID: PMC5390313 DOI: 10.1038/srep46321] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2017] [Accepted: 03/14/2017] [Indexed: 11/11/2022] Open

Vaitinadapoule A, Etchebest C. Molecular Modeling of Transporters: From Low Resolution Cryo-Electron Microscopy Map to Conformational Exploration. The Example of TSPO. Methods Mol Biol 2017;1635:383-416. [PMID: 28755381 DOI: 10.1007/978-1-4939-7151-0_21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Cui X, Lu Z, Wang S, Jing-Yan Wang J, Gao X. CMsearch: simultaneous exploration of protein sequence space and structure space improves not only protein homology detection but also protein structure prediction. Bioinformatics 2016;32:i332-i340. [PMID: 27307635 PMCID: PMC4908355 DOI: 10.1093/bioinformatics/btw271] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Abstract

MOTIVATION

Protein homology detection, a fundamental problem in computational biology, is an indispensable step toward predicting protein structures and understanding protein functions. Despite the advances in recent decades on sequence alignment, threading and alignment-free methods, protein homology detection remains a challenging open problem. Recently, network methods that try to find transitive paths in the protein structure space demonstrate the importance of incorporating network information of the structure space. Yet, current methods merge the sequence space and the structure space into a single space, and thus introduce inconsistency in combining different sources of information.

METHOD

We present a novel network-based protein homology detection method, CMsearch, based on cross-modal learning. Instead of exploring a single network built from the mixture of sequence and structure space information, CMsearch builds two separate networks to represent the sequence space and the structure space. It then learns sequence-structure correlation by simultaneously taking sequence information, structure information, sequence space information and structure space information into consideration.

RESULTS

We tested CMsearch on two challenging tasks, protein homology detection and protein structure prediction, by querying all 8332 PDB40 proteins. Our results demonstrate that CMsearch is insensitive to the similarity metrics used to define the sequence and the structure spaces. By using HMM-HMM alignment as the sequence similarity metric, CMsearch clearly outperforms state-of-the-art homology detection methods and the CASP-winning template-based protein structure prediction methods.

AVAILABILITY AND IMPLEMENTATION

Our program is freely available for download from http://sfb.kaust.edu.sa/Pages/Software.aspx

CONTACT

: xin.gao@kaust.edu.sa

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Wang S, Li W, Liu S, Xu J. RaptorX-Property: a web server for protein structure property prediction. Nucleic Acids Res 2016;44:W430-5. [PMID: 27112573 PMCID: PMC4987890 DOI: 10.1093/nar/gkw306] [Citation(s) in RCA: 331] [Impact Index Per Article: 41.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2016] [Accepted: 04/12/2016] [Indexed: 11/14/2022] Open

Lhota J, Xie L. Protein-fold recognition using an improved single-source K diverse shortest paths algorithm. Proteins 2016;84:467-72. [PMID: 26800480 DOI: 10.1002/prot.24993] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2015] [Revised: 01/10/2016] [Accepted: 01/12/2016] [Indexed: 11/11/2022]

Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields. Sci Rep 2016;6:18962. [PMID: 26752681 PMCID: PMC4707437 DOI: 10.1038/srep18962] [Citation(s) in RCA: 255] [Impact Index Per Article: 31.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2015] [Accepted: 11/26/2015] [Indexed: 12/29/2022] Open

Tong J, Pei J, Grishin NV. SFESA: a web server for pairwise alignment refinement by secondary structure shifts. BMC Bioinformatics 2015;16:282. [PMID: 26335387 PMCID: PMC4558796 DOI: 10.1186/s12859-015-0711-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2015] [Accepted: 08/19/2015] [Indexed: 12/01/2022] Open

AcconPred: Predicting Solvent Accessibility and Contact Number Simultaneously by a Multitask Learning Framework under the Conditional Neural Fields Model. BIOMED RESEARCH INTERNATIONAL 2015;2015:678764. [PMID: 26339631 PMCID: PMC4538422 DOI: 10.1155/2015/678764] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/27/2014] [Accepted: 03/11/2015] [Indexed: 12/14/2022]

DeepCNF-D: Predicting Protein Order/Disorder Regions by Weighted Deep Convolutional Neural Fields. Int J Mol Sci 2015;16:17315-30. [PMID: 26230689 PMCID: PMC4581195 DOI: 10.3390/ijms160817315] [Citation(s) in RCA: 58] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2015] [Revised: 07/15/2015] [Accepted: 07/16/2015] [Indexed: 12/14/2022] Open

Kozma D, Tusnády GE. TMFoldRec: a statistical potential-based transmembrane protein fold recognition tool. BMC Bioinformatics 2015;16:201. [PMID: 26123059 PMCID: PMC4486421 DOI: 10.1186/s12859-015-0638-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2015] [Accepted: 06/06/2015] [Indexed: 12/26/2022] Open

Bawono P, van der Velde A, Abeln S, Heringa J. Quantifying the displacement of mismatches in multiple sequence alignment benchmarks. PLoS One 2015;10:e0127431. [PMID: 25993129 PMCID: PMC4438059 DOI: 10.1371/journal.pone.0127431] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2014] [Accepted: 04/14/2015] [Indexed: 11/18/2022] Open

Abstract

Multiple Sequence Alignment (MSA) methods are typically benchmarked on sets of reference alignments. The quality of the alignment can then be represented by the sum-of-pairs (SP) or column (CS) scores, which measure the agreement between a reference and corresponding query alignment. Both the SP and CS scores treat mismatches between a query and reference alignment as equally bad, and do not take the separation into account between two amino acids in the query alignment, that should have been matched according to the reference alignment. This is significant since the magnitude of alignment shifts is often of relevance in biological analyses, including homology modeling and MSA refinement/manual alignment editing. In this study we develop a new alignment benchmark scoring scheme, SPdist, that takes the degree of discordance of mismatches into account by measuring the sequence distance between mismatched residue pairs in the query alignment. Using this new score along with the standard SP score, we investigate the discriminatory behavior of the new score by assessing how well six different MSA methods perform with respect to BAliBASE reference alignments. The SP score and the SPdist score yield very similar outcomes when the reference and query alignments are close. However, for more divergent reference alignments the SPdist score is able to distinguish between methods that keep alignments approximately close to the reference and those exhibiting larger shifts. We observed that by using SPdist together with SP scoring we were able to better delineate the alignment quality difference between alternative MSA methods. With a case study we exemplify why it is important, from a biological perspective, to consider the separation of mismatches. The SPdist scoring scheme has been implemented in the VerAlign web server (http://www.ibi.vu.nl/programs/veralignwww/). The code for calculating SPdist score is also available upon request.

Collapse