Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Xu J. Distance-based protein folding powered by deep learning. Proc Natl Acad Sci U S A 2019;116:16856-65. [PMID: 31399549 DOI: 10.1073/pnas.1821309116] [Citation(s) in RCA: 234] [Impact Index Per Article: 46.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Number

Cited by Other Article(s)

Han Y, Lu Y, Yan X, Cui H, Cheng S, Zheng J, Zhou Y, Wang S, Li Z. Atom-ProteinQA: Atom-level protein model quality assessment through fine-grained joint learning. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024;249:108078. [PMID: 38537495 DOI: 10.1016/j.cmpb.2024.108078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Revised: 12/26/2023] [Accepted: 02/10/2024] [Indexed: 04/21/2024]

Palamiuc L, Johnson JL, Haratipour Z, Loughran RM, Choi WJ, Arora GK, Tieu V, Ly K, Llorente A, Crabtree S, Wong JCY, Ravi A, Wiederhold T, Murad R, Blind RD, Emerling BM. Hippo and PI5P4K signaling intersect to control the transcriptional activation of YAP. Sci Signal 2024;17:eado6266. [PMID: 38805583 DOI: 10.1126/scisignal.ado6266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2024] [Accepted: 05/09/2024] [Indexed: 05/30/2024]

Sawhney A, Li J, Liao L. Improving AlphaFold Predicted Contacts for Alpha-Helical Transmembrane Proteins Using Structural Features. Int J Mol Sci 2024;25:5247. [PMID: 38791287 PMCID: PMC11121315 DOI: 10.3390/ijms25105247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Revised: 05/06/2024] [Accepted: 05/09/2024] [Indexed: 05/26/2024] Open

Xie T, Huang J. Can Protein Structure Prediction Methods Capture Alternative Conformations of Membrane Transporters? J Chem Inf Model 2024;64:3524-3536. [PMID: 38564295 DOI: 10.1021/acs.jcim.3c01936] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]

Jing X, Wu F, Luo X, Xu J. Single-sequence protein structure prediction by integrating protein language models. Proc Natl Acad Sci U S A 2024;121:e2308788121. [PMID: 38507445 PMCID: PMC10990103 DOI: 10.1073/pnas.2308788121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Accepted: 02/05/2024] [Indexed: 03/22/2024] Open

Jänes J, Beltrao P. Deep learning for protein structure prediction and design-progress and applications. Mol Syst Biol 2024;20:162-169. [PMID: 38291232 PMCID: PMC10912668 DOI: 10.1038/s44320-024-00016-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 12/21/2023] [Accepted: 01/11/2024] [Indexed: 02/01/2024] Open

Wuyun Q, Chen Y, Shen Y, Cao Y, Hu G, Cui W, Gao J, Zheng W. Recent Progress of Protein Tertiary Structure Prediction. Molecules 2024;29:832. [PMID: 38398585 PMCID: PMC10893003 DOI: 10.3390/molecules29040832] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Revised: 02/06/2024] [Accepted: 02/08/2024] [Indexed: 02/25/2024] Open

Ran C, Pu K. Molecularly generated light and its biomedical applications. Angew Chem Int Ed Engl 2024;63:e202314468. [PMID: 37955419 DOI: 10.1002/anie.202314468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 11/01/2023] [Accepted: 11/10/2023] [Indexed: 11/14/2023]

Darden C, Donkor JE, Korolkova O, Barozai MYK, Chaudhuri M. Distinct structural motifs are necessary for targeting and import of Tim17 in Trypanosoma brucei mitochondrion. mSphere 2024;9:e0055823. [PMID: 38193679 PMCID: PMC10871166 DOI: 10.1128/msphere.00558-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 11/28/2023] [Indexed: 01/10/2024] Open

Abstract

Nuclear-encoded mitochondrial proteins are correctly translocated to their proper sub-mitochondrial destination using location-specific mitochondrial targeting signals and via multi-protein import machineries (translocases) in the outer and inner mitochondrial membranes (TOM and TIMs, respectively). However, targeting signals of multi-pass Tims are less defined. Here, we report the characterization of the targeting signals of Trypanosoma brucei Tim17 (TbTim17), an essential component of the most divergent TIM complex. TbTim17 possesses a characteristic secondary structure including four predicted transmembrane (TM) domains in the center with hydrophilic N- and C-termini. After examining mitochondrial localization of various deletion and site-directed mutants of TbTim17 in T. brucei using subcellular fractionation and confocal microscopy, we located at least two internal targeting signals (ITS): (i) within TM1 (31-50 AAs) and (ii) TM4 + loop 3 (120-136 AAs). Both signals are required for proper targeting and integration of TbTim17 in the membrane. Furthermore, a positively charged residue (K122) is critical for mitochondrial localization of TbTim17. This is the first report of characterizing the ITS for a multipass inner membrane protein in a divergent eukaryote, like T. brucei.IMPORTANCEAfrican trypanosomiasis (AT) is a deadly disease in human and domestic animals, caused by the parasitic protozoan Trypanosoma brucei. Therefore, AT is not only a concern for human health but also for economic development in the vast area of sub-Saharan Africa. T. brucei possesses a single mitochondrion per cell that imports hundreds of nuclear-encoded mitochondrial proteins for its functions. T. brucei Tim17 (TbTim17), an essential component of the TbTIM17 complex, is a nuclear-encoded protein; thus, it is necessary to be imported from the cytosol to form the TbTIM17 complex. Here, we demonstrated that the internal targeting signals within the transmembrane 1 (TM1) and TM4 with loop 3, and residue K122 are required collectively for import and integration of TbTim17 in the T. brucei mitochondrion. This information could be utilized to block TbTim17 function and parasite growth.

Collapse

Peng CX, Liang F, Xia YH, Zhao KL, Hou MH, Zhang GJ. Recent Advances and Challenges in Protein Structure Prediction. J Chem Inf Model 2024;64:76-95. [PMID: 38109487 DOI: 10.1021/acs.jcim.3c01324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2023]

Li J, Wang L, Zhu Z, Song C. Exploring the Alternative Conformation of a Known Protein Structure Based on Contact Map Prediction. J Chem Inf Model 2024;64:301-315. [PMID: 38117138 PMCID: PMC10777399 DOI: 10.1021/acs.jcim.3c01381] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 12/03/2023] [Accepted: 12/05/2023] [Indexed: 12/21/2023]

Krokidis MG, Dimitrakopoulos GN, Vrahatis AG, Exarchos TP, Vlamos P. Challenges and limitations in computational prediction of protein misfolding in neurodegenerative diseases. Front Comput Neurosci 2024;17:1323182. [PMID: 38250244 PMCID: PMC10796696 DOI: 10.3389/fncom.2023.1323182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Accepted: 12/19/2023] [Indexed: 01/23/2024] Open

Salgado B, Rivas RB, Pinto D, Sonstegard TS, Carlson DF, Martins K, Bostrom JR, Sinebo Y, Rowland RRR, Brandariz-Nuñez A. Genetically modified pigs lacking CD163 PSTII-domain-coding exon 13 are completely resistant to PRRSV infection. Antiviral Res 2024;221:105793. [PMID: 38184111 DOI: 10.1016/j.antiviral.2024.105793] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 12/18/2023] [Accepted: 01/02/2024] [Indexed: 01/08/2024]

Curatolo AI, Kimchi O, Goodrich CP, Krueger RK, Brenner MP. A computational toolbox for the assembly yield of complex and heterogeneous structures. Nat Commun 2023;14:8328. [PMID: 38097568 PMCID: PMC10721878 DOI: 10.1038/s41467-023-43168-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Accepted: 11/02/2023] [Indexed: 12/17/2023] Open

Zhou Y, Litfin T, Zhan J. 3 = 1 + 2: how the divide conquered de novo protein structure prediction and what is next? Natl Sci Rev 2023;10:nwad259. [PMID: 38033736 PMCID: PMC10684263 DOI: 10.1093/nsr/nwad259] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Revised: 09/18/2023] [Indexed: 12/02/2023] Open

Zhang J, Liu S, Chen M, Chu H, Wang M, Wang Z, Yu J, Ni N, Yu F, Chen D, Yang YI, Xue B, Yang L, Liu Y, Gao YQ. Unsupervisedly Prompting AlphaFold2 for Accurate Few-Shot Protein Structure Prediction. J Chem Theory Comput 2023;19:8460-8471. [PMID: 37947474 DOI: 10.1021/acs.jctc.3c00528] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2023]

Pérez S. Computational modeling of protein-carbohydrate interactions: Current trends and future challenges. Adv Carbohydr Chem Biochem 2023;83:133-149. [PMID: 37968037 DOI: 10.1016/bs.accb.2023.10.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2023]

Ramya L, Helina Hilda S. Structural dynamics of moonlighting intrinsically disordered proteins - A black box in multiple sclerosis. J Mol Graph Model 2023;124:108572. [PMID: 37494873 DOI: 10.1016/j.jmgm.2023.108572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Revised: 07/19/2023] [Accepted: 07/20/2023] [Indexed: 07/28/2023]

Larrea-Sebal A, Jebari-Benslaiman S, Galicia-Garcia U, Jose-Urteaga AS, Uribe KB, Benito-Vicente A, Martín C. Predictive Modeling and Structure Analysis of Genetic Variants in Familial Hypercholesterolemia: Implications for Diagnosis and Protein Interaction Studies. Curr Atheroscler Rep 2023;25:839-859. [PMID: 37847331 PMCID: PMC10618353 DOI: 10.1007/s11883-023-01154-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/15/2023] [Indexed: 10/18/2023]

Abstract

PURPOSE OF REVIEW

Familial hypercholesterolemia (FH) is a hereditary condition characterized by elevated levels of low-density lipoprotein cholesterol (LDL-C), which increases the risk of cardiovascular disease if left untreated. This review aims to discuss the role of bioinformatics tools in evaluating the pathogenicity of missense variants associated with FH. Specifically, it highlights the use of predictive models based on protein sequence, structure, evolutionary conservation, and other relevant features in identifying genetic variants within LDLR, APOB, and PCSK9 genes that contribute to FH.

RECENT FINDINGS

In recent years, various bioinformatics tools have emerged as valuable resources for analyzing missense variants in FH-related genes. Tools such as REVEL, Varity, and CADD use diverse computational approaches to predict the impact of genetic variants on protein function. These tools consider factors such as sequence conservation, structural alterations, and receptor binding to aid in interpreting the pathogenicity of identified missense variants. While these predictive models offer valuable insights, the accuracy of predictions can vary, especially for proteins with unique characteristics that might not be well represented in the databases used for training. This review emphasizes the significance of utilizing bioinformatics tools for assessing the pathogenicity of FH-associated missense variants. Despite their contributions, a definitive diagnosis of a genetic variant necessitates functional validation through in vitro characterization or cascade screening. This step ensures the precise identification of FH-related variants, leading to more accurate diagnoses. Integrating genetic data with reliable bioinformatics predictions and functional validation can enhance our understanding of the genetic basis of FH, enabling improved diagnosis, risk stratification, and personalized treatment for affected individuals. The comprehensive approach outlined in this review promises to advance the management of this inherited disorder, potentially leading to better health outcomes for those affected by FH.

Collapse

Sawhney A, Li J, Liao L. Improving AlphaFold predicted contacts in alpha-helical transmembrane proteins structures using structural features. RESEARCH SQUARE 2023:rs.3.rs-3475769. [PMID: 37961476 PMCID: PMC10635369 DOI: 10.21203/rs.3.rs-3475769/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]

Fathollahi M, Motamedi H, Hossainpour H, Abiri R, Shahlaei M, Moradi S, Dashtbin S, Moradi J, Alvandi A. Designing a novel multi-epitopes pan-vaccine against SARS-CoV-2 and seasonal influenza: in silico and immunoinformatics approach. J Biomol Struct Dyn 2023:1-24. [PMID: 37723861 DOI: 10.1080/07391102.2023.2258420] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Accepted: 09/07/2023] [Indexed: 09/20/2023]

Cohen S, Schneidman-Duhovny D. A deep learning model for predicting optimal distance range in crosslinking mass spectrometry data. Proteomics 2023;23:e2200341. [PMID: 37070547 DOI: 10.1002/pmic.202200341] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 04/02/2023] [Accepted: 04/03/2023] [Indexed: 04/19/2023]

Ho W, Huang H, Huang J. IFF: Identifying key residues in intrinsically disordered regions of proteins using machine learning. Protein Sci 2023;32:e4739. [PMID: 37498545 PMCID: PMC10443345 DOI: 10.1002/pro.4739] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2023] [Revised: 06/21/2023] [Accepted: 07/25/2023] [Indexed: 07/28/2023]

Kandathil SM, Lau AM, Jones DT. Machine learning methods for predicting protein structure from single sequences. Curr Opin Struct Biol 2023;81:102627. [PMID: 37320955 DOI: 10.1016/j.sbi.2023.102627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 05/17/2023] [Accepted: 05/17/2023] [Indexed: 06/17/2023]

Darden C, Donkor J, Korolkova O, Khan Barozai MY, Chaudhuri M. Distinct structural motifs are necessary for targeting and import of Tim17 in Trypanosoma brucei mitochondrion. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.07.548172. [PMID: 37461662 PMCID: PMC10350046 DOI: 10.1101/2023.07.07.548172] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/23/2024]

Raghavachari K, Maier S, Collins EM, Debnath S, Sengupta A. Approaching Coupled Cluster Accuracy with Density Functional Theory Using the Generalized Connectivity-Based Hierarchy. J Chem Theory Comput 2023. [PMID: 37338997 DOI: 10.1021/acs.jctc.3c00301] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/22/2023]

Cheng Y, Wang H, Xu H, Liu Y, Ma B, Chen X, Zeng X, Wang X, Wang B, Shiau C, Ovchinnikov S, Su XD, Wang C. Co-evolution-based prediction of metal-binding sites in proteomes by machine learning. Nat Chem Biol 2023;19:548-555. [PMID: 36593274 DOI: 10.1038/s41589-022-01223-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Accepted: 11/08/2022] [Indexed: 01/03/2023]

Affiliation(s)

Yao Cheng Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, Peking University, Beijing, China Department of Chemical Biology, College of Chemistry and Molecular Engineering, Peking University, Beijing, China
Haobo Wang Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, Peking University, Beijing, China Department of Chemical Biology, College of Chemistry and Molecular Engineering, Peking University, Beijing, China
Hua Xu State Key Laboratory of Protein and Plant Gene Research, and Biomedical Pioneering Innovation Center (BIOPIC), Peking University, Beijing, China
Yuan Liu Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, Peking University, Beijing, China. Department of Chemical Biology, College of Chemistry and Molecular Engineering, Peking University, Beijing, China.
Bin Ma Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, Peking University, Beijing, China Department of Chemical Biology, College of Chemistry and Molecular Engineering, Peking University, Beijing, China
Xuemin Chen Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, Peking University, Beijing, China Department of Chemical Biology, College of Chemistry and Molecular Engineering, Peking University, Beijing, China
Xin Zeng Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
Xianghe Wang Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, Peking University, Beijing, China Department of Chemical Biology, College of Chemistry and Molecular Engineering, Peking University, Beijing, China
Bo Wang State Key Laboratory of Protein and Plant Gene Research, and Biomedical Pioneering Innovation Center (BIOPIC), Peking University, Beijing, China
Carina Shiau Cornell University, Ithaca, NY, USA
Sergey Ovchinnikov John Harvard Distinguished Science Fellow, Harvard University, Cambridge, MA, USA
Xiao-Dong Su State Key Laboratory of Protein and Plant Gene Research, and Biomedical Pioneering Innovation Center (BIOPIC), Peking University, Beijing, China.
Chu Wang Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, Peking University, Beijing, China. Department of Chemical Biology, College of Chemistry and Molecular Engineering, Peking University, Beijing, China. Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China.

Collapse

Jafari Najaf Abadi MH, Abyaneh FA, Zare N, Zamani J, Abdoli A, Aslanbeigi F, Hamblin MR, Tarrahimofrad H, Rahimi M, Hashemian SM, Mirzaei H. In silico design and immunoinformatics analysis of a chimeric vaccine construct based on Salmonella pathogenesis factors. Microb Pathog 2023;180:106130. [PMID: 37121524 DOI: 10.1016/j.micpath.2023.106130] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 04/26/2023] [Accepted: 04/27/2023] [Indexed: 05/02/2023]

Choudhury A, Saha S, Maiti NC, Datta S. Exploring structural features and potential lipid interactions of Pseudomonas aeruginosa type three secretion effector PemB by spectroscopic and calorimetric experiments. Protein Sci 2023;32:e4627. [PMID: 36916835 PMCID: PMC10044109 DOI: 10.1002/pro.4627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Revised: 03/06/2023] [Accepted: 03/10/2023] [Indexed: 03/15/2023]

Leiva S, Bugnon Valdano M, Gardiol D. Unravelling the epidemiological diversity of Zika virus by analyzing key protein variations. Arch Virol 2023;168:115. [PMID: 36943525 DOI: 10.1007/s00705-023-05726-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Accepted: 01/19/2023] [Indexed: 03/23/2023]

Li T, Li Y, Zhu X, He Y, Wu Y, Ying T, Xie Z. Artificial intelligence in cancer immunotherapy: Applications in neoantigen recognition, antibody design and immunotherapy response prediction. Semin Cancer Biol 2023;91:50-69. [PMID: 36870459 DOI: 10.1016/j.semcancer.2023.02.007] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 02/13/2023] [Accepted: 02/28/2023] [Indexed: 03/06/2023]

In silico design of a polypeptide as a vaccine candidate against ascariasis. Sci Rep 2023;13:3504. [PMID: 36864139 PMCID: PMC9981566 DOI: 10.1038/s41598-023-30445-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Accepted: 02/23/2023] [Indexed: 03/04/2023] Open

Huh E, Agosto MA, Wensel TG, Lichtarge O. Coevolutionary signals in metabotropic glutamate receptors capture residue contacts and long-range functional interactions. J Biol Chem 2023;299:103030. [PMID: 36806686 PMCID: PMC10060750 DOI: 10.1016/j.jbc.2023.103030] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 02/09/2023] [Accepted: 02/10/2023] [Indexed: 02/18/2023] Open

AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms. Commun Biol 2023;6:160. [PMID: 36755055 PMCID: PMC9908985 DOI: 10.1038/s42003-023-04488-9] [Citation(s) in RCA: 21] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 01/16/2023] [Indexed: 02/10/2023] Open

Liu J, Zhao K, Zhang G. Improved model quality assessment using sequence and structural information by enhanced deep neural networks. Brief Bioinform 2023;24:6865134. [PMID: 36460624 DOI: 10.1093/bib/bbac507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 10/02/2022] [Accepted: 10/24/2022] [Indexed: 12/04/2022] Open

Velazquez MB, Busi MV, Gomez-Casati DF, Nag-Dasgupta C, Barchiesi J. Molecular insight into cellulose degradation by the phototrophic green alga Scenedesmus. Proteins 2023;91:750-770. [PMID: 36607613 DOI: 10.1002/prot.26464] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Revised: 12/29/2022] [Accepted: 01/03/2023] [Indexed: 01/07/2023]

Bhattacharya S, Roche R, Shuvo MH, Moussad B, Bhattacharya D. Contact-Assisted Threading in Low-Homology Protein Modeling. Methods Mol Biol 2023;2627:41-59. [PMID: 36959441 DOI: 10.1007/978-1-0716-2974-1_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/25/2023]

Chen C, Chen X, Morehead A, Wu T, Cheng J. 3D-equivariant graph neural networks for protein model quality assessment. BIOINFORMATICS (OXFORD, ENGLAND) 2023;39:6986970. [PMID: 36637199 PMCID: PMC10089647 DOI: 10.1093/bioinformatics/btad030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 11/28/2022] [Accepted: 01/12/2023] [Indexed: 01/14/2023]

Abstract

MOTIVATION

Quality assessment (QA) of predicted protein tertiary structure models plays an important role in ranking and using them. With the recent development of deep learning end-to-end protein structure prediction techniques for generating highly confident tertiary structures for most proteins, it is important to explore corresponding QA strategies to evaluate and select the structural models predicted by them since these models have better quality and different properties than the models predicted by traditional tertiary structure prediction methods.

RESULTS

We develop EnQA, a novel graph-based 3D-equivariant neural network method that is equivariant to rotation and translation of 3D objects to estimate the accuracy of protein structural models by leveraging the structural features acquired from the state-of-the-art tertiary structure prediction method-AlphaFold2. We train and test the method on both traditional model datasets (e.g. the datasets of the Critical Assessment of Techniques for Protein Structure Prediction) and a new dataset of high-quality structural models predicted only by AlphaFold2 for the proteins whose experimental structures were released recently. Our approach achieves state-of-the-art performance on protein structural models predicted by both traditional protein structure prediction methods and the latest end-to-end deep learning method-AlphaFold2. It performs even better than the model QA scores provided by AlphaFold2 itself. The results illustrate that the 3D-equivariant graph neural network is a promising approach to the evaluation of protein structural models. Integrating AlphaFold2 features with other complementary sequence and structural features is important for improving protein model QA.

AVAILABILITY AND IMPLEMENTATION

The source code is available at https://github.com/BioinfoMachineLearning/EnQA.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Durairaj J, de Ridder D, van Dijk AD. Beyond sequence: Structure-based machine learning. Comput Struct Biotechnol J 2022;21:630-643. [PMID: 36659927 PMCID: PMC9826903 DOI: 10.1016/j.csbj.2022.12.039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 12/21/2022] [Accepted: 12/21/2022] [Indexed: 12/31/2022] Open

Chang Y, Hawkins BA, Du JJ, Groundwater PW, Hibbs DE, Lai F. A Guide to In Silico Drug Design. Pharmaceutics 2022;15:pharmaceutics15010049. [PMID: 36678678 PMCID: PMC9867171 DOI: 10.3390/pharmaceutics15010049] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Revised: 12/16/2022] [Accepted: 12/17/2022] [Indexed: 12/28/2022] Open

Miah MM, Tabassum N, Afroj Zinnia M, Islam ABMMK. Drug and Anti-Viral Peptide Design to Inhibit the Monkeypox Virus by Restricting A36R Protein. Bioinform Biol Insights 2022;16:11779322221141164. [PMID: 36570327 PMCID: PMC9772960 DOI: 10.1177/11779322221141164] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 11/06/2022] [Indexed: 12/24/2022] Open

Mufassirin MMM, Newton MAH, Sattar A. Artificial intelligence for template-free protein structure prediction: a comprehensive review. Artif Intell Rev 2022. [DOI: 10.1007/s10462-022-10350-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Buehler MJ. Multiscale Modeling at the Interface of Molecular Mechanics and Natural Language through Attention Neural Networks. Acc Chem Res 2022;55:3387-3403. [PMID: 36378952 DOI: 10.1021/acs.accounts.2c00330] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Abstract

Humans are continually bombarded with massive amounts of data. To deal with this influx of information, we use the concept of attention in order to perceive the most relevant input from vision, hearing, touch, and others. Thereby, the complex ensemble of signals is used to generate output by querying the processed data in appropriate ways. Attention is also the hallmark of the development of scientific theories, where we elucidate which parts of a problem are critical, often expressed through differential equations. In this Account we review the emergence of attention-based neural networks as a class of approaches that offer many opportunities to describe materials across scales and modalities, including how universal building blocks interact to yield a set of material properties. In fact, the self-assembly of hierarchical, structurally complex, and multifunctional biomaterials remains a grand challenge in modeling, theory, and experiment. Expanding from the process by which material building blocks physically interact to form a type of material, in this Account we view self-assembly as both the functional emergence of properties from interacting building blocks as well as the physical process by which elementary building blocks interact and yield structure and, thereby, functions. This perspective, integrated through the theory of materiomics, allows us to solve multiscale problems with a first-principles-based computational approach based on attention-based neural networks that transform information to feature to property while providing a flexible modeling approach that can integrate theory, simulation, and experiment. Since these models are based on a natural language framework, they offer various benefits including incorporation of general domain knowledge via general-purpose pretraining, which can be accomplished without labeled data or large amounts of lower-quality data. Pretrained models then offer a general-purpose platform that can be fine-tuned to adapt these models to make specific predictions, often with relatively little labeled data. The transferrable power of the language-based modeling approach realizes a neural olog description, where mathematical categorization is learned by multiheaded attention, without domain knowledge in its formulation. It can hence be applied to a range of complex modeling tasks─such as physical field predictions, molecular properties, or structure predictions, all using an identical formulation. This offers a complementary modeling approach that is already finding numerous applications, with great potential to solve complex assembly problems, enabling us to learn, build, and utilize functional categorization of how building blocks yield a range of material functions. In this Account, we demonstrate the approach in various application areas, including protein secondary structure prediction and prediction of normal-mode frequencies as well as predicting mechanical fields near cracks. Unifying these diverse problem areas is the building block approach, where the models are based on a universally applicable platform that offers benefits ranging from transferability, interpretability, and cross-domain pollination of knowledge as exemplified through a transformer model applied to predict how musical compositions infer de novo protein structures. We discuss future potentialities of this approach for a variety of material phenomena across scales, including the use in multiparadigm modeling schemes.

Collapse

Robert PA, Akbar R, Frank R, Pavlović M, Widrich M, Snapkov I, Slabodkin A, Chernigovskaya M, Scheffer L, Smorodina E, Rawat P, Mehta BB, Vu MH, Mathisen IF, Prósz A, Abram K, Olar A, Miho E, Haug DTT, Lund-Johansen F, Hochreiter S, Haff IH, Klambauer G, Sandve GK, Greiff V. Unconstrained generation of synthetic antibody-antigen structures to guide machine learning methodology for antibody specificity prediction. NATURE COMPUTATIONAL SCIENCE 2022;2:845-865. [PMID: 38177393 DOI: 10.1038/s43588-022-00372-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 11/09/2022] [Indexed: 01/06/2024]

Affiliation(s)

Philippe A Robert Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway.
Rahmad Akbar Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
Robert Frank Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
Milena Pavlović Department of Informatics, University of Oslo, Oslo, Norway
Michael Widrich ELLIS Unit Linz and LIT AI Lab, Institute for Machine Learning, Johannes Kepler University Linz, Linz, Austria
Igor Snapkov Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
Andrei Slabodkin Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
Maria Chernigovskaya Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
Lonneke Scheffer Department of Informatics, University of Oslo, Oslo, Norway
Eva Smorodina Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
Puneet Rawat Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
Brij Bhushan Mehta Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
Mai Ha Vu Department of Linguistics and Scandinavian Studies, University of Oslo, Oslo, Norway
Ingvild Frøberg Mathisen Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
Aurél Prósz Danish Cancer Society Research Center, Translational Cancer Genomics, Copenhagen, Denmark
Krzysztof Abram The Novo Nordisk Foundation Center for Biosustainability, Autoflow, DTU Biosustain and IT University of Copenhagen, Copenhagen, Denmark
Alex Olar Department of Complex Systems in Physics, Eötvös Loránd University, Budapest, Hungary
Enkelejda Miho Institute of Medical Engineering and Medical Informatics, School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Muttenz, Switzerland aiNET GmbH, Basel, Switzerland Swiss Institute of Bioinformatics, Lausanne, Switzerland
Dag Trygve Tryslew Haug Department of Linguistics and Scandinavian Studies, University of Oslo, Oslo, Norway
Fridtjof Lund-Johansen Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
Sepp Hochreiter ELLIS Unit Linz and LIT AI Lab, Institute for Machine Learning, Johannes Kepler University Linz, Linz, Austria Institute of Advanced Research in Artificial Intelligence (IARAI), Vienna, Austria
Ingrid Hobæk Haff Department of Mathematics, University of Oslo, Oslo, Norway
Günter Klambauer ELLIS Unit Linz and LIT AI Lab, Institute for Machine Learning, Johannes Kepler University Linz, Linz, Austria
Geir Kjetil Sandve Department of Informatics, University of Oslo, Oslo, Norway
Victor Greiff Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway.

Collapse

Roche R, Bhattacharya S, Shuvo MH, Bhattacharya D. rrQNet: Protein contact map quality estimation by deep evolutionary reconciliation. Proteins 2022;90:2023-2034. [PMID: 35751651 PMCID: PMC9633355 DOI: 10.1002/prot.26394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 05/31/2022] [Accepted: 06/21/2022] [Indexed: 11/10/2022]

Protein structure prediction in the deep learning era. Curr Opin Struct Biol 2022;77:102495. [PMID: 36371845 DOI: 10.1016/j.sbi.2022.102495] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Revised: 10/03/2022] [Accepted: 10/04/2022] [Indexed: 11/11/2022]

Hasanzadeh A, Hamblin MR, Kiani J, Noori H, Hardie JM, Karimi M, Shafiee H. Could artificial intelligence revolutionize the development of nanovectors for gene therapy and mRNA vaccines? NANO TODAY 2022;47:101665. [PMID: 37034382 PMCID: PMC10081506 DOI: 10.1016/j.nantod.2022.101665] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

Affiliation(s)

Akbar Hasanzadeh Cellular and Molecular Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran Department of Medical Nanotechnology, Faculty of Advanced Technologies in Medicine, Iran University of Medical Sciences, Tehran 1449614535, Iran
Michael R Hamblin Laser Research Centre, Faculty of Health Science, University of Johannesburg, Doornfontein 2028, South Africa Radiation Biology Research Center, Iran University of Medical Sciences, Tehran, Iran
Jafar Kiani Oncopathology Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran Department of Molecular Medicine, Faculty of Advanced Technologies in Medicine, Iran University of Medical Sciences, Tehran, Iran
Hamid Noori Cellular and Molecular Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran Department of Medical Nanotechnology, Faculty of Advanced Technologies in Medicine, Iran University of Medical Sciences, Tehran 1449614535, Iran
Joseph M. Hardie Division of Engineering in Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, 02139 USA
Mahdi Karimi Cellular and Molecular Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran Department of Medical Nanotechnology, Faculty of Advanced Technologies in Medicine, Iran University of Medical Sciences, Tehran 1449614535, Iran Oncopathology Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran Research Center for Science and Technology in Medicine, Tehran University of Medical Sciences, Tehran 141556559, Iran Applied Biotechnology Research Centre, Tehran Medical Science, Islamic Azad University, Tehran 1584743311, Iran
Hadi Shafiee Division of Engineering in Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, 02139 USA

Collapse

Katase N, Nishimatsu SI, Yamauchi A, Okano S, Fujita S. Establishment of anti-DKK3 peptide for the cancer control in head and neck squamous cell carcinoma (HNSCC). Cancer Cell Int 2022;22:352. [PMID: 36376957 PMCID: PMC9664703 DOI: 10.1186/s12935-022-02783-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Accepted: 11/04/2022] [Indexed: 11/16/2022] Open

Abstract

Background

Head and neck squamous cell carcinoma (HNSCC) is the most common malignant tumor of the head and neck. We identified cancer-specific genes in HNSCC and focused on DKK3 expression. DKK3 gene codes two isoforms of proteins (secreted and non-secreted) with two distinct cysteine rich domains (CRDs). It is reported that DKK3 functions as a negative regulator of oncogenic Wnt signaling and, is therefore, considered to be a tumor suppressor gene. However, our series of studies have demonstrated that DKK3 expression is specifically high in HNSCC tissues and cells, and that DKK3 might determine the malignant potentials of HNSCC cells via the activation of Akt. Further analyses strongly suggested that both secreted DKK3 and non-secreted DKK3 could activate Akt signaling in discrete ways, and consequently exert tumor promoting effects. We hypothesized that DKK3 might be a specific druggable target, and it is necessary to establish a DKK3 inhibitor that can inhibit both secreted and non-secreted isoforms of DKK3.

Methods

Using inverse polymerase chain reaction, we generated mutant expression plasmids that express DKK3 without CRD1, CRD2, or both CRD1 and CRD2 (DKK3ΔC1, DKK3ΔC2, and DKK3ΔC1ΔC2, respectively). These plasmids were then transfected into HNSCC-derived cells to determine the domain responsible for DKK3-mediated Akt activation. We designed antisense peptides using the MIMETEC program, targeting DKK3-specific amino acid sequences within CRD1 and CRD2. The structural models for peptides and DKK3 were generated using Raptor X, and then a docking simulation was performed using CluPro2. Afterward, the best set of the peptides was applied into HNSCC-derived cells, and the effects on Akt phosphorylation, cellular proliferation, invasion, and migration were assessed. We also investigated the therapeutic effects of the peptides in the xenograft models.

Results

Transfection of mutant expression plasmids and subsequent functional analyses revealed that it is necessary to delete both CRD1 and CRD2 to inhibit Akt activation and inhibition of proliferation, migration, and invasion. The inhibitory peptides for CRD1 and CRD2 of DKK3 significantly reduced the phosphorylation of Akt, and consequently suppressed cellular proliferation, migration, invasion and in vivo tumor growth at very low doses.

Conclusions

This inhibitory peptide represents a promising new therapeutic strategy for HNSCC treatment.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12935-022-02783-9.

Collapse

AmyJ33, a truncated amylase with improved catalytic properties. Biotechnol Lett 2022;44:1447-1463. [DOI: 10.1007/s10529-022-03311-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Accepted: 10/10/2022] [Indexed: 11/06/2022]

Barger J, Adhikari B. New Labeling Methods for Deep Learning Real-Valued Inter-Residue Distance Prediction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022;19:3586-3594. [PMID: 34559660 DOI: 10.1109/tcbb.2021.3115053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Abstract

BACKGROUND

Much of the recent success in protein structure prediction has been a result of accurate protein contact prediction-a binary classification problem. Dozens of methods, built from various types of machine learning and deep learning algorithms, have been published over the last two decades for predicting contacts. Recently, many groups, including Google DeepMind, have demonstrated that reformulating the problem as a multi-class classification problem is a more promising direction to pursue. As an alternative approach, we recently proposed real-valued distance predictions, formulating the problem as a regression problem. The nuances of protein 3D structures make this formulation appropriate, allowing predictions to reflect inter-residue distances in nature. Despite these promises, the accurate prediction of real-valued distances remains relatively unexplored; possibly due to classification being better suited to machine and deep learning algorithms.

METHODS

Can regression methods be designed to predict real-valued distances as precise as binary contacts? To investigate this, we propose multiple novel methods of input label engineering, which is different from feature engineering, with the goal of optimizing the distribution of distances to cater to the loss function of the deep-learning model. Since an important utility of predicted contacts or distances is to build three-dimensional models, we also tested if predicted distances can reconstruct more accurate models than contacts.

RESULTS

Our results demonstrate, for the first time, that deep learning methods for real-valued protein distance prediction can deliver distances as precise as binary classification methods. When using an optimal distance transformation function on the standard PSICOV dataset consisting of 150 representative proteins, the precision of 'top-all' long-range contacts improves from 60.9% to 61.4% when predicting real-valued distances instead of contacts. When building three-dimensional models we observed an average TM-score increase from 0.61 to 0.72, highlighting the advantage of predicting real-valued distances.

Collapse