Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Oprea TI. Exploring the dark genome: implications for precision medicine. Mamm Genome 2019;30:192-200. [PMID: 31270560 DOI: 10.1007/s00335-019-09809-0] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2019] [Accepted: 06/15/2019] [Indexed: 01/08/2023]

For:	Oprea TI. Exploring the dark genome: implications for precision medicine. Mamm Genome 2019;30:192-200. [PMID: 31270560 DOI: 10.1007/s00335-019-09809-0] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2019] [Accepted: 06/15/2019] [Indexed: 01/08/2023]

Number

Cited by Other Article(s)

Larmore M, Palomero OE, Kamat NP, DeCaen PG. A synthetic method to assay polycystin channel biophysics. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.06.592666. [PMID: 38766162 PMCID: PMC11100589 DOI: 10.1101/2024.05.06.592666] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2024]

Muscò A, Martini D, Digregorio M, Broccoli V, Andreazzoli M. Shedding a Light on Dark Genes: A Comparative Expression Study of PRR12 Orthologues during Zebrafish Development. Genes (Basel) 2024;15:492. [PMID: 38674426 PMCID: PMC11050278 DOI: 10.3390/genes15040492] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2024] [Revised: 04/06/2024] [Accepted: 04/09/2024] [Indexed: 04/28/2024] Open

Brlek P, Bulić L, Bračić M, Projić P, Škaro V, Shah N, Shah P, Primorac D. Implementing Whole Genome Sequencing (WGS) in Clinical Practice: Advantages, Challenges, and Future Perspectives. Cells 2024;13:504. [PMID: 38534348 DOI: 10.3390/cells13060504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Revised: 03/04/2024] [Accepted: 03/11/2024] [Indexed: 03/28/2024] Open

Oprea TI, Bologa C, Holmes J, Mathias S, Metzger VT, Waller A, Yang JJ, Leach AR, Jensen LJ, Kelleher KJ, Sheils TK, Mathé E, Avram S, Edwards JS. Overview of the Knowledge Management Center for Illuminating the Druggable Genome. Drug Discov Today 2024;29:103882. [PMID: 38218214 PMCID: PMC10939799 DOI: 10.1016/j.drudis.2024.103882] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 12/22/2023] [Accepted: 01/09/2024] [Indexed: 01/15/2024]

Koutrouli M, Nastou K, Piera Líndez P, Bouwmeester R, Rasmussen S, Martens L, Jensen LJ. FAVA: high-quality functional association networks inferred from scRNA-seq and proteomics data. Bioinformatics 2024;40:btae010. [PMID: 38192003 PMCID: PMC10868155 DOI: 10.1093/bioinformatics/btae010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2023] [Revised: 12/07/2023] [Accepted: 01/05/2024] [Indexed: 01/10/2024] Open

Kafita D, Nkhoma P, Dzobo K, Sinkala M. Shedding light on the dark genome: Insights into the genetic, CRISPR-based, and pharmacological dependencies of human cancers and disease aggressiveness. PLoS One 2023;18:e0296029. [PMID: 38117798 PMCID: PMC10732413 DOI: 10.1371/journal.pone.0296029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Accepted: 12/05/2023] [Indexed: 12/22/2023] Open

Cunningham M, Pins D, Dezső Z, Torrent M, Vasanthakumar A, Pandey A. PINNED: identifying characteristics of druggable human proteins using an interpretable neural network. J Cheminform 2023;15:64. [PMID: 37468968 PMCID: PMC10354961 DOI: 10.1186/s13321-023-00735-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 07/10/2023] [Indexed: 07/21/2023] Open

Lindovsky J, Nichtova Z, Dragano NRV, Pajuelo Reguera D, Prochazka J, Fuchs H, Marschall S, Gailus-Durner V, Sedlacek R, Hrabě de Angelis M, Rozman J, Spielmann N. A review of standardized high-throughput cardiovascular phenotyping with a link to metabolism in mice. Mamm Genome 2023;34:107-122. [PMID: 37326672 PMCID: PMC10290615 DOI: 10.1007/s00335-023-09997-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 05/03/2023] [Indexed: 06/17/2023]

Affiliation(s)

Jiri Lindovsky Czech Centre for Phenogenomics, Institute of Molecular Genetics, Czech Academy of Sciences, Prumyslova 595, 252 50 Vestec, Czech Republic
Zuzana Nichtova Czech Centre for Phenogenomics, Institute of Molecular Genetics, Czech Academy of Sciences, Prumyslova 595, 252 50 Vestec, Czech Republic
Nathalia R. V. Dragano Institute of Experimental Genetics, German Mouse Clinic, Helmholtz Center Munich, German Research Center for Environmental Health, Ingolstädter Landstr. 1, 85764 Neuherberg, Germany
David Pajuelo Reguera Czech Centre for Phenogenomics, Institute of Molecular Genetics, Czech Academy of Sciences, Prumyslova 595, 252 50 Vestec, Czech Republic
Jan Prochazka Czech Centre for Phenogenomics, Institute of Molecular Genetics, Czech Academy of Sciences, Prumyslova 595, 252 50 Vestec, Czech Republic
Helmut Fuchs Institute of Experimental Genetics, German Mouse Clinic, Helmholtz Center Munich, German Research Center for Environmental Health, Ingolstädter Landstr. 1, 85764 Neuherberg, Germany
Susan Marschall Institute of Experimental Genetics, German Mouse Clinic, Helmholtz Center Munich, German Research Center for Environmental Health, Ingolstädter Landstr. 1, 85764 Neuherberg, Germany
Valerie Gailus-Durner Institute of Experimental Genetics, German Mouse Clinic, Helmholtz Center Munich, German Research Center for Environmental Health, Ingolstädter Landstr. 1, 85764 Neuherberg, Germany
Radislav Sedlacek Czech Centre for Phenogenomics, Institute of Molecular Genetics, Czech Academy of Sciences, Prumyslova 595, 252 50 Vestec, Czech Republic
Martin Hrabě de Angelis Institute of Experimental Genetics, German Mouse Clinic, Helmholtz Center Munich, German Research Center for Environmental Health, Ingolstädter Landstr. 1, 85764 Neuherberg, Germany
Jan Rozman Czech Centre for Phenogenomics, Institute of Molecular Genetics, Czech Academy of Sciences, Prumyslova 595, 252 50 Vestec, Czech Republic Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
Nadine Spielmann Institute of Experimental Genetics, German Mouse Clinic, Helmholtz Center Munich, German Research Center for Environmental Health, Ingolstädter Landstr. 1, 85764 Neuherberg, Germany

Collapse

Zhang J, Wang T, Bi J, Ke M, Ren Y, Wang M, Du Z, Liu W, Hu L, Zhang X, Liu X, Wang B, Wu Z, Lv Y, Meng L, Wu R. Overexpression of HSF2 binding protein suppresses endoplasmic reticulum stress via regulating subcellular localization of CDC73 in hepatocytes. Cell Biosci 2023;13:64. [PMID: 36964632 PMCID: PMC10039577 DOI: 10.1186/s13578-023-01010-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2022] [Accepted: 03/07/2023] [Indexed: 03/26/2023] Open

Affiliation(s)

Jia Zhang National Local Joint Engineering Research Center for Precision Surgery & Regenerative Medicine, Shaanxi Provincial Center for Regenerative Medicine and Surgical Engineering, Center for Regenerative and Reconstructive Medicine, Med-X Institute, First Affiliated Hospital of Xi'an Jiaotong University, 124, 76 West Yanta Road, Xi'an, Shaanxi, 710061, China Department of Gastroenterology, The Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi, China
Tao Wang National Local Joint Engineering Research Center for Precision Surgery & Regenerative Medicine, Shaanxi Provincial Center for Regenerative Medicine and Surgical Engineering, Center for Regenerative and Reconstructive Medicine, Med-X Institute, First Affiliated Hospital of Xi'an Jiaotong University, 124, 76 West Yanta Road, Xi'an, Shaanxi, 710061, China Department of Hepatobiliary Surgery, First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi, China
Jianbin Bi National Local Joint Engineering Research Center for Precision Surgery & Regenerative Medicine, Shaanxi Provincial Center for Regenerative Medicine and Surgical Engineering, Center for Regenerative and Reconstructive Medicine, Med-X Institute, First Affiliated Hospital of Xi'an Jiaotong University, 124, 76 West Yanta Road, Xi'an, Shaanxi, 710061, China Department of Oncology, The Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi, China
Mengyun Ke National Local Joint Engineering Research Center for Precision Surgery & Regenerative Medicine, Shaanxi Provincial Center for Regenerative Medicine and Surgical Engineering, Center for Regenerative and Reconstructive Medicine, Med-X Institute, First Affiliated Hospital of Xi'an Jiaotong University, 124, 76 West Yanta Road, Xi'an, Shaanxi, 710061, China
Yifan Ren National Local Joint Engineering Research Center for Precision Surgery & Regenerative Medicine, Shaanxi Provincial Center for Regenerative Medicine and Surgical Engineering, Center for Regenerative and Reconstructive Medicine, Med-X Institute, First Affiliated Hospital of Xi'an Jiaotong University, 124, 76 West Yanta Road, Xi'an, Shaanxi, 710061, China Department of General Surgery, The Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi, China
Mengzhou Wang National Local Joint Engineering Research Center for Precision Surgery & Regenerative Medicine, Shaanxi Provincial Center for Regenerative Medicine and Surgical Engineering, Center for Regenerative and Reconstructive Medicine, Med-X Institute, First Affiliated Hospital of Xi'an Jiaotong University, 124, 76 West Yanta Road, Xi'an, Shaanxi, 710061, China Department of Hepatobiliary Surgery, First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi, China
Zhaoqing Du National Local Joint Engineering Research Center for Precision Surgery & Regenerative Medicine, Shaanxi Provincial Center for Regenerative Medicine and Surgical Engineering, Center for Regenerative and Reconstructive Medicine, Med-X Institute, First Affiliated Hospital of Xi'an Jiaotong University, 124, 76 West Yanta Road, Xi'an, Shaanxi, 710061, China Department of Hepatobiliary Surgery, Shaanxi Provincial People's Hospital, Xi'an, Shaanxi, China
Wuming Liu National Local Joint Engineering Research Center for Precision Surgery & Regenerative Medicine, Shaanxi Provincial Center for Regenerative Medicine and Surgical Engineering, Center for Regenerative and Reconstructive Medicine, Med-X Institute, First Affiliated Hospital of Xi'an Jiaotong University, 124, 76 West Yanta Road, Xi'an, Shaanxi, 710061, China Department of Hepatobiliary Surgery, First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi, China
Liangshuo Hu Department of Hepatobiliary Surgery, First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi, China
Xiaogang Zhang Department of Hepatobiliary Surgery, First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi, China
Xuemin Liu Department of Hepatobiliary Surgery, First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi, China
Bo Wang Department of Hepatobiliary Surgery, First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi, China
Zheng Wu Department of Hepatobiliary Surgery, First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi, China
Yi Lv National Local Joint Engineering Research Center for Precision Surgery & Regenerative Medicine, Shaanxi Provincial Center for Regenerative Medicine and Surgical Engineering, Center for Regenerative and Reconstructive Medicine, Med-X Institute, First Affiliated Hospital of Xi'an Jiaotong University, 124, 76 West Yanta Road, Xi'an, Shaanxi, 710061, China Department of Hepatobiliary Surgery, First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi, China
Lingzhong Meng Anesthesiology and Perioperative Medicine, Mayo Clinic College of Medicine, Rochester, MN, USA
Rongqian Wu National Local Joint Engineering Research Center for Precision Surgery & Regenerative Medicine, Shaanxi Provincial Center for Regenerative Medicine and Surgical Engineering, Center for Regenerative and Reconstructive Medicine, Med-X Institute, First Affiliated Hospital of Xi'an Jiaotong University, 124, 76 West Yanta Road, Xi'an, Shaanxi, 710061, China.

Collapse

Lachmann A, Rizzo KA, Bartal A, Jeon M, Clarke DJB, Ma’ayan A. PrismEXP: gene annotation prediction from stratified gene-gene co-expression matrices. PeerJ 2023;11:e14927. [PMID: 36874981 PMCID: PMC9979837 DOI: 10.7717/peerj.14927] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 01/30/2023] [Indexed: 03/03/2023] Open

Abstract

Background

Gene-gene co-expression correlations measured by mRNA-sequencing (RNA-seq) can be used to predict gene annotations based on the co-variance structure within these data. In our prior work, we showed that uniformly aligned RNA-seq co-expression data from thousands of diverse studies is highly predictive of both gene annotations and protein-protein interactions. However, the performance of the predictions varies depending on whether the gene annotations and interactions are cell type and tissue specific or agnostic. Tissue and cell type-specific gene-gene co-expression data can be useful for making more accurate predictions because many genes perform their functions in unique ways in different cellular contexts. However, identifying the optimal tissues and cell types to partition the global gene-gene co-expression matrix is challenging.

Results

Here we introduce and validate an approach called PRediction of gene Insights from Stratified Mammalian gene co-EXPression (PrismEXP) for improved gene annotation predictions based on RNA-seq gene-gene co-expression data. Using uniformly aligned data from ARCHS4, we apply PrismEXP to predict a wide variety of gene annotations including pathway membership, Gene Ontology terms, as well as human and mouse phenotypes. Predictions made with PrismEXP outperform predictions made with the global cross-tissue co-expression correlation matrix approach on all tested domains, and training using one annotation domain can be used to predict annotations in other domains.

Conclusions

By demonstrating the utility of PrismEXP predictions in multiple use cases we show how PrismEXP can be used to enhance unsupervised machine learning methods to better understand the roles of understudied genes and proteins. To make PrismEXP accessible, it is provided via a user-friendly web interface, a Python package, and an Appyter. AVAILABILITY. The PrismEXP web-based application, with pre-computed PrismEXP predictions, is available from: https://maayanlab.cloud/prismexp; PrismEXP is also available as an Appyter: https://appyters.maayanlab.cloud/PrismEXP/; and as Python package: https://github.com/maayanlab/prismexp.

Collapse

Kumar L, Brenner N, Sledzieski S, Olaosebikan M, Roger LM, Lynn-Goin M, Klein-Seetharaman R, Berger B, Putnam H, Yang J, Lewinski NA, Singh R, Daniels NM, Cowen L, Klein-Seetharaman J. Transfer of knowledge from model organisms to evolutionarily distant non-model organisms: The coral Pocillopora damicornis membrane signaling receptome. PLoS One 2023;18:e0270965. [PMID: 36735673 PMCID: PMC9897584 DOI: 10.1371/journal.pone.0270965] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Accepted: 06/21/2022] [Indexed: 02/04/2023] Open

Abstract

With the ease of gene sequencing and the technology available to study and manipulate non-model organisms, the extension of the methodological toolbox required to translate our understanding of model organisms to non-model organisms has become an urgent problem. For example, mining of large coral and their symbiont sequence data is a challenge, but also provides an opportunity for understanding functionality and evolution of these and other non-model organisms. Much more information than for any other eukaryotic species is available for humans, especially related to signal transduction and diseases. However, the coral cnidarian host and human have diverged over 700 million years ago and homologies between proteins in the two species are therefore often in the gray zone, or at least often undetectable with traditional BLAST searches. We introduce a two-stage approach to identifying putative coral homologues of human proteins. First, through remote homology detection using Hidden Markov Models, we identify candidate human homologues in the cnidarian genome. However, for many proteins, the human genome alone contains multiple family members with similar or even more divergence in sequence. In the second stage, therefore, we filter the remote homology results based on the functional and structural plausibility of each coral candidate, shortlisting the coral proteins likely to have conserved some of the functions of the human proteins. We demonstrate our approach with a pipeline for mapping membrane receptors in humans to membrane receptors in corals, with specific focus on the stony coral, P. damicornis. More than 1000 human membrane receptors mapped to 335 coral receptors, including 151 G protein coupled receptors (GPCRs). To validate specific sub-families, we chose opsin proteins, representative GPCRs that confer light sensitivity, and Toll-like receptors, representative non-GPCRs, which function in the immune response, and their ability to communicate with microorganisms. Through detailed structure-function analysis of their ligand-binding pockets and downstream signaling cascades, we selected those candidate remote homologues likely to carry out related functions in the corals. This pipeline may prove generally useful for other non-model organisms, such as to support the growing field of synthetic biology.

Collapse

Cai T, Xie L, Zhang S, Chen M, He D, Badkul A, Liu Y, Namballa HK, Dorogan M, Harding WW, Mura C, Bourne PE, Xie L. End-to-end sequence-structure-function meta-learning predicts genome-wide chemical-protein interactions for dark proteins. PLoS Comput Biol 2023;19:e1010851. [PMID: 36652496 PMCID: PMC9886305 DOI: 10.1371/journal.pcbi.1010851] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 01/30/2023] [Accepted: 01/05/2023] [Indexed: 01/19/2023] Open

Abstract

Systematically discovering protein-ligand interactions across the entire human and pathogen genomes is critical in chemical genomics, protein function prediction, drug discovery, and many other areas. However, more than 90% of gene families remain "dark"-i.e., their small-molecule ligands are undiscovered due to experimental limitations or human/historical biases. Existing computational approaches typically fail when the dark protein differs from those with known ligands. To address this challenge, we have developed a deep learning framework, called PortalCG, which consists of four novel components: (i) a 3-dimensional ligand binding site enhanced sequence pre-training strategy to encode the evolutionary links between ligand-binding sites across gene families; (ii) an end-to-end pretraining-fine-tuning strategy to reduce the impact of inaccuracy of predicted structures on function predictions by recognizing the sequence-structure-function paradigm; (iii) a new out-of-cluster meta-learning algorithm that extracts and accumulates information learned from predicting ligands of distinct gene families (meta-data) and applies the meta-data to a dark gene family; and (iv) a stress model selection step, using different gene families in the test data from those in the training and development data sets to facilitate model deployment in a real-world scenario. In extensive and rigorous benchmark experiments, PortalCG considerably outperformed state-of-the-art techniques of machine learning and protein-ligand docking when applied to dark gene families, and demonstrated its generalization power for target identifications and compound screenings under out-of-distribution (OOD) scenarios. Furthermore, in an external validation for the multi-target compound screening, the performance of PortalCG surpassed the rational design from medicinal chemists. Our results also suggest that a differentiable sequence-structure-function deep learning framework, where protein structural information serves as an intermediate layer, could be superior to conventional methodology where predicted protein structures were used for the compound screening. We applied PortalCG to two case studies to exemplify its potential in drug discovery: designing selective dual-antagonists of dopamine receptors for the treatment of opioid use disorder (OUD), and illuminating the understudied human genome for target diseases that do not yet have effective and safe therapeutics. Our results suggested that PortalCG is a viable solution to the OOD problem in exploring understudied regions of protein functional space.

Collapse

Affiliation(s)

Tian Cai Ph.D. Program in Computer Science, The Graduate Center, The City University of New York, New York, New York, United States of America
Li Xie Department of Computer Science, Hunter College, The City University of New York, New York, New York, United States of America
Shuo Zhang Ph.D. Program in Computer Science, The Graduate Center, The City University of New York, New York, New York, United States of America
Muge Chen Master Program in Computer Science, Courant Institute of Mathematical Sciences, New York University, New York, New York, United States of America
Di He Ph.D. Program in Computer Science, The Graduate Center, The City University of New York, New York, New York, United States of America
Amitesh Badkul Department of Computer Science, Hunter College, The City University of New York, New York, New York, United States of America
Yang Liu Department of Computer Science, Hunter College, The City University of New York, New York, New York, United States of America
Hari Krishna Namballa Department of Chemistry, Hunter College, The City University of New York, New York, New York, United States of America
Michael Dorogan Department of Chemistry, Hunter College, The City University of New York, New York, New York, United States of America
Wayne W. Harding Department of Chemistry, Hunter College, The City University of New York, New York, New York, United States of America
Cameron Mura School of Data Science & Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia, United States of America
Philip E. Bourne School of Data Science & Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia, United States of America
Lei Xie Ph.D. Program in Computer Science, The Graduate Center, The City University of New York, New York, New York, United States of America Department of Computer Science, Hunter College, The City University of New York, New York, New York, United States of America Helen and Robert Appel Alzheimer’s Disease Research Institute, Feil Family Brain & Mind Research Institute, Weill Cornell Medicine, Cornell University, New York, New York, United States of America

Collapse

Amaral MD. Using the genome to correct the ion transport defect in cystic fibrosis. J Physiol 2022;601:1573-1582. [PMID: 36068724 DOI: 10.1113/jp282308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 08/31/2022] [Indexed: 11/08/2022] Open

Gridley T, Murray SA. Mouse mutagenesis and phenotyping to generate models of development and disease. Curr Top Dev Biol 2022;148:1-12. [PMID: 35461561 PMCID: PMC11275630 DOI: 10.1016/bs.ctdb.2022.02.012] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Cai T, Abbu KA, Liu Y, Xie L. DeepREAL: A Deep Learning Powered Multi-scale Modeling Framework for Predicting Out-of-distribution Ligand-induced GPCR Activity. Bioinformatics 2022;38:2561-2570. [PMID: 35274689 PMCID: PMC9048666 DOI: 10.1093/bioinformatics/btac154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Revised: 02/18/2022] [Accepted: 03/10/2022] [Indexed: 11/20/2022] Open

Cai T, Xie L, Chen M, Liu Y, He D, Zhang S, Mura C, Bourne PE, Xie L. Exploration of Dark Chemical Genomics Space via Portal Learning: Applied to Targeting the Undruggable Genome and COVID-19 Anti-Infective Polypharmacology. RESEARCH SQUARE 2021:rs.3.rs-1109318. [PMID: 34873596 PMCID: PMC8647653 DOI: 10.21203/rs.3.rs-1109318/v1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Sheils T, Mathias SL, Siramshetty VB, Bocci G, Bologa CG, Yang JJ, Waller A, Southall N, Nguyen DT, Oprea TI. How to Illuminate the Druggable Genome Using Pharos. ACTA ACUST UNITED AC 2021;69:e92. [PMID: 31898878 DOI: 10.1002/cpbi.92] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Ferrari E, Naponelli V, Bettuzzi S. Lemur Tyrosine Kinases and Prostate Cancer: A Literature Review. Int J Mol Sci 2021;22:ijms22115453. [PMID: 34064250 PMCID: PMC8196904 DOI: 10.3390/ijms22115453] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 05/06/2021] [Accepted: 05/18/2021] [Indexed: 12/16/2022] Open

Cai T, Lim H, Abbu KA, Qiu Y, Nussinov R, Xie L. MSA-Regularized Protein Sequence Transformer toward Predicting Genome-Wide Chemical-Protein Interactions: Application to GPCRome Deorphanization. J Chem Inf Model 2021;61:1570-1582. [PMID: 33757283 PMCID: PMC8154251 DOI: 10.1021/acs.jcim.0c01285] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Indexed: 01/14/2023]

Abstract

Small molecules play a critical role in modulating biological systems. Knowledge of chemical-protein interactions helps address fundamental and practical questions in biology and medicine. However, with the rapid emergence of newly sequenced genes, the endogenous or surrogate ligands of a vast number of proteins remain unknown. Homology modeling and machine learning are two major methods for assigning new ligands to a protein but mostly fail when sequence homology between an unannotated protein and those with known functions or structures is low. In this study, we develop a new deep learning framework to predict chemical binding to evolutionary divergent unannotated proteins, whose ligand cannot be reliably predicted by existing methods. By incorporating evolutionary information into self-supervised learning of unlabeled protein sequences, we develop a novel method, distilled sequence alignment embedding (DISAE), for the protein sequence representation. DISAE can utilize all protein sequences and their multiple sequence alignment (MSA) to capture functional relationships between proteins without the knowledge of their structure and function. Followed by the DISAE pretraining, we devise a module-based fine-tuning strategy for the supervised learning of chemical-protein interactions. In the benchmark studies, DISAE significantly improves the generalizability of machine learning models and outperforms the state-of-the-art methods by a large margin. Comprehensive ablation studies suggest that the use of MSA, sequence distillation, and triplet pretraining critically contributes to the success of DISAE. The interpretability analysis of DISAE suggests that it learns biologically meaningful information. We further use DISAE to assign ligands to human orphan G-protein coupled receptors (GPCRs) and to cluster the human GPCRome by integrating their phylogenetic and ligand relationships. The promising results of DISAE open an avenue for exploring the chemical landscape of entire sequenced genomes.

Collapse

Koulouras G, Frith MC. Significant non-existence of sequences in genomes and proteomes. Nucleic Acids Res 2021;49:3139-3155. [PMID: 33693858 PMCID: PMC8034619 DOI: 10.1093/nar/gkab139] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Revised: 02/11/2021] [Accepted: 02/25/2021] [Indexed: 12/22/2022] Open

Horner NR, Venkataraman S, Armit C, Casero R, Brown JM, Wong MD, van Eede MC, Henkelman RM, Johnson S, Teboul L, Wells S, Brown SD, Westerberg H, Mallon AM. LAMA: automated image analysis for the developmental phenotyping of mouse embryos. Development 2021;148:dev192955. [PMID: 33574040 PMCID: PMC8015254 DOI: 10.1242/dev.192955] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Accepted: 12/21/2020] [Indexed: 11/20/2022]

UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res 2021. [PMID: 33237286 DOI: 10.1093/nar/gkaa] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open

Avram S, Bologa CG, Holmes J, Bocci G, Wilson TB, Nguyen DT, Curpan R, Halip L, Bora A, Yang JJ, Knockel J, Sirimulla S, Ursu O, Oprea TI. DrugCentral 2021 supports drug discovery and repositioning. Nucleic Acids Res 2021;49:D1160-D1169. [PMID: 33151287 PMCID: PMC7779058 DOI: 10.1093/nar/gkaa997] [Citation(s) in RCA: 94] [Impact Index Per Article: 31.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 10/09/2020] [Accepted: 10/14/2020] [Indexed: 12/18/2022] Open

Affiliation(s)

Sorin Avram Department of Computational Chemistry, “Coriolan Dragulescu’’ Institute of Chemistry, 24 Mihai Viteazu Blvd, Timişoara, Timiş, 300223, România
Cristian G Bologa Translational Informatics Division, Department of Internal Medicine, University of New Mexico Health Sciences Center, Albuquerque, NM 87131, USA UNM Comprehensive Cancer Center, University of New Mexico Health Sciences Center, Albuquerque, NM 87131, USA
Jayme Holmes Translational Informatics Division, Department of Internal Medicine, University of New Mexico Health Sciences Center, Albuquerque, NM 87131, USA
Giovanni Bocci Translational Informatics Division, Department of Internal Medicine, University of New Mexico Health Sciences Center, Albuquerque, NM 87131, USA
Thomas B Wilson College of Pharmacy, University of New Mexico Health Sciences Center, Albuquerque, NM 87131, USA
Dac-Trung Nguyen National Center for Advancing Translational Science, 9800 Medical Center Drive, Rockville, MD 20850, USA
Ramona Curpan Department of Computational Chemistry, “Coriolan Dragulescu’’ Institute of Chemistry, 24 Mihai Viteazu Blvd, Timişoara, Timiş, 300223, România
Liliana Halip Department of Computational Chemistry, “Coriolan Dragulescu’’ Institute of Chemistry, 24 Mihai Viteazu Blvd, Timişoara, Timiş, 300223, România
Alina Bora Department of Computational Chemistry, “Coriolan Dragulescu’’ Institute of Chemistry, 24 Mihai Viteazu Blvd, Timişoara, Timiş, 300223, România
Jeremy J Yang Translational Informatics Division, Department of Internal Medicine, University of New Mexico Health Sciences Center, Albuquerque, NM 87131, USA
Jeffrey Knockel Department of Computer Science, University of New Mexico, Albuquerque, NM 87131, USA
Suman Sirimulla Department of Pharmaceutical Sciences, School of Pharmacy, The University of Texas at El Paso, TX 79902, USA
Oleg Ursu Computational and Structural Chemistry, Merck & Co., Inc., 2000 Galloping Hill Road, Kenilworth, NJ 07033, USA
Tudor I Oprea Translational Informatics Division, Department of Internal Medicine, University of New Mexico Health Sciences Center, Albuquerque, NM 87131, USA Computational and Structural Chemistry, Merck & Co., Inc., 2000 Galloping Hill Road, Kenilworth, NJ 07033, USA Department of Rheumatology and Inflammation Research, Institute of Medicine, Sahlgrenska Academy at University of Gothenburg, 40530 Gothenburg, Sweden Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen, Denmark

Collapse

Bateman A, Martin MJ, Orchard S, Magrane M, Agivetova R, Ahmad S, Alpi E, Bowler-Barnett EH, Britto R, Bursteinas B, Bye-A-Jee H, Coetzee R, Cukura A, Da Silva A, Denny P, Dogan T, Ebenezer T, Fan J, Castro LG, Garmiri P, Georghiou G, Gonzales L, Hatton-Ellis E, Hussein A, Ignatchenko A, Insana G, Ishtiaq R, Jokinen P, Joshi V, Jyothi D, Lock A, Lopez R, Luciani A, Luo J, Lussi Y, MacDougall A, Madeira F, Mahmoudy M, Menchi M, Mishra A, Moulang K, Nightingale A, Oliveira CS, Pundir S, Qi G, Raj S, Rice D, Lopez MR, Saidi R, Sampson J, Sawford T, Speretta E, Turner E, Tyagi N, Vasudev P, Volynkin V, Warner K, Watkins X, Zaru R, Zellner H, Bridge A, Poux S, Redaschi N, Aimo L, Argoud-Puy G, Auchincloss A, Axelsen K, Bansal P, Baratin D, Blatter MC, Bolleman J, Boutet E, Breuza L, Casals-Casas C, de Castro E, Echioukh KC, Coudert E, Cuche B, Doche M, Dornevil D, Estreicher A, Famiglietti ML, Feuermann M, Gasteiger E, Gehant S, Gerritsen V, Gos A, Gruaz-Gumowski N, Hinz U, Hulo C, Hyka-Nouspikel N, Jungo F, Keller G, Kerhornou A, Lara V, Le Mercier P, Lieberherr D, Lombardot T, Martin X, Masson P, Morgat A, Neto TB, Paesano S, Pedruzzi I, Pilbout S, Pourcel L, Pozzato M, Pruess M, Rivoire C, Sigrist C, Sonesson K, Stutz A, Sundaram S, Tognolli M, Verbregue L, Wu CH, Arighi CN, Arminski L, Chen C, Chen Y, Garavelli JS, Huang H, Laiho K, McGarvey P, Natale DA, Ross K, Vinayaka CR, Wang Q, Wang Y, Yeh LS, Zhang J, Ruch P, Teodoro D. UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res 2021;49:D480-D489. [PMID: 33237286 PMCID: PMC7778908 DOI: 10.1093/nar/gkaa1100] [Citation(s) in RCA: 3710] [Impact Index Per Article: 1236.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 10/21/2020] [Accepted: 11/02/2020] [Indexed: 02/07/2023] Open

Sheils TK, Mathias SL, Kelleher KJ, Siramshetty VB, Nguyen DT, Bologa CG, Jensen LJ, Vidović D, Koleti A, Schürer SC, Waller A, Yang JJ, Holmes J, Bocci G, Southall N, Dharkar P, Mathé E, Simeonov A, Oprea TI. TCRD and Pharos 2021: mining the human proteome for disease biology. Nucleic Acids Res 2021;49:D1334-D1346. [PMID: 33156327 PMCID: PMC7778974 DOI: 10.1093/nar/gkaa993] [Citation(s) in RCA: 91] [Impact Index Per Article: 30.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 10/09/2020] [Accepted: 10/14/2020] [Indexed: 12/13/2022] Open

Affiliation(s)

Timothy K Sheils National Center for Advancing Translational Science, 9800 Medical Center Drive, Rockville, MD 20850, USA
Stephen L Mathias Translational Informatics Division, Department of Internal Medicine, University of New Mexico Health Sciences Center, Albuquerque, NM 87131, USA
Keith J Kelleher National Center for Advancing Translational Science, 9800 Medical Center Drive, Rockville, MD 20850, USA
Vishal B Siramshetty National Center for Advancing Translational Science, 9800 Medical Center Drive, Rockville, MD 20850, USA
Dac-Trung Nguyen National Center for Advancing Translational Science, 9800 Medical Center Drive, Rockville, MD 20850, USA
Cristian G Bologa Translational Informatics Division, Department of Internal Medicine, University of New Mexico Health Sciences Center, Albuquerque, NM 87131, USA
Lars Juhl Jensen Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen, Denmark
Dušica Vidović Institute for Data Science and Computing, University of Miami, Coral Gables, FL 33146, USA Department of Molecular and Cellular Pharmacology, Miller School of Medicine, University of Miami, Miami, FL 33136, USA
Amar Koleti Institute for Data Science and Computing, University of Miami, Coral Gables, FL 33146, USA
Stephan C Schürer Institute for Data Science and Computing, University of Miami, Coral Gables, FL 33146, USA Department of Molecular and Cellular Pharmacology, Miller School of Medicine, University of Miami, Miami, FL 33136, USA Sylvester Comprehensive Cancer Center, Miller School of Medicine, University of Miami, Miami, FL 33136, USA
Anna Waller UNM Center for Molecular Discovery, University of New Mexico Health Sciences Center, Albuquerque, NM 87131, USA
Jeremy J Yang Translational Informatics Division, Department of Internal Medicine, University of New Mexico Health Sciences Center, Albuquerque, NM 87131, USA
Jayme Holmes Translational Informatics Division, Department of Internal Medicine, University of New Mexico Health Sciences Center, Albuquerque, NM 87131, USA
Giovanni Bocci Translational Informatics Division, Department of Internal Medicine, University of New Mexico Health Sciences Center, Albuquerque, NM 87131, USA
Noel Southall National Center for Advancing Translational Science, 9800 Medical Center Drive, Rockville, MD 20850, USA
Poorva Dharkar National Center for Advancing Translational Science, 9800 Medical Center Drive, Rockville, MD 20850, USA
Ewy Mathé National Center for Advancing Translational Science, 9800 Medical Center Drive, Rockville, MD 20850, USA
Anton Simeonov National Center for Advancing Translational Science, 9800 Medical Center Drive, Rockville, MD 20850, USA
Tudor I Oprea Translational Informatics Division, Department of Internal Medicine, University of New Mexico Health Sciences Center, Albuquerque, NM 87131, USA Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen, Denmark UNM Comprehensive Cancer Center, University of New Mexico Health Sciences Center, Albuquerque, NM 87131, USA Department of Rheumatology and Inflammation Research, Institute of Medicine, Sahlgrenska Academy at University of Gothenburg, 40530 Gothenburg, Sweden

Collapse

Preuss F, Chatterjee D, Mathea S, Shrestha S, St-Germain J, Saha M, Kannan N, Raught B, Rottapel R, Knapp S. Nucleotide Binding, Evolutionary Insights, and Interaction Partners of the Pseudokinase Unc-51-like Kinase 4. Structure 2020;28:1184-1196.e6. [PMID: 32814032 DOI: 10.1016/j.str.2020.07.016] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Revised: 06/17/2020] [Accepted: 07/29/2020] [Indexed: 01/11/2023]

Affiliation(s)

Franziska Preuss Institute for Pharmaceutical Chemistry, Johann Wolfgang Goethe-University, Max-von-Laue-Str. 9, 60438 Frankfurt am Main, Germany; Buchmann Institute for Molecular Life Sciences, Structural Genomics Consortium, Johann Wolfgang Goethe-University, Max-von-Laue-Str. 15, 60438 Frankfurt am Main, Germany
Deep Chatterjee Institute for Pharmaceutical Chemistry, Johann Wolfgang Goethe-University, Max-von-Laue-Str. 9, 60438 Frankfurt am Main, Germany; Buchmann Institute for Molecular Life Sciences, Structural Genomics Consortium, Johann Wolfgang Goethe-University, Max-von-Laue-Str. 15, 60438 Frankfurt am Main, Germany
Sebastian Mathea Institute for Pharmaceutical Chemistry, Johann Wolfgang Goethe-University, Max-von-Laue-Str. 9, 60438 Frankfurt am Main, Germany; Buchmann Institute for Molecular Life Sciences, Structural Genomics Consortium, Johann Wolfgang Goethe-University, Max-von-Laue-Str. 15, 60438 Frankfurt am Main, Germany
Safal Shrestha Institute of Bioinformatics & Department of Biochemistry and Molecular Biology, University of Georgia, 120 Green Street, Athens, GA 30602-7229, USA
Jonathan St-Germain Princess Margaret Cancer Centre, University Health Network, Toronto M5G 2C4, Canada
Manipa Saha Princess Margaret Cancer Centre, University Health Network, Toronto M5G 2C4, Canada
Natarajan Kannan Institute of Bioinformatics & Department of Biochemistry and Molecular Biology, University of Georgia, 120 Green Street, Athens, GA 30602-7229, USA
Brian Raught Princess Margaret Cancer Centre, University Health Network, Toronto M5G 2C4, Canada
Robert Rottapel Princess Margaret Cancer Centre, University Health Network, Toronto M5G 2C4, Canada; Departments of Medicine, Immunology and Medical Biophysics, University of Toronto, Toronto M5G 1L7, Canada; Division of Rheumatology, St. Michael's Hospital, Toronto M5B 1W8, Canada
Stefan Knapp Institute for Pharmaceutical Chemistry, Johann Wolfgang Goethe-University, Max-von-Laue-Str. 9, 60438 Frankfurt am Main, Germany; Buchmann Institute for Molecular Life Sciences, Structural Genomics Consortium, Johann Wolfgang Goethe-University, Max-von-Laue-Str. 15, 60438 Frankfurt am Main, Germany; German Cancer Consortium (DKTK) and Frankfurt Cancer Institute (FCI), 60596 Frankfurt am Main, Germany.

Collapse

Brown SDM, Lad HV. The dark genome and pleiotropy: challenges for precision medicine. Mamm Genome 2020;30:212-216. [PMID: 31444567 PMCID: PMC6759675 DOI: 10.1007/s00335-019-09813-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]