1
|
Fazal S, Danzi MC, Xu I, Kobren SN, Sunyaev S, Reuter C, Marwaha S, Wheeler M, Dolzhenko E, Lucas F, Wuchty S, Tekin M, Züchner S, Aguiar-Pulido V. RExPRT: a machine learning tool to predict pathogenicity of tandem repeat loci. Genome Biol 2024; 25:39. [PMID: 38297326 PMCID: PMC10832122 DOI: 10.1186/s13059-024-03171-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Accepted: 01/10/2024] [Indexed: 02/02/2024] Open
Abstract
Expansions of tandem repeats (TRs) cause approximately 60 monogenic diseases. We expect that the discovery of additional pathogenic repeat expansions will narrow the diagnostic gap in many diseases. A growing number of TR expansions are being identified, and interpreting them is a challenge. We present RExPRT (Repeat EXpansion Pathogenicity pRediction Tool), a machine learning tool for distinguishing pathogenic from benign TR expansions. Our results demonstrate that an ensemble approach classifies TRs with an average precision of 93% and recall of 83%. RExPRT's high precision will be valuable in large-scale discovery studies, which require prioritization of candidate loci for follow-up studies.
Collapse
Affiliation(s)
- Sarah Fazal
- Dr. John T. Macdonald Foundation Department of Human Genetics and John P. Hussman Institute for Human Genetics, University of Miami Miller School of Medicine, Biomedical Research Building (BRB), Miami, FL, 33136, USA
| | - Matt C Danzi
- Dr. John T. Macdonald Foundation Department of Human Genetics and John P. Hussman Institute for Human Genetics, University of Miami Miller School of Medicine, Biomedical Research Building (BRB), Miami, FL, 33136, USA
| | - Isaac Xu
- Dr. John T. Macdonald Foundation Department of Human Genetics and John P. Hussman Institute for Human Genetics, University of Miami Miller School of Medicine, Biomedical Research Building (BRB), Miami, FL, 33136, USA
| | | | - Shamil Sunyaev
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02155, USA
| | - Chloe Reuter
- Stanford Center for Undiagnosed Diseases, Stanford University, Stanford, CA, 94305, USA
- Division of Cardiovascular Medicine, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | - Shruti Marwaha
- Stanford Center for Undiagnosed Diseases, Stanford University, Stanford, CA, 94305, USA
- Division of Cardiovascular Medicine, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | - Matthew Wheeler
- Stanford Center for Undiagnosed Diseases, Stanford University, Stanford, CA, 94305, USA
- Division of Cardiovascular Medicine, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | | | - Francesca Lucas
- Department of Computer Science, Delft University of Technology, Delft, The Netherlands
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Miami, FL, USA
- Deptartment of Biology, University of Miami, Miami, FL, USA
- Sylvester Comprehensive Cancer Center, University of Miami Miller School of Medicine, Miami, FL, USA
| | - Mustafa Tekin
- Dr. John T. Macdonald Foundation Department of Human Genetics and John P. Hussman Institute for Human Genetics, University of Miami Miller School of Medicine, Biomedical Research Building (BRB), Miami, FL, 33136, USA
| | - Stephan Züchner
- Dr. John T. Macdonald Foundation Department of Human Genetics and John P. Hussman Institute for Human Genetics, University of Miami Miller School of Medicine, Biomedical Research Building (BRB), Miami, FL, 33136, USA.
| | | |
Collapse
|
2
|
Yang X, Wuchty S, Liang Z, Ji L, Wang B, Zhu J, Zhang Z, Dong Y. Multi-modal features-based human-herpesvirus protein-protein interaction prediction by using LightGBM. Brief Bioinform 2024; 25:bbae005. [PMID: 38279649 PMCID: PMC10818167 DOI: 10.1093/bib/bbae005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 12/25/2023] [Accepted: 01/01/2021] [Indexed: 01/28/2024] Open
Abstract
The identification of human-herpesvirus protein-protein interactions (PPIs) is an essential and important entry point to understand the mechanisms of viral infection, especially in malignant tumor patients with common herpesvirus infection. While natural language processing (NLP)-based embedding techniques have emerged as powerful approaches, the application of multi-modal embedding feature fusion to predict human-herpesvirus PPIs is still limited. Here, we established a multi-modal embedding feature fusion-based LightGBM method to predict human-herpesvirus PPIs. In particular, we applied document and graph embedding approaches to represent sequence, network and function modal features of human and herpesviral proteins. Training our LightGBM models through our compiled non-rigorous and rigorous benchmarking datasets, we obtained significantly better performance compared to individual-modal features. Furthermore, our model outperformed traditional feature encodings-based machine learning methods and state-of-the-art deep learning-based methods using various benchmarking datasets. In a transfer learning step, we show that our model that was trained on human-herpesvirus PPI dataset without cytomegalovirus data can reliably predict human-cytomegalovirus PPIs, indicating that our method can comprehensively capture multi-modal fusion features of protein interactions across various herpesvirus subtypes. The implementation of our method is available at https://github.com/XiaodiYangpku/MultimodalPPI/.
Collapse
Affiliation(s)
- Xiaodi Yang
- Department of Hematology, Peking University First Hospital, Beijing, China
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Miami FL, 33146, USA
- Department of Biology, University of Miami, Miami FL, 33146, USA
- Institute of Data Science and Computation, University of Miami, Miami, FL 33146, USA
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL 33136, USA
| | - Zeyin Liang
- Department of Hematology, Peking University First Hospital, Beijing, China
| | - Li Ji
- Department of Hematology, Peking University First Hospital, Beijing, China
| | - Bingjie Wang
- Department of Hematology, Peking University First Hospital, Beijing, China
| | - Jialin Zhu
- Department of Hematology, Peking University First Hospital, Beijing, China
| | - Ziding Zhang
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Yujun Dong
- Department of Hematology, Peking University First Hospital, Beijing, China
| |
Collapse
|
3
|
Wuchty S, White AK, Olthof AM, Drake K, Hume AJ, Olejnik J, Aguiar-Pulido V, Mühlberger E, Kanadia RN. Minor intron-containing genes as an ancient backbone for viral infection? PNAS Nexus 2024; 3:pgad479. [PMID: 38274120 PMCID: PMC10810330 DOI: 10.1093/pnasnexus/pgad479] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 12/15/2023] [Indexed: 01/27/2024]
Abstract
Minor intron-containing genes (MIGs) account for <2% of all human protein-coding genes and are uniquely dependent on the minor spliceosome for proper excision. Despite their low numbers, we surprisingly found a significant enrichment of MIG-encoded proteins (MIG-Ps) in protein-protein interactomes and host factors of positive-sense RNA viruses, including SARS-CoV-1, SARS-CoV-2, MERS coronavirus, and Zika virus. Similarly, we observed a significant enrichment of MIG-Ps in the interactomes and sets of host factors of negative-sense RNA viruses such as Ebola virus, influenza A virus, and the retrovirus HIV-1. We also found an enrichment of MIG-Ps in double-stranded DNA viruses such as Epstein-Barr virus, human papillomavirus, and herpes simplex viruses. In general, MIG-Ps were highly connected and placed in central positions in a network of human-host protein interactions. Moreover, MIG-Ps that interact with viral proteins were enriched with essential genes. We also provide evidence that viral proteins interact with ancestral MIGs that date back to unicellular organisms and are mainly involved in basic cellular functions such as cell cycle, cell division, and signal transduction. Our results suggest that MIG-Ps form a stable, evolutionarily conserved backbone that viruses putatively tap to invade and propagate in human host cells.
Collapse
Affiliation(s)
- Stefan Wuchty
- Department of Computer Science, University of Miami, Coral Gables, FL 33146, USA
- Department of Biology, University of Miami, Coral Gables, FL 33146, USA
- Institute of Data Science and Computing, University of Miami, Coral Gables, FL 33146, USA
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL 33134, USA
| | - Alisa K White
- Physiology and Neurobiology Department, University of Connecticut, Storrs, CT 06269, USA
| | - Anouk M Olthof
- Physiology and Neurobiology Department, University of Connecticut, Storrs, CT 06269, USA
| | - Kyle Drake
- Physiology and Neurobiology Department, University of Connecticut, Storrs, CT 06269, USA
| | - Adam J Hume
- Department of Virology, Immunology and Microbiology, Boston University Chobanian and Avedisian School of Medicine, Boston, MA 02118, USA
- National Emerging Infectious Diseases Laboratories, Boston University, Boston, MA 02118, USA
- Center for Emerging Infectious Diseases Policy and Research, Boston University, Boston, MA 02118, USA
| | - Judith Olejnik
- Department of Virology, Immunology and Microbiology, Boston University Chobanian and Avedisian School of Medicine, Boston, MA 02118, USA
- National Emerging Infectious Diseases Laboratories, Boston University, Boston, MA 02118, USA
| | | | - Elke Mühlberger
- Department of Virology, Immunology and Microbiology, Boston University Chobanian and Avedisian School of Medicine, Boston, MA 02118, USA
- National Emerging Infectious Diseases Laboratories, Boston University, Boston, MA 02118, USA
| | - Rahul N Kanadia
- Physiology and Neurobiology Department, University of Connecticut, Storrs, CT 06269, USA
- Institute for Systems Genomics, University of Connecticut, Storrs, CT 06269, USA
| |
Collapse
|
4
|
Wolcott KA, Stanley EL, Gutierrez OA, Wuchty S, Whitlock BA. 3D pollination biology using micro-computed tomography and geometric morphometrics in Theobroma cacao. Appl Plant Sci 2023; 11:e11549. [PMID: 37915432 PMCID: PMC10617321 DOI: 10.1002/aps3.11549] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Revised: 06/05/2023] [Accepted: 06/13/2023] [Indexed: 11/03/2023]
Abstract
Premise Imaging technologies that capture three-dimensional (3D) variation in floral morphology at micro- and nano-resolutions are increasingly accessible. In herkogamous flowers, such as those of Theobroma cacao, structural barriers between anthers and stigmas represent bottlenecks that restrict pollinator size and access to reproductive organs. To study the unresolved pollination biology of cacao, we present a novel application of micro-computed tomography (micro-CT) using floral dimensions to quantify pollinator functional size limits. Methods We generated micro-CT data sets from field-collected flowers and museum specimens of potential pollinators. To compare floral variation, we used 3D Slicer to place landmarks on the surface models and performed a geometric morphometric (GMM) analysis using geomorph R. We identified the petal side door (an opening between the petal hoods and filament) as the main bottleneck for pollinator access. We compared its mean dimensions with proposed pollinators to identify viable candidates. Results We identified three levels of likelihood for putative pollinators based on the number of morphological (body) dimensions that fit through the petal side door. We also found floral reward microstructures whose presence and location were previously unclear. Discussion Using micro-CT and GMM to study the 3D pollination biology of cacao provides new evidence for predicting unknown pollinators. Incorporating geometry and floral rewards will strengthen plant-pollinator trait matching models for cacao and other species.
Collapse
Affiliation(s)
| | - Edward L. Stanley
- Department of Natural HistoryFlorida Museum of Natural HistoryGainesvilleFloridaUSA
| | - Osman A. Gutierrez
- Subtropical Horticultural Research StationUnited States Department of Agriculture–Agricultural Research Service (USDA‐ARS)MiamiFlorida33158USA
| | - Stefan Wuchty
- Department of BiologyUniversity of MiamiCoral GablesFlorida33124USA
- Department of Computer ScienceUniversity of MiamiCoral GablesFlorida33146USA
- Institute of Data Science and ComputingUniversity of MiamiCoral GablesFlorida33146USA
- Sylvester Comprehensive Cancer CenterUniversity of MiamiMiamiFlorida33136USA
| | | |
Collapse
|
5
|
Zhai R, Ruan K, Perez GF, Kubat M, Liu J, Hofacker I, Wuchty S. MicroRNA-Mediated Obstruction of Stem-loop Alternative Splicing (MIMOSAS): a global mechanism for the regulation of alternative splicing. Res Sq 2023:rs.3.rs-2977025. [PMID: 37546804 PMCID: PMC10402249 DOI: 10.21203/rs.3.rs-2977025/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]
Abstract
While RNA secondary structures are critical to regulate alternative splicing of long-range pre-mRNA, the factors that modulate RNA structure and interfere with the recognition of the splice sites are largely unknown. Previously, we identified a small, non-coding microRNA that sufficiently affects stable stem structure formation of Nmnat pre-mRNA to regulate the outcomes of alternative splicing. However, the fundamental question remains whether such microRNA-mediated interference with RNA secondary structures is a global molecular mechanism for regulating mRNA splicing. We designed and refined a bioinformatic pipeline to predict candidate microRNAs that potentially interfere with pre-mRNA stem-loop structures, and experimentally verified splicing predictions of three different long-range pre-mRNAs in the Drosophila model system. Specifically, we observed that microRNAs can either disrupt or stabilize stem-loop structures to influence splicing outcomes. Our study suggests that MicroRNA-Mediated Obstruction of Stem-loop Alternative Splicing (MIMOSAS) is a novel regulatory mechanism for the transcriptome-wide regulation of alternative splicing, increases the repertoire of microRNA function and further indicates cellular complexity of post-transcriptional regulation.
Collapse
Affiliation(s)
| | - Kai Ruan
- University of Miami, Miller School of Medicine
| | | | | | - Jiaqi Liu
- University of Miami Miller School of Medicine
| | | | | |
Collapse
|
6
|
Ruan K, Perez GF, Liu J, Kubat M, Hofacker I, Wuchty S, Zhai RG. MicroRNA-Mediated Obstruction of Stem-loop Alternative Splicing (MIMOSAS): a global mechanism for the regulation of alternative splicing. bioRxiv 2023:2023.04.14.536877. [PMID: 37425843 PMCID: PMC10327045 DOI: 10.1101/2023.04.14.536877] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
While RNA secondary structures are critical to regulate alternative splicing of long-range pre-mRNA, the factors that modulate RNA structure and interfere with the recognition of the splice sites are largely unknown. Previously, we identified a small, non-coding microRNA that sufficiently affects stable stem structure formation of Nmnat pre-mRNA to regulate the outcomes of alternative splicing. However, the fundamental question remains whether such microRNA-mediated interference with RNA secondary structures is a global molecular mechanism for regulating mRNA splicing. We designed and refined a bioinformatic pipeline to predict candidate microRNAs that potentially interfere with pre-mRNA stem-loop structures, and experimentally verified splicing predictions of three different long-range pre-mRNAs in the Drosophila model system. Specifically, we observed that microRNAs can either disrupt or stabilize stem-loop structures to influence splicing outcomes. Our study suggests that MicroRNA-Mediated Obstruction of Stem-loop Alternative Splicing (MIMOSAS) is a novel regulatory mechanism for the transcriptome-wide regulation of alternative splicing, increases the repertoire of microRNA function and further indicates cellular complexity of post-transcriptional regulation. One-Sentence Summary MicroRNA-Mediated Obstruction of Stem-loop Alternative Splicing (MIMOSAS) is a novel regulatory mechanism for the transcriptome-wide regulation of alternative splicing.
Collapse
|
7
|
Augspach A, Drake KD, Roma L, Qian E, Lee SR, Clarke D, Kumar S, Jaquet M, Gallon J, Bolis M, Triscott J, Galván JA, Chen Y, Thalmann GN, Kruithof-de Julio M, Theurillat JPP, Wuchty S, Gerstein M, Piscuoglio S, Kanadia RN, Rubin MA. Minor intron splicing is critical for survival of lethal prostate cancer. Mol Cell 2023; 83:1983-2002.e11. [PMID: 37295433 PMCID: PMC10637423 DOI: 10.1016/j.molcel.2023.05.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Revised: 03/29/2023] [Accepted: 05/12/2023] [Indexed: 06/12/2023]
Abstract
The evolutionarily conserved minor spliceosome (MiS) is required for protein expression of ∼714 minor intron-containing genes (MIGs) crucial for cell-cycle regulation, DNA repair, and MAP-kinase signaling. We explored the role of MIGs and MiS in cancer, taking prostate cancer (PCa) as an exemplar. Both androgen receptor signaling and elevated levels of U6atac, a MiS small nuclear RNA, regulate MiS activity, which is highest in advanced metastatic PCa. siU6atac-mediated MiS inhibition in PCa in vitro model systems resulted in aberrant minor intron splicing leading to cell-cycle G1 arrest. Small interfering RNA knocking down U6atac was ∼50% more efficient in lowering tumor burden in models of advanced therapy-resistant PCa compared with standard antiandrogen therapy. In lethal PCa, siU6atac disrupted the splicing of a crucial lineage dependency factor, the RE1-silencing factor (REST). Taken together, we have nominated MiS as a vulnerability for lethal PCa and potentially other cancers.
Collapse
Affiliation(s)
- Anke Augspach
- Department for BioMedical Research, University of Bern, 3008 Bern, Switzerland
| | - Kyle D Drake
- Physiology and Neurobiology Department, University of Connecticut, Storrs, CT 06269, USA
| | - Luca Roma
- Institute of Pathology and Medical Genetics, University Hospital Basel, 4056 Basel, Switzerland
| | - Ellen Qian
- Department of Computer Science, Yale University, New Haven, CT 06520, USA; Yale College, New Haven, CT 06520, USA
| | - Se Ri Lee
- Department of Computer Science, Yale University, New Haven, CT 06520, USA; Yale College, New Haven, CT 06520, USA
| | - Declan Clarke
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Sushant Kumar
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Muriel Jaquet
- Department for BioMedical Research, University of Bern, 3008 Bern, Switzerland
| | - John Gallon
- Institute of Pathology and Medical Genetics, University Hospital Basel, 4056 Basel, Switzerland
| | - Marco Bolis
- Institute of Oncology Research, 6500 Bellinzona, Switzerland; Computational Oncology Unit, Department of Oncology, Istituto di Ricerche Farmacologiche "Mario Negri" IRCCS, 20156 Milano, Italy
| | - Joanna Triscott
- Department for BioMedical Research, University of Bern, 3008 Bern, Switzerland
| | - José A Galván
- Institute of Pathology, University of Bern, Bern 3008, Switzerland
| | - Yu Chen
- Human Oncology and Pathogenesis Program (HOPP), Memorial Sloan Kettering, New York, NY 10065, USA; Department of Medicine, Memorial Sloan Kettering, New York, NY 10065, USA; Department of Medicine, Weill Cornell Medicine, New York, NY 10065, USA
| | - George N Thalmann
- Department for BioMedical Research, University of Bern, 3008 Bern, Switzerland; Department of Urology, Inselspital, Bern University Hospital, 3008 Bern, Switzerland
| | - Marianna Kruithof-de Julio
- Department for BioMedical Research, University of Bern, 3008 Bern, Switzerland; Department of Urology, Inselspital, Bern University Hospital, 3008 Bern, Switzerland; Bern Center for Precision Medicine, University of Bern and Inselspital, 3008 Bern, Switzerland
| | - Jean-Philippe P Theurillat
- Institute of Oncology Research, 6500 Bellinzona, Switzerland; Faculty of Biomedical Sciences, Università della Svizzera italiana, 6900 Lugano, Switzerland
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Coral Gables, FL 33146, USA; Sylvester Comprehensive Cancer Center, University of Miami, Coral Gables, FL 33136, USA; Department of Biology, University of Miami, Coral Gables, FL 33146, USA
| | - Mark Gerstein
- Department of Computer Science, Yale University, New Haven, CT 06520, USA; Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Salvatore Piscuoglio
- Institute of Pathology and Medical Genetics, University Hospital Basel, 4056 Basel, Switzerland; Department of Biomedicine, University of Basel, 4031 Basel, Switzerland
| | - Rahul N Kanadia
- Physiology and Neurobiology Department, University of Connecticut, Storrs, CT 06269, USA; Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA.
| | - Mark A Rubin
- Department for BioMedical Research, University of Bern, 3008 Bern, Switzerland; Bern Center for Precision Medicine, University of Bern and Inselspital, 3008 Bern, Switzerland.
| |
Collapse
|
8
|
Enders AM, Diekman A, Klofstad C, Murthi M, Verdear D, Wuchty S, Uscinski J. On modeling the correlates of conspiracy thinking. Sci Rep 2023; 13:8325. [PMID: 37221359 DOI: 10.1038/s41598-023-34391-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Accepted: 04/28/2023] [Indexed: 05/25/2023] Open
Abstract
While a robust literature on the psychology of conspiracy theories has identified dozens of characteristics correlated with conspiracy theory beliefs, much less attention has been paid to understanding the generalized predisposition towards interpreting events and circumstances as the product of supposed conspiracies. Using a unique national survey of 2015 U.S. adults from October 2020, we investigate the relationship between this predisposition-conspiracy thinking-and 34 different psychological, political, and social correlates. Using conditional inference tree modeling-a machine learning-based approach designed to facilitate prediction using a flexible modeling methodology-we identify the characteristics that are most useful for orienting individuals along the conspiracy thinking continuum, including (but not limited to): anomie, Manicheanism, support for political violence, a tendency to share false information online, populism, narcissism, and psychopathy. Altogether, psychological characteristics are much more useful in predicting conspiracy thinking than are political and social characteristics, though even our robust set of correlates only partially accounts for variance in conspiracy thinking.
Collapse
Affiliation(s)
- Adam M Enders
- Department of Political Science, University of Louisville, Louisville, KY, 40292, USA
| | - Amanda Diekman
- Department of Psychology, Indiana University, Bloomington, IN, 47405, USA
| | - Casey Klofstad
- Department of Political Science, University of Miami, Coral Gables, FL, 33146, USA
| | - Manohar Murthi
- Department of Electrical and Computer Engineering, University of Miami, Coral Gables, FL, 33146, USA
| | - Daniel Verdear
- Department of Computer Science, University of Miami, Coral Gables, FL, 33146, USA
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Coral Gables, FL, 33146, USA
- Department of Biology, University of Miami, Coral Gables, FL, 33146, USA
- Institute of Data Science and Computing, University of Miami, Coral Gables, FL, 33146, USA
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, 33136, USA
| | - Joseph Uscinski
- Department of Political Science, University of Miami, Coral Gables, FL, 33146, USA.
| |
Collapse
|
9
|
Zheng J, Yang X, Huang Y, Yang S, Wuchty S, Zhang Z. Deep learning-assisted prediction of protein-protein interactions in Arabidopsis thaliana. Plant J 2023; 114:984-994. [PMID: 36919205 DOI: 10.1111/tpj.16188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Revised: 02/20/2023] [Accepted: 03/09/2023] [Indexed: 05/27/2023]
Abstract
Currently, the experimentally identified interactome of Arabidopsis (Arabidopsis thaliana) is still far from complete, suggesting that computational prediction methods can complement experimental techniques. Motivated by the prosperity and success of deep learning algorithms and natural language processing techniques, we introduce an integrative deep learning framework, DeepAraPPI, allowing us to predict protein-protein interactions (PPIs) of Arabidopsis utilizing sequence, domain and Gene Ontology (GO) information. Our current DeepAraPPI comprises: (i) a word2vec encoding-based Siamese recurrent convolutional neural network (RCNN) model; (ii) a Domain2vec encoding-based multiple-layer perceptron (MLP) model; and (iii) a GO2vec encoding-based MLP model. Finally, DeepAraPPI combines the prediction results of the three individual predictors through a logistic regression model. Compiling high-quality positive and negative training and test samples by applying strict filtering strategies, DeepAraPPI shows superior performance compared with existing state-of-the-art Arabidopsis PPI prediction methods. DeepAraPPI also provides better cross-species predictive ability in rice (Oryza sativa) than traditional machine learning methods, although the overall performance in cross-species prediction remains to be improved. DeepAraPPI is freely accessible at http://zzdlab.com/deeparappi/. In the meantime, we have also made the source code and data sets of DeepAraPPI available at https://github.com/zjy1125/DeepAraPPI.
Collapse
Affiliation(s)
- Jingyan Zheng
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Xiaodi Yang
- Department of Hematology, Peking University First Hospital, Beijing, 100034, China
| | - Yan Huang
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Shiping Yang
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Miami, FL, 33146, USA
- Department of Biology, University of Miami, Miami, FL, 33146, USA
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, 33136, USA
- Institute of Data Science and Computing, University of Miami, Miami, FL, 33146, USA
| | - Ziding Zhang
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| |
Collapse
|
10
|
Huang Y, Wuchty S, Zhou Y, Zhang Z. SGPPI: structure-aware prediction of protein-protein interactions in rigorous conditions with graph convolutional network. Brief Bioinform 2023; 24:6995378. [PMID: 36682013 DOI: 10.1093/bib/bbad020] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Revised: 11/17/2022] [Accepted: 01/05/2023] [Indexed: 01/23/2023] Open
Abstract
While deep learning (DL)-based models have emerged as powerful approaches to predict protein-protein interactions (PPIs), the reliance on explicit similarity measures (e.g. sequence similarity and network neighborhood) to known interacting proteins makes these methods ineffective in dealing with novel proteins. The advent of AlphaFold2 presents a significant opportunity and also a challenge to predict PPIs in a straightforward way based on monomer structures while controlling bias from protein sequences. In this work, we established Structure and Graph-based Predictions of Protein Interactions (SGPPI), a structure-based DL framework for predicting PPIs, using the graph convolutional network. In particular, SGPPI focused on protein patches on the protein-protein binding interfaces and extracted the structural, geometric and evolutionary features from the residue contact map to predict PPIs. We demonstrated that our model outperforms traditional machine learning methods and state-of-the-art DL-based methods using non-representation-bias benchmark datasets. Moreover, our model trained on human dataset can be reliably transferred to predict yeast PPIs, indicating that SGPPI can capture converging structural features of protein interactions across various species. The implementation of SGPPI is available at https://github.com/emerson106/SGPPI.
Collapse
Affiliation(s)
- Yan Huang
- State Key Laboratory of Livestock and Poultry Biotechnology Breeding, College of Biological Sciences, China Agricultural University, Beijing 100193, China
- Department of Biomedical Informatics, Ministry of Education Key Laboratory of Molecular Cardiovascular Sciences, Center for Non-Coding RNA Medicine, School of Basic Medical Sciences, Peking University, Beijing 100191, China
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Coral Gables, FL 33146, USA
- Department of Biology, University of Miami, Coral Gables, FL 33146, USA
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL 33136, USA
- Institute of Data Science and Computing, University of Miami, Coral Gables, FL 33146, USA
| | - Yuan Zhou
- Department of Biomedical Informatics, Ministry of Education Key Laboratory of Molecular Cardiovascular Sciences, Center for Non-Coding RNA Medicine, School of Basic Medical Sciences, Peking University, Beijing 100191, China
| | - Ziding Zhang
- State Key Laboratory of Livestock and Poultry Biotechnology Breeding, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| |
Collapse
|
11
|
Paul I, Bolzan D, Youssef A, Gagnon KA, Hook H, Karemore G, Oliphant MUJ, Lin W, Liu Q, Phanse S, White C, Padhorny D, Kotelnikov S, Chen CS, Hu P, Denis GV, Kozakov D, Raught B, Siggers T, Wuchty S, Muthuswamy SK, Emili A. Parallelized multidimensional analytic framework applied to mammary epithelial cells uncovers regulatory principles in EMT. Nat Commun 2023; 14:688. [PMID: 36755019 PMCID: PMC9908882 DOI: 10.1038/s41467-023-36122-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Accepted: 01/17/2023] [Indexed: 02/10/2023] Open
Abstract
A proper understanding of disease etiology will require longitudinal systems-scale reconstruction of the multitiered architecture of eukaryotic signaling. Here we combine state-of-the-art data acquisition platforms and bioinformatics tools to devise PAMAF, a workflow that simultaneously examines twelve omics modalities, i.e., protein abundance from whole-cells, nucleus, exosomes, secretome and membrane; N-glycosylation, phosphorylation; metabolites; mRNA, miRNA; and, in parallel, single-cell transcriptomes. We apply PAMAF in an established in vitro model of TGFβ-induced epithelial to mesenchymal transition (EMT) to quantify >61,000 molecules from 12 omics and 10 timepoints over 12 days. Bioinformatics analysis of this EMT-ExMap resource allowed us to identify; -topological coupling between omics, -four distinct cell states during EMT, -omics-specific kinetic paths, -stage-specific multi-omics characteristics, -distinct regulatory classes of genes, -ligand-receptor mediated intercellular crosstalk by integrating scRNAseq and subcellular proteomics, and -combinatorial drug targets (e.g., Hedgehog signaling and CAMK-II) to inhibit EMT, which we validate using a 3D mammary duct-on-a-chip platform. Overall, this study provides a resource on TGFβ signaling and EMT.
Collapse
Affiliation(s)
- Indranil Paul
- Department of Biochemistry, Boston University School of Medicine, Boston University, 71 East Concord Street, Boston, MA, 02118, USA
| | - Dante Bolzan
- Department of Computer Science, University of Miami, 1356 Memorial Drive, Coral Gables, FL, 33146, USA
| | - Ahmed Youssef
- Graduate Program in Bioinformatics, Boston University, 24 Cummington Mall, Boston, MA, 02215, USA
| | - Keith A Gagnon
- Department of Biomedical Engineering, Boston University, 44 Cummington Mall, Boston, MA, 02215, USA
| | - Heather Hook
- Department of Biology, Boston University, 24 Cummington Mall, Boston, MA, 02115, USA
- Biological Design Center, Boston University, 610 Commonwealth Avenue, Boston, MA, 02215, USA
| | - Gopal Karemore
- Advanced Analytics, Novo Nordisk A/S, 2760, Måløv, Denmark
| | - Michael U J Oliphant
- Cancer Research Institute, Department of Medicine, Beth Israel Deaconess Medical Center, Boston, MA, 02115, USA
| | - Weiwei Lin
- Department of Biochemistry, Boston University School of Medicine, Boston University, 71 East Concord Street, Boston, MA, 02118, USA
| | - Qian Liu
- Department of Biochemistry and Medical Genetics, University of Manitoba, Winnipeg, Manitoba, R3E 0J9, Canada
| | - Sadhna Phanse
- Department of Biochemistry, Boston University School of Medicine, Boston University, 71 East Concord Street, Boston, MA, 02118, USA
| | - Carl White
- Department of Biochemistry, Boston University School of Medicine, Boston University, 71 East Concord Street, Boston, MA, 02118, USA
| | - Dzmitry Padhorny
- Department of Applied Mathematics and Statistics, Stony Brook University, 11794, Stony Brook, NY, USA
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, 11794, USA
| | - Sergei Kotelnikov
- Department of Applied Mathematics and Statistics, Stony Brook University, 11794, Stony Brook, NY, USA
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, 11794, USA
| | - Christopher S Chen
- Department of Biomedical Engineering, Boston University, 44 Cummington Mall, Boston, MA, 02215, USA
- Wyss Institute for Biologically Inspired Engineering, Harvard University, 3 Blackfan Circle, Boston, MA, 02115, USA
| | - Pingzhao Hu
- Department of Biochemistry, Western University, London, ON, N6A 5C1, Canada
| | - Gerald V Denis
- Boston Medical Center Cancer Center, Boston University, Boston University, 72 East Concord Street, Boston, MA, 02118, USA
| | - Dima Kozakov
- Department of Applied Mathematics and Statistics, Stony Brook University, 11794, Stony Brook, NY, USA
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, 11794, USA
| | - Brian Raught
- Discovery Tower (TMDT), 101 College St, Rm. 9-701A, University of Toronto, Toronto, ON, M5G 1L7, Canada
| | - Trevor Siggers
- Department of Biology, Boston University, 24 Cummington Mall, Boston, MA, 02115, USA
- Biological Design Center, Boston University, 610 Commonwealth Avenue, Boston, MA, 02215, USA
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, 1356 Memorial Drive, Coral Gables, FL, 33146, USA
| | - Senthil K Muthuswamy
- Laboratory of Cancer Biology and Genetics, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, MD, USA
| | - Andrew Emili
- Department of Biochemistry, Boston University School of Medicine, Boston University, 71 East Concord Street, Boston, MA, 02118, USA.
- Department of Biology, Charles River Campus, Boston University, Life Science & Engineering (LSEB-602), 24 Cummington Mall, Boston, MA, 02215, USA.
- Division of Oncological Sciences, Knight Cancer Institute, Oregon Health and Science University, Portland, USA.
| |
Collapse
|
12
|
Uscinski J, Enders A, Diekman A, Funchion J, Klofstad C, Kuebler S, Murthi M, Premaratne K, Seelig M, Verdear D, Wuchty S. The psychological and political correlates of conspiracy theory beliefs. Sci Rep 2022; 12:21672. [PMID: 36522383 PMCID: PMC9751515 DOI: 10.1038/s41598-022-25617-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Accepted: 12/01/2022] [Indexed: 12/23/2022] Open
Abstract
Understanding the individual-level characteristics associated with conspiracy theory beliefs is vital to addressing and combatting those beliefs. While researchers have identified numerous psychological and political characteristics associated with conspiracy theory beliefs, the generalizability of those findings is uncertain because they are typically drawn from studies of only a few conspiracy theories. Here, we employ a national survey of 2021 U.S. adults that asks about 15 psychological and political characteristics as well as beliefs in 39 different conspiracy theories. Across 585 relationships examined within both bivariate (correlations) and multivariate (regression) frameworks, we find that psychological traits (e.g., dark triad) and non-partisan/ideological political worldviews (e.g., populism, support for violence) are most strongly related to individual conspiracy theory beliefs, regardless of the belief under consideration, while other previously identified correlates (e.g., partisanship, ideological extremity) are inconsistently related. We also find that the correlates of specific conspiracy theory beliefs mirror those of conspiracy thinking (the predisposition), indicating that this predisposition operates like an 'average' of individual conspiracy theory beliefs. Overall, our findings detail the psychological and political traits of the individuals most drawn to conspiracy theories and have important implications for scholars and practitioners seeking to prevent or reduce the impact of conspiracy theories.
Collapse
Affiliation(s)
- Joseph Uscinski
- Department of Political Science, University of Miami, 1300 Campo Sano Blvd., Coral Gables, FL, 33146, USA.
| | - Adam Enders
- Department of Political Science, University of Louisville, Louisville, KY, 40292, USA
| | - Amanda Diekman
- Department of Psychology, Indiana University, Bloomington, IN, 47405, USA
| | - John Funchion
- Department of English, University of Miami, Coral Gables, FL, 33146, USA
| | - Casey Klofstad
- Department of Political Science, University of Miami, 1300 Campo Sano Blvd., Coral Gables, FL, 33146, USA
| | - Sandra Kuebler
- Department of Linguistics, Indiana University, Bloomington, IN, 47405, USA
| | - Manohar Murthi
- Department of Electrical and Computer Engineering, University of Miami, Coral Gables, FL, 33146, USA
| | - Kamal Premaratne
- Department of Electrical and Computer Engineering, University of Miami, Coral Gables, FL, 33146, USA
| | - Michelle Seelig
- Department of Cinema and Interactive Media, University of Miami, Coral Gables, FL, 33146, USA
| | - Daniel Verdear
- Department of Computer Science, University of Miami, Coral Gables, FL, 33146, USA
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Coral Gables, FL, 33146, USA
| |
Collapse
|
13
|
Yang X, Yang S, Ren P, Wuchty S, Zhang Z. Deep Learning-Powered Prediction of Human-Virus Protein-Protein Interactions. Front Microbiol 2022; 13:842976. [PMID: 35495666 PMCID: PMC9051481 DOI: 10.3389/fmicb.2022.842976] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2021] [Accepted: 03/25/2022] [Indexed: 11/13/2022] Open
Abstract
Identifying human-virus protein-protein interactions (PPIs) is an essential step for understanding viral infection mechanisms and antiviral response of the human host. Recent advances in high-throughput experimental techniques enable the significant accumulation of human-virus PPI data, which have further fueled the development of machine learning-based human-virus PPI prediction methods. Emerging as a very promising method to predict human-virus PPIs, deep learning shows the powerful ability to integrate large-scale datasets, learn complex sequence-structure relationships of proteins and convert the learned patterns into final prediction models with high accuracy. Focusing on the recent progresses of deep learning-powered human-virus PPI predictions, we review technical details of these newly developed methods, including dataset preparation, deep learning architectures, feature engineering, and performance assessment. Moreover, we discuss the current challenges and potential solutions and provide future perspectives of human-virus PPI prediction in the coming post-AlphaFold2 era.
Collapse
Affiliation(s)
- Xiaodi Yang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Shiping Yang
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Panyu Ren
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Miami, FL, United States
- Department of Biology, University of Miami, Miami, FL, United States
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, United States
| | - Ziding Zhang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
- *Correspondence: Ziding Zhang,
| |
Collapse
|
14
|
Devkota P, Wuchty S. Promoter/enhancer-based controllability of regulatory networks. Sci Rep 2022; 12:3528. [PMID: 35241702 PMCID: PMC8894475 DOI: 10.1038/s41598-022-07035-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Accepted: 01/19/2022] [Indexed: 11/21/2022] Open
Abstract
Understanding the mechanisms of tissue-specific transcriptional regulation is crucial as mis-regulation can cause a broad range of diseases. Here, we investigated transcription factors (TF) that are indispensable for the topological control of tissue specific and cell-type specific regulatory networks as a function of their binding to regulatory elements on promoters and enhancers of corresponding target genes. In particular, we found that promoter-binding TFs that were indispensable for regulatory network control regulate genes that are tissue-specifically expressed and overexpressed in corresponding cancer types. In turn, indispensable, enhancer-binding TFs were enriched with disease and signaling genes as they control an increasing number of cell-type specific regulatory networks. Their target genes were cell-type specific for blood and immune-related cell-types and over-expressed in blood-related cancers. Notably, target genes of indispensable enhancer-binding TFs in cell-type specific regulatory networks were enriched with cancer drug targets, while target genes of indispensable promoter-binding TFs were bona-fide targets of cancer drugs in corresponding tissues. Our results emphasize the significant role control analysis of regulatory networks plays in our understanding of transcriptional regulation, demonstrating potential therapeutic implications in tissue-specific drug discovery research.
Collapse
Affiliation(s)
- Prajwal Devkota
- Department of Computer Science, University of Miami, Miami, FL, 33146, USA.,Scipher Medicine Inc, Waltham, MA, 02453, USA
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Miami, FL, 33146, USA. .,Department of Biology, University of Miami, Miami, FL, 33146, USA. .,Sylvester Comprehensive Cancer Center, Univ. of Miami, Miami, FL, 33136, USA.
| |
Collapse
|
15
|
Lori A, Schultebraucks K, Galatzer-Levy I, Daskalakis NP, Katrinli S, Smith AK, Myers AJ, Richholt R, Huentelman M, Guffanti G, Wuchty S, Gould F, Harvey PD, Nemeroff CB, Jovanovic T, Gerasimov ES, Maples-Keller JL, Stevens JS, Michopoulos V, Rothbaum BO, Wingo AP, Ressler KJ. Transcriptome-wide association study of post-trauma symptom trajectories identified GRIN3B as a potential biomarker for PTSD development. Neuropsychopharmacology 2021; 46:1811-1820. [PMID: 34188182 PMCID: PMC8357796 DOI: 10.1038/s41386-021-01073-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Revised: 04/26/2021] [Accepted: 05/24/2021] [Indexed: 12/11/2022]
Abstract
Biomarkers that predict symptom trajectories after trauma can facilitate early detection or intervention for posttraumatic stress disorder (PTSD) and may also advance our understanding of its biology. Here, we aimed to identify trajectory-based biomarkers using blood transcriptomes collected in the immediate aftermath of trauma exposure. Participants were recruited from an Emergency Department in the immediate aftermath of trauma exposure and assessed for PTSD symptoms at baseline, 1, 3, 6, and 12 months. Three empirical symptom trajectories (chronic-PTSD, remitting, and resilient) were identified in 377 individuals based on longitudinal symptoms across four data points (1, 3, 6, and 12 months), using latent growth mixture modeling. Blood transcriptomes were examined for association with longitudinal symptom trajectories, followed by expression quantitative trait locus analysis. GRIN3B and AMOTL1 blood mRNA levels were associated with chronic vs. resilient post-trauma symptom trajectories at a transcriptome-wide significant level (N = 153, FDR-corrected p value = 0.0063 and 0.0253, respectively). We identified four genetic variants that regulate mRNA blood expression levels of GRIN3B. Among these, GRIN3B rs10401454 was associated with PTSD in an independent dataset (N = 3521, p = 0.04). Examination of the BrainCloud and GTEx databases revealed that rs10401454 was associated with brain mRNA expression levels of GRIN3B. While further replication and validation studies are needed, our data suggest that GRIN3B, a glutamate ionotropic receptor NMDA type subunit-3B, may be involved in the manifestation of PTSD. In addition, the blood mRNA level of GRIN3B may be a promising early biomarker for the PTSD manifestation and development.
Collapse
Affiliation(s)
- Adriana Lori
- Department of Psychiatry and Behavioral Sciences, Emory University, Atlanta, GA, USA
| | - Katharina Schultebraucks
- Department of Emergency Medicine, Columbia University Medical Center, New York, NY, USA
- Data Science Institute, Columbia University, New York, NY, USA
| | - Isaac Galatzer-Levy
- Department of Psychiatry, New York University School of Medicine, New York, NY, USA
| | - Nikolaos P Daskalakis
- Department of Psychiatry, Harvard Medical School and McLean Hospital, Belmont, MA, USA
| | - Seyma Katrinli
- Department of Gynecology and Obstetrics, Emory University, Atlanta, GA, USA
| | - Alicia K Smith
- Department of Gynecology and Obstetrics, Emory University, Atlanta, GA, USA
| | - Amanda J Myers
- Department of Psychiatry and Behavioral Sciences, University of Miami, Miller School of Medicine, Miami, FL, USA
| | - Ryan Richholt
- Neurogenomics Division and Center for Rare Childhood Disorders, Translational Genomics Research Institute, Phoenix, AZ, USA
| | - Matthew Huentelman
- Neurogenomics Division and Center for Rare Childhood Disorders, Translational Genomics Research Institute, Phoenix, AZ, USA
| | - Guia Guffanti
- Department of Psychiatry, Harvard Medical School and McLean Hospital, Belmont, MA, USA
| | - Stefan Wuchty
- Department of Biology, University of Miami, Coral Gables, FL, USA
- Department of Computer Science, University of Miami, Coral Gables, FL, USA
- Institute of Data Science and Computing, University of Miami, Coral Gables, FL, USA
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, USA
| | - Felicia Gould
- Department of Psychiatry and Behavioral Sciences, University of Miami, Miller School of Medicine, Miami, FL, USA
| | - Philip D Harvey
- Department of Psychiatry and Behavioral Sciences, University of Miami, Miller School of Medicine, Miami, FL, USA
| | | | - Tanja Jovanovic
- Department of Psychiatry and Behavioral Neurosciences, Wayne State University, Detroit, MI, USA
| | | | | | - Jennifer S Stevens
- Department of Psychiatry and Behavioral Sciences, Emory University, Atlanta, GA, USA
| | - Vasiliki Michopoulos
- Department of Psychiatry and Behavioral Sciences, Emory University, Atlanta, GA, USA
| | - Barbara O Rothbaum
- Department of Psychiatry and Behavioral Sciences, Emory University, Atlanta, GA, USA
| | - Aliza P Wingo
- Department of Psychiatry and Behavioral Sciences, Emory University, Atlanta, GA, USA.
- Division of Mental Health, Atlanta VA Medical Center, Decatur, GA, USA.
| | - Kerry J Ressler
- Department of Psychiatry and Behavioral Sciences, Emory University, Atlanta, GA, USA.
- Department of Psychiatry, Harvard Medical School and McLean Hospital, Belmont, MA, USA.
| |
Collapse
|
16
|
Yang X, Yang S, Lian X, Wuchty S, Zhang Z. Transfer learning via multi-scale convolutional neural layers for human-virus protein-protein interaction prediction. Bioinformatics 2021; 37:4771-4778. [PMID: 34273146 PMCID: PMC8406877 DOI: 10.1093/bioinformatics/btab533] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Revised: 06/03/2021] [Accepted: 07/16/2021] [Indexed: 11/20/2022] Open
Abstract
Motivation To complement experimental efforts, machine learning-based computational methods are playing an increasingly important role to predict human–virus protein–protein interactions (PPIs). Furthermore, transfer learning can effectively apply prior knowledge obtained from a large source dataset/task to a small target dataset/task, improving prediction performance. Results To predict interactions between human and viral proteins, we combine evolutionary sequence profile features with a Siamese convolutional neural network (CNN) architecture and a multi-layer perceptron. Our architecture outperforms various feature encodings-based machine learning and state-of-the-art prediction methods. As our main contribution, we introduce two transfer learning methods (i.e. ‘frozen’ type and ‘fine-tuning’ type) that reliably predict interactions in a target human–virus domain based on training in a source human–virus domain, by retraining CNN layers. Finally, we utilize the ‘frozen’ type transfer learning approach to predict human–SARS-CoV-2 PPIs, indicating that our predictions are topologically and functionally similar to experimentally known interactions. Availability and implementation: The source codes and datasets are available at https://github.com/XiaodiYangCAU/TransPPI/. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Xiaodi Yang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Shiping Yang
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Xianyi Lian
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Stefan Wuchty
- Dept. of Computer Science, University of Miami, Miami, FL 33146, USA.,Dept. of Biology, University of Miami, Miami, FL 33146, USA.,Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL 33136, USA
| | - Ziding Zhang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| |
Collapse
|
17
|
Enders AM, Uscinski JE, Seelig MI, Klofstad CA, Wuchty S, Funchion JR, Murthi MN, Premaratne K, Stoler J. The Relationship Between Social Media Use and Beliefs in Conspiracy Theories and Misinformation. Polit Behav 2021; 45:781-804. [PMID: 34248238 PMCID: PMC8262430 DOI: 10.1007/s11109-021-09734-6] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 06/30/2021] [Indexed: 05/29/2023]
Abstract
Numerous studies find associations between social media use and beliefs in conspiracy theories and misinformation. While such findings are often interpreted as evidence that social media causally promotes conspiracy beliefs, we theorize that this relationship is conditional on other individual-level predispositions. Across two studies, we examine the relationship between beliefs in conspiracy theories and media use, finding that individuals who get their news from social media and use social media frequently express more beliefs in some types of conspiracy theories and misinformation. However, we also find that these relationships are conditional on conspiracy thinking--the predisposition to interpret salient events as products of conspiracies--such that social media use becomes more strongly associated with conspiracy beliefs as conspiracy thinking intensifies. This pattern, which we observe across many beliefs from two studies, clarifies the relationship between social media use and beliefs in dubious ideas. Supplementary Information The online version contains supplementary material available at 10.1007/s11109-021-09734-6.
Collapse
Affiliation(s)
- Adam M. Enders
- Dept. of Political Science, Univ. of Louisville, Louisville, KY 40292 USA
| | | | - Michelle I. Seelig
- Dept. of Cinema and Interactive Media, Univ. of Miami, Coral Gables, FL 33146 USA
| | - Casey A. Klofstad
- Dept. of Political Science, Univ. of Miami, Coral Gables, FL 33146 USA
| | - Stefan Wuchty
- Dept. of Computer Science and Miami Institute of Data Science and Computing, Univ. of Miami, Coral Gables, FL 33146 USA
| | | | - Manohar N. Murthi
- Dept. of Electrical and Computer Engineering, Univ. of Miami, Coral Gables, FL 33146 USA
| | - Kamal Premaratne
- Dept. of Electrical and Computer Engineering, Univ. of Miami, Coral Gables, FL 33146 USA
| | - Justin Stoler
- Dept. of Geography, Univ. of Miami, Coral Gables, FL 33146 USA
| |
Collapse
|
18
|
Enders AM, Uscinski JE, Klofstad CA, Seelig MI, Wuchty S, Murthi MN, Premaratne K, Funchion JR. Do conspiracy beliefs form a belief system? Examining the structure and organization of conspiracy beliefs. J Soc Polit Psych 2021. [DOI: 10.5964/jspp.5649] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Despite regular reference to conspiracy theories as a “belief system,” few studies have attempted to explore the structure and organization of conspiracy beliefs beyond an examination of correlations between those beliefs. Employing unique data from two national surveys that includes respondent beliefs in 27 conspiracy theories, we decipher the substantive dimensions along which conspiracy beliefs are organized, as well as subgroupings within those dimensions. We find that variation in these conspiracy beliefs can be accounted for with two dimensions: the first regards partisan and ideological identities, while the other is composed of anti-social orientations, such as narcissism, Machiavellianism, psychopathy, and acceptance of political violence. Importantly, these two dimensions are uncorrelated. We also find that conspiracy beliefs group together by substantive content, such as those regarding partisan actors or science/medicine. Our findings also demonstrate that inferences about the correlates of conspiracy beliefs are highly contingent on the specific conspiracy theories employed by researchers. We provide suggestions for future research in this vein.
Collapse
|
19
|
Yang X, Lian X, Fu C, Wuchty S, Yang S, Zhang Z. HVIDB: a comprehensive database for human-virus protein-protein interactions. Brief Bioinform 2021; 22:832-844. [PMID: 33515030 DOI: 10.1093/bib/bbaa425] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2020] [Revised: 11/12/2020] [Accepted: 12/19/2020] [Indexed: 12/22/2022] Open
Abstract
While leading to millions of people's deaths every year the treatment of viral infectious diseases remains a huge public health challenge.Therefore, an in-depth understanding of human-virus protein-protein interactions (PPIs) as the molecular interface between a virus and its host cell is of paramount importance to obtain new insights into the pathogenesis of viral infections and development of antiviral therapeutic treatments. However, current human-virus PPI database resources are incomplete, lack annotation and usually do not provide the opportunity to computationally predict human-virus PPIs. Here, we present the Human-Virus Interaction DataBase (HVIDB, http://zzdlab.com/hvidb/) that provides comprehensively annotated human-virus PPI data as well as seamlessly integrates online PPI prediction tools. Currently, HVIDB highlights 48 643 experimentally verified human-virus PPIs covering 35 virus families, 6633 virally targeted host complexes, 3572 host dependency/restriction factors as well as 911 experimentally verified/predicted 3D complex structures of human-virus PPIs. Furthermore, our database resource provides tissue-specific expression profiles of 6790 human genes that are targeted by viruses and 129 Gene Expression Omnibus series of differentially expressed genes post-viral infections. Based on these multifaceted and annotated data, our database allows the users to easily obtain reliable information about PPIs of various human viruses and conduct an in-depth analysis of their inherent biological significance. In particular, HVIDB also integrates well-performing machine learning models to predict interactions between the human host and viral proteins that are based on (i) sequence embedding techniques, (ii) interolog mapping and (iii) domain-domain interaction inference. We anticipate that HVIDB will serve as a one-stop knowledge base to further guide hypothesis-driven experimental efforts to investigate human-virus relationships.
Collapse
Affiliation(s)
- Xiaodi Yang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Xianyi Lian
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Chen Fu
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Stefan Wuchty
- Institute of Data Science and Sylvester Comprehensive Cancer Center at the University of Miami, Beijing 100193, China
| | - Shiping Yang
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Ziding Zhang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| |
Collapse
|
20
|
Hekman RM, Hume AJ, Goel RK, Abo KM, Huang J, Blum BC, Werder RB, Suder EL, Paul I, Phanse S, Youssef A, Alysandratos KD, Padhorny D, Ojha S, Mora-Martin A, Kretov D, Ash PEA, Verma M, Zhao J, Patten JJ, Villacorta-Martin C, Bolzan D, Perea-Resa C, Bullitt E, Hinds A, Tilston-Lunel A, Varelas X, Farhangmehr S, Braunschweig U, Kwan JH, McComb M, Basu A, Saeed M, Perissi V, Burks EJ, Layne MD, Connor JH, Davey R, Cheng JX, Wolozin BL, Blencowe BJ, Wuchty S, Lyons SM, Kozakov D, Cifuentes D, Blower M, Kotton DN, Wilson AA, Mühlberger E, Emili A. Actionable Cytopathogenic Host Responses of Human Alveolar Type 2 Cells to SARS-CoV-2. Mol Cell 2021; 81:212. [PMID: 33417854 PMCID: PMC7831449 DOI: 10.1016/j.molcel.2020.12.028] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
21
|
Hekman RM, Hume AJ, Goel RK, Abo KM, Huang J, Blum BC, Werder RB, Suder EL, Paul I, Phanse S, Youssef A, Alysandratos KD, Padhorny D, Ojha S, Mora-Martin A, Kretov D, Ash PEA, Verma M, Zhao J, Patten JJ, Villacorta-Martin C, Bolzan D, Perea-Resa C, Bullitt E, Hinds A, Tilston-Lunel A, Varelas X, Farhangmehr S, Braunschweig U, Kwan JH, McComb M, Basu A, Saeed M, Perissi V, Burks EJ, Layne MD, Connor JH, Davey R, Cheng JX, Wolozin BL, Blencowe BJ, Wuchty S, Lyons SM, Kozakov D, Cifuentes D, Blower M, Kotton DN, Wilson AA, Mühlberger E, Emili A. Actionable Cytopathogenic Host Responses of Human Alveolar Type 2 Cells to SARS-CoV-2. Mol Cell 2020; 80:1104-1122.e9. [PMID: 33259812 PMCID: PMC7674017 DOI: 10.1016/j.molcel.2020.11.028] [Citation(s) in RCA: 63] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 10/16/2020] [Accepted: 11/11/2020] [Indexed: 12/11/2022]
Abstract
Human transmission of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), causative pathogen of the COVID-19 pandemic, exerts a massive health and socioeconomic crisis. The virus infects alveolar epithelial type 2 cells (AT2s), leading to lung injury and impaired gas exchange, but the mechanisms driving infection and pathology are unclear. We performed a quantitative phosphoproteomic survey of induced pluripotent stem cell-derived AT2s (iAT2s) infected with SARS-CoV-2 at air-liquid interface (ALI). Time course analysis revealed rapid remodeling of diverse host systems, including signaling, RNA processing, translation, metabolism, nuclear integrity, protein trafficking, and cytoskeletal-microtubule organization, leading to cell cycle arrest, genotoxic stress, and innate immunity. Comparison to analogous data from transformed cell lines revealed respiratory-specific processes hijacked by SARS-CoV-2, highlighting potential novel therapeutic avenues that were validated by a high hit rate in a targeted small molecule screen in our iAT2 ALI system.
Collapse
Affiliation(s)
- Ryan M Hekman
- Center for Network Systems Biology, Boston University, Boston, MA, USA; Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Adam J Hume
- Department of Microbiology, Boston University School of Medicine, Boston, MA, USA; National Emerging Infectious Diseases Laboratories, Boston University, Boston, MA, USA
| | - Raghuveera Kumar Goel
- Center for Network Systems Biology, Boston University, Boston, MA, USA; Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Kristine M Abo
- Center for Regenerative Medicine of Boston University and Boston Medical Center, Boston, MA, USA; The Pulmonary Center, Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Jessie Huang
- Center for Regenerative Medicine of Boston University and Boston Medical Center, Boston, MA, USA; The Pulmonary Center, Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Benjamin C Blum
- Center for Network Systems Biology, Boston University, Boston, MA, USA; Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Rhiannon B Werder
- Center for Regenerative Medicine of Boston University and Boston Medical Center, Boston, MA, USA; The Pulmonary Center, Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Ellen L Suder
- Department of Microbiology, Boston University School of Medicine, Boston, MA, USA; National Emerging Infectious Diseases Laboratories, Boston University, Boston, MA, USA
| | - Indranil Paul
- Center for Network Systems Biology, Boston University, Boston, MA, USA; Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Sadhna Phanse
- Center for Network Systems Biology, Boston University, Boston, MA, USA
| | - Ahmed Youssef
- Center for Network Systems Biology, Boston University, Boston, MA, USA; Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA; Bioinformatics Program, Boston University, Boston, MA, USA
| | - Konstantinos D Alysandratos
- Center for Regenerative Medicine of Boston University and Boston Medical Center, Boston, MA, USA; The Pulmonary Center, Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Dzmitry Padhorny
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY, USA; Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, USA
| | - Sandeep Ojha
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | | | - Dmitry Kretov
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Peter E A Ash
- Department of Pharmacology, Boston University School of Medicine, Boston, MA, USA
| | - Mamta Verma
- Department of Pharmacology, Boston University School of Medicine, Boston, MA, USA
| | - Jian Zhao
- Department of Electrical and Computer Engineering, Boston University, Boston, MA, USA
| | - J J Patten
- Department of Microbiology, Boston University School of Medicine, Boston, MA, USA; National Emerging Infectious Diseases Laboratories, Boston University, Boston, MA, USA
| | - Carlos Villacorta-Martin
- Center for Regenerative Medicine of Boston University and Boston Medical Center, Boston, MA, USA
| | - Dante Bolzan
- Department of Computer Science, University of Miami, Miami, FL, USA
| | - Carlos Perea-Resa
- Department of Molecular Biology, Harvard Medical School, Boston, MA, USA
| | - Esther Bullitt
- Department of Physiology and Biophysics, Boston University, Boston, MA, USA
| | - Anne Hinds
- The Pulmonary Center, Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Andrew Tilston-Lunel
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Xaralabos Varelas
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Shaghayegh Farhangmehr
- Donnelly Centre, University of Toronto, Toronto, ON, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | | | - Julian H Kwan
- Center for Network Systems Biology, Boston University, Boston, MA, USA; Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Mark McComb
- Center for Network Systems Biology, Boston University, Boston, MA, USA; Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA; Center for Biomedical Mass Spectrometry, Boston University School of Medicine, Boston, MA, USA
| | - Avik Basu
- Center for Network Systems Biology, Boston University, Boston, MA, USA; Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Mohsan Saeed
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA; National Emerging Infectious Diseases Laboratories, Boston University, Boston, MA, USA
| | - Valentina Perissi
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Eric J Burks
- Department of Pathology and Laboratory Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Matthew D Layne
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - John H Connor
- Department of Microbiology, Boston University School of Medicine, Boston, MA, USA; National Emerging Infectious Diseases Laboratories, Boston University, Boston, MA, USA
| | - Robert Davey
- Department of Microbiology, Boston University School of Medicine, Boston, MA, USA; National Emerging Infectious Diseases Laboratories, Boston University, Boston, MA, USA
| | - Ji-Xin Cheng
- Department of Biomedical Engineering, Boston University, Boston, MA, USA
| | - Benjamin L Wolozin
- Department of Pharmacology, Boston University School of Medicine, Boston, MA, USA
| | - Benjamin J Blencowe
- Donnelly Centre, University of Toronto, Toronto, ON, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Miami, FL, USA; Department of Biology, University of Miami, Miami, FL, USA; Miami Institute of Data Science and Computing, Miami, FL, USA
| | - Shawn M Lyons
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Dima Kozakov
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY, USA; Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, USA
| | - Daniel Cifuentes
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA
| | - Michael Blower
- Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA; Department of Molecular Biology, Harvard Medical School, Boston, MA, USA
| | - Darrell N Kotton
- Center for Regenerative Medicine of Boston University and Boston Medical Center, Boston, MA, USA; The Pulmonary Center, Department of Medicine, Boston University School of Medicine, Boston, MA, USA.
| | - Andrew A Wilson
- Center for Regenerative Medicine of Boston University and Boston Medical Center, Boston, MA, USA; The Pulmonary Center, Department of Medicine, Boston University School of Medicine, Boston, MA, USA.
| | - Elke Mühlberger
- Department of Microbiology, Boston University School of Medicine, Boston, MA, USA; National Emerging Infectious Diseases Laboratories, Boston University, Boston, MA, USA.
| | - Andrew Emili
- Center for Network Systems Biology, Boston University, Boston, MA, USA; Department of Biochemistry, Boston University School of Medicine, Boston, MA, USA; Department of Biology, Boston University, Boston, MA, USA.
| |
Collapse
|
22
|
Abstract
The determination of signaling pathways and transcriptional networks that control various biological processes is a major challenge from both basic science and translational medicine perspectives. Because such analysis can point to critical disease driver nodes to target for therapeutic purposes, we combined data from phenotypic screening experiments and gene expression studies of mouse neurons to determine information flow through a molecular interaction network using a network propagation approach. We hypothesized that differences in information flow between control and injured conditions prioritize relevant driver nodes that cause this state change. Identifying paths likely taken from potential source nodes to a set of transcription factors (TFs), called sinks, we found that kinases are enriched among source genes sending significantly different amounts of information to TFs in an axonal injury model. Additionally, TFs found to be differentially active during axon growth were enriched in the set of sink genes that received significantly altered amounts of information from source genes. Notably, such enrichment levels hold even when restricting the set of source genes to only those kinases observed to support or hamper neurite growth. That way, we found a set of 71 source genes that send significantly different levels of information to axon growth-relevant TFs. We analyzed their information flow changes in response to axonal injury and their influences on TFs predicted to facilitate or antagonize axon growth. Finally, we drew a network diagram of the interactions and changes in information flow between these source genes and their axon growth-relevant sink TFs.
Collapse
Affiliation(s)
- Prajwal Devkota
- Department of Computer Science, University of Miami, Miami, FL, USA
| | - Matt C Danzi
- Miami Project to Cure Paralysis, University of Miami Miller School of Medicine, Miami, FL, USA
| | - Vance P Lemmon
- Miami Project to Cure Paralysis, University of Miami Miller School of Medicine, Miami, FL, USA.,Miami Institute of Data Science and Computing, University of Miami, Miami, FL, USA.,Department of Neurological Surgery, University of Miami Miller School of Medicine, Miami, FL, USA.,Department of Molecular and Cellular Pharmacology, University of Miami Miller School of Medicine, Miami, FL, USA.,Sylvester Comprehensive Cancer Center, University of Miami Miller School of Medicine, Miami, FL, USA
| | - John L Bixby
- Miami Project to Cure Paralysis, University of Miami Miller School of Medicine, Miami, FL, USA.,Miami Institute of Data Science and Computing, University of Miami, Miami, FL, USA.,Department of Neurological Surgery, University of Miami Miller School of Medicine, Miami, FL, USA.,Department of Molecular and Cellular Pharmacology, University of Miami Miller School of Medicine, Miami, FL, USA.,Sylvester Comprehensive Cancer Center, University of Miami Miller School of Medicine, Miami, FL, USA
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Miami, FL, USA.,Miami Institute of Data Science and Computing, University of Miami, Miami, FL, USA.,Sylvester Comprehensive Cancer Center, University of Miami Miller School of Medicine, Miami, FL, USA.,Department of Biology, University of Miami, Miami, FL, USA
| |
Collapse
|
23
|
Yang X, Yang S, Li Q, Wuchty S, Zhang Z. Prediction of human-virus protein-protein interactions through a sequence embedding-based machine learning method. Comput Struct Biotechnol J 2019; 18:153-161. [PMID: 31969974 PMCID: PMC6961065 DOI: 10.1016/j.csbj.2019.12.005] [Citation(s) in RCA: 67] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2019] [Revised: 11/29/2019] [Accepted: 12/10/2019] [Indexed: 12/11/2022] Open
Abstract
The identification of human-virus protein-protein interactions (PPIs) is an essential and challenging research topic, potentially providing a mechanistic understanding of viral infection. Given that the experimental determination of human-virus PPIs is time-consuming and labor-intensive, computational methods are playing an important role in providing testable hypotheses, complementing the determination of large-scale interactome between species. In this work, we applied an unsupervised sequence embedding technique (doc2vec) to represent protein sequences as rich feature vectors of low dimensionality. Training a Random Forest (RF) classifier through a training dataset that covers known PPIs between human and all viruses, we obtained excellent predictive accuracy outperforming various combinations of machine learning algorithms and commonly-used sequence encoding schemes. Rigorous comparison with three existing human-virus PPI prediction methods, our proposed computational framework further provided very competitive and promising performance, suggesting that the doc2vec encoding scheme effectively captures context information of protein sequences, pertaining to corresponding protein-protein interactions. Our approach is freely accessible through our web server as part of our host-pathogen PPI prediction platform (http://zzdlab.com/InterSPPI/). Taken together, we hope the current work not only contributes a useful predictor to accelerate the exploration of human-virus PPIs, but also provides some meaningful insights into human-virus relationships.
Collapse
Key Words
- AC, Auto Covariance
- ACC, Accuracy
- AUC, area under the ROC curve
- AUPRC, area under the PR curve
- Adaboost, Adaptive Boosting
- CT, Conjoint Triad
- Doc2vec
- Embedding
- Human-virus interaction
- LD, Local Descriptor
- MCC, Matthews correlation coefficient
- ML, machine learning
- MLP, Multiple Layer Perceptron
- MS, mass spectroscopy
- Machine learning
- PPIs, protein-protein interactions
- PR, Precision-Recall
- Prediction
- Protein-protein interaction
- RBF, radial basis function
- RF, Random Forest
- ROC, Receiver Operating Characteristic
- SGD, stochastic gradient descent
- SVM, Support Vector Machine
- Y2H, yeast two-hybrid
Collapse
Affiliation(s)
- Xiaodi Yang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Shiping Yang
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Qinmengge Li
- National Demonstration Center for Experimental Biological Sciences Education, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Stefan Wuchty
- Dept. of Computer Science, University of Miami, Miami, FL 33146, USA
- Dept. of Biology, University of Miami, Miami, FL 33146, USA
- Center of Computational Science, University of Miami, Miami, FL 33146, USA
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL 33136, USA
| | - Ziding Zhang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| |
Collapse
|
24
|
Park J, Zhu Y, Tao X, Brazill JM, Li C, Wuchty S, Zhai RG. MicroRNA miR-1002 Enhances NMNAT-Mediated Stress Response by Modulating Alternative Splicing. iScience 2019; 19:1048-1064. [PMID: 31522116 PMCID: PMC6745518 DOI: 10.1016/j.isci.2019.08.052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2018] [Revised: 05/07/2019] [Accepted: 08/27/2019] [Indexed: 11/30/2022] Open
Abstract
Understanding endogenous regulation of stress resistance and homeostasis maintenance is critical to developing neuroprotective therapies. Nicotinamide mononucleotide adenylyltransferase (NMNAT) is a conserved essential enzyme that confers extraordinary protection and stress resistance in many neurodegenerative disease models. Drosophila Nmnat is alternatively spliced to two mRNA variants, RA and RB. RB translates to protein isoform PD with robust protective activity and is upregulated upon stress to confer enhanced neuroprotection. The mechanisms regulating the alternative splicing and stress response of NMNAT remain unclear. We have discovered a Drosophila microRNA, dme-miR-1002, which promotes the splicing of NMNAT pre-mRNA to RB by disrupting a pre-mRNA stem-loop structure. NMNAT pre-mRNA is preferentially spliced to RA in basal conditions, whereas miR-1002 enhances NMNAT PD-mediated stress protection by binding via RISC component Argonaute1 to the pre-mRNA, facilitating the splicing switch to RB. These results outline a new process for microRNAs in regulating alternative splicing and modulating stress resistance.
Collapse
Affiliation(s)
- Joun Park
- Department of Molecular and Cellular Pharmacology, University of Miami Miller School of Medicine, Miami, FL 33136, USA; Program in Neuroscience, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Yi Zhu
- Department of Molecular and Cellular Pharmacology, University of Miami Miller School of Medicine, Miami, FL 33136, USA; Program in Molecular and Cellular Pharmacology, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Xianzun Tao
- Department of Molecular and Cellular Pharmacology, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Jennifer M Brazill
- Department of Molecular and Cellular Pharmacology, University of Miami Miller School of Medicine, Miami, FL 33136, USA; Program in Molecular and Cellular Pharmacology, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Chong Li
- Department of Molecular and Cellular Pharmacology, University of Miami Miller School of Medicine, Miami, FL 33136, USA; Program in Human Genetics and Genomics, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Coral Gables, FL 33146, USA
| | - R Grace Zhai
- Department of Molecular and Cellular Pharmacology, University of Miami Miller School of Medicine, Miami, FL 33136, USA; Program in Neuroscience, University of Miami Miller School of Medicine, Miami, FL 33136, USA; Program in Molecular and Cellular Pharmacology, University of Miami Miller School of Medicine, Miami, FL 33136, USA; Program in Human Genetics and Genomics, University of Miami Miller School of Medicine, Miami, FL 33136, USA.
| |
Collapse
|
25
|
Johnson NF, Leahy R, Restrepo NJ, Velasquez N, Zheng M, Manrique P, Devkota P, Wuchty S. Hidden resilience and adaptive dynamics of the global online hate ecology. Nature 2019; 573:261-265. [PMID: 31435010 DOI: 10.1038/s41586-019-1494-7] [Citation(s) in RCA: 74] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Accepted: 07/24/2019] [Indexed: 11/09/2022]
Abstract
Online hate and extremist narratives have been linked to abhorrent real-world events, including a current surge in hate crimes1-6 and an alarming increase in youth suicides that result from social media vitriol7; inciting mass shootings such as the 2019 attack in Christchurch, stabbings and bombings8-11; recruitment of extremists12-16, including entrapment and sex-trafficking of girls as fighter brides17; threats against public figures, including the 2019 verbal attack against an anti-Brexit politician, and hybrid (racist-anti-women-anti-immigrant) hate threats against a US member of the British royal family18; and renewed anti-western hate in the 2019 post-ISIS landscape associated with support for Osama Bin Laden's son and Al Qaeda. Social media platforms seem to be losing the battle against online hate19,20 and urgently need new insights. Here we show that the key to understanding the resilience of online hate lies in its global network-of-network dynamics. Interconnected hate clusters form global 'hate highways' that-assisted by collective online adaptations-cross social media platforms, sometimes using 'back doors' even after being banned, as well as jumping between countries, continents and languages. Our mathematical model predicts that policing within a single platform (such as Facebook) can make matters worse, and will eventually generate global 'dark pools' in which online hate will flourish. We observe the current hate network rapidly rewiring and self-repairing at the micro level when attacked, in a way that mimics the formation of covalent bonds in chemistry. This understanding enables us to propose a policy matrix that can help to defeat online hate, classified by the preferred (or legally allowed) granularity of the intervention and top-down versus bottom-up nature. We provide quantitative assessments for the effects of each intervention. This policy matrix also offers a tool for tackling a broader class of illicit online behaviours21,22 such as financial fraud.
Collapse
Affiliation(s)
- N F Johnson
- Physics Department, George Washington University, Washington, DC, USA.
| | - R Leahy
- Physics Department, George Washington University, Washington, DC, USA
| | | | - N Velasquez
- Elliot School of International Affairs, George Washington University, Washington, DC, USA
| | - M Zheng
- Physics Department, University of Miami, Coral Gables, FL, USA
| | - P Manrique
- Physics Department, University of Miami, Coral Gables, FL, USA
| | - P Devkota
- Computer Science Department, University of Miami, Coral Gables, FL, USA
| | - S Wuchty
- Computer Science Department, University of Miami, Coral Gables, FL, USA
| |
Collapse
|
26
|
Abstract
Background Histone deacetylases (HDACs) are the proteins responsible for removing the acetyl group from lysine residues of core histones in chromosomes, a crucial component of gene regulation. Eleven known HDACs exist in humans and most other vertebrates. While the basic function of HDACs has been well characterized and new discoveries are still being made, the transcriptional regulation of their corresponding genes is still poorly understood. Results Here, we conducted a computational analysis of the eleven HDAC promoter sequences in 25 vertebrate species to determine whether transcription factor binding sites (TFBSs) are conserved in HDAC evolution, and if so, whether they provide useful information about HDAC expression and function. Furthermore, we used tissue-specific information of transcription factors to investigate the potential expression patterns of HDACs in different human tissues based on their transcription factor binding sites. We found that the TFBS profiles of most of the HDACs were well conserved in closely related species for all HDAC promoters except HDAC7 and HDAC10. HDAC5 had particularly strong conservation across over half of the species studied, with nearly identical profiles in the primate species. Our comparisons of TFBSs with the tissue specific gene expression profiles of their corresponding TFs showed that most HDACs had the ability to be ubiquitously expressed. A few HDAC promoters exhibited the potential for preferential expression in certain tissues, most notably HDAC11 in gall bladder, while HDAC9 seemed to have less propensity for expression in the nervous system. Conclusions In general, we found evolutionary conservation in HDAC promoters that seems to be more prominent for the ubiquitously expressed HDACs. In turn, when conservation did not follow usual phylogeny, human TFBS patterns indicated possible functional relevance. While we found that HDACs appear to uniformly expressed, we confirm that the functional differences in HDACs may be less a matter of location of activity than a question of which proteins and which acetyl groups they may be acting on. Electronic supplementary material The online version of this article (10.1186/s12864-019-5973-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Toni A Boltz
- Department of Computer Science, University of Miami, Coral Gables, FL, USA.,Present address: University of California, Los Angeles, Los Angeles, CA, USA
| | - Sawsan Khuri
- University of Exeter College of Medicine and Health, Exeter, UK
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Coral Gables, FL, USA. .,Department of Biology, University of Miami, Coral Gables, FL, USA. .,Center of Computational Science, University of Miami, Coral Gables, FL, USA. .,Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, USA.
| |
Collapse
|
27
|
Goodacre N, Devkota P, Bae E, Wuchty S, Uetz P. Protein-protein interactions of human viruses. Semin Cell Dev Biol 2018; 99:31-39. [PMID: 30031213 PMCID: PMC7102568 DOI: 10.1016/j.semcdb.2018.07.018] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2017] [Revised: 04/02/2018] [Accepted: 07/17/2018] [Indexed: 12/16/2022]
Abstract
Viruses infect their human hosts by a series of interactions between viral and host proteins, indicating that detailed knowledge of such virus-host interaction interfaces are critical for our understanding of viral infection mechanisms, disease etiology and the development of new drugs. In this review, we primarily survey human host-virus interaction data that are available from public databases following the standardized PSI-MS format. Notably, available host-virus protein interaction information is strongly biased toward a small number of virus families including herpesviridae, papillomaviridae, orthomyxoviridae and retroviridae. While we explore the reliability and relevance of these protein interactions we also survey the current knowledge about viruses functional and topological targets. Furthermore, we assess emerging frontiers of host-virus protein interaction research, focusing on protein interaction interfaces of hosts that are infected by different viruses and viruses that infect multiple hosts. Finally, we cover the current status of research that investigates the relationships of virus-targeted host proteins to other comorbidities as well as the influence of host-virus protein interactions on human metabolism.
Collapse
Affiliation(s)
- Norman Goodacre
- Division of Viral Products, Office of Vaccines Research and Review, Center for Biologics Evaluation and Research, U.S. Food and Drug Administration, Silver Spring, MD, USA
| | - Prajwal Devkota
- Dept. of Computer Science, Univ. of Miami, Coral Gables, FL, 33146, USA
| | - Eunhae Bae
- Division of Viral Products, Office of Vaccines Research and Review, Center for Biologics Evaluation and Research, U.S. Food and Drug Administration, Silver Spring, MD, USA
| | - Stefan Wuchty
- Dept. of Computer Science, Univ. of Miami, Coral Gables, FL, 33146, USA; Center for Computational Science, Univ. of Miami, Coral Gables, FL, 33146, USA; Dept. of Biology, Univ. of Miami, Coral Gables, FL, 33146, USA; Sylvester Comprehensive Cancer Center, Miller School of Medicine, University of Miami, Miami, FL, 33136, USA.
| | - Peter Uetz
- Center for the Study of Biological Complexity, Virginia Commonwealth University, Richmond, VA, 23284, USA.
| |
Collapse
|
28
|
Devkota P, Danzi MC, Wuchty S. Beyond degree and betweenness centrality: Alternative topological measures to predict viral targets. PLoS One 2018; 13:e0197595. [PMID: 29795705 PMCID: PMC5967884 DOI: 10.1371/journal.pone.0197595] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2017] [Accepted: 05/04/2018] [Indexed: 11/18/2022] Open
Abstract
The availability of large-scale screens of host-virus interaction interfaces enabled the topological analysis of viral protein targets of the host. In particular, host proteins that bind viral proteins are generally hubs and proteins with high betweenness centrality. Recently, other topological measures were introduced that a virus may tap to infect a host cell. Utilizing experimentally determined sets of human protein targets from Herpes, Hepatitis, HIV and Influenza, we pooled molecular interactions between proteins from different pathway databases. Apart from a protein's degree and betweenness centrality, we considered a protein's pathway participation, ability to topologically control a network and protein PageRank index. In particular, we found that proteins with increasing values of such measures tend to accumulate viral targets and distinguish viral targets from non-targets. Furthermore, all such topological measures strongly correlate with the occurrence of a given protein in different pathways. Building a random forest classifier that is based on such topological measures, we found that protein PageRank index had the highest impact on the classification of viral (non-)targets while proteins' ability to topologically control an interaction network played the least important role.
Collapse
Affiliation(s)
- Prajwal Devkota
- Dept. of Computer Science, Univ. of Miami, Coral Gables, FL, United States of America
| | - Matt C. Danzi
- The Miami Project to Cure Paralysis, Miller School of Medicine, University of Miami, Miami, FL, United States of America
- Center for Computational Science, Univ. of Miami, Coral Gables, FL, United States of America
| | - Stefan Wuchty
- Dept. of Computer Science, Univ. of Miami, Coral Gables, FL, United States of America
- Center for Computational Science, Univ. of Miami, Coral Gables, FL, United States of America
- Dept. of Biology, Univ. of Miami, Coral Gables, FL, United States of America
- Sylvester Comprehensive Cancer Center, Miller School of Medicine, University of Miami, Miami, FL, United States of America
- * E-mail:
| |
Collapse
|
29
|
Wuchty S, Müller SA, Caufield JH, Häuser R, Aloy P, Kalkhof S, Uetz P. Proteome Data Improves Protein Function Prediction in the Interactome of Helicobacter pylori. Mol Cell Proteomics 2018; 17:961-973. [PMID: 29414760 DOI: 10.1074/mcp.ra117.000474] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Revised: 01/25/2018] [Indexed: 01/17/2023] Open
Abstract
Helicobacter pylori is a common pathogen that is estimated to infect half of the human population, causing several diseases such as duodenal ulcer. Despite one of the first pathogens to be sequenced, its proteome remains poorly characterized as about one-third of its proteins have no functional annotation. Here, we integrate and analyze known protein interactions with proteomic and genomic data from different sources. We find that proteins with similar abundances tend to interact. Such an observation is accompanied by a trend of interactions to appear between proteins of similar functions, although some show marked cross-talk to others. Protein function prediction with protein interactions is significantly improved when interactions from other bacteria are included in our network, allowing us to obtain putative functions of more than 300 poorly or previously uncharacterized proteins. Proteins that are critical for the topological controllability of the underlying network are significantly enriched with genes that are up-regulated in the spiral compared with the coccoid form of H. pylori Determining their evolutionary conservation, we present evidence that 80 protein complexes are identical in composition with their counterparts in Escherichia coli, while 85 are partially conserved and 120 complexes are completely absent. Furthermore, we determine network clusters that coincide with related functions, gene essentiality, genetic context, cellular localization, and gene expression in different cellular states.
Collapse
Affiliation(s)
- Stefan Wuchty
- From the ‡Dept. of Computer Science.,§Center for Computational Science.,¶Dept. of Biology.,‖Sylvester Comprehensive Cancer Center, Univ. of Miami, Miami, FL 33156
| | - Stefan A Müller
- **German Center for Neurodegenerative Diseases (DZNE), 81377 Munich, Germany
| | - J Harry Caufield
- ‡‡Center for the Study of Biological Complexity, Virginia Commonwealth University, Richmond, VI 23284
| | - Roman Häuser
- §§German Cancer Research Center, 69120 Heidelberg, Germany
| | - Patrick Aloy
- ¶¶Joint IRB-BSC-CRG Program in Computational Biology, Institute for Research in Biomedicine (IRB Barcelona) and the Barcelona Institute of Science and Technology. Barcelona, Catalonia, Spain.,‖‖Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Catalonia, Spain
| | - Stefan Kalkhof
- Department of Molecular Systems Biology, UFZ, Helmholtz-Centre for Environmental Research Leipzig, 04318 Leipzig, Germany.,Institute of Bioanalysis, University of Applied Sciences and Arts of Coburg, Friedrich-Streib-Str. 2, 96450 Coburg, Germany.,Fraunhofer Institute for Cell Therapy and Immunology, Department of Therapy Validation, 04103 Leipzig, Germany
| | - Peter Uetz
- ‡‡Center for the Study of Biological Complexity, Virginia Commonwealth University, Richmond, VI 23284
| |
Collapse
|
30
|
Feng H, Wang L, Wuchty S, Wilson ACC. microRNA regulation in an ancient obligate endosymbiosis. Mol Ecol 2018; 27:1777-1793. [PMID: 29271121 DOI: 10.1111/mec.14464] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2017] [Revised: 11/20/2017] [Accepted: 11/28/2017] [Indexed: 01/03/2023]
Abstract
Although many insects are associated with obligate bacterial endosymbionts, the mechanisms by which these host/endosymbiont associations are regulated remain mysterious. While microRNAs (miRNAs) have been recently identified as regulators of host/microbe interactions, including host/pathogen and host/facultative endosymbiont interactions, the role miRNAs may play in mediating host/obligate endosymbiont interactions is virtually unknown. Here, we identified conserved miRNAs that potentially mediate symbiotic interactions between aphids and their obligate endosymbiont, Buchnera aphidicola. Using small RNA sequence data from Myzus persicae and Acyrthosiphon pisum, we annotated 93 M. persicae and 89 A. pisum miRNAs, among which 69 were shared. We found 14 miRNAs that were either highly expressed in aphid bacteriome, the Buchnera-housing tissue, or differentially expressed in bacteriome vs. gut, a non-Buchnera-housing tissue. Strikingly, 10 of these 14 miRNAs have been implicated previously in other host/microbe interaction studies. Investigating the interaction networks of these miRNAs using a custom computational pipeline, we identified 103 miRNA::mRNA interactions shared between M. persicae and A. pisum. Functional annotation of the shared mRNA targets revealed only two over-represented cluster of orthologous group categories: amino acid transport and metabolism, and signal transduction mechanisms. Our work supports a role for miRNAs in mediating host/symbiont interactions between aphids and their obligate endosymbiont Buchnera. In addition, our results highlight the probable importance of signal transduction mechanisms to host/endosymbiont coevolution.
Collapse
Affiliation(s)
- Honglin Feng
- Department of Biology, University of Miami, Coral Gables, FL, USA
| | - Lingyu Wang
- Department of Biology, University of Miami, Coral Gables, FL, USA
| | - Stefan Wuchty
- Department of Biology, University of Miami, Coral Gables, FL, USA.,Department of Computer Science, University of Miami, Coral Gables, FL, USA.,Center for Computational Science, University of Miami, Coral Gables, FL, USA.,Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, USA
| | - Alex C C Wilson
- Department of Biology, University of Miami, Coral Gables, FL, USA
| |
Collapse
|
31
|
Mariano R, Wuchty S. Structure-based prediction of host–pathogen protein interactions. Curr Opin Struct Biol 2017; 44:119-124. [DOI: 10.1016/j.sbi.2017.02.007] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2016] [Accepted: 02/28/2017] [Indexed: 11/25/2022]
|
32
|
Wuchty S, Boltz T, Küçük-McGinty H. Links between critical proteins drive the controllability of protein interaction networks. Proteomics 2017; 17:e1700056. [DOI: 10.1002/pmic.201700056] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2017] [Revised: 03/23/2017] [Accepted: 04/07/2017] [Indexed: 01/03/2023]
Affiliation(s)
- Stefan Wuchty
- Department of Computer Science; University of Miami; Coral Gables FL USA
- Center of Computational Sciences; University of Miami; Coral Gables FL USA
- Sylvester Comprehensive Cancer Center; University of Miami; Miami FL USA
| | - Toni Boltz
- Department of Computer Science; University of Miami; Coral Gables FL USA
| | | |
Collapse
|
33
|
Caufield JH, Wimble C, Shary S, Wuchty S, Uetz P. Bacterial protein meta-interactomes predict cross-species interactions and protein function. BMC Bioinformatics 2017; 18:171. [PMID: 28298180 PMCID: PMC5353844 DOI: 10.1186/s12859-017-1585-0] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2016] [Accepted: 03/04/2017] [Indexed: 11/24/2022] Open
Abstract
Background Protein-protein interactions (PPIs) can offer compelling evidence for protein function, especially when viewed in the context of proteome-wide interactomes. Bacteria have been popular subjects of interactome studies: more than six different bacterial species have been the subjects of comprehensive interactome studies while several more have had substantial segments of their proteomes screened for interactions. The protein interactomes of several bacterial species have been completed, including several from prominent human pathogens. The availability of interactome data has brought challenges, as these large data sets are difficult to compare across species, limiting their usefulness for broad studies of microbial genetics and evolution. Results In this study, we use more than 52,000 unique protein-protein interactions (PPIs) across 349 different bacterial species and strains to determine their conservation across data sets and taxonomic groups. When proteins are collapsed into orthologous groups (OGs) the resulting meta-interactome still includes more than 43,000 interactions, about 14,000 of which involve proteins of unknown function. While conserved interactions provide support for protein function in their respective species data, we found only 429 PPIs (~1% of the available data) conserved in two or more species, rendering any cross-species interactome comparison immediately useful. The meta-interactome serves as a model for predicting interactions, protein functions, and even full interactome sizes for species with limited to no experimentally observed PPI, including Bacillus subtilis and Salmonella enterica which are predicted to have up to 18,000 and 31,000 PPIs, respectively. Conclusions In the course of this work, we have assembled cross-species interactome comparisons that will allow interactomics researchers to anticipate the structures of yet-unexplored microbial interactomes and to focus on well-conserved yet uncharacterized interactors for further study. Such conserved interactions should provide evidence for important but yet-uncharacterized aspects of bacterial physiology and may provide targets for anti-microbial therapies. Electronic supplementary material The online version of this article (doi:10.1186/s12859-017-1585-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- J Harry Caufield
- Center for the Study of Biological Complexity, Virginia Commonwealth University, Richmond, Virginia, USA
| | - Christopher Wimble
- Center for the Study of Biological Complexity, Virginia Commonwealth University, Richmond, Virginia, USA
| | - Semarjit Shary
- Center for the Study of Biological Complexity, Virginia Commonwealth University, Richmond, Virginia, USA
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Coral Gables, Florida, USA.,Center for Computational Science, University of Miami, Coral Gables, Florida, USA.,Sylvester Comprehensive Cancer Center, Miller School of Medicine, University of Miami, Miami, Florida, USA
| | - Peter Uetz
- Center for the Study of Biological Complexity, Virginia Commonwealth University, Richmond, Virginia, USA.
| |
Collapse
|
34
|
Keasey SL, Natesan M, Pugh C, Kamata T, Wuchty S, Ulrich RG. Cell-free Determination of Binary Complexes That Comprise Extended Protein-Protein Interaction Networks of Yersinia pestis. Mol Cell Proteomics 2016; 15:3220-3232. [PMID: 27489291 DOI: 10.1074/mcp.m116.059337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2016] [Indexed: 11/06/2022] Open
Abstract
Binary protein interactions form the basic building blocks of molecular networks and dynamic assemblies that control all cellular functions of bacteria. Although these protein interactions are a potential source of targets for the development of new antibiotics, few high-confidence data sets are available for the large proteomes of most pathogenic bacteria. We used a library of recombinant proteins from the plague bacterium Yersinia pestis to probe planar microarrays of immobilized proteins that represented ∼85% (3552 proteins) of the bacterial proteome, resulting in >77,000 experimentally determined binary interactions. Moderate (KD ∼μm) to high-affinity (KD ∼nm) interactions were characterized for >1600 binary complexes by surface plasmon resonance imaging of microarrayed proteins. Core binary interactions that were in common with other gram-negative bacteria were identified from the results of both microarray methods. Clustering of proteins within the interaction network by function revealed statistically enriched complexes and pathways involved in replication, biosynthesis, virulence, metabolism, and other diverse biological processes. The interaction pathways included many proteins with no previously known function. Further, a large assembly of proteins linked to transcription and translation were contained within highly interconnected subregions of the network. The two-tiered microarray approach used here is an innovative method for detecting binary interactions, and the resulting data will serve as a critical resource for the analysis of protein interaction networks that function within an important human pathogen.
Collapse
Affiliation(s)
- Sarah L Keasey
- From the ‡Molecular and Translational Sciences Division, U.S. Army Medical Research Institute of Infectious Diseases, Frederick, Maryland 21702; §Biological Sciences Department, University of Maryland Baltimore County, Baltimore, Maryland 21250
| | - Mohan Natesan
- From the ‡Molecular and Translational Sciences Division, U.S. Army Medical Research Institute of Infectious Diseases, Frederick, Maryland 21702
| | - Christine Pugh
- From the ‡Molecular and Translational Sciences Division, U.S. Army Medical Research Institute of Infectious Diseases, Frederick, Maryland 21702
| | - Teddy Kamata
- From the ‡Molecular and Translational Sciences Division, U.S. Army Medical Research Institute of Infectious Diseases, Frederick, Maryland 21702
| | - Stefan Wuchty
- ¶National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland 20892
| | - Robert G Ulrich
- From the ‡Molecular and Translational Sciences Division, U.S. Army Medical Research Institute of Infectious Diseases, Frederick, Maryland 21702;
| |
Collapse
|
35
|
Johnson NF, Zheng M, Vorobyeva Y, Gabriel A, Qi H, Velasquez N, Manrique P, Johnson D, Restrepo E, Song C, Wuchty S. New online ecology of adversarial aggregates: ISIS and beyond. Science 2016; 352:1459-63. [DOI: 10.1126/science.aaf0675] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2015] [Accepted: 05/12/2016] [Indexed: 11/02/2022]
Affiliation(s)
- N. F. Johnson
- Department of Physics, University of Miami, Coral Gables, FL 33126, USA
| | - M. Zheng
- Department of Physics, University of Miami, Coral Gables, FL 33126, USA
| | - Y. Vorobyeva
- Department of International Studies, University of Miami, Coral Gables, FL 33126, USA
| | - A. Gabriel
- Department of Physics, University of Miami, Coral Gables, FL 33126, USA
| | - H. Qi
- Department of Physics, University of Miami, Coral Gables, FL 33126, USA
| | - N. Velasquez
- Department of International Studies, University of Miami, Coral Gables, FL 33126, USA
| | - P. Manrique
- Department of Physics, University of Miami, Coral Gables, FL 33126, USA
| | - D. Johnson
- Department of Government, Harvard University, Cambridge, MA 02138, USA
| | - E. Restrepo
- Department of Geography and Regional Studies, University of Miami, Coral Gables, FL 33126, USA
| | - C. Song
- Department of Physics, University of Miami, Coral Gables, FL 33126, USA
| | - S. Wuchty
- Department of Computer Science, University of Miami, Coral Gables, FL 33126, USA
- Center for Computational Science, University of Miami, Coral Gables, FL 33126, USA
| |
Collapse
|
36
|
Manrique P, Cao Z, Gabriel A, Horgan J, Gill P, Qi H, Restrepo EM, Johnson D, Wuchty S, Song C, Johnson N. Women's connectivity in extreme networks. Sci Adv 2016; 2:e1501742. [PMID: 27386564 PMCID: PMC4928915 DOI: 10.1126/sciadv.1501742] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2015] [Accepted: 04/27/2016] [Indexed: 06/02/2023]
Abstract
A popular stereotype is that women will play more minor roles than men as environments become more dangerous and aggressive. Our analysis of new longitudinal data sets from offline and online operational networks [for example, ISIS (Islamic State)] shows that although men dominate numerically, women emerge with superior network connectivity that can benefit the underlying system's robustness and survival. Our observations suggest new female-centric approaches that could be used to affect such networks. They also raise questions about how individual contributions in high-pressure systems are evaluated.
Collapse
Affiliation(s)
- Pedro Manrique
- Department of Physics, University of Miami, Coral Gables, FL 33124, USA
| | - Zhenfeng Cao
- Department of Physics, University of Miami, Coral Gables, FL 33124, USA
| | - Andrew Gabriel
- Department of Computer Science, University of Miami, Coral Gables, FL 33124, USA
| | - John Horgan
- Global Studies Institute and Department of Psychology, Georgia State University, Atlanta, GA 30302, USA
| | - Paul Gill
- Department of Security and Crime Science, University College London, London WC1H 9EZ, UK
| | - Hong Qi
- Department of Physics, University of Miami, Coral Gables, FL 33124, USA
| | - Elvira M. Restrepo
- Department of Geography and Global Studies, University of Miami, Coral Gables, FL 33126, USA
| | - Daniela Johnson
- Department of Government, Harvard University, Cambridge, MA 02138, USA
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Coral Gables, FL 33124, USA
- Center for Computational Science, University of Miami, Coral Gables, FL 33124, USA
| | - Chaoming Song
- Department of Physics, University of Miami, Coral Gables, FL 33124, USA
| | - Neil Johnson
- Department of Physics, University of Miami, Coral Gables, FL 33124, USA
| |
Collapse
|
37
|
Mariano R, Wuchty S, Vizoso-Pinto MG, Häuser R, Uetz P. The interactome of Streptococcus pneumoniae and its bacteriophages show highly specific patterns of interactions among bacteria and their phages. Sci Rep 2016; 6:24597. [PMID: 27103053 PMCID: PMC4840434 DOI: 10.1038/srep24597] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2015] [Accepted: 03/22/2016] [Indexed: 12/11/2022] Open
Abstract
Although an abundance of bacteriophages exists, little is known about interactions between their proteins and those of their bacterial hosts. Here, we experimentally determined the phage-host interactomes of the phages Dp-1 and Cp-1 and their underlying protein interaction network in the host Streptococcus pneumoniae. We compared our results to the interaction patterns of E. coli phages lambda and T7. Dp-1 and Cp-1 target highly connected host proteins, occupy central network positions, and reach many protein clusters through the interactions of their targets. In turn, lambda and T7 targets cluster to conserved and essential proteins in E. coli, while such patterns were largely absent in S. pneumoniae. Furthermore, targets in E. coli were mutually strongly intertwined, while targets of Dp-1 and Cp-1 were strongly connected through essential and orthologous proteins in their immediate network vicinity. In both phage-host systems, the impact of phages on their protein targets appears to extend from their network neighbors, since proteins that interact with phage targets were located in central network positions, have a strong topologically disruptive effect and touch complexes with high functional heterogeneity. Such observations suggest that the phages, biological impact is accomplished through a surprisingly limited topological reach of their targets.
Collapse
Affiliation(s)
- Rachelle Mariano
- Dept. of Computer Science, University of Miami, Coral Gables, FL 33146, USA
| | - Stefan Wuchty
- Dept. of Computer Science, University of Miami, Coral Gables, FL 33146, USA.,Center for Computational Science, University of Miami, Coral Gables, FL 33146, USA
| | - Maria G Vizoso-Pinto
- Max von Pettenkofer-Institute, Department of Virology, Ludwig-Maximilians-University, Munich, Germany.,Instituto Superior de Investigaciones Biológicas (INSIBIO), CONICET-UNT, and Instituto de Fisiología, Facultad de Medicina, UNT. San Miguel de Tucumán, Argentina
| | - Roman Häuser
- Genomics and Proteomics Core Facilities, German Cancer Research Center, Im Neuenheimer Feld 580, 69120 Heidelberg, Germany
| | - Peter Uetz
- Center for the Study of Biological Complexity, Virginia Commonwealth University, Richmond, VA 23284, USA
| |
Collapse
|
38
|
Keasey SL, Natesan M, Pugh C, Kamata T, Wuchty S, Ulrich RG. The road to linking genomics and proteomics of pathogenic bacteria: from binary protein complexes to interaction pathways. BMC Bioinformatics 2015. [PMCID: PMC4331799 DOI: 10.1186/1471-2105-16-s2-a9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
39
|
Abstract
Background Minimum dominating sets (MDSet) of protein interaction networks allow the control of underlying protein interaction networks through their topological placement. While essential proteins are enriched in MDSets, we hypothesize that the statistical properties of biological functions of essential genes are enhanced when we focus on essential MDSet proteins (e-MDSet). Results Here, we determined minimum dominating sets of proteins (MDSet) in interaction networks of E. coli, S. cerevisiae and H. sapiens, defined as subsets of proteins whereby each remaining protein can be reached by a single interaction. We compared several topological and functional parameters of essential, MDSet, and essential MDSet (e-MDSet) proteins. In particular, we observed that their topological placement allowed e-MDSet proteins to provide a positive correlation between degree and lethality, connect more protein complexes, and have a stronger impact on network resilience than essential proteins alone. In comparison to essential proteins we further found that interactions between e-MDSet proteins appeared more frequently within complexes, while interactions of e-MDSet proteins between complexes were depleted. Finally, these e-MDSet proteins classified into functional groupings that play a central role in survival and adaptability. Conclusions The determination of e-MDSet of an organism highlights a set of proteins that enhances the enrichment signals of biological functions of essential proteins. As a consequence, we surmise that e-MDSets may provide a new method of evaluating the core proteins of an organism.
Collapse
Affiliation(s)
- Sawsan Khuri
- Department of Computer Science, University of Miami, Coral Gables, FL, 33146, USA. .,Center for Computational Science, University of Miami, Coral Gables, FL, 33146, USA.
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, Coral Gables, FL, 33146, USA. .,Center for Computational Science, University of Miami, Coral Gables, FL, 33146, USA.
| |
Collapse
|
40
|
Blasche S, Arens S, Ceol A, Siszler G, Schmidt MA, Häuser R, Schwarz F, Wuchty S, Aloy P, Uetz P, Stradal T, Koegl M. The EHEC-host interactome reveals novel targets for the translocated intimin receptor. Sci Rep 2014; 4:7531. [PMID: 25519916 PMCID: PMC4269881 DOI: 10.1038/srep07531] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2014] [Accepted: 11/21/2014] [Indexed: 12/20/2022] Open
Abstract
Enterohemorrhagic E. coli (EHEC) manipulate their human host through at least 39 effector proteins which hijack host processes through direct protein-protein interactions (PPIs). To identify their protein targets in the host cells, we performed yeast two-hybrid screens, allowing us to find 48 high-confidence protein-protein interactions between 15 EHEC effectors and 47 human host proteins. In comparison to other bacteria and viruses we found that EHEC effectors bind more frequently to hub proteins as well as to proteins that participate in a higher number of protein complexes. The data set includes six new interactions that involve the translocated intimin receptor (TIR), namely HPCAL1, HPCAL4, NCALD, ARRB1, PDE6D, and STK16. We compared these TIR interactions in EHEC and enteropathogenic E. coli (EPEC) and found that five interactions were conserved. Notably, the conserved interactions included those of serine/threonine kinase 16 (STK16), hippocalcin-like 1 (HPCAL1) as well as neurocalcin-delta (NCALD). These proteins co-localize with the infection sites of EPEC. Furthermore, our results suggest putative functions of poorly characterized effectors (EspJ, EspY1). In particular, we observed that EspJ is connected to the microtubule system while EspY1 appears to be involved in apoptosis/cell cycle regulation.
Collapse
Affiliation(s)
- Sonja Blasche
- Genomics and Proteomics Core Facilities, German Cancer Research Center, Im Neuenheimer Feld 580, 69120 Heidelberg, Germany
| | - Stefan Arens
- Institute of Molecular Cell Biology, University of Münster, Schlossplatz 5, D-48149 Münster
| | - Arnaud Ceol
- 1] Joint IRB-BSC Program in Computational Biology, Institute for Research in Biomedicine (IRB Barcelona), Barcelona, Spain [2] Center for Genomic Science of IIT@SEMM, Fondazione Istituto Italiano di Tecnologia (IIT), Via Adamello 16, 20139 Milan - Italy
| | - Gabriella Siszler
- Genomics and Proteomics Core Facilities, German Cancer Research Center, Im Neuenheimer Feld 580, 69120 Heidelberg, Germany
| | - M Alexander Schmidt
- Institute of Infectiology, ZMBE, University of Münster, Von-Esmarch-Str. 56, D-48149 Münster
| | - Roman Häuser
- Genomics and Proteomics Core Facilities, German Cancer Research Center, Im Neuenheimer Feld 580, 69120 Heidelberg, Germany
| | - Frank Schwarz
- Genomics and Proteomics Core Facilities, German Cancer Research Center, Im Neuenheimer Feld 580, 69120 Heidelberg, Germany
| | - Stefan Wuchty
- 1] Dept. of Computer Science, Univ. of Miami, 1365 Memorial Drive, Coral Gables, FL 33146, USA [2] Center for Computational Science, Univ. of Miami, 1365 Memorial Drive, Coral Gables, FL 33146, USA
| | - Patrick Aloy
- 1] Joint IRB-BSC Program in Computational Biology, Institute for Research in Biomedicine (IRB Barcelona), Barcelona, Spain [2] Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
| | - Peter Uetz
- Center for the Study of Biological Complexity, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Theresia Stradal
- 1] Institute of Molecular Cell Biology, University of Münster, Schlossplatz 5, D-48149 Münster [2] Helmholtz Centre for Infection Research, Inhoffenstrasse 7, D-38124 Braunschweig
| | - Manfred Koegl
- Genomics and Proteomics Core Facilities, German Cancer Research Center, Im Neuenheimer Feld 580, 69120 Heidelberg, Germany
| |
Collapse
|
41
|
Butler CL, Lucas O, Wuchty S, Xue B, Uversky VN, White M. Identifying novel cell cycle proteins in Apicomplexa parasites through co-expression decision analysis. PLoS One 2014; 9:e97625. [PMID: 24841368 PMCID: PMC4026381 DOI: 10.1371/journal.pone.0097625] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2013] [Accepted: 04/22/2014] [Indexed: 11/26/2022] Open
Abstract
Hypothetical proteins comprise roughly half of the predicted gene complement of Toxoplasma gondii and Plasmodium falciparum and represent the largest class of uniquely functioning proteins in these parasites. Following the idea that functional relationships can be informed by the timing of gene expression, we devised a strategy to identify the core set of apicomplexan cell division cycling genes with important roles in parasite division, which includes many uncharacterized proteins. We assembled an expanded list of orthologs from the T. gondii and P. falciparum genome sequences (2781 putative orthologs), compared their mRNA profiles during synchronous replication, and sorted the resulting set of dual cell cycle regulated orthologs (744 total) into protein pairs conserved across many eukaryotic families versus those unique to the Apicomplexa. The analysis identified more than 100 ortholog gene pairs with unknown function in T. gondii and P. falciparum that displayed co-conserved mRNA abundance, dynamics of cyclical expression and similar peak timing that spanned the complete division cycle in each parasite. The unknown cyclical mRNAs encoded a diverse set of proteins with a wide range of mass and showed a remarkable conservation in the internal organization of ordered versus disordered structural domains. A representative sample of cyclical unknown genes (16 total) was epitope tagged in T. gondii tachyzoites yielding the discovery of new protein constituents of the parasite inner membrane complex, key mitotic structures and invasion organelles. These results demonstrate the utility of using gene expression timing and dynamic profile to identify proteins with unique roles in Apicomplexa biology.
Collapse
Affiliation(s)
- Carrie L. Butler
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, Florida, United States of America
| | - Olivier Lucas
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, Florida, United States of America
| | - Stefan Wuchty
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Bin Xue
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, Florida, United States of America
| | - Vladimir N. Uversky
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, Florida, United States of America
| | - Michael White
- Department of Global Health, College of Public Health, University of South Florida, Tampa, Florida, United States of America
- Florida Center for Drug Discovery and Innovation, University of South Florida, Tampa, Florida, United States of America
- * E-mail:
| |
Collapse
|
42
|
Häuser R, Ceol A, Rajagopala SV, Mosca R, Siszler G, Wermke N, Sikorski P, Schwarz F, Schick M, Wuchty S, Aloy P, Uetz P. A second-generation protein-protein interaction network of Helicobacter pylori. Mol Cell Proteomics 2014; 13:1318-29. [PMID: 24627523 DOI: 10.1074/mcp.o113.033571] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Helicobacter pylori infections cause gastric ulcers and play a major role in the development of gastric cancer. In 2001, the first protein interactome was published for this species, revealing over 1500 binary protein interactions resulting from 261 yeast two-hybrid screens. Here we roughly double the number of previously published interactions using an ORFeome-based, proteome-wide yeast two-hybrid screening strategy. We identified a total of 1515 protein-protein interactions, of which 1461 are new. The integration of all the interactions reported in H. pylori results in 3004 unique interactions that connect about 70% of its proteome. Excluding interactions of promiscuous proteins we derived from our new data a core network consisting of 908 interactions. We compared our data set to several other bacterial interactomes and experimentally benchmarked the conservation of interactions using 365 protein pairs (interologs) of E. coli of which one third turned out to be conserved in both species.
Collapse
Affiliation(s)
- Roman Häuser
- German Cancer Research Center (Deutsches Krebsforschungszentrum), Technologiepark 3, Im Neuenheimer Feld 580, 69120 Heidelberg, Germany
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
43
|
Vogel AL, Hall KL, Fiore SM, Klein JT, Bennett LM, Gadlin H, Stokols D, Nebeling LC, Wuchty S, Patrick K, Spotts EL, Pohl C, Riley WT, Falk-Krzesinski HJ. The Team Science Toolkit: enhancing research collaboration through online knowledge sharing. Am J Prev Med 2013; 45:787-9. [PMID: 24237924 DOI: 10.1016/j.amepre.2013.09.001] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/03/2013] [Revised: 09/03/2013] [Accepted: 09/03/2013] [Indexed: 11/19/2022]
Affiliation(s)
- Amanda L Vogel
- Clinical Research Directorate/CMRP Leidos Biomedical Research, Inc. (formerly SAIC-Frederick, Inc.), Frederick National Laboratory for Cancer Research, Frederick, Maryland.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Blasche S, Wuchty S, Rajagopala SV, Uetz P. The protein interaction network of bacteriophage lambda with its host, Escherichia coli. J Virol 2013; 87:12745-55. [PMID: 24049175 PMCID: PMC3838138 DOI: 10.1128/jvi.02495-13] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2013] [Accepted: 09/10/2013] [Indexed: 11/20/2022] Open
Abstract
Although most of the 73 open reading frames (ORFs) in bacteriophage λ have been investigated intensively, the function of many genes in host-phage interactions remains poorly understood. Using yeast two-hybrid screens of all lambda ORFs for interactions with its host Escherichia coli, we determined a raw data set of 631 host-phage interactions resulting in a set of 62 high-confidence interactions after multiple rounds of retesting. These links suggest novel regulatory interactions between the E. coli transcriptional network and lambda proteins. Targeted host proteins and genes required for lambda infection are enriched among highly connected proteins, suggesting that bacteriophages resemble interaction patterns of human viruses. Lambda tail proteins interact with both bacterial fimbrial proteins and E. coli proteins homologous to other phage proteins. Lambda appears to dramatically differ from other phages, such as T7, because of its unusually large number of modified and processed proteins, which reduces the number of host-virus interactions detectable by yeast two-hybrid screens.
Collapse
Affiliation(s)
- Sonja Blasche
- Genomics and Proteomics Core Facilities, German Cancer Research Center, Heidelberg, Germany
| | - Stefan Wuchty
- National Center of Biotechnology Information, National Institutes of Health, Bethesda, Maryland, USA
| | | | - Peter Uetz
- Center for the Study of Biological Complexity, Virginia Commonwealth University, Richmond, Virginia, USA
| |
Collapse
|
45
|
Abstract
Although the identification of protein interactions by high-throughput methods progresses at a fast pace, "interactome" datasets still suffer from high rates of false positives and low coverage. To map the interactome of any organism, this unit presents a computational framework to predict protein-protein or gene-gene interactions utilizing experimentally determined evidence of structural complexes, atomic details of binding interfaces and evolutionary conservation.
Collapse
Affiliation(s)
- Benjamin Shoemaker
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland
| | - Stefan Wuchty
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland
| | - Anna R Panchenko
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland
| |
Collapse
|
46
|
Nishi H, Tyagi M, Teng S, Shoemaker BA, Hashimoto K, Alexov E, Wuchty S, Panchenko AR. Cancer missense mutations alter binding properties of proteins and their interaction networks. PLoS One 2013; 8:e66273. [PMID: 23799087 PMCID: PMC3682950 DOI: 10.1371/journal.pone.0066273] [Citation(s) in RCA: 81] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2012] [Accepted: 05/02/2013] [Indexed: 11/18/2022] Open
Abstract
Many studies have shown that missense mutations might play an important role in carcinogenesis. However, the extent to which cancer mutations might affect biomolecular interactions remains unclear. Here, we map glioblastoma missense mutations on the human protein interactome, model the structures of affected protein complexes and decipher the effect of mutations on protein-protein, protein-nucleic acid and protein-ion binding interfaces. Although some missense mutations over-stabilize protein complexes, we found that the overall effect of mutations is destabilizing, mostly affecting the electrostatic component of binding energy. We also showed that mutations on interfaces resulted in more drastic changes of amino acid physico-chemical properties than mutations occurring outside the interfaces. Analysis of glioblastoma mutations on interfaces allowed us to stratify cancer-related interactions, identify potential driver genes, and propose two dozen additional cancer biomarkers, including those specific to functions of the nervous system. Such an analysis also offered insight into the molecular mechanism of the phenotypic outcomes of mutations, including effects on complex stability, activity, binding and turnover rate. As a result of mutated protein and gene network analysis, we observed that interactions of proteins with mutations mapped on interfaces had higher bottleneck properties compared to interactions with mutations elsewhere on the protein or unaffected interactions. Such observations suggest that genes with mutations directly affecting protein binding properties are preferably located in central network positions and may influence critical nodes and edges in signal transduction networks.
Collapse
Affiliation(s)
- Hafumi Nishi
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Manoj Tyagi
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri, United States of America
| | - Shaolei Teng
- Computational Biophysics and Bioinformatics, Department of Physics, Clemson University, Clemson, South Carolina, United States of America
| | - Benjamin A. Shoemaker
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
| | | | - Emil Alexov
- Computational Biophysics and Bioinformatics, Department of Physics, Clemson University, Clemson, South Carolina, United States of America
| | - Stefan Wuchty
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Anna R. Panchenko
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
- * E-mail:
| |
Collapse
|
47
|
Abstract
The determination of expression quantitative trait loci (eQTL) epistasis – a form of functional interaction between genetic loci that affect gene expression – is an important step toward the thorough understanding of gene regulation. Since gene expression has emerged as an “intermediate” molecular phenotype eQTL epistasis might help to explain the relationship between genotype and higher level organismal phenotypes such as diseases. A characteristic feature of eQTL analysis is the big number of tests required to identify associations between gene expression and genetic loci variability. This problem is aggravated, when epistatic effects between eQTLs are analyzed. In this review, we discuss recent algorithmic approaches for the detection of eQTL epistasis and highlight lessons that can be learned from current methods.
Collapse
Affiliation(s)
- Yang Huang
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health Bethesda, MD, USA
| | | | | |
Collapse
|
48
|
Wuchty S, Vazquez A, Bozdag S, Bauer PO. Genome-wide associations of signaling pathways in glioblastoma multiforme. BMC Med Genomics 2013; 6:11. [PMID: 23537212 PMCID: PMC3616958 DOI: 10.1186/1755-8794-6-11] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2012] [Accepted: 03/12/2013] [Indexed: 01/02/2023] Open
Abstract
BACKGROUND eQTL analysis is a powerful method that allows the identification of causal genomic alterations, providing an explanation of expression changes of single genes. However, genes mediate their biological roles in groups rather than in isolation, prompting us to extend the concept of eQTLs to whole gene pathways. METHODS We combined matched genomic alteration and gene expression data of glioblastoma patients and determined associations between the expression of signaling pathways and genomic copy number alterations with a non-linear machine learning approach. RESULTS Expectedly, over-expressed pathways were largely associated to tag-loci on chromosomes with signature alterations. Surprisingly, tag-loci that were associated to under-expressed pathways were largely placed on other chromosomes, an observation that held for composite effects between chromosomes as well. Indicating their biological relevance, identified genomic regions were highly enriched with genes having a reported driving role in gliomas. Furthermore, we found pathways that were significantly enriched with such driver genes. CONCLUSIONS Driver genes and their associated pathways may represent a functional core that drive the tumor emergence and govern the signaling apparatus in GBMs. In addition, such associations may be indicative of drug combinations for the treatment of brain tumors that follow similar patterns of common and diverging alterations.
Collapse
Affiliation(s)
- Stefan Wuchty
- National Center of Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| | | | | | | |
Collapse
|
49
|
Abstract
We computationally determined miRs that are significantly connected to molecular pathways by utilizing gene expression profiles in different cancer types such as glioblastomas, ovarian and breast cancers. Specifically, we assumed that the knowledge of physical interactions between miRs and genes indicated subsets of important miRs (IM) that significantly contributed to the regression of pathway-specific enrichment scores. Despite the different nature of the considered cancer types, we found strongly overlapping sets of IMs. Furthermore, IMs that were important for many pathways were enriched with literature-curated cancer and differentially expressed miRs. Such sets of IMs also coincided well with clusters of miRs that were experimentally indicated in numerous other cancer types. In particular, we focused on an overlapping set of 99 overall important miRs (OIM) that were found in glioblastomas, ovarian and breast cancers simultaneously. Notably, we observed that interactions between OIMs and leading edge genes of differentially expressed pathways were characterized by considerable changes in their expression correlations. Such gains/losses of miR and gene expression correlation indicated miR/gene pairs that may play a causal role in the underlying cancers. We assume that a network of physical interactions between miRs and genes allows us to determine miRs that influence the expression of whole pathways in different tumor types. Specifically, we represented each pathway by an enrichment score and an array of miRs counting the number of genes in the pathway a given miR can bind. Despite the different nature of the considered tumor types, we obtained a large set of overlapping miRs using a machine-learning algorithm. Such associated miRs were enriched with literature-curated cancer and differentially expressed miRs and also coincided well with clusters of miRs that were experimentally indicated in numerous other cancer types. Focusing on such sets of miRs we observed that interactions with genes in differentially expressed pathways were characterized by massive gains/losses of expression correlations. Such drastic changes of miR and gene expression correlation indicate miR/gene pairs that may play a causal role in the underlying cancers.
Collapse
Affiliation(s)
- Stefan Wuchty
- National Center of Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America.
| | | | | |
Collapse
|
50
|
Abstract
Collecting representative sets of cancer microRNAs (miRs) from the literature we show that their corresponding families are enriched in sets of highly interacting miR families. Targeting cancer genes on a statistically significant level, such cancer miR families strongly intervene with signaling pathways that harbor numerous cancer genes. Clustering miR family-specific profiles of pathway intervention, we found that different miR families share similar interaction patterns. Resembling corresponding patterns of cancer miRs families, such interaction patterns may indicate a miR family’s potential role in cancer. As we find that the number of targeted cancer genes is a naïve proxy for a cancer miR family, we design a simple method to predict candidate miR families based on gene-specific interaction profiles. Assessing the impact of miR families to distinguish between (non-)cancer genes, we predict a set of 84 potential candidate families, including 75% of initially collected cancer miR families. Further confirming their relevance, predicted cancer miR families are significantly indicated in increasing, non-random numbers of tumor types.
Collapse
Affiliation(s)
- Stefan Wuchty
- National Center of Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| | | | | | | |
Collapse
|