1
|
Baryshev A, La Fleur A, Groves B, Michel C, Baker D, Ljubetič A, Seelig G. Massively parallel measurement of protein-protein interactions by sequencing using MP3-seq. Nat Chem Biol 2024; 20:1514-1523. [PMID: 39192093 PMCID: PMC11511666 DOI: 10.1038/s41589-024-01718-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Accepted: 08/01/2024] [Indexed: 08/29/2024]
Abstract
Protein-protein interactions (PPIs) regulate many cellular processes and engineered PPIs have cell and gene therapy applications. Here, we introduce massively parallel PPI measurement by sequencing (MP3-seq), an easy-to-use and highly scalable yeast two-hybrid approach for measuring PPIs. In MP3-seq, DNA barcodes are associated with specific protein pairs and barcode enrichment can be read by sequencing to provide a direct measure of interaction strength. We show that MP3-seq is highly quantitative and scales to over 100,000 interactions. We apply MP3-seq to characterize interactions between families of rationally designed heterodimers and to investigate elements conferring specificity to coiled-coil interactions. Lastly, we predict coiled heterodimer structures using AlphaFold-Multimer (AF-M) and train linear models on physics-based energy terms to predict MP3-seq values. We find that AF-M-based models could be valuable for prescreening interactions but experimentally measuring interactions remains necessary to rank their strengths quantitatively.
Collapse
Affiliation(s)
- Alexandr Baryshev
- Department of Electrical & Computer Engineering, University of Washington, Seattle, WA, USA
| | - Alyssa La Fleur
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA, USA
| | - Benjamin Groves
- Department of Electrical & Computer Engineering, University of Washington, Seattle, WA, USA
| | - Cirstyn Michel
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| | - David Baker
- Department of Bioengineering, University of Washington, Seattle, WA, USA
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Ajasja Ljubetič
- Department of Biochemistry, University of Washington, Seattle, WA, USA.
- Institute for Protein Design, University of Washington, Seattle, WA, USA.
- Department for Synthetic Biology and Immunology, National Institute of Chemistry, Ljubljana, Slovenia.
| | - Georg Seelig
- Department of Electrical & Computer Engineering, University of Washington, Seattle, WA, USA.
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA, USA.
| |
Collapse
|
2
|
Llano-Suárez P, Sánchez-Visedo A, Ortiz-Gómez I, Fernández-Argüelles MT, Prado M, Costa-Fernández JM, Soldado A. Sesame Detection in Food Using DNA-Functionalized Gold Nanoparticles: A Sensitive, Rapid, and Cost-Effective Colorimetric Approach. BIOSENSORS 2024; 14:377. [PMID: 39194606 DOI: 10.3390/bios14080377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/17/2024] [Revised: 07/31/2024] [Accepted: 08/01/2024] [Indexed: 08/29/2024]
Abstract
Food safety control is a key issue in the food and agriculture industries. For such purposes, developing miniaturized analytical methods is critical for enabling the rapid and sensitive detection of food supplements, allergens, and pollutants. Here, a novel bioanalytical methodology based on DNA-functionalized gold nanoparticles (AuNPs) and colorimetric detection was developed to detect the presence of sesame (a major allergen) through sesame seed DNA as a target, in food samples. The presence of sesame DNA induces controlled nanoparticle aggregation/desegregation, resulting in a color change (from blue to red) proportional to sesame DNA concentration. The incorporation of multicomponent nucleic acid enzymes (MNAzymes) in this strategy has been carried out to perform an isothermal signal amplification strategy to improve the sensitivity of detection. Also, open-source software for color analysis was used to ensure an unbiased visual color-change detection, enhancing detection accuracy and sensitivity and opening the possibility of performing a simple and decentralized analyte detection. The method successfully detected the presence of sesame DNA in sesame seed, sesame oil, olive oil, and sunflower oil. In brief, the developed approach constitutes a simple and affordable alternative to perform a highly sensitive detection of DNA in food without complex methodologies or the requirement of expensive instrumentation.
Collapse
Affiliation(s)
- Pablo Llano-Suárez
- Department of Physical and Analytical Chemistry, University of Oviedo, c/Julián Clavería, 8, 33006 Oviedo, Spain
| | - Adrián Sánchez-Visedo
- Department of Physical and Analytical Chemistry, University of Oviedo, c/Julián Clavería, 8, 33006 Oviedo, Spain
| | - Inmaculada Ortiz-Gómez
- Department of Physical and Analytical Chemistry, University of Oviedo, c/Julián Clavería, 8, 33006 Oviedo, Spain
| | | | - Marta Prado
- International Iberian Nanotechnology Laboratory, Av. Mestre José Veiga Sthis n, 4715-330 Braga, Portugal
| | - José Manuel Costa-Fernández
- Department of Physical and Analytical Chemistry, University of Oviedo, c/Julián Clavería, 8, 33006 Oviedo, Spain
| | - Ana Soldado
- Department of Physical and Analytical Chemistry, University of Oviedo, c/Julián Clavería, 8, 33006 Oviedo, Spain
| |
Collapse
|
3
|
Tan Y, Li M, Zhou Z, Tan P, Yu H, Fan G, Hong L. PETA: evaluating the impact of protein transfer learning with sub-word tokenization on downstream applications. J Cheminform 2024; 16:92. [PMID: 39095917 PMCID: PMC11297785 DOI: 10.1186/s13321-024-00884-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Accepted: 07/13/2024] [Indexed: 08/04/2024] Open
Abstract
Protein language models (PLMs) play a dominant role in protein representation learning. Most existing PLMs regard proteins as sequences of 20 natural amino acids. The problem with this representation method is that it simply divides the protein sequence into sequences of individual amino acids, ignoring the fact that certain residues often occur together. Therefore, it is inappropriate to view amino acids as isolated tokens. Instead, the PLMs should recognize the frequently occurring combinations of amino acids as a single token. In this study, we use the byte-pair-encoding algorithm and unigram to construct advanced residue vocabularies for protein sequence tokenization, and we have shown that PLMs pre-trained using these advanced vocabularies exhibit superior performance on downstream tasks when compared to those trained with simple vocabularies. Furthermore, we introduce PETA, a comprehensive benchmark for systematically evaluating PLMs. We find that vocabularies comprising 50 and 200 elements achieve optimal performance. Our code, model weights, and datasets are available at https://github.com/ginnm/ProteinPretraining . SCIENTIFIC CONTRIBUTION: This study introduces advanced protein sequence tokenization analysis, leveraging the byte-pair-encoding algorithm and unigram. By recognizing frequently occurring combinations of amino acids as single tokens, our proposed method enhances the performance of PLMs on downstream tasks. Additionally, we present PETA, a new comprehensive benchmark for the systematic evaluation of PLMs, demonstrating that vocabularies of 50 and 200 elements offer optimal performance.
Collapse
Affiliation(s)
- Yang Tan
- School of Information Science and Engineering, East China University of Science and Technology, Shanghai, 200237, China
- Shanghai National Center for Applied Mathematics (SJTU Center), & Institute of Natural Science, Shanghai Jiao Tong University, Shanghai, 200240, China
- Shanghai Artificial Intelligence Laboratory, Shanghai, 200240, China
- Chongqing Artificial Intelligence Research Institute of Shanghai Jiao Tong University, Chongqing, 200240, China
| | - Mingchen Li
- School of Information Science and Engineering, East China University of Science and Technology, Shanghai, 200237, China
- Shanghai National Center for Applied Mathematics (SJTU Center), & Institute of Natural Science, Shanghai Jiao Tong University, Shanghai, 200240, China
- Shanghai Artificial Intelligence Laboratory, Shanghai, 200240, China
- Chongqing Artificial Intelligence Research Institute of Shanghai Jiao Tong University, Chongqing, 200240, China
| | - Ziyi Zhou
- Shanghai National Center for Applied Mathematics (SJTU Center), & Institute of Natural Science, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Pan Tan
- Shanghai National Center for Applied Mathematics (SJTU Center), & Institute of Natural Science, Shanghai Jiao Tong University, Shanghai, 200240, China
- Shanghai Artificial Intelligence Laboratory, Shanghai, 200240, China
| | - Huiqun Yu
- School of Information Science and Engineering, East China University of Science and Technology, Shanghai, 200237, China.
| | - Guisheng Fan
- School of Information Science and Engineering, East China University of Science and Technology, Shanghai, 200237, China.
| | - Liang Hong
- Shanghai National Center for Applied Mathematics (SJTU Center), & Institute of Natural Science, Shanghai Jiao Tong University, Shanghai, 200240, China.
- Shanghai Artificial Intelligence Laboratory, Shanghai, 200240, China.
- Chongqing Artificial Intelligence Research Institute of Shanghai Jiao Tong University, Chongqing, 200240, China.
| |
Collapse
|
4
|
Acs-Szabo L, Papp LA, Miklos I. Understanding the molecular mechanisms of human diseases: the benefits of fission yeasts. MICROBIAL CELL (GRAZ, AUSTRIA) 2024; 11:288-311. [PMID: 39104724 PMCID: PMC11299203 DOI: 10.15698/mic2024.08.833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 07/04/2024] [Accepted: 07/10/2024] [Indexed: 08/07/2024]
Abstract
The role of model organisms such as yeasts in life science research is crucial. Although the baker's yeast (Saccharomyces cerevisiae) is the most popular model among yeasts, the contribution of the fission yeasts (Schizosaccharomyces) to life science is also indisputable. Since both types of yeasts share several thousands of common orthologous genes with humans, they provide a simple research platform to investigate many fundamental molecular mechanisms and functions, thereby contributing to the understanding of the background of human diseases. In this review, we would like to highlight the many advantages of fission yeasts over budding yeasts. The usefulness of fission yeasts in virus research is shown as an example, presenting the most important research results related to the Human Immunodeficiency Virus Type 1 (HIV-1) Vpr protein. Besides, the potential role of fission yeasts in the study of prion biology is also discussed. Furthermore, we are keen to promote the uprising model yeast Schizosaccharomyces japonicus, which is a dimorphic species in the fission yeast genus. We propose the hyphal growth of S. japonicus as an unusual opportunity as a model to study the invadopodia of human cancer cells since the two seemingly different cell types can be compared along fundamental features. Here we also collect the latest laboratory protocols and bioinformatics tools for the fission yeasts to highlight the many possibilities available to the research community. In addition, we present several limiting factors that everyone should be aware of when working with yeast models.
Collapse
Affiliation(s)
- Lajos Acs-Szabo
- Department of Genetics and Applied Microbiology, Faculty of Science and Technology, University of DebrecenDebrecen, 4032Hungary
| | - Laszlo Attila Papp
- Department of Genetics and Applied Microbiology, Faculty of Science and Technology, University of DebrecenDebrecen, 4032Hungary
| | - Ida Miklos
- Department of Genetics and Applied Microbiology, Faculty of Science and Technology, University of DebrecenDebrecen, 4032Hungary
| |
Collapse
|
5
|
Shen Y, Duan H, Yuan L, Asikaer A, Liu Y, Zhang R, Liu Y, Wang Y, Lin Z. Computational biology-based study of the molecular mechanism of spermidine amelioration of acute pancreatitis. Mol Divers 2024; 28:2583-2601. [PMID: 37523101 DOI: 10.1007/s11030-023-10698-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2023] [Accepted: 07/10/2023] [Indexed: 08/01/2023]
Abstract
Acute pancreatitis (AP) is an acute inflammatory gastrointestinal disease, the mortality and morbility of which has been on the increase in the past years. Spermidine, a natural polyamine, has a wide range of pharmacological effects including anti-inflammation, antioxidation, anti-aging, and anti-tumorigenic. This study aimed to investigate the reliable targets and molecular mechanisms of spermidine in treating AP. By employing computational biology methods including network pharmacology, molecular docking, and molecular dynamics (MD) simulations, we explored the potential targets of spermidine in improving AP with dietary supplementation. The computational biology results revealed that spermidine had high degrees (degree: 18, betweenness: 38.91; degree: 18, betweenness: 206.41) and stable binding free energy (ΔGbind: - 12.81 ± 0.55 kcal/mol, - 15.00 ± 1.00 kcal/mol) with acetylcholinesterase (AchE) and serotonin transporter (5-HTT). Experimental validation demonstrates that spermidine treatment could reduce the necrosis and AchE activity in pancreatic acinar cells. Cellular thermal shift assay (CETSA) results revealed that spermidine could bind to and stabilize the 5-HTT protein in acinar cells. Moreover, spermidine treatment impeded the rise of the expression of 5-HTT in pancreatic tissues of caerulein induced acute pancreatitis mice. In conclusion, serotonin transporter might be a reliable target of spermidine in treating AP. This study provides new idea for the exploration of potential targets of natural compounds.
Collapse
Affiliation(s)
- Yan Shen
- Department of Pharmacy and Bioengineering, Chongqing University of Technology, Chongqing, 405400, China
- Chongqing Key Laboratory of Medicinal Chemistry and Molecular Pharmacology, Department of Pharmacy and Bioengineering, Chongqing University of Technology, Chongqing, 405400, People's Republic of China
| | - Hongtao Duan
- Department of Pharmacy and Bioengineering, Chongqing University of Technology, Chongqing, 405400, China
| | - Lu Yuan
- Department of Pharmacy and Bioengineering, Chongqing University of Technology, Chongqing, 405400, China
| | - Aiminuer Asikaer
- Department of Pharmacy and Bioengineering, Chongqing University of Technology, Chongqing, 405400, China
| | - Yiyuan Liu
- Department of Pharmacy and Bioengineering, Chongqing University of Technology, Chongqing, 405400, China
| | - Rui Zhang
- Department of Pharmacy, Guizhou Provincial People's Hospital, Guiyang, 550002, China
| | - Yang Liu
- Department of Hepatobiliary Surgery II, Guizhou Provincial People's Hospital, Guiyang, 550002, China
| | - Yuanqiang Wang
- Department of Pharmacy and Bioengineering, Chongqing University of Technology, Chongqing, 405400, China
- Chongqing Key Laboratory of Medicinal Chemistry and Molecular Pharmacology, Department of Pharmacy and Bioengineering, Chongqing University of Technology, Chongqing, 405400, People's Republic of China
| | - Zhihua Lin
- Department of Pharmacy and Bioengineering, Chongqing University of Technology, Chongqing, 405400, China.
- Chongqing Key Laboratory of Medicinal Chemistry and Molecular Pharmacology, Department of Pharmacy and Bioengineering, Chongqing University of Technology, Chongqing, 405400, People's Republic of China.
| |
Collapse
|
6
|
Han S, Lee JE, Kang S, So M, Jin H, Lee JH, Baek S, Jun H, Kim TY, Lee YS. Standigm ASK™: knowledge graph and artificial intelligence platform applied to target discovery in idiopathic pulmonary fibrosis. Brief Bioinform 2024; 25:bbae035. [PMID: 38349059 PMCID: PMC10862655 DOI: 10.1093/bib/bbae035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 12/28/2023] [Indexed: 02/15/2024] Open
Abstract
Standigm ASK™ revolutionizes healthcare by addressing the critical challenge of identifying pivotal target genes in disease mechanisms-a fundamental aspect of drug development success. Standigm ASK™ integrates a unique combination of a heterogeneous knowledge graph (KG) database and an attention-based neural network model, providing interpretable subgraph evidence. Empowering users through an interactive interface, Standigm ASK™ facilitates the exploration of predicted results. Applying Standigm ASK™ to idiopathic pulmonary fibrosis (IPF), a complex lung disease, we focused on genes (AMFR, MDFIC and NR5A2) identified through KG evidence. In vitro experiments demonstrated their relevance, as TGFβ treatment induced gene expression changes associated with epithelial-mesenchymal transition characteristics. Gene knockdown reversed these changes, identifying AMFR, MDFIC and NR5A2 as potential therapeutic targets for IPF. In summary, Standigm ASK™ emerges as an innovative KG and artificial intelligence platform driving insights in drug target discovery, exemplified by the identification and validation of therapeutic targets for IPF.
Collapse
Affiliation(s)
- Seokjin Han
- Standigm Inc., Nonhyeon-ro 85-gil, 06234, Seoul, Republic of Korea
| | - Ji Eun Lee
- College of Pharmacy, Ewha Womans University, Ewhayeodae-gil, 03760, Seoul, Republic of Korea
| | - Seolhee Kang
- Standigm Inc., Nonhyeon-ro 85-gil, 06234, Seoul, Republic of Korea
| | - Minyoung So
- Standigm Inc., Nonhyeon-ro 85-gil, 06234, Seoul, Republic of Korea
| | - Hee Jin
- College of Pharmacy, Ewha Womans University, Ewhayeodae-gil, 03760, Seoul, Republic of Korea
| | - Jang Ho Lee
- Standigm Inc., Nonhyeon-ro 85-gil, 06234, Seoul, Republic of Korea
| | - Sunghyeob Baek
- Standigm Inc., Nonhyeon-ro 85-gil, 06234, Seoul, Republic of Korea
| | - Hyungjin Jun
- Standigm Inc., Nonhyeon-ro 85-gil, 06234, Seoul, Republic of Korea
| | - Tae Yong Kim
- Standigm Inc., Nonhyeon-ro 85-gil, 06234, Seoul, Republic of Korea
| | - Yun-Sil Lee
- College of Pharmacy, Ewha Womans University, Ewhayeodae-gil, 03760, Seoul, Republic of Korea
| |
Collapse
|
7
|
Wu MY, Li ZW, Lu JH. Molecular Modulators and Receptors of Selective Autophagy: Disease Implication and Identification Strategies. Int J Biol Sci 2024; 20:751-764. [PMID: 38169614 PMCID: PMC10758101 DOI: 10.7150/ijbs.83205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Accepted: 08/31/2023] [Indexed: 01/05/2024] Open
Abstract
Autophagy is a highly conserved physiological process that maintains cellular homeostasis by recycling cellular contents. Selective autophagy is based on the specificity of cargo recognition and has been implicated in various human diseases, including neurodegenerative diseases and cancer. Selective autophagy receptors and modulators play key roles in this process. Identifying these receptors and modulators and their roles is critical for understanding the machinery and physiological function of selective autophagy and providing therapeutic value for diseases. Using modern researching tools and novel screening technologies, an increasing number of selective autophagy receptors and modulators have been identified. A variety of Strategies and approaches, including protein-protein interactions (PPIs)-based identification and genome-wide screening, have been used to identify selective autophagy receptors and modulators. Understanding the strengths and challenges of these approaches not only promotes the discovery of even more such receptors and modulators but also provides a useful reference for the identification of regulatory proteins or genes involved in other cellular mechanisms. In this review, we summarize the functions, disease association, and identification strategies of selective autophagy receptors and modulators.
Collapse
Affiliation(s)
| | | | - Jia-Hong Lu
- State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau, Macau, China
| |
Collapse
|
8
|
Baryshev A, La Fleur A, Groves B, Michel C, Baker D, Ljubetič A, Seelig G. Massively parallel protein-protein interaction measurement by sequencing (MP3-seq) enables rapid screening of protein heterodimers. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.08.527770. [PMID: 36798377 PMCID: PMC9934699 DOI: 10.1101/2023.02.08.527770] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/11/2023]
Abstract
Protein-protein interactions (PPIs) regulate many cellular processes, and engineered PPIs have cell and gene therapy applications. Here we introduce massively parallel protein-protein interaction measurement by sequencing (MP3-seq), an easy-to-use and highly scalable yeast-two-hybrid approach for measuring PPIs. In MP3-seq, DNA barcodes are associated with specific protein pairs, and barcode enrichment can be read by sequencing to provide a direct measure of interaction strength. We show that MP3-seq is highly quantitative and scales to over 100,000 interactions. We apply MP3-seq to characterize interactions between families of rationally designed heterodimers and to investigate elements conferring specificity to coiled-coil interactions. Finally, we predict coiled heterodimer structures using AlphaFold-Multimer (AF-M) and train linear models on physics simulation energy terms to predict MP3-seq values. We find that AF-M and AF-M complex prediction-based models could be valuable for pre-screening interactions, but that measuring interactions experimentally remains necessary to rank their strengths quantitatively.
Collapse
Affiliation(s)
- Alexander Baryshev
- Department of Electrical & Computer Engineering, University of Washington, Seattle, WA 98195, USA
| | - Alyssa La Fleur
- Paul G. Allen School of Computer Science & Engineering, University of Washington, Seattle, WA 98195, USA
| | - Benjamin Groves
- Department of Electrical & Computer Engineering, University of Washington, Seattle, WA 98195, USA
| | - Cirstyn Michel
- Department of Bioengineering, University of Washington, Seattle, WA 98195, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98195, USA
- Department of Bioengineering, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| | - Ajasja Ljubetič
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98195, USA
- Department for Synthetic Biology and Immunology, National Institute of Chemistry, Ljubljana SI-1000, Slovenia
| | - Georg Seelig
- Department of Electrical & Computer Engineering, University of Washington, Seattle, WA 98195, USA
- Paul G. Allen School of Computer Science & Engineering, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
9
|
Ding X, Singh P, Schimenti K, Tran TN, Fragoza R, Hardy J, Orwig KE, Olszewska M, Kurpisz MK, Yatsenko AN, Conrad DF, Yu H, Schimenti JC. In vivo versus in silico assessment of potentially pathogenic missense variants in human reproductive genes. Proc Natl Acad Sci U S A 2023; 120:e2219925120. [PMID: 37459509 PMCID: PMC10372637 DOI: 10.1073/pnas.2219925120] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 05/25/2023] [Indexed: 07/20/2023] Open
Abstract
Infertility is a heterogeneous condition, with genetic causes thought to underlie a substantial fraction of cases. Genome sequencing is becoming increasingly important for genetic diagnosis of diseases including idiopathic infertility; however, most rare or minor alleles identified in patients are variants of uncertain significance (VUS). Interpreting the functional impacts of VUS is challenging but profoundly important for clinical management and genetic counseling. To determine the consequences of these variants in key fertility genes, we functionally evaluated 11 missense variants in the genes ANKRD31, BRDT, DMC1, EXO1, FKBP6, MCM9, M1AP, MEI1, MSH4 and SEPT12 by generating genome-edited mouse models. Nine variants were classified as deleterious by most functional prediction algorithms, and two disrupted a protein-protein interaction (PPI) in the yeast two hybrid (Y2H) assay. Though these genes are essential for normal meiosis or spermiogenesis in mice, only one variant, observed in the MCM9 gene of a male infertility patient, compromised fertility or gametogenesis in the mouse models. To explore the disconnect between predictions and outcomes, we compared pathogenicity calls of missense variants made by ten widely used algorithms to 1) those annotated in ClinVar and 2) those evaluated in mice. All the algorithms performed poorly in terms of predicting the effects of human missense variants modeled in mice. These studies emphasize caution in the genetic diagnoses of infertile patients based primarily on pathogenicity prediction algorithms and emphasize the need for alternative and efficient in vitro or in vivo functional validation models for more effective and accurate VUS description to either pathogenic or benign categories.
Collapse
Affiliation(s)
- Xinbao Ding
- College of Veterinary Medicine, Department of Biomedical Sciences, Cornell University, Ithaca, NY14853
| | - Priti Singh
- College of Veterinary Medicine, Department of Biomedical Sciences, Cornell University, Ithaca, NY14853
| | - Kerry Schimenti
- College of Veterinary Medicine, Department of Biomedical Sciences, Cornell University, Ithaca, NY14853
| | - Tina N. Tran
- College of Veterinary Medicine, Department of Biomedical Sciences, Cornell University, Ithaca, NY14853
| | - Robert Fragoza
- Department of Computational Biology, Cornell University, Ithaca, NY14853
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY14853
| | - Jimmaline Hardy
- School of Medicine, Department of Obstetrics, Gynecology, and Reproductive Sciences, Magee-Womens Research Institute, University of Pittsburgh, Pittsburgh, PA15213
| | - Kyle E. Orwig
- School of Medicine, Department of Obstetrics, Gynecology, and Reproductive Sciences, Magee-Womens Research Institute, University of Pittsburgh, Pittsburgh, PA15213
| | - Marta Olszewska
- Institute of Human Genetics, Polish Academy of Sciences, Poznan60-479, Poland
| | - Maciej K. Kurpisz
- Institute of Human Genetics, Polish Academy of Sciences, Poznan60-479, Poland
| | - Alexander N. Yatsenko
- School of Medicine, Department of Obstetrics, Gynecology, and Reproductive Sciences, Magee-Womens Research Institute, University of Pittsburgh, Pittsburgh, PA15213
| | - Donald F. Conrad
- Oregon Health & Science University, Division of Genetics, Oregon National Primate Research Center, Beaverton, OR97006
| | - Haiyuan Yu
- Department of Computational Biology, Cornell University, Ithaca, NY14853
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY14853
| | - John C. Schimenti
- College of Veterinary Medicine, Department of Biomedical Sciences, Cornell University, Ithaca, NY14853
| |
Collapse
|
10
|
Milito A, Aschern M, McQuillan JL, Yang JS. Challenges and advances towards the rational design of microalgal synthetic promoters in Chlamydomonas reinhardtii. JOURNAL OF EXPERIMENTAL BOTANY 2023; 74:3833-3850. [PMID: 37025006 DOI: 10.1093/jxb/erad100] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Accepted: 03/24/2023] [Indexed: 06/19/2023]
Abstract
Microalgae hold enormous potential to provide a safe and sustainable source of high-value compounds, acting as carbon-fixing biofactories that could help to mitigate rapidly progressing climate change. Bioengineering microalgal strains will be key to optimizing and modifying their metabolic outputs, and to render them competitive with established industrial biotechnology hosts, such as bacteria or yeast. To achieve this, precise and tuneable control over transgene expression will be essential, which would require the development and rational design of synthetic promoters as a key strategy. Among green microalgae, Chlamydomonas reinhardtii represents the reference species for bioengineering and synthetic biology; however, the repertoire of functional synthetic promoters for this species, and for microalgae generally, is limited in comparison to other commercial chassis, emphasizing the need to expand the current microalgal gene expression toolbox. Here, we discuss state-of-the-art promoter analyses, and highlight areas of research required to advance synthetic promoter development in C. reinhardtii. In particular, we exemplify high-throughput studies performed in other model systems that could be applicable to microalgae, and propose novel approaches to interrogating algal promoters. We lastly outline the major limitations hindering microalgal promoter development, while providing novel suggestions and perspectives for how to overcome them.
Collapse
Affiliation(s)
- Alfonsina Milito
- Centre for Research in Agricultural Genomics (CRAG), CSIC-IRTA-UAB-UB, Campus UAB, Bellaterra, Barcelona, Spain
| | - Moritz Aschern
- Centre for Research in Agricultural Genomics (CRAG), CSIC-IRTA-UAB-UB, Campus UAB, Bellaterra, Barcelona, Spain
| | - Josie L McQuillan
- Department of Chemical and Biological Engineering, University of Sheffield, Mappin Street, Sheffield, S1 3JD, UK
| | - Jae-Seong Yang
- Centre for Research in Agricultural Genomics (CRAG), CSIC-IRTA-UAB-UB, Campus UAB, Bellaterra, Barcelona, Spain
| |
Collapse
|
11
|
Huang T, Lin KH, Machado-Vieira R, Soares JC, Jiang X, Kim Y. Explainable drug side effect prediction via biologically informed graph neural network. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.05.26.23290615. [PMID: 37333107 PMCID: PMC10275013 DOI: 10.1101/2023.05.26.23290615] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/20/2023]
Abstract
Early detection of potential side effects (SE) is a critical and challenging task for drug discovery and patient care. In-vitro or in-vivo approach to detect potential SEs is not scalable for many drug candidates during the preclinical stage. Recent advances in explainable machine learning may facilitate detecting potential SEs of new drugs before market release and elucidating the critical mechanism of biological actions. Here, we leverage multi-modal interactions among molecules to develop a biologically informed graph-based SE prediction model, called HHAN-DSI. HHAN-DSI predicted frequent and even uncommon SEs of the unseen drug with higher or comparable accuracy against benchmark methods. When applying HHAN-DSI to the central nervous system, the organs with the largest number of SEs, the model revealed diverse psychiatric medications' previously unknown but probable SEs, together with the potential mechanisms of actions through a network of genes, biological functions, drugs, and SEs.
Collapse
Affiliation(s)
- Tongtong Huang
- School of Biomedical Informatics, UTHealth, Houston, TX, United States
| | - Ko-Hong Lin
- School of Biomedical Informatics, UTHealth, Houston, TX, United States
| | - Rodrigo Machado-Vieira
- Department of Psychiatry and Behavioral Sciences, McGovern Medical School, UTHealth, Houston, TX, United States
| | - Jair C Soares
- Department of Psychiatry and Behavioral Sciences, McGovern Medical School, UTHealth, Houston, TX, United States
| | - Xiaoqian Jiang
- School of Biomedical Informatics, UTHealth, Houston, TX, United States
| | - Yejin Kim
- School of Biomedical Informatics, UTHealth, Houston, TX, United States
| |
Collapse
|
12
|
Hao B, Kovács IA. A positive statistical benchmark to assess network agreement. Nat Commun 2023; 14:2988. [PMID: 37225699 DOI: 10.1038/s41467-023-38625-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Accepted: 05/09/2023] [Indexed: 05/26/2023] Open
Abstract
Current computational methods for validating experimental network datasets compare overlap, i.e., shared links, with a reference network using a negative benchmark. However, this fails to quantify the level of agreement between the two networks. To address this, we propose a positive statistical benchmark to determine the maximum possible overlap between networks. Our approach can efficiently generate this benchmark in a maximum entropy framework and provides a way to assess whether the observed overlap is significantly different from the best-case scenario. We introduce a normalized overlap score, Normlap, to enhance comparisons between experimental networks. As an application, we compare molecular and functional networks, resulting in an agreement network of human as well as yeast network datasets. The Normlap score can improve the comparison between experimental networks by providing a computational alternative to network thresholding and validation.
Collapse
Affiliation(s)
- Bingjie Hao
- Department of Physics and Astronomy, Northwestern University, Evanston, IL, 60208, USA
| | - István A Kovács
- Department of Physics and Astronomy, Northwestern University, Evanston, IL, 60208, USA.
- Northwestern Institute on Complex Systems, Northwestern University, Evanston, IL, 60208, USA.
| |
Collapse
|
13
|
Li S, Dohlman HG. Evolutionary conservation of sequence motifs at sites of protein modification. J Biol Chem 2023; 299:104617. [PMID: 36933807 PMCID: PMC10139944 DOI: 10.1016/j.jbc.2023.104617] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Revised: 02/20/2023] [Accepted: 03/11/2023] [Indexed: 03/18/2023] Open
Abstract
Gene duplications are common in biology and are likely to be an important source of functional diversification and specialization. The yeast Saccharomyces cerevisiae underwent a whole-genome duplication event early in evolution, and a substantial number of duplicated genes have been retained. We identified more than 3500 instances where only one of two paralogous proteins undergoes posttranslational modification despite having retained the same amino acid residue in both. We also developed a web-based search algorithm (CoSMoS.c.) that scores conservation of amino acid sequences based on 1011 wild and domesticated yeast isolates and used it to compare differentially modified pairs of paralogous proteins. We found that the most common modifications-phosphorylation, ubiquitylation, and acylation but not N-glycosylation-occur in regions of high sequence conservation. Such conservation is evident even for ubiquitylation and succinylation, where there is no established 'consensus site' for modification. Differences in phosphorylation were not associated with predicted secondary structure or solvent accessibility but did mirror known differences in kinase-substrate interactions. Thus, differences in posttranslational modification likely result from differences in adjoining amino acids and their interactions with modifying enzymes. By integrating data from large-scale proteomics and genomics analysis, in a system with such substantial genetic diversity, we obtained a more comprehensive understanding of the functional basis for genetic redundancies that have persisted for 100 million years.
Collapse
Affiliation(s)
- Shuang Li
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Henrik G Dohlman
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA.
| |
Collapse
|
14
|
Elmore JM, Velásquez-Zapata V, Wise RP. Next-Generation Yeast Two-Hybrid Screening to Discover Protein-Protein Interactions. Methods Mol Biol 2023; 2690:205-222. [PMID: 37450150 DOI: 10.1007/978-1-0716-3327-4_19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/18/2023]
Abstract
Yeast two-hybrid is a powerful approach to discover new protein-protein interactions. Traditional methods involve screening a target protein against a cDNA expression library and assaying individual positive colonies to identify interacting partners. Here we describe a simple approach to perform yeast two-hybrid screens of a cDNA expression library in batch liquid culture. Positive yeast cell populations are enriched under selection and then harvested en masse. Prey cDNAs are amplified and used as input for next-generation sequencing libraries for identification, quantification, and ranking.
Collapse
Affiliation(s)
- J Mitch Elmore
- USDA-Agricultural Research Service, Cereal Disease Laboratory, St. Paul, MN, USA.
- USDA-Agricultural Research Service, Corn Insects and Crop Genetics Research, Ames, IA, USA.
- Department of Plant Pathology, Entomology and Microbiology, Iowa State University, Ames, IA, USA.
| | - Valeria Velásquez-Zapata
- Department of Plant Pathology, Entomology and Microbiology, Iowa State University, Ames, IA, USA
- Program in Bioinformatics & Computational Biology, Iowa State University, Ames, IA, USA
| | - Roger P Wise
- USDA-Agricultural Research Service, Corn Insects and Crop Genetics Research, Ames, IA, USA
- Department of Plant Pathology, Entomology and Microbiology, Iowa State University, Ames, IA, USA
- Program in Bioinformatics & Computational Biology, Iowa State University, Ames, IA, USA
| |
Collapse
|
15
|
Ray S, Lall S, Mukhopadhyay A, Bandyopadhyay S, Schönhuth A. Deep variational graph autoencoders for novel host-directed therapy options against COVID-19. Artif Intell Med 2022; 134:102418. [PMID: 36462892 PMCID: PMC9556806 DOI: 10.1016/j.artmed.2022.102418] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2021] [Revised: 03/22/2022] [Accepted: 10/02/2022] [Indexed: 12/14/2022]
Abstract
The COVID-19 pandemic has been keeping asking urgent questions with respect to therapeutic options. Existing drugs that can be repurposed promise rapid implementation in practice because of their prior approval. Conceivably, there is still room for substantial improvement, because most advanced artificial intelligence techniques for screening drug repositories have not been exploited so far. We construct a comprehensive network by combining year-long curated drug-protein/protein-protein interaction data on the one hand, and most recent SARS-CoV-2 protein interaction data on the other hand. We learn the structure of the resulting encompassing molecular interaction network and predict missing links using variational graph autoencoders (VGAEs), as a most advanced deep learning technique that has not been explored so far. We focus on hitherto unknown links between drugs and human proteins that play key roles in the replication cycle of SARS-CoV-2. Thereby, we establish novel host-directed therapy (HDT) options whose utmost plausibility is confirmed by realistic simulations. As a consequence, many of the predicted links are likely to be crucial for the virus to thrive on the one hand, and can be targeted with existing drugs on the other hand.
Collapse
Affiliation(s)
- Sumanta Ray
- Department of Computer Science and Engineering, Aliah University, New Town, Kolkata, India; Health Analytics Network, PA, USA.
| | - Snehalika Lall
- Machine Intelligence Unit, Indian Statistical Institute, Kolkata, India
| | - Anirban Mukhopadhyay
- Department of Computer Science and Engineering, University of Kalyani, Kalyani, India
| | | | | |
Collapse
|
16
|
He YY, Zhou HF, Chen L, Wang YT, Xie WL, Xu ZZ, Xiong Y, Feng YQ, Liu GY, Li X, Liu J, Wu QP. The Fra-1: Novel role in regulating extensive immune cell states and affecting inflammatory diseases. Front Immunol 2022; 13:954744. [PMID: 36032067 PMCID: PMC9404335 DOI: 10.3389/fimmu.2022.954744] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Accepted: 07/22/2022] [Indexed: 11/13/2022] Open
Abstract
Fra-1(Fos-related antigen1), a member of transcription factor activator protein (AP-1), plays an important role in cell proliferation, apoptosis, differentiation, inflammation, oncogenesis and tumor metastasis. Accumulating evidence suggest that the malignancy and invasive ability of tumors can be significantly changed by directly targeting Fra-1. Besides, the effects of Fra-1 are gradually revealed in immune and inflammatory settings, such as arthritis, pneumonia, psoriasis and cardiovascular disease. These regulatory mechanisms that orchestrate immune and non-immune cells underlie Fra-1 as a potential therapeutic target for a variety of human diseases. In this review, we focus on the current knowledge of Fra-1 in immune system, highlighting its unique importance in regulating tissue homeostasis. In addition, we also discuss the possible critical intervention strategy in diseases, which also outline future research and development avenues.
Collapse
|
17
|
Li P, Song R, Du Y, Liu H, Li X. Adtrp regulates thermogenic activity of adipose tissue via mediating the secretion of S100b. Cell Mol Life Sci 2022; 79:407. [PMID: 35804197 PMCID: PMC11072551 DOI: 10.1007/s00018-022-04441-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 06/14/2022] [Accepted: 06/19/2022] [Indexed: 11/03/2022]
Abstract
Brown and beige adipose tissues dissipate chemical energy in the form of heat to maintain your body temperature in cold conditions. The impaired function of these tissues results in various metabolic diseases in humans and mice. By bioinformatical analyses, we identified a functional thermogenic regulator of adipose tissue, Androgen-dependent tissue factor pathway inhibitor [TFPI]-regulating protein (Adtrp), which was significantly overexpressed in and functionally activated the mature brown/beige adipocytes. Hereby, we knocked out Adtrp in mice which led to multiple abnormalities in thermogenesis, metabolism, and maturation of brown/beige adipocytes causing excess lipid accumulation in brown adipose tissue (BAT) and cold intolerance. The capability of thermogenesis in brown/beige adipose tissues could be recovered in Adtrp KO mice upon direct β3-adrenergic receptor (β3-AR) stimulation by CL316,243 treatment. Our mechanistic studies revealed that Adtrp by binding to S100 calcium-binding protein b (S100b) indirectly mediated the secretion of S100b, which in turn promoted the β3-AR mediated thermogenesis via sympathetic innervation. These results may provide a novel insight into Adtrp in metabolism via regulating the differentiation and thermogenesis of adipose tissues in mice.
Collapse
Affiliation(s)
- Peng Li
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Runjie Song
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Yaqi Du
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Huijiao Liu
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Xiangdong Li
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China.
- Department of Reproduction and Gynecological Endocrinology, Medical University of Bialystok, Białystok, Poland.
- Department of Nutrition and Health, China Agricultural University, Beijing, 100193, China.
| |
Collapse
|
18
|
Li X, Xiang J, Wu FX, Li M. A Dual Ranking Algorithm Based on the Multiplex Network for Heterogeneous Complex Disease Analysis. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:1993-2002. [PMID: 33577455 DOI: 10.1109/tcbb.2021.3059046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Identifying biomarkers of heterogeneous complex diseases has always been one of the focuses in medical research. In previous studies, the powerful network propagation methods have been applied to finding marker genes related to specific diseases, but existing methods are mostly based on a single network, which may be greatly affected by the incompleteness of the network and the ignorance of a large amount of information about physical and functional interactions between biological components. Other methods that directly integrate multiple types of interactions into an aggregate network have the risks that different types of data may conflict with each other and the characteristics and topologies of each individual network are lost. Meanwhile, biomarkers used in clinical trials should have the characteristics of small quantity and strong discriminate ability. In this study, we developed a multiplex network-based dual ranking framework (DualRank) for heterogeneous complex disease analysis. We applied the proposed method to heterogeneous complex diseases for diagnosis, prognosis, and classification. The results showed that DualRank outperformed competing methods and could identify biomarkers with the small quantity, great prediction performance (average AUC = 0.818) and biological interpretability.
Collapse
|
19
|
Wu J, Wu C, Li G. Identifying common driver modules by equilibrating coverage and mutual exclusivity across pan-cancer data. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.04.050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
20
|
Poleksic A. Overcoming Sparseness of Biomedical Networks to Identify Drug Repositioning Candidates. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:2377-2384. [PMID: 33591920 DOI: 10.1109/tcbb.2021.3059807] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Modeling complex biological systems is necessary to understand biochemical interactions behind pharmacological effects of drugs. Successful in silico drug repurposing relies on exploration of diverse biochemical concepts and their relationships, including drug's adverse reactions, drug targets, disease symptoms, as well as disease associated genes and their pathways, to name a few. We present a computational method for inferring drug-disease associations from complex but incomplete and biased biological networks. Our method employs matrix completion to overcome the sparseness of biomedical data and to enrich the set of relationships between different biomedical entities. We present a strategy for identifying network paths supportive of drug efficacy as well as a computational procedure capable of combining different network patterns to better distinguish treatments from non-treatments. The algorithms is available at http://bioinfo.cs.uni.edu/AEONET.html.
Collapse
|
21
|
Comprehensive characterization of posttranscriptional impairment-related 3'-UTR mutations in 2413 whole genomes of cancer patients. NPJ Genom Med 2022; 7:34. [PMID: 35654793 PMCID: PMC9163142 DOI: 10.1038/s41525-022-00305-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 05/05/2022] [Indexed: 11/09/2022] Open
Abstract
The 3' untranslated region (3'-UTR) is the vital element regulating gene expression, but most studies have focused on variations in RNA-binding proteins (RBPs), miRNAs, alternative polyadenylation (APA) and RNA modifications. To explore the posttranscriptional function of 3'-UTR somatic mutations in tumorigenesis, we collected whole-genome data from 2413 patients across 18 cancer types. Our updated algorithm, PIVar, revealed 25,216 3'-UTR posttranscriptional impairment-related SNVs (3'-UTR piSNVs) spanning 2930 genes; 24 related RBPs were significantly enriched. The somatic 3'-UTR piSNV ratio was markedly increased across all 18 cancer types, which was associated with worse survival for four cancer types. Several cancer-related genes appeared to facilitate tumorigenesis at the protein and posttranscriptional regulation levels, whereas some 3'-UTR piSNV-affected genes functioned mainly via posttranscriptional mechanisms. Moreover, we assessed immune cell and checkpoint characteristics between the high/low 3'-UTR piSNV ratio groups and predicted 80 compounds associated with the 3'-UTR piSNV-affected gene expression signature. In summary, our study revealed the prevalence and clinical relevance of 3'-UTR piSNVs in cancers, and also demonstrates that in addition to affecting miRNAs, 3'-UTR piSNVs perturb RBPs binding, APA and m6A RNA modification, which emphasized the importance of considering 3'-UTR piSNVs in cancer biology.
Collapse
|
22
|
Evans-Yamamoto D, Rouleau FD, Nanda P, Makanae K, Liu Y, Després P, Matsuo H, Seki M, Dubé AK, Ascencio D, Yachie N, Landry C. OUP accepted manuscript. Nucleic Acids Res 2022; 50:e54. [PMID: 35137167 PMCID: PMC9122585 DOI: 10.1093/nar/gkac045] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Revised: 12/22/2021] [Accepted: 01/14/2022] [Indexed: 11/13/2022] Open
Abstract
Barcode fusion genetics (BFG) utilizes deep sequencing to improve the throughput of protein–protein interaction (PPI) screening in pools. BFG has been implemented in Yeast two-hybrid (Y2H) screens (BFG-Y2H). While Y2H requires test protein pairs to localize in the nucleus for reporter reconstruction, dihydrofolate reductase protein-fragment complementation assay (DHFR-PCA) allows proteins to localize in broader subcellular contexts and proves to be largely orthogonal to Y2H. Here, we implemented BFG to DHFR-PCA (BFG-PCA). This plasmid-based system can leverage ORF collections across model organisms to perform comparative analysis, unlike the original DHFR-PCA that requires yeast genomic integration. The scalability and quality of BFG-PCA were demonstrated by screening human and yeast interactions for >11 000 bait-prey pairs. BFG-PCA showed high-sensitivity and high-specificity for capturing known interactions for both species. BFG-Y2H and BFG-PCA capture distinct sets of PPIs, which can partially be explained based on the domain orientation of the reporter tags. BFG-PCA is a high-throughput protein interaction technology to interrogate binary PPIs that exploits clone collections from any species of interest, expanding the scope of PPI assays.
Collapse
Affiliation(s)
- Daniel Evans-Yamamoto
- Institut de Biologie Intégrative et des Systèmes, Université Laval, Québec, QC, G1V 0A6, Canada
- Synthetic Biology Division, Research Center for Advanced Science and Technology, The University of Tokyo, Tokyo, 153-8904, Japan
- Systems Biology Program, Graduate School of Media and Governance, Keio University, Fujisawa, 252-0882, Japan
- Institute for Advanced Biosciences, Keio University, Fujisawa, 252-0882, Japan
| | - François D Rouleau
- Institut de Biologie Intégrative et des Systèmes, Université Laval, Québec, QC, G1V 0A6, Canada
- Synthetic Biology Division, Research Center for Advanced Science and Technology, The University of Tokyo, Tokyo, 153-8904, Japan
- Regroupement Québécois de Recherche sur la Fonction, l’Ingénierie et les Applications des Protéines, (PROTEO), Université Laval, Québec, QC, G1V 0A6, Canada
- Département de biochimie, microbiologie et bio-informatique, Université Laval, Québec, QC, G1V 0A6, Canada
| | - Piyush Nanda
- Synthetic Biology Division, Research Center for Advanced Science and Technology, The University of Tokyo, Tokyo, 153-8904, Japan
| | - Koji Makanae
- Synthetic Biology Division, Research Center for Advanced Science and Technology, The University of Tokyo, Tokyo, 153-8904, Japan
| | - Yin Liu
- Synthetic Biology Division, Research Center for Advanced Science and Technology, The University of Tokyo, Tokyo, 153-8904, Japan
| | - Philippe C Després
- Institut de Biologie Intégrative et des Systèmes, Université Laval, Québec, QC, G1V 0A6, Canada
- Synthetic Biology Division, Research Center for Advanced Science and Technology, The University of Tokyo, Tokyo, 153-8904, Japan
- Regroupement Québécois de Recherche sur la Fonction, l’Ingénierie et les Applications des Protéines, (PROTEO), Université Laval, Québec, QC, G1V 0A6, Canada
- Département de biochimie, microbiologie et bio-informatique, Université Laval, Québec, QC, G1V 0A6, Canada
| | - Hitoshi Matsuo
- Synthetic Biology Division, Research Center for Advanced Science and Technology, The University of Tokyo, Tokyo, 153-8904, Japan
| | - Motoaki Seki
- Synthetic Biology Division, Research Center for Advanced Science and Technology, The University of Tokyo, Tokyo, 153-8904, Japan
| | - Alexandre K Dubé
- Institut de Biologie Intégrative et des Systèmes, Université Laval, Québec, QC, G1V 0A6, Canada
- Regroupement Québécois de Recherche sur la Fonction, l’Ingénierie et les Applications des Protéines, (PROTEO), Université Laval, Québec, QC, G1V 0A6, Canada
- Département de biochimie, microbiologie et bio-informatique, Université Laval, Québec, QC, G1V 0A6, Canada
- Département de biologie, Université Laval, Québec, QC, G1V 0A6, Canada
| | - Diana Ascencio
- Institut de Biologie Intégrative et des Systèmes, Université Laval, Québec, QC, G1V 0A6, Canada
- Regroupement Québécois de Recherche sur la Fonction, l’Ingénierie et les Applications des Protéines, (PROTEO), Université Laval, Québec, QC, G1V 0A6, Canada
- Département de biochimie, microbiologie et bio-informatique, Université Laval, Québec, QC, G1V 0A6, Canada
- Département de biologie, Université Laval, Québec, QC, G1V 0A6, Canada
| | - Nozomu Yachie
- Correspondence may also be addressed to Nozomu Yachie. Tel: +1 604 822 9512;
| | - Christian R Landry
- To whom correspondence should be addressed. Tel: +1 418 656 3954; Fax: +1 418 656 7176;
| |
Collapse
|
23
|
Matondo M, Dumas G, Maronde E. Analysis of the Human Pineal Proteome by Mass Spectrometry. Methods Mol Biol 2022; 2550:123-132. [PMID: 36180685 DOI: 10.1007/978-1-0716-2593-4_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
The human pineal gland regulates the day-night dynamics of multiple physiological processes, especially through the secretion of melatonin. Recently, using mass spectrometry-based proteomics and dedicated analysis tools, we have identified regulated proteins and signaling pathways that differ between day and night and/or between control and autistic pineal glands. This large-scale proteomic approach is the method of choice to study proteins in a biological system globally. This chapter proposes a protocol for large-scale analysis of the pineal gland proteome.
Collapse
Affiliation(s)
- Mariette Matondo
- Institut Pasteur, Université de Paris, CNRS USR2000, Proteomics Platform, Mass Spectrometry for Biology Unit, Paris, France
| | - Guillaume Dumas
- Institut Pasteur, UMR 3571 CNRS, University Paris Diderot, Paris France, Human Genetics and Cognitive Functions, Paris, France
- Computational Psychiatry, Department of Psychiatry and Addiction, University of Montreal, Montreal, Canada
| | - Erik Maronde
- Institute for Anatomy II, Faculty of Medicine, Goethe University, Frankfurt, Germany.
| |
Collapse
|
24
|
Cain B, Gebelein B. Mechanisms Underlying Hox-Mediated Transcriptional Outcomes. Front Cell Dev Biol 2021; 9:787339. [PMID: 34869389 PMCID: PMC8635045 DOI: 10.3389/fcell.2021.787339] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 10/28/2021] [Indexed: 11/13/2022] Open
Abstract
Metazoans differentially express multiple Hox transcription factors to specify diverse cell fates along the developing anterior-posterior axis. Two challenges arise when trying to understand how the Hox transcription factors regulate the required target genes for morphogenesis: First, how does each Hox factor differ from one another to accurately activate and repress target genes required for the formation of distinct segment and regional identities? Second, how can a Hox factor that is broadly expressed in many tissues within a segment impact the development of specific organs by regulating target genes in a cell type-specific manner? In this review, we highlight how recent genomic, interactome, and cis-regulatory studies are providing new insights into answering these two questions. Collectively, these studies suggest that Hox factors may differentially modify the chromatin of gene targets as well as utilize numerous interactions with additional co-activators, co-repressors, and sequence-specific transcription factors to achieve accurate segment and cell type-specific transcriptional outcomes.
Collapse
Affiliation(s)
- Brittany Cain
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, United States
| | - Brian Gebelein
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, United States.,Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, United States
| |
Collapse
|
25
|
Gu Y, Li G, Wang P, Guo Y, Li J. A simple and precise method (Y2H-in-frame-seq) improves yeast two-hybrid screening with cDNA libraries. J Genet Genomics 2021; 49:595-598. [PMID: 34864215 DOI: 10.1016/j.jgg.2021.11.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 11/16/2021] [Accepted: 11/22/2021] [Indexed: 10/19/2022]
Affiliation(s)
- Yinghui Gu
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Guannan Li
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Ping Wang
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Yan Guo
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Jingrui Li
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing, 100193, China.
| |
Collapse
|
26
|
Sugiyama MG, Cui H, Redka DS, Karimzadeh M, Rujas E, Maan H, Hayat S, Cheung K, Misra R, McPhee JB, Viirre RD, Haller A, Botelho RJ, Karshafian R, Sabatinos SA, Fairn GD, Madani Tonekaboni SA, Windemuth A, Julien JP, Shahani V, MacKinnon SS, Wang B, Antonescu CN. Multiscale interactome analysis coupled with off-target drug predictions reveals drug repurposing candidates for human coronavirus disease. Sci Rep 2021; 11:23315. [PMID: 34857794 PMCID: PMC8640055 DOI: 10.1038/s41598-021-02432-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Accepted: 11/03/2021] [Indexed: 12/20/2022] Open
Abstract
The COVID-19 pandemic has highlighted the urgent need for the identification of new antiviral drug therapies for a variety of diseases. COVID-19 is caused by infection with the human coronavirus SARS-CoV-2, while other related human coronaviruses cause diseases ranging from severe respiratory infections to the common cold. We developed a computational approach to identify new antiviral drug targets and repurpose clinically-relevant drug compounds for the treatment of a range of human coronavirus diseases. Our approach is based on graph convolutional networks (GCN) and involves multiscale host-virus interactome analysis coupled to off-target drug predictions. Cell-based experimental assessment reveals several clinically-relevant drug repurposing candidates predicted by the in silico analyses to have antiviral activity against human coronavirus infection. In particular, we identify the MET inhibitor capmatinib as having potent and broad antiviral activity against several coronaviruses in a MET-independent manner, as well as novel roles for host cell proteins such as IRAK1/4 in supporting human coronavirus infection, which can inform further drug discovery studies.
Collapse
Affiliation(s)
- Michael G Sugiyama
- Department of Chemistry and Biology, Ryerson University, Toronto, ON, Canada
| | - Haotian Cui
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Vector Institute, Toronto, ON, Canada
| | | | | | - Edurne Rujas
- Program in Molecular Medicine, The Hospital for Sick Children Research Institute, Toronto, ON, Canada
- Department of Biochemistry, University of Toronto, Toronto, ON, Canada
- Biofisika Institute (CSIC, UPV/EHU) and Department of Biochemistry and Molecular Biology, University of the Basque Country (UPV/EHU), Bilbao, Spain
| | - Hassaan Maan
- Vector Institute, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
- Peter Munk Cardiac Centre, University Health Centre, Toronto, ON, Canada
| | - Sikander Hayat
- Precision Cardiology Laboratory, Bayer US LLC, Cambridge, MA, USA
- Institute of Experimental Medicine and Systems Biology, RWTH Aachen University, Aachen, Germany
| | - Kyle Cheung
- Department of Chemistry and Biology, Ryerson University, Toronto, ON, Canada
- Graduate Program in Molecular Science, Ryerson University, Toronto, ON, Canada
| | - Rahul Misra
- Department of Chemistry and Biology, Ryerson University, Toronto, ON, Canada
| | - Joseph B McPhee
- Department of Chemistry and Biology, Ryerson University, Toronto, ON, Canada
- Graduate Program in Molecular Science, Ryerson University, Toronto, ON, Canada
| | - Russell D Viirre
- Department of Chemistry and Biology, Ryerson University, Toronto, ON, Canada
- Graduate Program in Molecular Science, Ryerson University, Toronto, ON, Canada
| | - Andrew Haller
- Phoenox Pharma, Toronto, ON, Canada
- Department of Pharmacology and Toxicology, University of Toronto, Toronto, ON, Canada
| | - Roberto J Botelho
- Department of Chemistry and Biology, Ryerson University, Toronto, ON, Canada
- Graduate Program in Molecular Science, Ryerson University, Toronto, ON, Canada
| | - Raffi Karshafian
- Graduate Program in Molecular Science, Ryerson University, Toronto, ON, Canada
- Keenan Research Centre for Biomedical Science, St. Michael's Hospital, Toronto, ON, Canada
- Institute for Biomedical Engineering, Science and Technology (iBEST), a partnership between Ryerson University and St. Michael's Hospital, Toronto, ON, Canada
- Department of Physics, Ryerson University, Toronto, ON, Canada
| | - Sarah A Sabatinos
- Department of Chemistry and Biology, Ryerson University, Toronto, ON, Canada
- Graduate Program in Molecular Science, Ryerson University, Toronto, ON, Canada
| | - Gregory D Fairn
- Department of Biochemistry, University of Toronto, Toronto, ON, Canada
- Keenan Research Centre for Biomedical Science, St. Michael's Hospital, Toronto, ON, Canada
- Department of Surgery, University of Toronto, Toronto, ON, Canada
| | | | | | - Jean-Philippe Julien
- Program in Molecular Medicine, The Hospital for Sick Children Research Institute, Toronto, ON, Canada
- Department of Biochemistry, University of Toronto, Toronto, ON, Canada
- Department of Immunology, Toronto, ON, Canada
| | | | | | - Bo Wang
- Department of Computer Science, University of Toronto, Toronto, ON, Canada.
- Vector Institute, Toronto, ON, Canada.
- Peter Munk Cardiac Centre, University Health Centre, Toronto, ON, Canada.
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada.
| | - Costin N Antonescu
- Department of Chemistry and Biology, Ryerson University, Toronto, ON, Canada.
- Graduate Program in Molecular Science, Ryerson University, Toronto, ON, Canada.
- Keenan Research Centre for Biomedical Science, St. Michael's Hospital, Toronto, ON, Canada.
| |
Collapse
|
27
|
Narykov O, Johnson NT, Korkin D. Predicting protein interaction network perturbation by alternative splicing with semi-supervised learning. Cell Rep 2021; 37:110045. [PMID: 34818539 DOI: 10.1016/j.celrep.2021.110045] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 07/21/2021] [Accepted: 11/02/2021] [Indexed: 10/19/2022] Open
Abstract
Alternative splicing introduces an additional layer of protein diversity and complexity in regulating cellular functions that can be specific to the tissue and cell type, physiological state of a cell, or disease phenotype. Recent high-throughput experimental studies have illuminated the functional role of splicing events through rewiring protein-protein interactions; however, the extent to which the macromolecular interactions are affected by alternative splicing has yet to be fully understood. In silico methods provide a fast and cheap alternative to interrogating functional characteristics of thousands of alternatively spliced isoforms. Here, we develop an accurate feature-based machine learning approach that predicts whether a protein-protein interaction carried out by a reference isoform is perturbed by an alternatively spliced isoform. Our method, called the alternatively spliced interactions prediction (ALT-IN) tool, is compared with the state-of-the-art PPI prediction tools and shows superior performance, achieving 0.92 in precision and recall values.
Collapse
Affiliation(s)
- Oleksandr Narykov
- Department of Computer Science, and Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA, USA
| | - Nathan T Johnson
- Department of Computer Science, and Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA, USA; Harvard Program in Therapeutic Sciences, Harvard Medical School, and Breast Tumor Immunology Laboratory, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Dmitry Korkin
- Department of Computer Science, and Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA, USA.
| |
Collapse
|
28
|
Khan R, Palo A, Dixit M. Role of FRG1 in predicting the overall survivability in cancers using multivariate based optimal model. Sci Rep 2021; 11:22505. [PMID: 34795329 PMCID: PMC8602605 DOI: 10.1038/s41598-021-01665-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Accepted: 11/02/2021] [Indexed: 12/20/2022] Open
Abstract
FRG1 has a role in tumorigenesis and angiogenesis. Our preliminary analysis showed that FRG1 mRNA expression is associated with overall survival (OS) in certain cancers, but the effect varies. In cervix and gastric cancers, we found a clear difference in the OS between the low and high FRG1 mRNA expression groups, but the difference was not prominent in breast, lung, and liver cancers. We hypothesized that FRG1 expression level could affect the functionality of the correlated genes or vice versa, which might mask the effect of a single gene on the OS analysis in cancer patients. We used the multivariate Cox regression, risk score, and Kaplan Meier analyses to determine OS in a multigene model. STRING, Cytoscape, HIPPIE, Gene Ontology, and DAVID (KEGG) were used to deduce FRG1 associated pathways. In breast, lung, and liver cancers, we found a distinct difference in the OS between the low and high FRG1 mRNA expression groups in the multigene model, suggesting an independent role of FRG1 in survival. Risk scores were calculated based upon regression coefficients in the multigene model. Low and high-risk score groups showed a significant difference in the FRG1 mRNA expression level and OS. HPF1, RPL34, and EXOSC9 were the most common genes present in FRG1 associated pathways across the cancer types. Validation of the effect of FRG1 mRNA expression level on these genes by qRT-PCR supports that FRG1 might be an upstream regulator of their expression. These genes may have multiple regulators, which also affect their expression, leading to the masking effect in the survival analysis. In conclusion, our study highlights the role of FRG1 in the survivability of cancer patients in tissue-specific manner and the use of multigene models in prognosis.
Collapse
Affiliation(s)
- Rehan Khan
- grid.419643.d0000 0004 1764 227XSchool of Biological Sciences, National Institute of Science Education and Research, Bhubaneswar, HBNI, P.O. Jatni, Khurda, 752050 Odisha India
| | - Ananya Palo
- grid.419643.d0000 0004 1764 227XSchool of Biological Sciences, National Institute of Science Education and Research, Bhubaneswar, HBNI, P.O. Jatni, Khurda, 752050 Odisha India
| | - Manjusha Dixit
- School of Biological Sciences, National Institute of Science Education and Research, Bhubaneswar, HBNI, P.O. Jatni, Khurda, 752050, Odisha, India. .,School of Biological Sciences, NISER, Room No.- 203, P.O. Jatni, Khurda, Odisha, 752050, India.
| |
Collapse
|
29
|
Kan Y, Jiang L, Guo Y, Tang J, Guo F. Two-stage-vote ensemble framework based on integration of mutation data and gene interaction network for uncovering driver genes. Brief Bioinform 2021; 23:6426028. [PMID: 34791034 DOI: 10.1093/bib/bbab429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2021] [Revised: 08/30/2021] [Accepted: 09/18/2021] [Indexed: 11/14/2022] Open
Abstract
Identifying driver genes, exactly from massive genes with mutations, promotes accurate diagnosis and treatment of cancer. In recent years, a lot of works about uncovering driver genes based on integration of mutation data and gene interaction networks is gaining more attention. However, it is in suspense if it is more effective for prioritizing driver genes when integrating various types of mutation information (frequency and functional impact) and gene networks. Hence, we build a two-stage-vote ensemble framework based on somatic mutations and mutual interactions. Specifically, we first represent and combine various kinds of mutation information, which are propagated through networks by an improved iterative framework. The first vote is conducted on iteration results by voting methods, and the second vote is performed to get ensemble results of the first poll for the final driver gene list. Compared with four excellent previous approaches, our method has better performance in identifying driver genes on $33$ types of cancer from The Cancer Genome Atlas. Meanwhile, we also conduct a comparative analysis about two kinds of mutation information, five gene interaction networks and four voting strategies. Our framework offers a new view for data integration and promotes more latent cancer genes to be admitted.
Collapse
Affiliation(s)
- Yingxin Kan
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Limin Jiang
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin, China.,Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Yan Guo
- Comprehensive cancer center, Department of Internal Medicine, University of New Mexico, Albuquerque, U.S
| | - Jijun Tang
- Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.,School of Computational Science and Engineering, University of South Carolina, Columbia, U.S
| | - Fei Guo
- School of Computer Science and Engineering, Central South University, Changsha, China
| |
Collapse
|
30
|
Li X, Xiang J, Wang J, Li J, Wu FX, Li M. FUNMarker: Fusion Network-Based Method to Identify Prognostic and Heterogeneous Breast Cancer Biomarkers. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:2483-2491. [PMID: 32070993 DOI: 10.1109/tcbb.2020.2973148] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Breast cancer is a heterogeneous disease with many clinically distinguishable molecular subtypes each corresponding to a cluster of patients. Identification of prognostic and heterogeneous biomarkers for breast cancer is to detect cluster-specific gene biomarkers which can be used for accurate survival prediction of breast cancer outcomes. In this study, we proposed a FUsion Network-based method (FUNMarker) to identify prognostic and heterogeneous breast cancer biomarkers by considering the heterogeneity of patient samples and biological information from multiple sources. To reduce the affect of heterogeneity of patients, samples were first clustered using the K-means algorithm based on the principal components of gene expression. For each cluster, to comprehensively evaluate the influence of genes on breast cancer, genes were weighted from three aspects: biological function, prognostic ability and correlation with known disease genes. Then they were ranked via a label propagation model on a fusion network that combined physical protein interactions from seven types of networks and thus could reduce the impact of incompleteness of interactome. We compared FUNMarker with three state-of-the-art methods and the results showed that biomarkers identified by FUNMarker were biological interpretable and had stronger discriminative power than the existing methods in differentiating patients with different prognostic outcomes.
Collapse
|
31
|
Zhang W, Wang SL, Liu Y. Identification of Cancer Driver Modules Based on Graph Clustering from Multiomics Data. J Comput Biol 2021; 28:1007-1020. [PMID: 34529511 DOI: 10.1089/cmb.2021.0052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
A major challenge in cancer genomics is to identify cancer driver genes and modules. Most existing methods to identify cancer driver modules (iCDM) identify groups of genes whose somatic mutational patterns exhibit either mutual exclusivity or high coverage of patient samples, without considering other biological information from multiomics data sets. Here we integrate mutual exclusivity, coverage, and protein-protein interaction information to construct an edge-weighted network, and present a graph clustering approach based on symmetric non-negative matrix factorization to iCDM. iCDM was tested on pan-cancer data and the results were compared with those from several advanced computational methods. Our approach outperformed other methods in recovering known cancer driver modules, and the identified driver modules showed high accuracy in classifying normal and tumor samples.
Collapse
Affiliation(s)
- Wei Zhang
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China.,Hunan Province Key Laboratory of Industrial Internet Technology and Security, Changsha University, Changsha, China
| | - Shu-Lin Wang
- College of Computer Science and Electronics Engineering, Hunan University, Changsha, China
| | - Yue Liu
- College of Computer Science and Electronics Engineering, Hunan University, Changsha, China
| |
Collapse
|
32
|
Wang R, Loscalzo J. Network module-based drug repositioning for pulmonary arterial hypertension. CPT Pharmacometrics Syst Pharmacol 2021; 10:994-1005. [PMID: 34132494 PMCID: PMC8452304 DOI: 10.1002/psp4.12670] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Revised: 04/12/2021] [Accepted: 05/10/2021] [Indexed: 01/05/2023] Open
Abstract
Pulmonary arterial hypertension (PAH) is a progressive disorder characterized by pulmonary vascular remodeling leading to increased pulmonary vascular resistance and pulmonary arterial pressure. PAH is a highly morbid cardiopulmonary disease adversely affecting lifespan and quality of life. Despite increased awareness and advances of medical therapies in recent decades, long-term prognosis and survival remain poor for patients with PAH. Novel therapies that can target the underlying pathobiology of PAH and reverse pulmonary vascular remodeling are clearly needed. In this study, we develop a network module-based framework to examine potential drug repositioning for PAH. The rationale for this approach is that in order to have therapeutic effects, the targets of potential drugs must be significantly proximate to the disease module of interest in the human protein-protein interactome. Based on 15 existing drugs for treating PAH, our framework integrates drug-drug interactions, drug-drug chemical similarity, drug targets, and PAH disease proteins into the human interactome, and prioritizes candidate drugs for PAH. We identified 53 drugs that could potentially be repurposed for PAH. Many of these candidates have strong literature support. Compared to black-box-like machine learning models, network module-based drug repositioning can provide mechanistic insights into how repositioned drugs can target the underlying pathobiological mechanisms of PAH.
Collapse
Affiliation(s)
- Rui‐Sheng Wang
- Department of Medicine, Cardiovascular DivisionBrigham and Women’s HospitalHarvard Medical SchoolBostonMassachusettsUSA
| | - Joseph Loscalzo
- Department of Medicine, Cardiovascular DivisionBrigham and Women’s HospitalHarvard Medical SchoolBostonMassachusettsUSA
| |
Collapse
|
33
|
Fukunaga K, Yokobayashi Y. Directed evolution of orthogonal RNA-RBP pairs through library-vs-library in vitro selection. Nucleic Acids Res 2021; 50:601-616. [PMID: 34219162 PMCID: PMC8789040 DOI: 10.1093/nar/gkab527] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Revised: 06/03/2021] [Accepted: 06/08/2021] [Indexed: 12/30/2022] Open
Abstract
RNA-binding proteins (RBPs) and their RNA ligands play many critical roles in gene regulation and RNA processing in cells. They are also useful for various applications in cell biology and synthetic biology. However, re-engineering novel and orthogonal RNA-RBP pairs from natural components remains challenging while such synthetic RNA-RBP pairs could significantly expand the RNA-RBP toolbox for various applications. Here, we report a novel library-vs-library in vitro selection strategy based on Phage Display coupled with Systematic Evolution of Ligands by EXponential enrichment (PD-SELEX). Starting with pools of 1.1 × 1012 unique RNA sequences and 4.0 × 108 unique phage-displayed L7Ae-scaffold (LS) proteins, we selected RNA-RBP complexes through a two-step affinity purification process. After six rounds of library-vs-library selection, the selected RNAs and LS proteins were analyzed by next-generation sequencing (NGS). Further deconvolution of the enriched RNA and LS protein sequences revealed two synthetic and orthogonal RNA-RBP pairs that exhibit picomolar affinity and >4000-fold selectivity.
Collapse
Affiliation(s)
- Keisuke Fukunaga
- Nucleic Acid Chemistry and Engineering Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa 904 0495, Japan
| | - Yohei Yokobayashi
- Nucleic Acid Chemistry and Engineering Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Okinawa 904 0495, Japan
| |
Collapse
|
34
|
Zhou Y, Chen H, Li S, Chen M. mPPI: a database extension to visualize structural interactome in a one-to-many manner. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2021; 2021:6307707. [PMID: 34156447 DOI: 10.1093/database/baab036] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Revised: 05/10/2021] [Accepted: 05/28/2021] [Indexed: 01/02/2023]
Abstract
Protein-protein interaction (PPI) databases with structural information are useful to investigate biological functions at both systematic and atomic levels. However, most existing PPI databases only curate binary interactome. From the perspective of the display and function of PPI, as well as the structural binding interface, the related database and resources are summarized. We developed a database extension, named mPPI, for PPI structural visualization. Comparing with the existing structural interactomes that curate resolved PPI conformation in pairs, mPPI can visualize target protein and its multiple interactors simultaneously, which facilitates multi-target drug discovery and structure prediction of protein macro-complexes. By employing a protein-protein docking algorithm, mPPI largely extends the coverage of structural interactome from experimentally resolved complexes. mPPI is designed to be a customizable and convenient plugin for PPI databases. It possesses wide potential applications for various PPI databases, and it has been used for a neurodegenerative disease-related PPI database as demonstration. Scripts and implementation guidelines of mPPI are documented at the database tool website. Database URL http://bis.zju.edu.cn/mppi/.
Collapse
Affiliation(s)
- Yekai Zhou
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China.,Department of Computer Science, The University of Hong Kong, Hong Kong 999077, China
| | - Hongjun Chen
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China
| | - Sida Li
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China
| | - Ming Chen
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China.,Bioinformatics Center, The First Affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou 310058, China
| |
Collapse
|
35
|
Rufenach B, Van Petegem F. Structure and function of STAC proteins: Calcium channel modulators and critical components of muscle excitation-contraction coupling. J Biol Chem 2021; 297:100874. [PMID: 34129875 PMCID: PMC8258685 DOI: 10.1016/j.jbc.2021.100874] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Revised: 06/02/2021] [Accepted: 06/11/2021] [Indexed: 12/26/2022] Open
Abstract
In skeletal muscle tissue, an intriguing mechanical coupling exists between two ion channels from different membranes: the L-type voltage-gated calcium channel (CaV1.1), located in the plasma membrane, and ryanodine receptor 1 (RyR1) located in the sarcoplasmic reticulum membrane. Excitable cells rely on Cavs to initiate Ca2+ entry in response to action potentials. RyRs can amplify this signal by releasing Ca2+ from internal stores. Although this process can be mediated through Ca2+ as a messenger, an overwhelming amount of evidence suggests that RyR1 has recruited CaV1.1 directly as its voltage sensor. The exact mechanisms that underlie this coupling have been enigmatic, but a recent wave of reports have illuminated the coupling protein STAC3 as a critical player. Without STAC3, the mechanical coupling between Cav1.1 and RyR1 is lost, and muscles fail to contract. Various sequence variants of this protein have been linked to congenital myopathy. Other STAC isoforms are expressed in the brain and may serve as regulators of L-type CaVs. Despite the short length of STACs, several points of contacts have been proposed between them and CaVs. However, it is currently unclear whether STAC3 also forms direct interactions with RyR1, and whether this modulates RyR1 function. In this review, we discuss the 3D architecture of STAC proteins, the biochemical evidence for their interactions, the relevance of these connections for functional modulation, and their involvement in myopathy.
Collapse
Affiliation(s)
- Britany Rufenach
- Department of Biochemistry and Molecular Biology, Life Sciences Institute, University of British Columbia, Vancouver, Canada
| | - Filip Van Petegem
- Department of Biochemistry and Molecular Biology, Life Sciences Institute, University of British Columbia, Vancouver, Canada.
| |
Collapse
|
36
|
Pan C, Zhu Y, Yu M, Zhao Y, Zhang C, Zhang X, Yao Y. Control Analysis of Protein-Protein Interaction Network Reveals Potential Regulatory Targets for MYCN. Front Oncol 2021; 11:633579. [PMID: 33968733 PMCID: PMC8096904 DOI: 10.3389/fonc.2021.633579] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Accepted: 03/04/2021] [Indexed: 12/05/2022] Open
Abstract
BACKGROUND MYCN is an oncogenic transcription factor of the MYC family and plays an important role in the formation of tissues and organs during development before birth. Due to the difficulty in drugging MYCN directly, revealing the molecules in MYCN regulatory networks will help to identify effective therapeutic targets. METHODS We utilized network controllability theory, a recent developed powerful tool, to identify the potential drug target around MYCN based on Protein-Protein interaction network of MYCN. First, we constructed a Protein-Protein interaction network of MYCN based on public databases. Second, network control analysis was applied on network to identify driver genes and indispensable genes of the MYCN regulatory network. Finally, we developed a novel integrated approach to identify potential drug targets for regulating the function of the MYCN regulatory network. RESULTS We constructed an MYCN regulatory network that has 79 genes and 129 interactions. Based on network controllability theory, we analyzed driver genes which capable to fully control the network. We found 10 indispensable genes whose alternation will significantly change the regulatory pathways of the MYCN network. We evaluated the stability and correlation analysis of these genes and found EGFR may be the potential drug target which closely associated with MYCN. CONCLUSION Together, our findings indicate that EGFR plays an important role in the regulatory network and pathways of MYCN and therefore may represent an attractive therapeutic target for cancer treatment.
Collapse
Affiliation(s)
- Chunyu Pan
- Northeastern University, Shenyang, China
- Joint Laboratory of Artificial Intelligence and Precision Medicine of China Medical University and Northeastern University, Shenyang, China
- Early Intervention Unit, Department of Psychiatry, Affiliated Nanjing Brain Hospital, Nanjing Medical University, Nanjing, China
| | - Yuyan Zhu
- Joint Laboratory of Artificial Intelligence and Precision Medicine of China Medical University and Northeastern University, Shenyang, China
- Department of Urology, The First Hospital of China Medical University, Shenyang, China
| | - Meng Yu
- Department of Reproductive Biology and Transgenic Animal, China Medical University, Shenyang, China
| | - Yongkang Zhao
- National Institute of Health and Medical Big Data, China Medical University, Shenyang, China
| | | | - Xizhe Zhang
- Joint Laboratory of Artificial Intelligence and Precision Medicine of China Medical University and Northeastern University, Shenyang, China
- Early Intervention Unit, Department of Psychiatry, Affiliated Nanjing Brain Hospital, Nanjing Medical University, Nanjing, China
- School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, China
| | - Yang Yao
- Department of Physiology, Shenyang Medical College, Shenyang, China
| |
Collapse
|
37
|
Xu L, Wang XR, Sun K, Yu T, Xu JH, Ding PX, Tang LM, Zhang DX, Guan WB. The complete chloroplast genome of Acanthus ilicifolius, an excellent mangrove plant. Mitochondrial DNA B Resour 2021; 6:1283-1284. [PMID: 33898744 PMCID: PMC8023608 DOI: 10.1080/23802359.2021.1884022] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Abstract
Acanthus ilicifolius is an excellent mangrove plant. In this study, the complete chloroplast genome of A. ilicifolius, a salt tolerant plant of Acanthaceae, was generated. The length of chloroplast genome is 150,758 bp, in which the large-single copy region (LSC) is 82,963 bp, the small-single copy (SSC) region is 17,191 bp, and a pair of inverted repeat (IRa and IRb) regions is 25,302 bp. The chloroplast genome contains 128 genes, including 84 protein-coding genes, eight rRNA genes, and 36 tRNAs genes, with a total GC content of 38%. Phylogenetic analysis showed that A. ilicifolius was closely related to A. ebracteatus, both species belonged to Acanthus genus.
Collapse
Affiliation(s)
- Li Xu
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, China
| | - Xin-Rui Wang
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, China
| | - Kuo Sun
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, China
| | - Ting Yu
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, China
| | - Jiu-Heng Xu
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, China
| | - Ping-Xing Ding
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, China
| | | | - Dong-Xu Zhang
- Protected Agricultural Technology Development Center, Shanxi Datong University, Datong, China
| | - Wen-Bin Guan
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, China
| |
Collapse
|
38
|
Dumas G, Goubran‐Botros H, Matondo M, Pagan C, Boulègue C, Chaze T, Chamot‐Rooke J, Maronde E, Bourgeron T. Mass-spectrometry analysis of the human pineal proteome during night and day and in autism. J Pineal Res 2021; 70:e12713. [PMID: 33368564 PMCID: PMC8047921 DOI: 10.1111/jpi.12713] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/16/2020] [Revised: 12/16/2020] [Accepted: 12/18/2020] [Indexed: 12/15/2022]
Abstract
The human pineal gland regulates day-night dynamics of multiple physiological processes, especially through the secretion of melatonin. Using mass-spectrometry-based proteomics and dedicated analysis tools, we identify proteins in the human pineal gland and analyze systematically their variation throughout the day and compare these changes in the pineal proteome between control specimens and donors diagnosed with autism. Results reveal diverse regulated clusters of proteins with, among others, catabolic carbohydrate process and cytoplasmic membrane-bounded vesicle-related proteins differing between day and night and/or control versus autism pineal glands. These data show novel and unexpected processes happening in the human pineal gland during the day/night rhythm as well as specific differences between autism donor pineal glands and those from controls.
Collapse
Affiliation(s)
- Guillaume Dumas
- Human Genetics and Cognitive FunctionsInstitut PasteurUMR 3571 CNRSUniversity Paris DiderotParisFrance
- Precision Psychiatry and Social Physiology laboratoryCHU Ste‐Justine Research CenterDepartment of PsychiatryUniversity of MontrealQuebecQCCanada
| | - Hany Goubran‐Botros
- Human Genetics and Cognitive FunctionsInstitut PasteurUMR 3571 CNRSUniversity Paris DiderotParisFrance
| | - Mariette Matondo
- Institut PasteurUnité de Spectrométrie de Masse pour la Biologie (MSBio)Centre de Ressources et Recherches Technologiques (C2RT)USR 2000 CNRSParisFrance
| | - Cécile Pagan
- Paris Descartes UniversityParisFrance
- Service de Biochimie et Biologie MoléculaireINSERM U942Hôpital LariboisièreAPHPParisFrance
| | - Cyril Boulègue
- Institut PasteurUnité de Spectrométrie de Masse pour la Biologie (MSBio)Centre de Ressources et Recherches Technologiques (C2RT)USR 2000 CNRSParisFrance
| | - Thibault Chaze
- Institut PasteurUnité de Spectrométrie de Masse pour la Biologie (MSBio)Centre de Ressources et Recherches Technologiques (C2RT)USR 2000 CNRSParisFrance
| | - Julia Chamot‐Rooke
- Institut PasteurUnité de Spectrométrie de Masse pour la Biologie (MSBio)Centre de Ressources et Recherches Technologiques (C2RT)USR 2000 CNRSParisFrance
| | - Erik Maronde
- Institute for Anatomy IIFaculty of MedicineGoethe UniversityFrankfurtGermany
| | - Thomas Bourgeron
- Human Genetics and Cognitive FunctionsInstitut PasteurUMR 3571 CNRSUniversity Paris DiderotParisFrance
| |
Collapse
|
39
|
Ruiz C, Zitnik M, Leskovec J. Identification of disease treatment mechanisms through the multiscale interactome. Nat Commun 2021; 12:1796. [PMID: 33741907 PMCID: PMC7979814 DOI: 10.1038/s41467-021-21770-8] [Citation(s) in RCA: 61] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Accepted: 02/04/2021] [Indexed: 12/12/2022] Open
Abstract
Most diseases disrupt multiple proteins, and drugs treat such diseases by restoring the functions of the disrupted proteins. How drugs restore these functions, however, is often unknown as a drug's therapeutic effects are not limited to the proteins that the drug directly targets. Here, we develop the multiscale interactome, a powerful approach to explain disease treatment. We integrate disease-perturbed proteins, drug targets, and biological functions into a multiscale interactome network. We then develop a random walk-based method that captures how drug effects propagate through a hierarchy of biological functions and physical protein-protein interactions. On three key pharmacological tasks, the multiscale interactome predicts drug-disease treatment, identifies proteins and biological functions related to treatment, and predicts genes that alter a treatment's efficacy and adverse reactions. Our results indicate that physical interactions between proteins alone cannot explain treatment since many drugs treat diseases by affecting the biological functions disrupted by the disease rather than directly targeting disease proteins or their regulators. We provide a general framework for explaining treatment, even when drugs seem unrelated to the diseases they are recommended for.
Collapse
Affiliation(s)
- Camilo Ruiz
- Computer Science Department, Stanford University, Stanford, CA, USA
- Bioengineering Department, Stanford University, Stanford, CA, USA
| | - Marinka Zitnik
- Biomedical Informatics Department, Harvard University, Boston, MA, USA
| | - Jure Leskovec
- Computer Science Department, Stanford University, Stanford, CA, USA.
- Chan Zuckerberg Biohub, San Francisco, CA, USA.
| |
Collapse
|
40
|
Shivhare D, Musialak-Lange M, Julca I, Gluza P, Mutwil M. Removing auto-activators from yeast-two-hybrid assays by conditional negative selection. Sci Rep 2021; 11:5477. [PMID: 33750818 PMCID: PMC7943551 DOI: 10.1038/s41598-021-84608-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Accepted: 01/19/2021] [Indexed: 11/17/2022] Open
Abstract
Yeast-two-hybrid (Y2H) is widely used as a strategy to detect protein–protein interactions (PPIs). Recent advancements have made it possible to generate and analyse genome-wide PPI networks en masse by coupling Y2H with next-generation sequencing technology. However, one of the major challenges of yeast two-hybrid assay is the large amount of false-positive hits caused by auto-activators (AAs), which are proteins that activate the reporter genes without the presence of an interacting protein partner. Here, we have developed a negative selection to minimize these auto-activators by integrating the pGAL2-URA3 fragment into the yeast genome. Upon activation of the pGAL2 promoter by an AA, yeast cells expressing URA3 cannot grow in media supplemented with 5-Fluoroorotic acid (5-FOA). Hence, we selectively inhibit the growth of yeast cells expressing auto-activators and thus minimizing the amount of false-positive hits. Here, we have demonstrated that auto-activators can be successfully removed from a Marchantia polymorpha cDNA library using pGAL2-URA3 and 5-FOA treatment, in liquid and solid-grown cultures. Furthermore, since URA3 can also serve as a marker for uracil autotrophy, we propose that our approach is a valuable addition to any large-scale Y2H screen.
Collapse
Affiliation(s)
- Devendra Shivhare
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore, 637551, Singapore
| | | | - Irene Julca
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore, 637551, Singapore
| | - Pawel Gluza
- Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, 14476, Potsdam, Germany.,School of Biosciences, University of Melbourne, Parkville, VIC, 3010, Australia
| | - Marek Mutwil
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore, 637551, Singapore. .,Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, 14476, Potsdam, Germany.
| |
Collapse
|
41
|
Yu T, Ren SY, Wang XR, Xu L, Xu JH, Ding GT, Tang LM, Zhang DX, Guan WB. The complete chloroplast genome sequences of Dolichandrone spathacea (L. F.) K. Schum., a semi-mangrove plant. Mitochondrial DNA B Resour 2021; 6:1164-1165. [PMID: 33829081 PMCID: PMC8008931 DOI: 10.1080/23802359.2021.1901619] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022] Open
Affiliation(s)
- Ting Yu
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, PR China
| | - Si-Yu Ren
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, PR China
| | - Xin-Rui Wang
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, PR China
| | - Li Xu
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, PR China
| | - Jiu-Heng Xu
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, PR China
| | - Gong-Tao Ding
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, PR China
| | - Li-Ming Tang
- Forestry Department of Guangxi, Nanning, PR China;
| | - Dong-Xu Zhang
- Protected Agricultural Technology Development Center, Shanxi Datong University, Shanxi, PR China
| | - Wen-Bin Guan
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, PR China
| |
Collapse
|
42
|
Integrative analysis of transcriptomic data for identification of T-cell activation-related mRNA signatures indicative of preterm birth. Sci Rep 2021; 11:2392. [PMID: 33504832 PMCID: PMC7841165 DOI: 10.1038/s41598-021-81834-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Accepted: 12/21/2020] [Indexed: 12/19/2022] Open
Abstract
Preterm birth (PTB), defined as birth at less than 37 weeks of gestation, is a major determinant of neonatal mortality and morbidity. Early diagnosis of PTB risk followed by protective interventions are essential to reduce adverse neonatal outcomes. However, due to the redundant nature of the clinical conditions with other diseases, PTB-associated clinical parameters are poor predictors of PTB. To identify molecular signatures predictive of PTB with high accuracy, we performed mRNA sequencing analysis of PTB patients and full-term birth (FTB) controls in Korean population and identified differentially expressed genes (DEGs) as well as cellular pathways represented by the DEGs between PTB and FTB. By integrating the gene expression profiles of different ethnic groups from previous studies, we identified the core T-cell activation pathway associated with PTB, which was shared among all previous datasets, and selected three representative DEGs (CYLD, TFRC, and RIPK2) from the core pathway as mRNA signatures predictive of PTB. We confirmed the dysregulation of the candidate predictors and the core T-cell activation pathway in an independent cohort. Our results suggest that CYLD, TFRC, and RIPK2 are potentially reliable predictors for PTB.
Collapse
|
43
|
Paci P, Fiscon G, Conte F, Wang RS, Farina L, Loscalzo J. Gene co-expression in the interactome: moving from correlation toward causation via an integrated approach to disease module discovery. NPJ Syst Biol Appl 2021; 7:3. [PMID: 33479222 PMCID: PMC7819998 DOI: 10.1038/s41540-020-00168-0] [Citation(s) in RCA: 53] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Accepted: 10/19/2020] [Indexed: 01/29/2023] Open
Abstract
In this study, we integrate the outcomes of co-expression network analysis with the human interactome network to predict novel putative disease genes and modules. We first apply the SWItch Miner (SWIM) methodology, which predicts important (switch) genes within the co-expression network that regulate disease state transitions, then map them to the human protein-protein interaction network (PPI, or interactome) to predict novel disease-disease relationships (i.e., a SWIM-informed diseasome). Although the relevance of switch genes to an observed phenotype has been recently assessed, their performance at the system or network level constitutes a new, potentially fascinating territory yet to be explored. Quantifying the interplay between switch genes and human diseases in the interactome network, we found that switch genes associated with specific disorders are closer to each other than to other nodes in the network, and tend to form localized connected subnetworks. These subnetworks overlap between similar diseases and are situated in different neighborhoods for pathologically distinct phenotypes, consistent with the well-known topological proximity property of disease genes. These findings allow us to demonstrate how SWIM-based correlation network analysis can serve as a useful tool for efficient screening of potentially new disease gene associations. When integrated with an interactome-based network analysis, it not only identifies novel candidate disease genes, but also may offer testable hypotheses by which to elucidate the molecular underpinnings of human disease and reveal commonalities between seemingly unrelated diseases.
Collapse
Affiliation(s)
- Paola Paci
- Department of Computer, Control and Management Engineering, Sapienza University of Rome, Rome, Italy.
| | - Giulia Fiscon
- Institute for Systems Analysis and Computer Science "Antonio Ruberti", National Research Council, Rome, Italy
- Fondazione per la Medicina Personalizzata, Via Goffredo Mameli, 3/1 Genova, Italy
| | - Federica Conte
- Institute for Systems Analysis and Computer Science "Antonio Ruberti", National Research Council, Rome, Italy
| | - Rui-Sheng Wang
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, 02115, USA
| | - Lorenzo Farina
- Department of Computer, Control and Management Engineering, Sapienza University of Rome, Rome, Italy
| | - Joseph Loscalzo
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, 02115, USA
| |
Collapse
|
44
|
Castel P, Holtz-Morris A, Kwon Y, Suter BP, McCormick F. DoMY-Seq: A yeast two-hybrid-based technique for precision mapping of protein-protein interaction motifs. J Biol Chem 2021; 296:100023. [PMID: 33410398 PMCID: PMC7949039 DOI: 10.1074/jbc.ra120.014284] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Revised: 10/23/2020] [Accepted: 11/03/2020] [Indexed: 12/14/2022] Open
Abstract
Interactions between proteins are fundamental for every biological process and especially important in cell signaling pathways. Biochemical techniques that evaluate these protein-protein interactions (PPIs), such as in vitro pull downs and coimmunoprecipitations, have become popular in most laboratories and are essential to identify and validate novel protein binding partners. Most PPIs occur through small domains or motifs, which are challenging and laborious to map by using standard biochemical approaches because they generally require the cloning of several truncation mutants. Moreover, these classical methodologies provide limited resolution of the interacting interface. Here, we describe the development of an alternative technique to overcome these limitations termed "Protein Domain mapping using Yeast 2 Hybrid-Next Generation Sequencing" (DoMY-Seq), which leverages both yeast two-hybrid and next-generation sequencing techniques. In brief, our approach involves creating a library of fragments derived from an open reading frame of interest and enriching for the interacting fragments using a yeast two-hybrid reporter system. Next-generation sequencing is then subsequently employed to read and map the sequence of the interacting fragment, yielding a high-resolution plot of the binding interface. We optimized DoMY-Seq by taking advantage of the well-described and high-affinity interaction between KRAS and CRAF, and we provide high-resolution domain mapping on this and other protein-interacting pairs, including CRAF-MEK1, RIT1-RGL3, and p53-MDM2. Thus, DoMY-Seq provides an unbiased alternative method to rapidly identify the domains involved in PPIs by advancing the use of yeast two-hybrid technology.
Collapse
Affiliation(s)
- Pau Castel
- Helen Diller Family Comprehensive Cancer Center, University of California San Francisco, San Francisco, California, USA.
| | | | | | | | - Frank McCormick
- Helen Diller Family Comprehensive Cancer Center, University of California San Francisco, San Francisco, California, USA
| |
Collapse
|
45
|
Banerjee S, Velásquez-Zapata V, Fuerst G, Elmore JM, Wise RP. NGPINT: a next-generation protein-protein interaction software. Brief Bioinform 2020; 22:6046042. [PMID: 33367498 DOI: 10.1093/bib/bbaa351] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Revised: 10/23/2020] [Accepted: 11/02/2020] [Indexed: 12/27/2022] Open
Abstract
Mapping protein-protein interactions at a proteome scale is critical to understanding how cellular signaling networks respond to stimuli. Since eukaryotic genomes encode thousands of proteins, testing their interactions one-by-one is a challenging prospect. High-throughput yeast-two hybrid (Y2H) assays that employ next-generation sequencing to interrogate complementary DNA (cDNA) libraries represent an alternative approach that optimizes scale, cost and effort. We present NGPINT, a robust and scalable software to identify all putative interactors of a protein using Y2H in batch culture. NGPINT combines diverse tools to align sequence reads to target genomes, reconstruct prey fragments and compute gene enrichment under reporter selection. Central to this pipeline is the identification of fusion reads containing sequences derived from both the Y2H expression plasmid and the cDNA of interest. To reduce false positives, these fusion reads are evaluated as to whether the cDNA fragment forms an in-frame translational fusion with the Y2H transcription factor. NGPINT successfully recognized 95% of interactions in simulated test runs. As proof of concept, NGPINT was tested using published data sets and it recognized all validated interactions. NGPINT can process interaction data from any biosystem with an available genome or transcriptome reference, thus facilitating the discovery of protein-protein interactions in model and non-model organisms.
Collapse
Affiliation(s)
- Sagnik Banerjee
- Program in Bioinformatics & Computational Biology, Iowa State University, Ames, IA, 50011, USA.,Department of Statistics, Iowa State University, Ames, IA, 50011, USA
| | - Valeria Velásquez-Zapata
- Program in Bioinformatics & Computational Biology, Iowa State University, Ames, IA, 50011, USA.,Department of Plant Pathology & Microbiology, Iowa State University, Ames, IA, 50011, USA
| | - Gregory Fuerst
- Department of Plant Pathology & Microbiology, Iowa State University, Ames, IA, 50011, USA.,Corn Insects and Crop Genetics Research, USDA-Agricultural Research Service, Ames, IA, 50011, USA
| | - J Mitch Elmore
- Department of Plant Pathology & Microbiology, Iowa State University, Ames, IA, 50011, USA.,Corn Insects and Crop Genetics Research, USDA-Agricultural Research Service, Ames, IA, 50011, USA
| | - Roger P Wise
- Program in Bioinformatics & Computational Biology, Iowa State University, Ames, IA, 50011, USA.,Department of Plant Pathology & Microbiology, Iowa State University, Ames, IA, 50011, USA.,Corn Insects and Crop Genetics Research, USDA-Agricultural Research Service, Ames, IA, 50011, USA
| |
Collapse
|
46
|
Baali I, Erten C, Kazan H. DriveWays: a method for identifying possibly overlapping driver pathways in cancer. Sci Rep 2020; 10:21971. [PMID: 33319839 PMCID: PMC7738685 DOI: 10.1038/s41598-020-78852-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Accepted: 11/19/2020] [Indexed: 11/22/2022] Open
Abstract
The majority of the previous methods for identifying cancer driver modules output nonoverlapping modules. This assumption is biologically inaccurate as genes can participate in multiple molecular pathways. This is particularly true for cancer-associated genes as many of them are network hubs connecting functionally distinct set of genes. It is important to provide combinatorial optimization problem definitions modeling this biological phenomenon and to suggest efficient algorithms for its solution. We provide a formal definition of the Overlapping Driver Module Identification in Cancer (ODMIC) problem. We show that the problem is NP-hard. We propose a seed-and-extend based heuristic named DriveWays that identifies overlapping cancer driver modules from the graph built from the IntAct PPI network. DriveWays incorporates mutual exclusivity, coverage, and the network connectivity information of the genes. We show that DriveWays outperforms the state-of-the-art methods in recovering well-known cancer driver genes performed on TCGA pan-cancer data. Additionally, DriveWay’s output modules show a stronger enrichment for the reference pathways in almost all cases. Overall, we show that enabling modules to overlap improves the recovery of functional pathways filtered with known cancer drivers, which essentially constitute the reference set of cancer-related pathways.
Collapse
Affiliation(s)
- Ilyes Baali
- Electrical and Computer Engineering Graduate Program, Antalya Bilim University, 07190, Antalya, Turkey
| | - Cesim Erten
- Department of Computer Engineering, Antalya Bilim University, 07190, Antalya, Turkey.
| | - Hilal Kazan
- Department of Computer Engineering, Antalya Bilim University, 07190, Antalya, Turkey.
| |
Collapse
|
47
|
Ahmed R, Baali I, Erten C, Hoxha E, Kazan H. MEXCOwalk: mutual exclusion and coverage based random walk to identify cancer modules. Bioinformatics 2020; 36:872-879. [PMID: 31432076 DOI: 10.1093/bioinformatics/btz655] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2019] [Revised: 07/03/2019] [Accepted: 08/18/2019] [Indexed: 12/25/2022] Open
Abstract
MOTIVATION Genomic analyses from large cancer cohorts have revealed the mutational heterogeneity problem which hinders the identification of driver genes based only on mutation profiles. One way to tackle this problem is to incorporate the fact that genes act together in functional modules. The connectivity knowledge present in existing protein-protein interaction (PPI) networks together with mutation frequencies of genes and the mutual exclusivity of cancer mutations can be utilized to increase the accuracy of identifying cancer driver modules. RESULTS We present a novel edge-weighted random walk-based approach that incorporates connectivity information in the form of protein-protein interactions (PPIs), mutual exclusivity and coverage to identify cancer driver modules. MEXCOwalk outperforms several state-of-the-art computational methods on TCGA pan-cancer data in terms of recovering known cancer genes, providing modules that are capable of classifying normal and tumor samples and that are enriched for mutations in specific cancer types. Furthermore, the risk scores determined with output modules can stratify patients into low-risk and high-risk groups in multiple cancer types. MEXCOwalk identifies modules containing both well-known cancer genes and putative cancer genes that are rarely mutated in the pan-cancer data. The data, the source code and useful scripts are available at: https://github.com/abu-compbio/MEXCOwalk. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Rafsan Ahmed
- Electrical and Computer Engineering Graduate Program, Department of Computer Engineering, Antalya Bilim University, Antalya 07190, Turkey
| | - Ilyes Baali
- Electrical and Computer Engineering Graduate Program, Department of Computer Engineering, Antalya Bilim University, Antalya 07190, Turkey
| | - Cesim Erten
- Department of Computer Engineering, Antalya Bilim University, Antalya 07190, Turkey
| | - Evis Hoxha
- Department of Computer Engineering, Antalya Bilim University, Antalya 07190, Turkey
| | - Hilal Kazan
- Department of Computer Engineering, Antalya Bilim University, Antalya 07190, Turkey
| |
Collapse
|
48
|
Peng J, Guan J, Hui W, Shang X. A novel subnetwork representation learning method for uncovering disease-disease relationships. Methods 2020; 192:77-84. [PMID: 32946974 DOI: 10.1016/j.ymeth.2020.09.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Revised: 08/20/2020] [Accepted: 09/07/2020] [Indexed: 12/12/2022] Open
Abstract
Analyzing disease-disease relationships plays an important role for understanding disease mechanisms and finding alternative uses for a drug. A disease is usually the result of abnormal state of multiple molecular process. Since biological networks can model the interplay of multiple molecular processes, network-based methods have been proposed to uncover the disease-disease relationships recently. Given a disease and a network, the disease could be represented as a subnetwork constructed by the disease genes involved in the given network, named disease subnetwork. Because it is difficult to learn the feature representation of disease subnetworks, most existing methods are unsupervised ones without using labeled information. To fill this gap, we propose a novel method named SubNet2vec to learn the feature vectors of diseases from their corresponding subnetwork in the biological network. By utilizing the feature representation of disease subnetwork, we can analyze disease-disease relationships in a supervised fashion. The evaluation results show that the proposed framework outperforms some state-of-the-art approaches in a large margin on disease-disease/disease-drug association prediction. The source code and data are available athttps://github.com/MedicineBiology-AI/SubNet2vec.git.
Collapse
Affiliation(s)
- Jiajie Peng
- School of Computer Science, Northwestern Polytechnical University, Xi'an 710129, China.
| | - Jiaojiao Guan
- School of Computer Science, Northwestern Polytechnical University, Xi'an 710129, China.
| | - Weiwei Hui
- Vivo mobile communications (Hang Zhou) co. LTD, China.
| | - Xuequn Shang
- School of Computer Science, Northwestern Polytechnical University, Xi'an 710129, China.
| |
Collapse
|
49
|
Lu Y, Zhou SK, Chen R, Jiang LX, Yang LL, Bi TN. Knockdown of SAR1B suppresses proliferation and induces apoptosis of RKO colorectal cancer cells. Oncol Lett 2020; 20:186. [PMID: 32952655 PMCID: PMC7479511 DOI: 10.3892/ol.2020.12048] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2019] [Accepted: 03/06/2020] [Indexed: 01/13/2023] Open
Abstract
Colorectal cancer (CRC) is the third most commonly diagnosed cancer worldwide. SAR1 gene homolog B (SAR1B) is a GTPase that has been reported to have a central role in the regulation of lipid homeostasis and is associated with numerous diseases. However, its role in cancer, particularly in CRC, remains unclear. The present study revealed that SAR1B was overexpressed in CRC samples and this was associated with shorter overall survival time in patients with CRC. Colony formation, cell proliferation and flow cytometry assays were conducted to evaluate the functions of SAR1B in CRC. It was reported that SAR1B may be associated with tumorigenesis of CRC. Knockdown of SAR1B suppressed cell proliferation and induced significant apoptosis of RKO cells. Furthermore, microarray analysis was performed to identify the potential targets of SAR1B in CRC. Bioinformatics analysis revealed that SAR1B was significantly involved in regulating ‘TGF-β signaling’, ‘paxillin signaling’, ‘cell cycle regulation by BTG family proteins’ and ‘IGF-1 signaling’. These results suggested that SAR1B may be considered a potential prognostic biomarker and therapeutic target for CRC.
Collapse
Affiliation(s)
- Yong Lu
- Department of Gastrointestinal Surgery, Taizhou Hospital of Zhejiang Province Affiliated to Wenzhou Medical University, Taizhou, Zhejiang 318000, P.R. China
| | - Shen-Kang Zhou
- Department of Gastrointestinal Surgery, Taizhou Hospital of Zhejiang Province Affiliated to Wenzhou Medical University, Taizhou, Zhejiang 318000, P.R. China
| | - Rui Chen
- Department of Gastrointestinal Surgery, Taizhou Hospital of Zhejiang Province Affiliated to Wenzhou Medical University, Taizhou, Zhejiang 318000, P.R. China
| | - Liang-Xian Jiang
- Department of Gastrointestinal Surgery, Taizhou Hospital of Zhejiang Province Affiliated to Wenzhou Medical University, Taizhou, Zhejiang 318000, P.R. China
| | - Lei-Lei Yang
- Department of Gastrointestinal Surgery, Taizhou Hospital of Zhejiang Province Affiliated to Wenzhou Medical University, Taizhou, Zhejiang 318000, P.R. China
| | - Tie-Nan Bi
- Department of Gastrointestinal Surgery, Taizhou Hospital of Zhejiang Province Affiliated to Wenzhou Medical University, Taizhou, Zhejiang 318000, P.R. China
| |
Collapse
|
50
|
Song E, Wang R, Leopold JA, Loscalzo J. Network determinants of cardiovascular calcification and repositioned drug treatments. FASEB J 2020; 34:11087-11100. [PMID: 32638415 PMCID: PMC7497212 DOI: 10.1096/fj.202001062r] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Revised: 06/03/2020] [Accepted: 06/15/2020] [Indexed: 01/31/2023]
Abstract
Ectopic cardiovascular calcification is a highly prevalent pathology for which there are no effective novel or repurposed pharmacotherapeutics to prevent disease progression. We created a human calcification endophenotype module (ie, the "calcificasome") by mapping vascular calcification genes (proteins) to the human vascular smooth muscle-specific protein-protein interactome (218 nodes and 632 edges, P < 10-5 ). Network proximity analysis was used to demonstrate that the calcificasome overlapped significantly with endophenotype modules governing inflammation, thrombosis, and fibrosis in the human interactome (P < 0.001). A network-based drug repurposing analysis further revealed that everolimus, temsirolimus, and pomalidomide are predicted to target the calcificasome. The efficacy of these agents in limiting calcification was confirmed experimentally by treating human coronary artery smooth muscle cells in an in vitro calcification assay. Each of the drugs affected expression or activity of their predicted target in the network, and decreased calcification significantly (P < 0.009). An integrated network analytical approach identified novel mediators of ectopic cardiovascular calcification and biologically plausible candidate drugs that could be repurposed to target calcification. This methodological framework for drug repurposing has broad applicability to other diseases.
Collapse
Affiliation(s)
- Euijun Song
- Department of MedicineBrigham and Women's HospitalHarvard Medical SchoolBostonMAUSA
| | - Rui‐Sheng Wang
- Department of MedicineBrigham and Women's HospitalHarvard Medical SchoolBostonMAUSA
| | - Jane A. Leopold
- Division of Cardiovascular MedicineBrigham and Women's HospitalHarvard Medical SchoolBostonMAUSA
| | - Joseph Loscalzo
- Department of MedicineBrigham and Women's HospitalHarvard Medical SchoolBostonMAUSA
| |
Collapse
|