1
|
Seddon AR, Damiano OM, Hampton MB, Stevens AJ. Widespread genomic de novo DNA methylation occurs following CD8 + T cell activation and proliferation. Epigenetics 2024; 19:2367385. [PMID: 38899429 PMCID: PMC11195465 DOI: 10.1080/15592294.2024.2367385] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Accepted: 06/05/2024] [Indexed: 06/21/2024] Open
Abstract
This research investigates the intricate dynamics of DNA methylation in the hours following CD8+ T cell activation, during a critical yet understudied temporal window. DNA methylation is an epigenetic modification central to regulation of gene expression and directing immune responses. Our investigation spanned 96-h post-activation and unveils a nuanced tapestry of global and site-specific methylation changes. We identified 15,626 significant differentially methylated CpGs spread across the genome, with the most significant changes occurring within the genes ADAM10, ICA1, and LAPTM5. While many changes had modest effect sizes, approximately 120 CpGs exhibited a log2FC above 1.5, with cell activation and proliferation pathways the most affected. Relatively few of the differentially methylated CpGs occurred along adjacent gene regions. The exceptions were seven differentially methylated gene regions, with the Human T cell Receptor Alpha Joining Genes demonstrating consistent methylation change over a 3kb window. We also investigated whether an inflammatory environment could alter DNA methylation during activation, with proliferating cells exposed to the oxidant glycine chloramine. No substantial differential methylation was observed in this context. The temporal perspective of early activation adds depth to the evolving field of epigenetic immunology, offering insights with implications for therapeutic innovation and expanding our understanding of epigenetic modulation in immune function.
Collapse
Affiliation(s)
- Annika R. Seddon
- Department of Pathology and Biomedical Science, Mātai Hāora - Centre for Redox Biology and Medicine, University of Otago, Christchurch, New Zealand
| | - Olivia M. Damiano
- Department of Pathology and Molecular Medicine, Genetics and Epigenetics Research Group, University of Otago, Wellington, New Zealand
| | - Mark B. Hampton
- Department of Pathology and Biomedical Science, Mātai Hāora - Centre for Redox Biology and Medicine, University of Otago, Christchurch, New Zealand
| | - Aaron J. Stevens
- Department of Pathology and Molecular Medicine, Genetics and Epigenetics Research Group, University of Otago, Wellington, New Zealand
| |
Collapse
|
2
|
Kim K, Kim MM, Skoufos G, Diffenderfer ES, Motlagh SAO, Kokkorakis M, Koliaki I, Morcos G, Shoniyozov K, Griffin J, Hatzigeorgiou AG, Metz JM, Lin A, Feigenberg SJ, Cengel KA, Ky B, Koumenis C, Verginadis II. FLASH Proton Radiation Therapy Mitigates Inflammatory and Fibrotic Pathways and Preserves Cardiac Function in a Preclinical Mouse Model of Radiation-Induced Heart Disease. Int J Radiat Oncol Biol Phys 2024; 119:1234-1247. [PMID: 38364948 PMCID: PMC11209795 DOI: 10.1016/j.ijrobp.2024.01.224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 01/12/2024] [Accepted: 01/28/2024] [Indexed: 02/18/2024]
Abstract
PURPOSE Studies during the past 9 years suggest that delivering radiation at dose rates exceeding 40 Gy/s, known as "FLASH" radiation therapy, enhances the therapeutic index of radiation therapy (RT) by decreasing normal tissue damage while maintaining tumor response compared with conventional (or standard) RT. This study demonstrates the cardioprotective benefits of FLASH proton RT (F-PRT) compared with standard (conventional) proton RT (S-PRT), as evidenced by reduced acute and chronic cardiac toxicities. METHODS AND MATERIALS Mice were imaged using cone beam computed tomography to precisely determine the heart's apex as the beam isocenter. Irradiation was conducted using a shoot-through technique with a 5-mm diameter circular collimator. Bulk RNA-sequencing was performed on nonirradiated samples, as well as apexes treated with F-PRT or S-PRT, at 2 weeks after a single 40 Gy dose. Inflammatory responses were assessed through multiplex cytokine/chemokine microbead assay and immunofluorescence analyses. Levels of perivascular fibrosis were quantified using Masson's Trichrome and Picrosirius red staining. Additionally, cardiac tissue functionality was evaluated by 2-dimensional echocardiograms at 8- and 30-weeks post-PRT. RESULTS Radiation damage was specifically localized to the heart's apex. RNA profiling of cardiac tissues treated with PRT revealed that S-PRT uniquely upregulated pathways associated with DNA damage response, induction of tumor necrosis factor superfamily, and inflammatory response, and F-PRT primarily affected cytoplasmic translation, mitochondrion organization, and adenosine triphosphate synthesis. Notably, F-PRT led to a milder inflammatory response, accompanied by significantly attenuated changes in transforming growth factor β1 and α smooth muscle actin levels. Critically, F-PRT decreased collagen deposition and better preserved cardiac functionality compared with S-PRT. CONCLUSIONS This study demonstrated that F-PRT reduces the induction of an inflammatory environment with lower expression of inflammatory cytokines and profibrotic factors. Importantly, the results indicate that F-PRT better preserves cardiac functionality, as confirmed by echocardiography analysis, while also mitigating the development of long-term fibrosis.
Collapse
Affiliation(s)
- Kyle Kim
- Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Michele M Kim
- Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Giorgos Skoufos
- Department of Electrical & Computer Engineering, University of Thessaly, Greece; Hellenic Pasteur Institute, Athens, Greece
| | - Eric S Diffenderfer
- Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Seyyedeh Azar Oliaei Motlagh
- Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Michail Kokkorakis
- Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania; Department of Clinical Pharmacy and Pharmacology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Ilektra Koliaki
- Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania; Department of Clinical Pharmacy and Pharmacology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - George Morcos
- Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Khayrullo Shoniyozov
- Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Joanna Griffin
- Department of Medicine, Cardiovascular Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Artemis G Hatzigeorgiou
- Department of Electrical & Computer Engineering, University of Thessaly, Greece; Hellenic Pasteur Institute, Athens, Greece; DIANA-Laboratory, Department of Computer Science and Biomedical Informatics, University of Thessaly, Thessaly, Greece
| | - James M Metz
- Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Alexander Lin
- Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Steven J Feigenberg
- Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Keith A Cengel
- Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Bonnie Ky
- Department of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Constantinos Koumenis
- Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania.
| | - Ioannis I Verginadis
- Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania.
| |
Collapse
|
3
|
Giri SJ, Ibtehaz N, Kihara D. GO2Sum: generating human-readable functional summary of proteins from GO terms. NPJ Syst Biol Appl 2024; 10:29. [PMID: 38491038 PMCID: PMC10943200 DOI: 10.1038/s41540-024-00358-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Accepted: 03/05/2024] [Indexed: 03/18/2024] Open
Abstract
Understanding the biological functions of proteins is of fundamental importance in modern biology. To represent a function of proteins, Gene Ontology (GO), a controlled vocabulary, is frequently used, because it is easy to handle by computer programs avoiding open-ended text interpretation. Particularly, the majority of current protein function prediction methods rely on GO terms. However, the extensive list of GO terms that describe a protein function can pose challenges for biologists when it comes to interpretation. In response to this issue, we developed GO2Sum (Gene Ontology terms Summarizer), a model that takes a set of GO terms as input and generates a human-readable summary using the T5 large language model. GO2Sum was developed by fine-tuning T5 on GO term assignments and free-text function descriptions for UniProt entries, enabling it to recreate function descriptions by concatenating GO term descriptions. Our results demonstrated that GO2Sum significantly outperforms the original T5 model that was trained on the entire web corpus in generating Function, Subunit Structure, and Pathway paragraphs for UniProt entries.
Collapse
Affiliation(s)
| | - Nabil Ibtehaz
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN, USA.
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA.
| |
Collapse
|
4
|
Solano LE, D’Sa NM, Nikolaidis N. PRRGO: A Tool for Visualizing and Mapping Globally Expressed Genes in Public Gene Expression Omnibus RNA-Sequencing Studies to PageRank-scored Gene Ontology Terms. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.21.576540. [PMID: 38328158 PMCID: PMC10849496 DOI: 10.1101/2024.01.21.576540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/09/2024]
Abstract
We herein report PageRankeR Gene Ontology (PRRGO), a downloadable web application that can integrate differentially expressed gene (DEG) data from the gene expression omnibus (GEO) GEO2R web tool with the gene ontology (GO) database [1]. Unlike existing tools, PRRGO computes the PageRank for the entire GO network and can generate both interactive GO networks on the web interface and comma-separated values (CSV) files containing the DEG statistics categorized by GO term. These hierarchical and tabular GO-DEG data are especially conducive to hypothesis generation and overlap studies with the use of PageRank data, which can provide a metric of GO term centrality. We verified the tool for accuracy and reliability across nine independent heat shock (HS) studies for which the RNA-seq data was publicly available on GEO and found that the tool produced increasing concordance between study DEGs, GO terms, and select HS-specific GO terms.
Collapse
Affiliation(s)
- Luis E. Solano
- Department of Biological Science, Center for Applied Biotechnology Studies, and Center for Computational and Applied Mathematics, College of Natural Sciences and Mathematics, California State University Fullerton, Fullerton, CA 92834-6850
- Center for Complex Biological Systems, University of California, Irvine, Irvine, CA
| | - Nicholas M. D’Sa
- Department of Biological Science, Center for Applied Biotechnology Studies, and Center for Computational and Applied Mathematics, College of Natural Sciences and Mathematics, California State University Fullerton, Fullerton, CA 92834-6850
- University of California, Irvine, Irvine, CA
| | - Nikolas Nikolaidis
- Department of Biological Science, Center for Applied Biotechnology Studies, and Center for Computational and Applied Mathematics, College of Natural Sciences and Mathematics, California State University Fullerton, Fullerton, CA 92834-6850
| |
Collapse
|
5
|
Giri SJ, Ibtehaz N, Kihara D. GO2Sum: Generating Human Readable Functional Summary of Proteins from GO Terms. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.10.566665. [PMID: 38014080 PMCID: PMC10680659 DOI: 10.1101/2023.11.10.566665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Understanding the biological functions of proteins is of fundamental importance in modern biology. To represent function of proteins, Gene Ontology (GO), a controlled vocabulary, is frequently used, because it is easy to handle by computer programs avoiding open-ended text interpretation. Particularly, the majority of current protein function prediction methods rely on GO terms. However, the extensive list of GO terms that describe a protein function can pose challenges for biologists when it comes to interpretation. In response to this issue, we developed GO2Sum (Gene Ontology terms Summarizer), a model that takes a set of GO terms as input and generates a human-readable summary using the T5 large language model. GO2Sum was developed by fine-tuning T5 on GO term assignments and free-text function descriptions for UniProt entries, enabling it to recreate function descriptions by concatenating GO term descriptions. Our results demonstrated that GO2Sum significantly outperforms the original T5 model that was trained on the entire web corpus in generating Function, Subunit Structure, and Pathway paragraphs for UniProt entries.
Collapse
Affiliation(s)
| | - Nabil Ibtehaz
- Department of Computer Science, Purdue University, West Lafayette, IN, United States
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN, United States
- Department of Biological Sciences, Purdue University, West Lafayette, IN, United States
| |
Collapse
|
6
|
Ibtehaz N, Kagaya Y, Kihara D. Domain-PFP allows protein function prediction using function-aware domain embedding representations. Commun Biol 2023; 6:1103. [PMID: 37907681 PMCID: PMC10618451 DOI: 10.1038/s42003-023-05476-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Accepted: 10/17/2023] [Indexed: 11/02/2023] Open
Abstract
Domains are functional and structural units of proteins that govern various biological functions performed by the proteins. Therefore, the characterization of domains in a protein can serve as a proper functional representation of proteins. Here, we employ a self-supervised protocol to derive functionally consistent representations for domains by learning domain-Gene Ontology (GO) co-occurrences and associations. The domain embeddings we constructed turned out to be effective in performing actual function prediction tasks. Extensive evaluations showed that protein representations using the domain embeddings are superior to those of large-scale protein language models in GO prediction tasks. Moreover, the new function prediction method built on the domain embeddings, named Domain-PFP, substantially outperformed the state-of-the-art function predictors. Additionally, Domain-PFP demonstrated competitive performance in the CAFA3 evaluation, achieving overall the best performance among the top teams that participated in the assessment.
Collapse
Affiliation(s)
- Nabil Ibtehaz
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Yuki Kagaya
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN, USA.
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA.
| |
Collapse
|
7
|
Ibtehaz N, Kagaya Y, Kihara D. Domain-PFP: Protein Function Prediction Using Function-Aware Domain Embedding Representations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.23.554486. [PMID: 37662252 PMCID: PMC10473699 DOI: 10.1101/2023.08.23.554486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2023]
Abstract
Domains are functional and structural units of proteins that govern various biological functions performed by the proteins. Therefore, the characterization of domains in a protein can serve as a proper functional representation of proteins. Here, we employ a self-supervised protocol to derive functionally consistent representations for domains by learning domain-Gene Ontology (GO) co-occurrences and associations. The domain embeddings we constructed turned out to be effective in performing actual function prediction tasks. Extensive evaluations showed that protein representations using the domain embeddings are superior to those of large-scale protein language models in GO prediction tasks. Moreover, the new function prediction method built on the domain embeddings, named Domain-PFP, significantly outperformed the state-of-the-art function predictors. Additionally, Domain-PFP demonstrated competitive performance in the CAFA3 evaluation, achieving overall the best performance among the top teams that participated in the assessment.
Collapse
Affiliation(s)
- Nabil Ibtehaz
- Department of Computer Science, Purdue University, West Lafayette, IN, United States
| | - Yuki Kagaya
- Department of Biological Sciences, Purdue University, West Lafayette, IN, United States
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN, United States
- Department of Biological Sciences, Purdue University, West Lafayette, IN, United States
| |
Collapse
|
8
|
Piemonte KM, Webb BM, Bobbitt JR, Majmudar PR, Cuellar-Vite L, Bryson BL, Latina NC, Seachrist DD, Keri RA. Disruption of CDK7 signaling leads to catastrophic chromosomal instability coupled with a loss of condensin-mediated chromatin compaction. J Biol Chem 2023; 299:104834. [PMID: 37201585 PMCID: PMC10300262 DOI: 10.1016/j.jbc.2023.104834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 05/11/2023] [Accepted: 05/12/2023] [Indexed: 05/20/2023] Open
Abstract
Chromatin organization is highly dynamic and modulates DNA replication, transcription, and chromosome segregation. Condensin is essential for chromosome assembly during mitosis and meiosis, as well as maintenance of chromosome structure during interphase. While it is well established that sustained condensin expression is necessary to ensure chromosome stability, the mechanisms that control its expression are not yet known. Herein, we report that disruption of cyclin-dependent kinase 7 (CDK7), the core catalytic subunit of CDK-activating kinase, leads to reduced transcription of several condensin subunits, including structural maintenance of chromosomes 2 (SMC2). Live and static microscopy revealed that inhibiting CDK7 signaling prolongs mitosis and induces chromatin bridge formation, DNA double-strand breaks, and abnormal nuclear features, all of which are indicative of mitotic catastrophe and chromosome instability. Affirming the importance of condensin regulation by CDK7, genetic suppression of the expression of SMC2, a core subunit of this complex, phenocopies CDK7 inhibition. Moreover, analysis of genome-wide chromatin conformation using Hi-C revealed that sustained activity of CDK7 is necessary to maintain chromatin sublooping, a function that is ascribed to condensin. Notably, the regulation of condensin subunit gene expression is independent of superenhancers. Together, these studies reveal a new role for CDK7 in sustaining chromatin configuration by ensuring the expression of condensin genes, including SMC2.
Collapse
Affiliation(s)
- Katrina M Piemonte
- Department of Cancer Biology, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, USA; Department of Pharmacology, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| | - Bryan M Webb
- Department of Cancer Biology, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, USA; Department of Pharmacology, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| | - Jessica R Bobbitt
- Department of Cancer Biology, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, USA; Department of Pathology, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| | - Parth R Majmudar
- Department of Cancer Biology, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, USA; Department of Pharmacology, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| | - Leslie Cuellar-Vite
- Department of Cancer Biology, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, USA; Department of Pharmacology, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| | - Benjamin L Bryson
- Department of Cancer Biology, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, USA
| | - Nicholas C Latina
- Department of Genetics and Genome Sciences, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| | - Darcie D Seachrist
- Department of Cancer Biology, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, USA
| | - Ruth A Keri
- Department of Cancer Biology, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, USA; Department of Pathology, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA; Department of Genetics and Genome Sciences, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA; Department of General Medical Sciences-Oncology, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA.
| |
Collapse
|
9
|
Bozic D, Živančević K, Baralić K, Miljaković EA, Djordjević AB, Ćurčić M, Bulat Z, Antonijević B, Đukić-Ćosić D. Conducting bioinformatics analysis to predict sulforaphane-triggered adverse outcome pathways in healthy human cells. Biomed Pharmacother 2023; 160:114316. [PMID: 36731342 DOI: 10.1016/j.biopha.2023.114316] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 01/17/2023] [Accepted: 01/26/2023] [Indexed: 02/04/2023] Open
Abstract
Sulforaphane (SFN) is a naturally occurring molecule present in plants from Brassica family. It becomes bioactive after hydrolytic reaction mediated by myrosinase or human gastrointestinal microbiota. Sulforaphane gained scientific popularity due to its antioxidant and anti-cancer properties. However, its toxicity profile and potential to cause adverse effects remain largely unidentified. Thus, this study aimed to generate SFN-triggered adverse outcome pathway (AOP) by looking at the relationship between SFN-chemical structure and its toxicity, as well as SFN-gene interactions. Quantitative structure-activity relationship (QSAR) analysis identified 2 toxophores (Derek Nexus software) that have the potential to cause chromosomal damage and skin sensitization in mammals or mutagenicity in bacteria. Data extracted from Comparative Toxicogenomics Database (CTD) linked SFN with previously proposed outcomes via gene interactions. The total of 11 and 146 genes connected SFN with chromosomal damage and skin diseases, respectively. However, network analysis (NetworkAnalyst tool) revealed that these genes function in wider networks containing 490 and 1986 nodes, respectively. The over-representation analysis (ExpressAnalyst tool) pointed out crucial biological pathways regulated by SFN-interfering genes. These pathways are uploaded to AOP-helpFinder tool which found the 2321 connections between 19 enriched pathways and SFN which were further considered as key events. Two major, interconnected AOPs were generated: first starting from disruption of biological pathways involved in cell cycle and cell proliferation leading to increased apoptosis, and the second one connecting activated immune system signaling pathways to inflammation and apoptosis. In both cases, chromosomal damage and/or skin diseases such as dermatitis or psoriasis appear as adverse outcomes.
Collapse
Affiliation(s)
- Dragica Bozic
- Department of Toxicology "Akademik Danilo Soldatović", Toxicological Risk Assessment Center, University of Belgrade - Faculty of Pharmacy, Vojvode Stepe 450, 11221 Belgrade, Serbia.
| | - Katarina Živančević
- Department of Toxicology "Akademik Danilo Soldatović", Toxicological Risk Assessment Center, University of Belgrade - Faculty of Pharmacy, Vojvode Stepe 450, 11221 Belgrade, Serbia; University of Belgrade - Faculty of Biology, Institute of Physiology and Biochemistry "Ivan Djaja", Center for Laser Microscopy, Studentski trg 16, 11158 Belgrade, Serbia
| | - Katarina Baralić
- Department of Toxicology "Akademik Danilo Soldatović", Toxicological Risk Assessment Center, University of Belgrade - Faculty of Pharmacy, Vojvode Stepe 450, 11221 Belgrade, Serbia
| | - Evica Antonijević Miljaković
- Department of Toxicology "Akademik Danilo Soldatović", Toxicological Risk Assessment Center, University of Belgrade - Faculty of Pharmacy, Vojvode Stepe 450, 11221 Belgrade, Serbia
| | - Aleksandra Buha Djordjević
- Department of Toxicology "Akademik Danilo Soldatović", Toxicological Risk Assessment Center, University of Belgrade - Faculty of Pharmacy, Vojvode Stepe 450, 11221 Belgrade, Serbia
| | - Marijana Ćurčić
- Department of Toxicology "Akademik Danilo Soldatović", Toxicological Risk Assessment Center, University of Belgrade - Faculty of Pharmacy, Vojvode Stepe 450, 11221 Belgrade, Serbia
| | - Zorica Bulat
- Department of Toxicology "Akademik Danilo Soldatović", Toxicological Risk Assessment Center, University of Belgrade - Faculty of Pharmacy, Vojvode Stepe 450, 11221 Belgrade, Serbia
| | - Biljana Antonijević
- Department of Toxicology "Akademik Danilo Soldatović", Toxicological Risk Assessment Center, University of Belgrade - Faculty of Pharmacy, Vojvode Stepe 450, 11221 Belgrade, Serbia
| | - Danijela Đukić-Ćosić
- Department of Toxicology "Akademik Danilo Soldatović", Toxicological Risk Assessment Center, University of Belgrade - Faculty of Pharmacy, Vojvode Stepe 450, 11221 Belgrade, Serbia
| |
Collapse
|
10
|
Khazaal A, Zandavi SM, Smolnikov A, Fatima S, Vafaee F. Pan-Cancer Analysis Reveals Functional Similarity of Three lncRNAs across Multiple Tumors. Int J Mol Sci 2023; 24:ijms24054796. [PMID: 36902227 PMCID: PMC10003012 DOI: 10.3390/ijms24054796] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Revised: 02/24/2023] [Accepted: 02/28/2023] [Indexed: 03/06/2023] Open
Abstract
Long non-coding RNAs (lncRNAs) are emerging as key regulators in many biological processes. The dysregulation of lncRNA expression has been associated with many diseases, including cancer. Mounting evidence suggests lncRNAs to be involved in cancer initiation, progression, and metastasis. Thus, understanding the functional implications of lncRNAs in tumorigenesis can aid in developing novel biomarkers and therapeutic targets. Rich cancer datasets, documenting genomic and transcriptomic alterations together with advancement in bioinformatics tools, have presented an opportunity to perform pan-cancer analyses across different cancer types. This study is aimed at conducting a pan-cancer analysis of lncRNAs by performing differential expression and functional analyses between tumor and non-neoplastic adjacent samples across eight cancer types. Among dysregulated lncRNAs, seven were shared across all cancer types. We focused on three lncRNAs, found to be consistently dysregulated among tumors. It has been observed that these three lncRNAs of interest are interacting with a wide range of genes across different tissues, yet enriching substantially similar biological processes, found to be implicated in cancer progression and proliferation.
Collapse
Affiliation(s)
- Abir Khazaal
- School of Biotechnology and Biomolecular Sciences, Faculty of Science, University of New South Wales, Sydney, NSW 2052, Australia
- UNSW Data Science Hub, University of New South Wales, Sydney, NSW 2052, Australia
| | - Seid Miad Zandavi
- School of Biotechnology and Biomolecular Sciences, Faculty of Science, University of New South Wales, Sydney, NSW 2052, Australia
- Harvard Medical School, Harvard University, Boston, MA 02115, USA
| | - Andrei Smolnikov
- School of Biotechnology and Biomolecular Sciences, Faculty of Science, University of New South Wales, Sydney, NSW 2052, Australia
| | - Shadma Fatima
- School of Biotechnology and Biomolecular Sciences, Faculty of Science, University of New South Wales, Sydney, NSW 2052, Australia
- Ingham Institute of Applied Medical Research, Sydney, NSW 2170, Australia
| | - Fatemeh Vafaee
- School of Biotechnology and Biomolecular Sciences, Faculty of Science, University of New South Wales, Sydney, NSW 2052, Australia
- UNSW Data Science Hub, University of New South Wales, Sydney, NSW 2052, Australia
- Correspondence:
| |
Collapse
|
11
|
Salihoglu R, Srivastava M, Liang C, Schilling K, Szalay A, Bencurova E, Dandekar T. PRO-Simat: Protein network simulation and design tool. Comput Struct Biotechnol J 2023; 21:2767-2779. [PMID: 37181657 PMCID: PMC10172639 DOI: 10.1016/j.csbj.2023.04.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 04/20/2023] [Accepted: 04/20/2023] [Indexed: 05/16/2023] Open
Abstract
PRO-Simat is a simulation tool for analysing protein interaction networks, their dynamic change and pathway engineering. It provides GO enrichment, KEGG pathway analyses, and network visualisation from an integrated database of more than 8 million protein-protein interactions across 32 model organisms and the human proteome. We integrated dynamical network simulation using the Jimena framework, which quickly and efficiently simulates Boolean genetic regulatory networks. It enables simulation outputs with in-depth analysis of the type, strength, duration and pathway of the protein interactions on the website. Furthermore, the user can efficiently edit and analyse the effect of network modifications and engineering experiments. In case studies, applications of PRO-Simat are demonstrated: (i) understanding mutually exclusive differentiation pathways in Bacillus subtilis, (ii) making Vaccinia virus oncolytic by switching on its viral replication mainly in cancer cells and triggering cancer cell apoptosis and (iii) optogenetic control of nucleotide processing protein networks to operate DNA storage. Multilevel communication between components is critical for efficient network switching, as demonstrated by a general census on prokaryotic and eukaryotic networks and comparing design with synthetic networks using PRO-Simat. The tool is available at https://prosimat.heinzelab.de/ as a web-based query server.
Collapse
Affiliation(s)
- Rana Salihoglu
- Department of Bioinformatics, University of Würzburg, Würzburg, Germany
| | - Mugdha Srivastava
- Department of Bioinformatics, University of Würzburg, Würzburg, Germany
- Core Unit Systems Medicine, University of Würzburg, 97080 Würzburg, Germany
| | - Chunguang Liang
- Department of Bioinformatics, University of Würzburg, Würzburg, Germany
| | - Klaus Schilling
- Informatics VII, Robotics and Telematics, Department of Mathematics and Informatics, Am Hubland, University of Würzburg, D-97074 Würzburg, Germany
| | - Aladar Szalay
- Dept. of Biochemistry, Biocenter, Am Hubland, University of Würzburg, D-97074 Würzburg, Germany
- Department of Radiation Medicine and Applied Sciences, Rebecca & John Moores Comprehensive Cancer Center, University of California, San Diego, CA, USA
- Dept. of Pathology, Center of Immune technologies, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Elena Bencurova
- Department of Bioinformatics, University of Würzburg, Würzburg, Germany
- Corresponding author.
| | - Thomas Dandekar
- Department of Bioinformatics, University of Würzburg, Würzburg, Germany
- Structural and Computational Biology, European Molecular Biology Laboratory, Heidelberg, Germany
- Corresponding author at: Structural and Computational Biology, European Molecular Biology Laboratory, Heidelberg, Germany.
| |
Collapse
|
12
|
Zafeiropoulos H, Beracochea M, Ninidakis S, Exter K, Potirakis A, De Moro G, Richardson L, Corre E, Machado J, Pafilis E, Kotoulas G, Santi I, Finn RD, Cox CJ, Pavloudi C. metaGOflow: a workflow for the analysis of marine Genomic Observatories shotgun metagenomics data. Gigascience 2022; 12:giad078. [PMID: 37850871 PMCID: PMC10583283 DOI: 10.1093/gigascience/giad078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 06/30/2023] [Accepted: 09/11/2023] [Indexed: 10/19/2023] Open
Abstract
BACKGROUND Genomic Observatories (GOs) are sites of long-term scientific study that undertake regular assessments of the genomic biodiversity. The European Marine Omics Biodiversity Observation Network (EMO BON) is a network of GOs that conduct regular biological community samplings to generate environmental and metagenomic data of microbial communities from designated marine stations around Europe. The development of an effective workflow is essential for the analysis of the EMO BON metagenomic data in a timely and reproducible manner. FINDINGS Based on the established MGnify resource, we developed metaGOflow. metaGOflow supports the fast inference of taxonomic profiles from GO-derived data based on ribosomal RNA genes and their functional annotation using the raw reads. Thanks to the Research Object Crate packaging, relevant metadata about the sample under study, and the details of the bioinformatics analysis it has been subjected to, are inherited to the data product while its modular implementation allows running the workflow partially. The analysis of 2 EMO BON samples and 1 Tara Oceans sample was performed as a use case. CONCLUSIONS metaGOflow is an efficient and robust workflow that scales to the needs of projects producing big metagenomic data such as EMO BON. It highlights how containerization technologies along with modern workflow languages and metadata package approaches can support the needs of researchers when dealing with ever-increasing volumes of biological data. Despite being initially oriented to address the needs of EMO BON, metaGOflow is a flexible and easy-to-use workflow that can be broadly used for one-sample-at-a-time analysis of shotgun metagenomics data.
Collapse
Affiliation(s)
- Haris Zafeiropoulos
- Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Former U.S. Base of Gournes, 71003 Heraklion, Crete, Greece
- KU Leuven, Department of Microbiology, Immunology and Transplantation, Rega Institute for Medical Research, Laboratory of Molecular Bacteriology, 3000 Leuven, Belgium
| | - Martin Beracochea
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Stelios Ninidakis
- Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Former U.S. Base of Gournes, 71003 Heraklion, Crete, Greece
| | - Katrina Exter
- Flanders Marine Institute (VLIZ), 8400 Oostende, Belgium
| | - Antonis Potirakis
- Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Former U.S. Base of Gournes, 71003 Heraklion, Crete, Greece
| | - Gianluca De Moro
- Centro de Ciências do Mar (CCMAR), Universidade do Algarve, Campus de Gambelas, 8005-139 Faro, Portugal
| | - Lorna Richardson
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Erwan Corre
- CNRS, FR 2424, ABiMS Platform, Station Biologique de Roscoff (SBR), 29680 Roscoff, France
| | - João Machado
- Centro de Ciências do Mar (CCMAR), Universidade do Algarve, Campus de Gambelas, 8005-139 Faro, Portugal
| | - Evangelos Pafilis
- Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Former U.S. Base of Gournes, 71003 Heraklion, Crete, Greece
| | - Georgios Kotoulas
- Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Former U.S. Base of Gournes, 71003 Heraklion, Crete, Greece
| | - Ioulia Santi
- Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Former U.S. Base of Gournes, 71003 Heraklion, Crete, Greece
- European Marine Biological Resource Centre (EMBRC-ERIC), 75005 Paris, France
| | - Robert D Finn
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Cymon J Cox
- Centro de Ciências do Mar (CCMAR), Universidade do Algarve, Campus de Gambelas, 8005-139 Faro, Portugal
| | - Christina Pavloudi
- Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Former U.S. Base of Gournes, 71003 Heraklion, Crete, Greece
- Department of Biological Sciences, The George Washington University, 20052 Washington, DC, USA
| |
Collapse
|
13
|
Banerjee A, Biswas D, Barpanda A, Halder A, Sibal S, Kattimani R, Shah A, Mahadevan A, Goel A, Srivastava S. The First Pituitary Proteome Landscape From Matched Anterior and Posterior Lobes for a Better Understanding of the Pituitary Gland. Mol Cell Proteomics 2022; 22:100478. [PMID: 36470533 PMCID: PMC9877467 DOI: 10.1016/j.mcpro.2022.100478] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Revised: 11/29/2022] [Accepted: 11/30/2022] [Indexed: 12/12/2022] Open
Abstract
To date, very few mass spectrometry (MS)-based proteomics studies are available on the anterior and posterior lobes of the pituitary. In the past, MS-based investigations have focused exclusively on the whole pituitary gland or anterior pituitary lobe. In this study, for the first time, we performed a deep MS-based analysis of five anterior and five posterior matched lobes to build the first lobe-specific pituitary proteome map, which documented 4090 proteins with isoforms, mostly mapped into chromosomes 1, 2, and 11. About 1446 differentially expressed significant proteins were identified, which were studied for lobe specificity, biological pathway enrichment, protein-protein interaction, regions specific to comparison of human brain and other neuroendocrine glands from Human Protein Atlas to identify pituitary-enriched proteins. Hormones specific to each lobe were also identified and validated with parallel reaction monitoring-based target verification. The study identified and validated hormones, growth hormone and thyroid-stimulating hormone subunit beta, exclusively to the anterior lobe whereas oxytocin-neurophysin 1 and arginine vasopressin to the posterior lobe. The study also identified proteins POU1F1 (pituitary-specific positive transcription factor 1), POMC (pro-opiomelanocortin), PCOLCE2 (procollagen C-endopeptidase enhancer 2), and NPTX2 (neuronal pentraxin-2) as pituitary-enriched proteins and was validated for their lobe specificity using parallel reaction monitoring. In addition, three uPE1 proteins, namely THEM6 (mesenchymal stem cell protein DSCD75), FSD1L (coiled-coil domain-containing protein 10), and METTL26 (methyltransferase-like 26), were identified using the NeXtProt database, and depicted tumor markers S100 proteins having high expression in the posterior lobe. In summary, the study documents the first matched anterior and posterior pituitary proteome map acting as a reference control for a better understanding of functional and nonfunctional pituitary adenomas and extrapolating the aim of the Human Proteome Project towards the investigation of the proteome of life.
Collapse
Affiliation(s)
- Arghya Banerjee
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Mumbai, India
| | - Deepatarup Biswas
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Mumbai, India
| | - Abhilash Barpanda
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Mumbai, India
| | - Ankit Halder
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Mumbai, India
| | - Shamira Sibal
- Lokmanya Tilak Municipal Medical College, Mumbai, India
| | | | - Abhidha Shah
- Department of Neurosurgery at King Edward Memorial Hospital and Seth G. S. Medical College, Mumbai, India
| | - Anita Mahadevan
- Human Brain Bank, National Institute of Mental Health and Neuro Sciences (NIMHANS), Bangalore, India
| | - Atul Goel
- Department of Neurosurgery at King Edward Memorial Hospital and Seth G. S. Medical College, Mumbai, India
| | - Sanjeeva Srivastava
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Mumbai, India.
| |
Collapse
|
14
|
Bartolek Z, Creveld SGV, Coesel S, Cain KR, Schatz M, Morales R, Virginia Armbrust E. Flavobacterial exudates disrupt cell cycle progression and metabolism of the diatom Thalassiosira pseudonana. THE ISME JOURNAL 2022; 16:2741-2751. [PMID: 36104452 PMCID: PMC9666458 DOI: 10.1038/s41396-022-01313-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Revised: 08/22/2022] [Accepted: 08/24/2022] [Indexed: 12/15/2022]
Abstract
Phytoplankton and bacteria form the base of marine ecosystems and their interactions drive global biogeochemical cycles. The effects of bacteria and bacteria-produced compounds on diatoms range from synergistic to pathogenic and can affect the physiology and transcriptional patterns of the interacting diatom. Here, we investigate physiological and transcriptional changes in the marine diatom Thalassiosira pseudonana induced by extracellular metabolites of a known antagonistic bacterium Croceibacter atlanticus. Mono-cultures of C. atlanticus released compounds that inhibited diatom cell division and elicited a distinctive morphology of enlarged cells with increased chloroplast content and enlarged nuclei, similar to what was previously observed when the diatom was co-cultured with live bacteria. The extracellular C. atlanticus metabolites induced transcriptional changes in diatom pathways that include recognition and signaling pathways, cell cycle regulation, carbohydrate and amino acid production, as well as cell wall stability. Phenotypic analysis showed a disruption in the diatom cell cycle progression and an increase in both intra- and extracellular carbohydrates in diatom cultures after bacterial exudate treatment. The transcriptional changes and corresponding phenotypes suggest that extracellular bacterial metabolites, produced independently of direct bacterial-diatom interaction, may modulate diatom metabolism in ways that support bacterial growth.
Collapse
Affiliation(s)
- Zinka Bartolek
- grid.34477.330000000122986657School of Oceanography, University of Washington, Seattle, WA 98195 USA
| | - Shiri Graff van Creveld
- grid.34477.330000000122986657School of Oceanography, University of Washington, Seattle, WA 98195 USA
| | - Sacha Coesel
- grid.34477.330000000122986657School of Oceanography, University of Washington, Seattle, WA 98195 USA
| | - Kelsy R. Cain
- grid.34477.330000000122986657School of Oceanography, University of Washington, Seattle, WA 98195 USA
| | - Megan Schatz
- grid.34477.330000000122986657School of Oceanography, University of Washington, Seattle, WA 98195 USA
| | - Rhonda Morales
- grid.34477.330000000122986657School of Oceanography, University of Washington, Seattle, WA 98195 USA
| | - E. Virginia Armbrust
- grid.34477.330000000122986657School of Oceanography, University of Washington, Seattle, WA 98195 USA
| |
Collapse
|
15
|
Toh H, Yang C, Formenti G, Raja K, Yan L, Tracey A, Chow W, Howe K, Bergeron LA, Zhang G, Haase B, Mountcastle J, Fedrigo O, Fogg J, Kirilenko B, Munegowda C, Hiller M, Jain A, Kihara D, Rhie A, Phillippy AM, Swanson SA, Jiang P, Clegg DO, Jarvis ED, Thomson JA, Stewart R, Chaisson MJP, Bukhman YV. A haplotype-resolved genome assembly of the Nile rat facilitates exploration of the genetic basis of diabetes. BMC Biol 2022; 20:245. [DOI: 10.1186/s12915-022-01427-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Accepted: 09/29/2022] [Indexed: 11/09/2022] Open
Abstract
Abstract
Background
The Nile rat (Avicanthis niloticus) is an important animal model because of its robust diurnal rhythm, a cone-rich retina, and a propensity to develop diet-induced diabetes without chemical or genetic modifications. A closer similarity to humans in these aspects, compared to the widely used Mus musculus and Rattus norvegicus models, holds the promise of better translation of research findings to the clinic.
Results
We report a 2.5 Gb, chromosome-level reference genome assembly with fully resolved parental haplotypes, generated with the Vertebrate Genomes Project (VGP). The assembly is highly contiguous, with contig N50 of 11.1 Mb, scaffold N50 of 83 Mb, and 95.2% of the sequence assigned to chromosomes. We used a novel workflow to identify 3613 segmental duplications and quantify duplicated genes. Comparative analyses revealed unique genomic features of the Nile rat, including some that affect genes associated with type 2 diabetes and metabolic dysfunctions. We discuss 14 genes that are heterozygous in the Nile rat or highly diverged from the house mouse.
Conclusions
Our findings reflect the exceptional level of genomic resolution present in this assembly, which will greatly expand the potential of the Nile rat as a model organism.
Collapse
|
16
|
Ayub U, Naveed H. BioAlign: An Accurate Global PPI Network Alignment Algorithm. Evol Bioinform Online 2022; 18:11769343221110658. [PMID: 35898232 PMCID: PMC9309777 DOI: 10.1177/11769343221110658] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Accepted: 06/02/2022] [Indexed: 11/15/2022] Open
Abstract
Motivation The advancement of high-throughput PPI profiling techniques results in generating a large amount of PPI data. The alignment of the PPI networks uncovers the relationship between the species that can help understand the biological systems. The comparative study reveals the conserved biological interactions of the proteins across the species. It can also help study the biological pathways and signal networks of the cells. Although several network alignment algorithms are developed to study and compare the PPI data, the development of the aligner that aligns the PPI networks with high biological similarity and coverage is still challenging. Results This paper presents a novel global network alignment algorithm, BioAlign, that incorporates a significant amount of biological information. Existing studies use global sequence and/or 3D-structure similarity to align the PPI networks. In contrast, BioAlign uses the local sequence similarity, predicted secondary structure motifs, and remote homology in addition to global sequence and 3D-structure similarity. The extra sources of biological information help BioAlign to align the proteins with high biological similarity. BioAlign produces significantly better results in terms of AFS and Coverage (6-32 and 7-34 with respect to MF and BP, respectively) than the existing algorithms. BioAlign aligns a much larger number of proteins that have high biological similarities as compared to the existing aligners. BioAlign helps in studying the functionally similar protein pairs across the species.
Collapse
Affiliation(s)
- Umair Ayub
- FAST School of Computing, National
University of Computer and Emerging Sciences, Lahore, Pakistan
- Computational Biology Research Lab,
Department of Computing, National University of Computer and Emerging Sciences,
Islamabad, Pakistan
| | - Hammad Naveed
- FAST School of Computing, National
University of Computer and Emerging Sciences, Lahore, Pakistan
- Computational Biology Research Lab,
Department of Computing, National University of Computer and Emerging Sciences,
Islamabad, Pakistan
- Hammad Naveed, Computational Biology
Research Lab, Department of Computing, National University of Computer and
Emerging Sciences, 852 Milaad Street, Block B, Faisal Town, Lahore, Pakistan.
| |
Collapse
|
17
|
Comparative Transcriptome Analysis of Organ-Specific Adaptive Responses to Hypoxia Provides Insights to Human Diseases. Genes (Basel) 2022; 13:genes13061096. [PMID: 35741857 PMCID: PMC9222487 DOI: 10.3390/genes13061096] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Revised: 06/15/2022] [Accepted: 06/17/2022] [Indexed: 02/01/2023] Open
Abstract
The common carp is a hypoxia-tolerant fish, and the understanding of its ability to live in low-oxygen environments has been applied to human health issues such as cancer and neuron degeneration. Here, we investigated differential gene expression changes during hypoxia in five common carp organs including the brain, the gill, the head kidney, the liver, and the intestine. Based on RNA sequencing, gene expression changes under hypoxic conditions were detected in over 1800 genes in common carp. The analysis of these genes further revealed that all five organs had high expression-specific properties. According to the results of the GO and KEGG, the pathways involved in the adaptation to hypoxia provided information on responses specific to each organ in low oxygen, such as glucose metabolism and energy usage, cholesterol synthesis, cell cycle, circadian rhythm, and dopamine activation. DisGeNET analysis showed that some human diseases such as cancer, diabetes, epilepsy, metabolism diseases, and social ability disorders were related to hypoxia-regulated genes. Our results suggested that common carp undergo various gene regulations in different organs under hypoxic conditions, and integrative bioinformatics may provide some potential targets for advancing disease research.
Collapse
|
18
|
Saxena R, Bishnoi R, Singla D. Gene Ontology: application and importance in functional annotation of the genomic data. Bioinformatics 2022. [DOI: 10.1016/b978-0-323-89775-4.00015-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
|
19
|
Marini F, Ludt A, Linke J, Strauch K. GeneTonic: an R/Bioconductor package for streamlining the interpretation of RNA-seq data. BMC Bioinformatics 2021; 22:610. [PMID: 34949163 PMCID: PMC8697502 DOI: 10.1186/s12859-021-04461-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Accepted: 10/26/2021] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND The interpretation of results from transcriptome profiling experiments via RNA sequencing (RNA-seq) can be a complex task, where the essential information is distributed among different tabular and list formats-normalized expression values, results from differential expression analysis, and results from functional enrichment analyses. A number of tools and databases are widely used for the purpose of identification of relevant functional patterns, yet often their contextualization within the data and results at hand is not straightforward, especially if these analytic components are not combined together efficiently. RESULTS We developed the GeneTonic software package, which serves as a comprehensive toolkit for streamlining the interpretation of functional enrichment analyses, by fully leveraging the information of expression values in a differential expression context. GeneTonic is implemented in R and Shiny, leveraging packages that enable HTML-based interactive visualizations for executing drilldown tasks seamlessly, viewing the data at a level of increased detail. GeneTonic is integrated with the core classes of existing Bioconductor workflows, and can accept the output of many widely used tools for pathway analysis, making this approach applicable to a wide range of use cases. Users can effectively navigate interlinked components (otherwise available as flat text or spreadsheet tables), bookmark features of interest during the exploration sessions, and obtain at the end a tailored HTML report, thus combining the benefits of both interactivity and reproducibility. CONCLUSION GeneTonic is distributed as an R package in the Bioconductor project ( https://bioconductor.org/packages/GeneTonic/ ) under the MIT license. Offering both bird's-eye views of the components of transcriptome data analysis and the detailed inspection of single genes, individual signatures, and their relationships, GeneTonic aims at simplifying the process of interpretation of complex and compelling RNA-seq datasets for many researchers with different expertise profiles.
Collapse
Affiliation(s)
- Federico Marini
- Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center of the Johannes Gutenberg University Mainz, Obere Zahlbacher Str. 69, 55131 Mainz, Germany
- Center for Thrombosis and Hemostasis (CTH), University Medical Center of the Johannes Gutenberg University Mainz, Langenbeckstr. 1, 55131 Mainz, Germany
| | - Annekathrin Ludt
- Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center of the Johannes Gutenberg University Mainz, Obere Zahlbacher Str. 69, 55131 Mainz, Germany
| | - Jan Linke
- Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center of the Johannes Gutenberg University Mainz, Obere Zahlbacher Str. 69, 55131 Mainz, Germany
- Center for Thrombosis and Hemostasis (CTH), University Medical Center of the Johannes Gutenberg University Mainz, Langenbeckstr. 1, 55131 Mainz, Germany
| | - Konstantin Strauch
- Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center of the Johannes Gutenberg University Mainz, Obere Zahlbacher Str. 69, 55131 Mainz, Germany
| |
Collapse
|
20
|
Jung HJ, Coleman R, Woodward OM, Welling PA. Doxycycline Changes the Transcriptome Profile of mIMCD3 Renal Epithelial Cells. Front Physiol 2021; 12:771691. [PMID: 34803745 PMCID: PMC8602682 DOI: 10.3389/fphys.2021.771691] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Accepted: 10/18/2021] [Indexed: 12/12/2022] Open
Abstract
Tetracycline-inducible gene expression systems have been used successfully to study gene function in vivo and in vitro renal epithelial models but the effects of the common inducing agent, doxycycline (DOX), on gene expression are not well appreciated. Here, we evaluated the DOX effects on the transcriptome of a widely used renal epithelial cell model, mIMCD3 cells, to establish a reference. Cells were grown on permeable filter supports in the absence and presence of DOX (3 or 6 days), and genome-wide transcriptome profiles were assessed using RNA-Seq. We found DOX significantly altered the transcriptome profile, changing the abundance of 1,549 transcripts at 3 days and 2,643 transcripts at 6 days. Within 3 days of treatment, DOX significantly decreased the expression of multiple signaling pathways (ERK, cAMP, and Notch) that are associated with cell proliferation and differentiation. Genes associated with cell cycle progression were subsequently downregulated in cells treated with DOX for 6 days, as were genes involved in cellular immune response processes and several cytokines and chemokines, correlating with a remarkable repression of genes encoding cell proliferation markers. The results provide new insight into responses of renal epithelial cells to DOX and a establish a resource for DOX-mediated gene expression systems.
Collapse
Affiliation(s)
- Hyun Jun Jung
- Division of Nephrology, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, United States
| | - Richard Coleman
- Division of Nephrology, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, United States
| | - Owen M Woodward
- Department of Physiology, University of Maryland School of Medicine, Baltimore, MD, United States
| | - Paul A Welling
- Division of Nephrology, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, United States.,Department of Physiology, Johns Hopkins University School of Medicine, Baltimore, MD, United States
| |
Collapse
|
21
|
Disturbance of phylogenetic layer-specific adaptation of human brain gene expression in Alzheimer's disease. Sci Rep 2021; 11:20200. [PMID: 34642398 PMCID: PMC8511061 DOI: 10.1038/s41598-021-99760-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Accepted: 09/28/2021] [Indexed: 11/08/2022] Open
Abstract
Alzheimer's disease (AD) is a progressive neurodegenerative disorder with typical neuropathological hallmarks, such as neuritic plaques and neurofibrillary tangles, preferentially found at layers III and V. The distribution of both hallmarks provides the basis for the staging of AD, following a hierarchical pattern throughout the cerebral cortex. To unravel the background of this layer-specific vulnerability, we evaluated differential gene expression of supragranular and infragranular layers and subcortical white matter in both healthy controls and AD patients. We identified AD-associated layer-specific differences involving protein-coding and non-coding sequences, most of those present in the subcortical white matter, thus indicating a critical role for long axons and oligodendrocytes in AD pathomechanism. In addition, GO analysis identified networks containing synaptic vesicle transport, vesicle exocytosis and regulation of neurotransmitter levels. Numerous AD-associated layer-specifically expressed genes were previously reported to undergo layer-specific switches in recent hominid brain evolution between layers V and III, i.e., those layers that are most vulnerable to AD pathology. Against the background of our previous finding of accelerated evolution of AD-specific gene expression, here we suggest a critical role in AD pathomechanism for this phylogenetic layer-specific adaptation of gene expression, which is most prominently seen in the white matter compartment.
Collapse
|
22
|
Animesh S, Choudhary R, Wong BJH, Koh CTJ, Ng XY, Tay JKX, Chong WQ, Jian H, Chen L, Goh BC, Fullwood MJ. Profiling of 3D Genome Organization in Nasopharyngeal Cancer Needle Biopsy Patient Samples by a Modified Hi-C Approach. Front Genet 2021; 12:673530. [PMID: 34539729 PMCID: PMC8446523 DOI: 10.3389/fgene.2021.673530] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2021] [Accepted: 07/31/2021] [Indexed: 11/16/2022] Open
Abstract
Nasopharyngeal cancer (NPC), a cancer derived from epithelial cells in the nasopharynx, is a cancer common in China, Southeast Asia, and Africa. The three-dimensional (3D) genome organization of nasopharyngeal cancer is poorly understood. A major challenge in understanding the 3D genome organization of cancer samples is the lack of a method for the characterization of chromatin interactions in solid cancer needle biopsy samples. Here, we developed Biop-C, a modified in situ Hi-C method using solid cancer needle biopsy samples. We applied Biop-C to characterize three nasopharyngeal cancer solid cancer needle biopsy patient samples. We identified topologically associated domains (TADs), chromatin interaction loops, and frequently interacting regions (FIREs) at key oncogenes in nasopharyngeal cancer from the Biop-C heatmaps. We observed that the genomic features are shared at some important oncogenes, but the patients also display extensive heterogeneity at certain genomic loci. On analyzing the super enhancer landscape in nasopharyngeal cancer cell lines, we found that the super enhancers are associated with FIREs and can be linked to distal genes via chromatin loops in NPC. Taken together, our results demonstrate the utility of our Biop-C method in investigating 3D genome organization in solid cancers.
Collapse
Affiliation(s)
- Sambhavi Animesh
- Cancer Science Institute of Singapore, Centre for Translational Medicine, National University of Singapore, Singapore, Singapore
| | - Ruchi Choudhary
- Cancer Science Institute of Singapore, Centre for Translational Medicine, National University of Singapore, Singapore, Singapore.,School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | | | - Charlotte Tze Jia Koh
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Xin Yi Ng
- Department of Haematology-Oncology, National University Cancer Institute, National University Health System, Singapore, Singapore
| | - Joshua Kai Xun Tay
- Department of Otolaryngology - Head and Neck Surgery, National University of Singapore, Singapore, Singapore
| | - Wan-Qin Chong
- Department of Haematology-Oncology, National University Cancer Institute, National University Health System, Singapore, Singapore
| | - Han Jian
- Cancer Science Institute of Singapore, Centre for Translational Medicine, National University of Singapore, Singapore, Singapore
| | - Leilei Chen
- Cancer Science Institute of Singapore, Centre for Translational Medicine, National University of Singapore, Singapore, Singapore.,Department of Anatomy, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - Boon Cher Goh
- Cancer Science Institute of Singapore, Centre for Translational Medicine, National University of Singapore, Singapore, Singapore.,Department of Haematology-Oncology, National University Cancer Institute, National University Health System, Singapore, Singapore.,Department of Pharmacology, Yong Loo Lin School of Medicine, National University Health System, Singapore, Singapore
| | - Melissa Jane Fullwood
- Cancer Science Institute of Singapore, Centre for Translational Medicine, National University of Singapore, Singapore, Singapore.,School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| |
Collapse
|
23
|
TGF-β/activin signaling promotes CDK7 inhibitor resistance in triple-negative breast cancer cells through upregulation of multidrug transporters. J Biol Chem 2021; 297:101162. [PMID: 34481843 PMCID: PMC8498470 DOI: 10.1016/j.jbc.2021.101162] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2021] [Revised: 08/23/2021] [Accepted: 08/31/2021] [Indexed: 01/10/2023] Open
Abstract
Cyclin-dependent kinase 7 (CDK7) is a master regulatory kinase that drives cell cycle progression and stimulates expression of oncogenes in a myriad of cancers. Inhibitors of CDK7 (CDK7i) are currently in clinical trials; however, as with many cancer therapies, patients will most likely experience recurrent disease due to acquired resistance. Identifying targets underlying CDK7i resistance will facilitate prospective development of new therapies that can circumvent such resistance. Here we utilized triple-negative breast cancer as a model to discern mechanisms of resistance as it has been previously shown to be highly responsive to CDK7 inhibitors. After generating cell lines with acquired resistance, high-throughput RNA sequencing revealed significant upregulation of genes associated with efflux pumps and transforming growth factor-beta (TGF-β) signaling pathways. Genetic silencing or pharmacological inhibition of ABCG2, an efflux pump associated with multidrug resistance, resensitized resistant cells to CDK7i, indicating a reliance on these transporters. Expression of activin A (INHBA), a member of the TGF-β family of ligands, was also induced, whereas its intrinsic inhibitor, follistatin (FST), was repressed. In resistant cells, increased phosphorylation of SMAD3, a downstream mediator, confirmed an increase in activin signaling, and phosphorylated SMAD3 directly bound the ABCG2 promoter regulatory region. Finally, pharmacological inhibition of TGF-β/activin receptors or genetic silencing of SMAD4, a transcriptional partner of SMAD3, reversed the upregulation of ABCG2 in resistant cells and phenocopied ABCG2 inhibition. This study reveals that inhibiting the TGF-β/Activin-ABCG2 pathway is a potential avenue for preventing or overcoming resistance to CDK7 inhibitors.
Collapse
|
24
|
Wang S, Zhong Y, Cheng J, Yang H. EnrichVisBox: A Versatile and Powerful Web Toolbox for Visualizing Complex Functional Enrichment Results of Omics Data. J Comput Biol 2021; 28:922-930. [PMID: 34271847 DOI: 10.1089/cmb.2020.0564] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
Efficient visualization helps researchers obtain valuable mechanistic insights and present interesting results with regard to the functional enrichment analysis of omics data. However, the functions of existing published tools used to implement relevant visualization are neither sufficiently comprehensive nor easily accessible. Most of these tools require users to have professional programming skills. This study alleviates this issue by proposing EnrichVisBox, a web application developed for integrative and versatile data visualization, including bubble plots, UpSet plots, polar bar plots, rectangle plots, ridgeline plots, network plots, and variant chord plots. Specifically, scientists can use these insightful plots to conveniently present functional enrichment analysis results of omics data with a simple mouse click through a user-friendly interface.
Collapse
Affiliation(s)
- Shisheng Wang
- Frontiers Science Center for Disease-Related Molecular Network, Institutes for Systems Genetics, Key Lab of Transplant Engineering and Immunology, MOH, West China Hospital, Sichuan University, Chengdu, China
| | - Yi Zhong
- Frontiers Science Center for Disease-Related Molecular Network, Institutes for Systems Genetics, Key Lab of Transplant Engineering and Immunology, MOH, West China Hospital, Sichuan University, Chengdu, China
| | - Jingqiu Cheng
- Frontiers Science Center for Disease-Related Molecular Network, Institutes for Systems Genetics, Key Lab of Transplant Engineering and Immunology, MOH, West China Hospital, Sichuan University, Chengdu, China
| | - Hao Yang
- Frontiers Science Center for Disease-Related Molecular Network, Institutes for Systems Genetics, Key Lab of Transplant Engineering and Immunology, MOH, West China Hospital, Sichuan University, Chengdu, China
| |
Collapse
|
25
|
Habib N, Rahman MM. Diagnosis of corona diseases from associated genes and X-ray images using machine learning algorithms and deep CNN. INFORMATICS IN MEDICINE UNLOCKED 2021; 24:100621. [PMID: 34075341 PMCID: PMC8159714 DOI: 10.1016/j.imu.2021.100621] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 05/18/2021] [Accepted: 05/24/2021] [Indexed: 01/15/2023] Open
Abstract
Novel Coronavirus with its highly transmittable characteristics is rapidly spreading, endangering millions of human lives and the global economy. To expel the chain of alteration and subversive expansion, early and effective diagnosis of infected patients is immensely important. Unfortunately, there is a lack of testing equipment in many countries as compared with the number of infected patients. It would be desirable to have a swift diagnosis with identification of COVID-19 from disease genes or from CT or X-Ray images. COVID-19 causes flus, cough, pneumonia, and lung infection in patients, wherein massive alveolar damage and progressive respiratory failure can lead to death. This paper proposes two different detection methods - the first is a Gene-based screening method to detect Corona diseases (Middle East respiratory syndrome-related coronavirus, Severe acute respiratory syndrome coronavirus 2, and Human coronavirus HKU1) and differentiate it from Pneumonia. This novel approach to healthcare utilizes disease genes to build functional semantic similarity among genes. Different machine learning algorithms - eXtreme Gradient Boosting, Naïve Bayes, Regularized Random Forest, Random Forest Rule-Based Model, Random Ferns, C5.0 and Multi-Layer Perceptron, are trained and tested on the semantic similarities to classify Corona and Pneumonia diseases. The best performing models are then ensembled, yielding an accuracy of nearly 93%. The second diagnosis technique proposed herein is an automated COVID-19 diagnostic method which uses chest X-ray images to classify Normal versus COVID-19 and Pneumonia versus COVID-19 images using the deep-CNN technique, achieving 99.87% and 99.48% test accuracy. Thus, this research can be an assistance for providing better treatment against COVID-19.
Collapse
Affiliation(s)
- Nahida Habib
- Department of Computer Science and Engineering (CSE), Mawlana Bhashani Science and Technology University (MBSTU), Santosh, Tangail, 1902, Bangladesh
- Department of Computer Science and Engineering (CSE), Ranada Prasad Shaha University (RPSU), Narayanganj, 1400, Bangladesh
| | - Mohammad Motiur Rahman
- Department of Computer Science and Engineering (CSE), Mawlana Bhashani Science and Technology University (MBSTU), Santosh, Tangail, 1902, Bangladesh
| |
Collapse
|
26
|
Askland KD, Strong D, Wright MN, Moore JH. The Translational Machine: A novel machine-learning approach to illuminate complex genetic architectures. Genet Epidemiol 2021; 45:485-536. [PMID: 33942369 DOI: 10.1002/gepi.22383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2020] [Revised: 03/05/2021] [Accepted: 03/23/2021] [Indexed: 11/08/2022]
Abstract
The Translational Machine (TM) is a machine learning (ML)-based analytic pipeline that translates genotypic/variant call data into biologically contextualized features that richly characterize complex variant architectures and permit greater interpretability and biological replication. It also reduces potentially confounding effects of population substructure on outcome prediction. The TM consists of three main components. First, replicable but flexible feature engineering procedures translate genome-scale data into biologically informative features that appropriately contextualize simple variant calls/genotypes within biological and functional contexts. Second, model-free, nonparametric ML-based feature filtering procedures empirically reduce dimensionality and noise of both original genotype calls and engineered features. Third, a powerful ML algorithm for feature selection is used to differentiate risk variant contributions across variant frequency and functional prediction spectra. The TM simultaneously evaluates potential contributions of variants operative under polygenic and heterogeneous models of genetic architecture. Our TM enables integration of biological information (e.g., genomic annotations) within conceptual frameworks akin to geneset-/pathways-based and collapsing methods, but overcomes some of these methods' limitations. The full TM pipeline is executed in R. Our approach and initial findings from its application to a whole-exome schizophrenia case-control data set are presented. These TM procedures extend the findings of the primary investigation and yield novel results.
Collapse
Affiliation(s)
- Kathleen D Askland
- Waypoint Centre for Mental Health Care Penetanguishene, University of Toronto, Toronto, Ontario, Canada
| | - David Strong
- Department of Family Medicine and Public Health, University of California San Diego, San Diego, California, USA
| | - Marvin N Wright
- Department Biometry and Data Management, Leibniz Institute for Prevention Research and Epidemiology - BIPS GmbH, Germany
| | - Jason H Moore
- Department of Biostatistics, Epidemiology, & Informatics, The Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| |
Collapse
|
27
|
Song HR, Kim HK, Kim SG, Lim HJ, Kim HY, Han MK. Changes in the phosphorylation of nucleotide metabolism‑associated proteins by leukemia inhibitory factor in mouse embryonic stem cells. Mol Med Rep 2021; 23:431. [PMID: 33846773 PMCID: PMC8060798 DOI: 10.3892/mmr.2021.12070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2020] [Accepted: 09/22/2020] [Indexed: 11/05/2022] Open
Abstract
Leukemia inhibitory factor (LIF) is a stem cell growth factor that maintains self‑renewal of mouse embryonic stem cells (mESCs). LIF is a cytokine in the interleukin‑6 family and signals via the common receptor subunit gp130 and ligand‑specific LIF receptor. LIF causes heterodimerization of the LIF receptor and gp130, activating the Janus kinase/STAT and MAPK pathways, resulting in changes in protein phosphorylation. The present study profiled LIF‑mediated protein phosphorylation changes in mESCs via proteomic analysis. mESCs treated in the presence or absence of LIF were analyzed via two‑dimensional differential in‑gel electrophoresis and protein and phosphoprotein staining. Protein identification was performed by matrix‑assisted laser desorption/ionization‑time of flight mass spectrophotometry. Increased phosphorylation of 16 proteins and decreased phosphorylation of 34 proteins in response to LIF treatment was detected. Gene Ontology terms enriched in these proteins included 'organonitrogen compound metabolic process', 'regulation of mRNA splicing via spliceosome' and 'nucleotide metabolic process'. The present results revealed that LIF modulated phosphorylation levels of nucleotide metabolism‑associated proteins, thus providing insight into the mechanism underlying LIF action in mESCs.
Collapse
Affiliation(s)
- Hwa-Ryung Song
- Department of Microbiology, Jeonbuk National University Medical School, Jeonju, Jeollabuk 54896, Republic of Korea
| | - Han-Kyu Kim
- Department of Microbiology, Jeonbuk National University Medical School, Jeonju, Jeollabuk 54896, Republic of Korea
| | - Seung-Gook Kim
- Department of Microbiology, Jeonbuk National University Medical School, Jeonju, Jeollabuk 54896, Republic of Korea
| | - Hyung-Jin Lim
- Department of Microbiology, Jeonbuk National University Medical School, Jeonju, Jeollabuk 54896, Republic of Korea
| | - Hyun-Yi Kim
- Division of Anatomy and Developmental Biology, Department of Oral Biology, Yonsei University College of Dentistry, Seoul 03722, Republic of Korea
| | - Myung-Kwan Han
- Department of Microbiology, Jeonbuk National University Medical School, Jeonju, Jeollabuk 54896, Republic of Korea
| |
Collapse
|
28
|
Vogrinc D, Goričar K, Dolžan V. Genetic Variability in Molecular Pathways Implicated in Alzheimer's Disease: A Comprehensive Review. Front Aging Neurosci 2021; 13:646901. [PMID: 33815092 PMCID: PMC8012500 DOI: 10.3389/fnagi.2021.646901] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2020] [Accepted: 02/16/2021] [Indexed: 12/14/2022] Open
Abstract
Alzheimer's disease (AD) is a complex neurodegenerative disease, affecting a significant part of the population. The majority of AD cases occur in the elderly with a typical age of onset of the disease above 65 years. AD presents a major burden for the healthcare system and since population is rapidly aging, the burden of the disease will increase in the future. However, no effective drug treatment for a full-blown disease has been developed to date. The genetic background of AD is extensively studied; numerous genome-wide association studies (GWAS) identified significant genes associated with increased risk of AD development. This review summarizes more than 100 risk loci. Many of them may serve as biomarkers of AD progression, even in the preclinical stage of the disease. Furthermore, we used GWAS data to identify key pathways of AD pathogenesis: cellular processes, metabolic processes, biological regulation, localization, transport, regulation of cellular processes, and neurological system processes. Gene clustering into molecular pathways can provide background for identification of novel molecular targets and may support the development of tailored and personalized treatment of AD.
Collapse
Affiliation(s)
| | | | - Vita Dolžan
- Pharmacogenetics Laboratory, Institute of Biochemistry and Molecular Genetics, Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
| |
Collapse
|
29
|
Tahira A, Marques F, Lisboa B, Feltrin A, Barbosa A, de Oliveira KC, de Bragança Pereira CA, Leite R, Grinberg L, Suemoto C, de Lucena Ferretti-Rebustini RE, Pasqualucci CA, Jacob-Filho W, Brentani H, Palha JA. Are the 50's, the transition decade, in choroid plexus aging? GeroScience 2021; 43:225-237. [PMID: 33576945 PMCID: PMC8050122 DOI: 10.1007/s11357-021-00329-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Accepted: 01/25/2021] [Indexed: 12/13/2022] Open
Abstract
The choroid plexus (CP) is an important structure for the brain. Besides its major role in the production of cerebrospinal fluid (CSF), it conveys signals originating from the brain, and from the circulatory system, shaping brain function in health and in pathology. Previous studies in rodents have revealed altered transcriptome both during aging and in various diseases of the central nervous system, including Alzheimer's disease. In the present study, a high-throughput sequencing of the CP transcriptome was performed in postmortem samples of clinically healthy individuals aged 50's through 80's. The data shows an age-related profile, with the main changes occurring in the transition from the 50's to the 60's, stabilizing thereafter. Specifically, neuronal and membrane functions distinguish the transcriptome between the 50's and the 60's, while neuronal and axon development and extracellular structure organization differentiate the 50's from the 70's. These findings suggest that changes in the CP transcriptome occur early in the aging process. Future studies will unravel whether these relate with processes occurring in late- onset brain diseases.
Collapse
Affiliation(s)
- Ana Tahira
- LIM23, Instituto de Psiquiatria, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, SP, Brazil
| | - Fernanda Marques
- Life and Health Sciences Research Institute (ICVS), School of Medicine, University of Minho, Braga, Portugal
- ICVS/3B's-PT Government Associate Laboratory, Braga/Guimarães, Portugal
| | - Bianca Lisboa
- LIM23, Instituto de Psiquiatria, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, SP, Brazil
| | - Arthur Feltrin
- LIM23, Instituto de Psiquiatria, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, SP, Brazil
- Center of Mathematics, Computing and Cognition, Federal University of ABC, Santo André, SP, Brazil
| | - André Barbosa
- LIM23, Instituto de Psiquiatria, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, SP, Brazil
- Inter-institutional Grad Program on Bioinformatics, University of São Paulo, São Paulo, SP, Brazil
| | - Kátia Cristina de Oliveira
- LIM23, Instituto de Psiquiatria, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, SP, Brazil
- Center of Mathematics, Computing and Cognition, Federal University of ABC, Santo André, SP, Brazil
| | | | - Renata Leite
- Biobank for Aging Studies Group, Faculdade de Medicina, Universidade de São Paulo, São Paulo, SP, Brazil
| | - Lea Grinberg
- Biobank for Aging Studies Group, Faculdade de Medicina, Universidade de São Paulo, São Paulo, SP, Brazil
| | - Claudia Suemoto
- Biobank for Aging Studies Group, Faculdade de Medicina, Universidade de São Paulo, São Paulo, SP, Brazil
| | | | - Carlos Augusto Pasqualucci
- Biobank for Aging Studies Group, Faculdade de Medicina, Universidade de São Paulo, São Paulo, SP, Brazil
| | - Wilson Jacob-Filho
- Biobank for Aging Studies Group, Faculdade de Medicina, Universidade de São Paulo, São Paulo, SP, Brazil
| | - Helena Brentani
- LIM23, Instituto de Psiquiatria, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, SP, Brazil
- Departamento de Psiquiatria, Faculdade de Medicina, Universidade de São Paulo, São Paulo, SP, Brazil
| | - Joana Almeida Palha
- Life and Health Sciences Research Institute (ICVS), School of Medicine, University of Minho, Braga, Portugal.
- ICVS/3B's-PT Government Associate Laboratory, Braga/Guimarães, Portugal.
- Clinical Academic Center, Braga, Portugal.
| |
Collapse
|
30
|
Littmann M, Heinzinger M, Dallago C, Olenyi T, Rost B. Embeddings from deep learning transfer GO annotations beyond homology. Sci Rep 2021; 11:1160. [PMID: 33441905 PMCID: PMC7806674 DOI: 10.1038/s41598-020-80786-0] [Citation(s) in RCA: 58] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2020] [Accepted: 12/24/2020] [Indexed: 11/09/2022] Open
Abstract
Knowing protein function is crucial to advance molecular and medical biology, yet experimental function annotations through the Gene Ontology (GO) exist for fewer than 0.5% of all known proteins. Computational methods bridge this sequence-annotation gap typically through homology-based annotation transfer by identifying sequence-similar proteins with known function or through prediction methods using evolutionary information. Here, we propose predicting GO terms through annotation transfer based on proximity of proteins in the SeqVec embedding rather than in sequence space. These embeddings originate from deep learned language models (LMs) for protein sequences (SeqVec) transferring the knowledge gained from predicting the next amino acid in 33 million protein sequences. Replicating the conditions of CAFA3, our method reaches an Fmax of 37 ± 2%, 50 ± 3%, and 57 ± 2% for BPO, MFO, and CCO, respectively. Numerically, this appears close to the top ten CAFA3 methods. When restricting the annotation transfer to proteins with < 20% pairwise sequence identity to the query, performance drops (Fmax BPO 33 ± 2%, MFO 43 ± 3%, CCO 53 ± 2%); this still outperforms naïve sequence-based transfer. Preliminary results from CAFA4 appear to confirm these findings. Overall, this new concept is likely to change the annotation of proteins, in particular for proteins from smaller families or proteins with intrinsically disordered regions.
Collapse
Affiliation(s)
- Maria Littmann
- Department of Informatics, Bioinformatics and Computational Biology, i12, TUM (Technical University of Munich), Boltzmannstr. 3, Garching, 85748, Munich, Germany.
- TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Boltzmannstr. 11, 85748, Garching, Germany.
| | - Michael Heinzinger
- Department of Informatics, Bioinformatics and Computational Biology, i12, TUM (Technical University of Munich), Boltzmannstr. 3, Garching, 85748, Munich, Germany
- TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Boltzmannstr. 11, 85748, Garching, Germany
| | - Christian Dallago
- Department of Informatics, Bioinformatics and Computational Biology, i12, TUM (Technical University of Munich), Boltzmannstr. 3, Garching, 85748, Munich, Germany
- TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Boltzmannstr. 11, 85748, Garching, Germany
| | - Tobias Olenyi
- Department of Informatics, Bioinformatics and Computational Biology, i12, TUM (Technical University of Munich), Boltzmannstr. 3, Garching, 85748, Munich, Germany
| | - Burkhard Rost
- Department of Informatics, Bioinformatics and Computational Biology, i12, TUM (Technical University of Munich), Boltzmannstr. 3, Garching, 85748, Munich, Germany
- Institute for Advanced Study (TUM-IAS), Lichtenbergstr. 2a, Garching, 85748, Munich, Germany
- School of Life Sciences Weihenstephan (TUM-WZW), TUM (Technical University of Munich), Alte Akademie 8, Freising, Germany
- Department of Biochemistry and Molecular Biophysics, Columbia University, 701 West, 168th Street, New York, NY, 10032, USA
| |
Collapse
|
31
|
Mahjoub M, Ezer D. PAFway: pairwise associations between functional annotations in biological networks and pathways. Bioinformatics 2020; 36:4963-4964. [PMID: 32678900 PMCID: PMC7750965 DOI: 10.1093/bioinformatics/btaa639] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Revised: 04/18/2020] [Accepted: 07/10/2020] [Indexed: 11/12/2022] Open
Abstract
Motivation Large gene networks can be dense and difficult to interpret in a biologically meaningful way. Results Here, we introduce PAFway, which estimates pairwise associations between functional annotations in biological networks and pathways. It answers the biological question: do genes that have a specific function tend to regulate genes that have a different specific function? The results can be visualized as a heatmap or a network of biological functions. We apply this package to reveal associations between functional annotations in an Arabidopsis thaliana gene network. Availability and implementation PAFway is submitted to CRAN. Currently available here: https://github.com/ezer/PAFway. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mahiar Mahjoub
- Department of Mathematics, University of Cambridge, Cambridge CB3 0WA, UK.,The Alan Turing Institute, London NW1 2DB, UK.,Royal Prince Alfred Hospital, Central Clinical School, University of Sydney, Sydney, NSW 2050, Australia
| | - Daphne Ezer
- The Alan Turing Institute, London NW1 2DB, UK.,Department of Statistics, University of Warwick, Coventry CV4 7AL, UK.,Department of Biology, University of York, York, YO10 5NG, UK
| |
Collapse
|
32
|
Ayub U, Haider I, Naveed H. SAlign-a structure aware method for global PPI network alignment. BMC Bioinformatics 2020; 21:500. [PMID: 33148180 PMCID: PMC7640460 DOI: 10.1186/s12859-020-03827-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2020] [Accepted: 10/20/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND High throughput experiments have generated a significantly large amount of protein interaction data, which is being used to study protein networks. Studying complete protein networks can reveal more insight about healthy/disease states than studying proteins in isolation. Similarly, a comparative study of protein-protein interaction (PPI) networks of different species reveals important insights which may help in disease analysis and drug design. The study of PPI network alignment can also helps in understanding the different biological systems of different species. It can also be used in transfer of knowledge across different species. Different aligners have been introduced in the last decade but developing an accurate and scalable global alignment algorithm that can ensures the biological significance alignment is still challenging. RESULTS This paper presents a novel global pairwise network alignment algorithm, SAlign, which uses topological and biological information in the alignment process. The proposed algorithm incorporates sequence and structural information for computing biological scores, whereas previous algorithms only use sequence information. The alignment based on the proposed technique shows that the combined effect of structure and sequence results in significantly better pairwise alignments. We have compared SAlign with state-of-art algorithms on the basis of semantic similarity of alignment and the number of aligned nodes on multiple PPI network pairs. The results of SAlign on the network pairs which have high percentage of proteins with available structure are 3-63% semantically better than all existing techniques. Furthermore, it also aligns 5-14% more nodes of these network pairs as compared to existing aligners. The results of SAlign on other PPI network pairs are comparable or better than all existing techniques. We also introduce [Formula: see text], a Monte Carlo based alignment algorithm, that produces multiple network alignments with similar semantic similarity. This helps the user to pick biologically meaningful alignments. CONCLUSION The proposed algorithm has the ability to find the alignments that are more biologically significant/relevant as compared to the alignments of existing aligners. Furthermore, the proposed method is able to generate alternate alignments that help in studying different genes/proteins of the specie.
Collapse
Affiliation(s)
- Umair Ayub
- Department of Computing, National University of Computer and Emerging Sciences, Islamabad, 40100, Pakistan.,Computational Biology Research Lab, Islamabad, 40100, Pakistan
| | - Imran Haider
- Department of Computing, National University of Computer and Emerging Sciences, Islamabad, 40100, Pakistan.,Computational Biology Research Lab, Islamabad, 40100, Pakistan
| | - Hammad Naveed
- Department of Computing, National University of Computer and Emerging Sciences, Islamabad, 40100, Pakistan. .,Computational Biology Research Lab, Islamabad, 40100, Pakistan.
| |
Collapse
|
33
|
Makrodimitris S, van Ham RCHJ, Reinders MJT. Automatic Gene Function Prediction in the 2020's. Genes (Basel) 2020; 11:E1264. [PMID: 33120976 PMCID: PMC7692357 DOI: 10.3390/genes11111264] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Revised: 10/19/2020] [Accepted: 10/21/2020] [Indexed: 02/06/2023] Open
Abstract
The current rate at which new DNA and protein sequences are being generated is too fast to experimentally discover the functions of those sequences, emphasizing the need for accurate Automatic Function Prediction (AFP) methods. AFP has been an active and growing research field for decades and has made considerable progress in that time. However, it is certainly not solved. In this paper, we describe challenges that the AFP field still has to overcome in the future to increase its applicability. The challenges we consider are how to: (1) include condition-specific functional annotation, (2) predict functions for non-model species, (3) include new informative data sources, (4) deal with the biases of Gene Ontology (GO) annotations, and (5) maximally exploit the GO to obtain performance gains. We also provide recommendations for addressing those challenges, by adapting (1) the way we represent proteins and genes, (2) the way we represent gene functions, and (3) the algorithms that perform the prediction from gene to function. Together, we show that AFP is still a vibrant research area that can benefit from continuing advances in machine learning with which AFP in the 2020s can again take a large step forward reinforcing the power of computational biology.
Collapse
Affiliation(s)
- Stavros Makrodimitris
- Delft Bioinformatics Lab, Delft University of Technology, 2628XE Delft, The Netherlands; (R.C.H.J.v.H.); (M.J.T.R.)
- Keygene N.V., 6708PW Wageningen, The Netherlands
| | - Roeland C. H. J. van Ham
- Delft Bioinformatics Lab, Delft University of Technology, 2628XE Delft, The Netherlands; (R.C.H.J.v.H.); (M.J.T.R.)
- Keygene N.V., 6708PW Wageningen, The Netherlands
| | - Marcel J. T. Reinders
- Delft Bioinformatics Lab, Delft University of Technology, 2628XE Delft, The Netherlands; (R.C.H.J.v.H.); (M.J.T.R.)
- Leiden Computational Biology Center, Leiden University Medical Center, 2333ZC Leiden, The Netherlands
| |
Collapse
|
34
|
Gnanadesikan GE, Hare B, Snyder-Mackler N, Call J, Kaminski J, Miklósi Á, MacLean EL. Breed Differences in Dog Cognition Associated with Brain-Expressed Genes and Neurological Functions. Integr Comp Biol 2020; 60:976-990. [PMID: 32726413 DOI: 10.1093/icb/icaa112] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Given their remarkable phenotypic diversity, dogs present a unique opportunity for investigating the genetic bases of cognitive and behavioral traits. Our previous work demonstrated that genetic relatedness among breeds accounts for a substantial portion of variation in dog cognition. Here, we investigated the genetic architecture of breed differences in cognition, seeking to identify genes that contribute to variation in cognitive phenotypes. To do so, we combined cognitive data from the citizen science project Dognition.com with published breed-average genetic polymorphism data, resulting in a dataset of 1654 individuals with cognitive phenotypes representing 49 breeds. We conducted a breed-average genome-wide association study to identify specific polymorphisms associated with breed differences in inhibitory control, communication, memory, and physical reasoning. We found five single nucleotide polymorphisms (SNPs) that reached genome-wide significance after Bonferroni correction, located in EML1, OR52E2, HS3ST5, a U6 spliceosomal RNA, and a long noncoding RNA. When we combined results across multiple SNPs within the same gene, we identified 188 genes implicated in breed differences in cognition. This gene set included more genes than expected by chance that were (1) differentially expressed in brain tissue and (2) involved in nervous system functions including peripheral nervous system development, Wnt signaling, presynapse assembly, and synaptic vesicle exocytosis. These results advance our understanding of the genetic underpinnings of complex cognitive phenotypes and identify specific genetic variants for further research.
Collapse
Affiliation(s)
- Gitanjali E Gnanadesikan
- School of Anthropology, University of Arizona, Tucson, AZ, USA.,Cognitive Science Program, University of Arizona, Tucson, AZ, USA
| | - Brian Hare
- Department of Evolutionary Anthropology, Duke University, Durham, NC, USA.,Center for Cognitive Neuroscience, Duke University, Durham, NC, USA
| | - Noah Snyder-Mackler
- Department of Psychology, University of Washington, Seattle, WA, USA.,Center for Evolution and Medicine, Arizona State University, Tempe, AZ, USA.,School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Josep Call
- School of Psychology and Neuroscience, University of St Andrews, St Andrews, UK
| | - Juliane Kaminski
- Department of Psychology, University of Portsmouth, Portsmouth, UK
| | - Ádám Miklósi
- Department of Ethology, Eötvös Loránd University, Budapest, Hungary.,MTA-ELTE Comparative Ethology Research Group, Budapest, Hungary
| | - Evan L MacLean
- School of Anthropology, University of Arizona, Tucson, AZ, USA.,Cognitive Science Program, University of Arizona, Tucson, AZ, USA.,Psychology Department, University of Arizona, Tucson, AZ, USA.,College of Veterinary Medicine, University of Arizona, Tucson, AZ, USA
| |
Collapse
|
35
|
Lambo S, Gröbner SN, Rausch T, Waszak SM, Schmidt C, Gorthi A, Romero JC, Mauermann M, Brabetz S, Krausert S, Buchhalter I, Koster J, Zwijnenburg DA, Sill M, Hübner JM, Mack N, Schwalm B, Ryzhova M, Hovestadt V, Papillon-Cavanagh S, Chan JA, Landgraf P, Ho B, Milde T, Witt O, Ecker J, Sahm F, Sumerauer D, Ellison DW, Orr BA, Darabi A, Haberler C, Figarella-Branger D, Wesseling P, Schittenhelm J, Remke M, Taylor MD, Gil-da-Costa MJ, Łastowska M, Grajkowska W, Hasselblatt M, Hauser P, Pietsch T, Uro-Coste E, Bourdeaut F, Masliah-Planchon J, Rigau V, Alexandrescu S, Wolf S, Li XN, Schüller U, Snuderl M, Karajannis MA, Giangaspero F, Jabado N, von Deimling A, Jones DTW, Korbel JO, von Hoff K, Lichter P, Huang A, Bishop AJR, Pfister SM, Korshunov A, Kool M. The molecular landscape of ETMR at diagnosis and relapse. Nature 2019; 576:274-280. [PMID: 31802000 PMCID: PMC6908757 DOI: 10.1038/s41586-019-1815-x] [Citation(s) in RCA: 63] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2018] [Accepted: 10/16/2019] [Indexed: 12/18/2022]
Abstract
Embryonal tumours with multilayered rosettes (ETMRs) are aggressive paediatric embryonal brain tumours with a universally poor prognosis1. Here we collected 193 primary ETMRs and 23 matched relapse samples to investigate the genomic landscape of this distinct tumour type. We found that patients with tumours in which the proposed driver C19MC2-4 was not amplified frequently had germline mutations in DICER1 or other microRNA-related aberrations such as somatic amplification of miR-17-92 (also known as MIR17HG). Whole-genome sequencing revealed that tumours had an overall low recurrence of single-nucleotide variants (SNVs), but showed prevalent genomic instability caused by widespread occurrence of R-loop structures. We show that R-loop-associated chromosomal instability can be induced by the loss of DICER1 function. Comparison of primary tumours and matched relapse samples showed a strong conservation of structural variants, but low conservation of SNVs. Moreover, many newly acquired SNVs are associated with a mutational signature related to cisplatin treatment. Finally, we show that targeting R-loops with topoisomerase and PARP inhibitors might be an effective treatment strategy for this deadly disease.
Collapse
Affiliation(s)
- Sander Lambo
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Susanne N Gröbner
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Tobias Rausch
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
| | - Sebastian M Waszak
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
| | - Christin Schmidt
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Aparna Gorthi
- Department of Cell Systems and Anatomy, University of Texas Health at San Antonio, San Antonio, TX, USA
- Greehey Children's Cancer Research Institute, University of Texas Health at San Antonio, San Antonio, TX, USA
| | - July Carolina Romero
- Department of Cell Systems and Anatomy, University of Texas Health at San Antonio, San Antonio, TX, USA
- Greehey Children's Cancer Research Institute, University of Texas Health at San Antonio, San Antonio, TX, USA
| | - Monika Mauermann
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Sebastian Brabetz
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Sonja Krausert
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Ivo Buchhalter
- Omics IT and Data Management Core Facility, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Jan Koster
- Department of Oncogenomics, Academic Medical Center, Amsterdam, The Netherlands
| | - Danny A Zwijnenburg
- Department of Oncogenomics, Academic Medical Center, Amsterdam, The Netherlands
| | - Martin Sill
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Jens-Martin Hübner
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Norman Mack
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Benjamin Schwalm
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Marina Ryzhova
- Department of Neuropathology, NN Burdenko Neurosurgical Institute, Moscow, Russia
| | - Volker Hovestadt
- Division of Molecular Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Simon Papillon-Cavanagh
- Department of Pediatrics, McGill University Health Center, McGill University, Montreal, Quebec, Canada
| | - Jennifer A Chan
- Department of Pathology and Laboratory Medicine, University of Calgary, Calgary, Alberta, Canada
| | - Pablo Landgraf
- Department of Pediatrics, Pediatric Oncology and Hematology, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Ben Ho
- Division of Hematology/Oncology Arthur and Sonia Labatt Brain Tumour Research Centre, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Till Milde
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
- Clinical Cooperation Unit Pediatric Oncology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Department of Pediatric Oncology, Hematology and Immunology, Heidelberg University Hospital, Heidelberg, Germany
| | - Olaf Witt
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
- Clinical Cooperation Unit Pediatric Oncology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Department of Pediatric Oncology, Hematology and Immunology, Heidelberg University Hospital, Heidelberg, Germany
| | - Jonas Ecker
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- Clinical Cooperation Unit Pediatric Oncology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Department of Pediatric Oncology, Hematology and Immunology, Heidelberg University Hospital, Heidelberg, Germany
| | - Felix Sahm
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
- Department of Neuropathology, Heidelberg University Hospital, Heidelberg, Germany
- Clinical Cooperation Unit Neuropathology, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - David Sumerauer
- Department of Pediatric Hematology and Oncology, University Hospital Motol, Prague, Czech Republic
| | - David W Ellison
- Department of Pathology, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Brent A Orr
- Department of Pathology, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Anna Darabi
- Department of Clinical Sciences Lund, Section of Neurosurgery, Faculty of Medicine, Lund University, Lund, Sweden
| | | | - Dominique Figarella-Branger
- Aix-Marseille University, Neurophysiopathology Institute (INP), CNRS, Marseille, France
- Department of Pathology, APHM, Marseille, France
| | - Pieter Wesseling
- Princess Máxima Center for Pediatric Oncology, Utrecht, The Netherlands
- Department of Pathology, Amsterdam University Medical Centers/location VUmc, Amsterdam, The Netherlands
| | - Jens Schittenhelm
- Department of Neuropathology, Institute of Pathology and Neuropathology, University Hospital of Tübingen, Tübingen, Germany
- Center for CNS Tumors, Comprehensive Cancer Center Tübingen-Stuttgart, University Hospital of Tübingen, Tübingen, Germany
| | - Marc Remke
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
- Department of Pediatric Oncology, Hematology and Clinical Immunology, Medical Faculty, University Hospital Düsseldorf, Düsseldorf, Germany
- Division of Neurosurgery, Arthur and Sonia Labatt Brain Tumor Research Center, Hospital for Sick Children, University of Toronto, Toronto, Ontario, Canada
| | - Michael D Taylor
- Division of Neurosurgery, Arthur and Sonia Labatt Brain Tumor Research Center, Hospital for Sick Children, University of Toronto, Toronto, Ontario, Canada
| | - Maria J Gil-da-Costa
- Pediatric Hematology and Oncology Division, University Hospital São João Alameda Hernani Monteiro, Porto, Portugal
| | - Maria Łastowska
- Department of Pathology, Children's Memorial Health Institute, Warsaw, Poland
| | - Wiesława Grajkowska
- Department of Pathology, Children's Memorial Health Institute, Warsaw, Poland
| | - Martin Hasselblatt
- Institute of Neuropathology, University Hospital Münster, Münster, Germany
| | - Peter Hauser
- 2nd Department of Pediatrics, Semmelweis University, Budapest, Hungary
| | - Torsten Pietsch
- Institute of Neuropathology, Brain Tumor Reference Center of the German Society of Neuropathology and Neuroanatomy, University of Bonn Medical Center, Bonn, Germany
| | - Emmanuelle Uro-Coste
- Department of Pathology, Toulouse University Hospital, Toulouse, France
- INSERM U1037, Cancer Research Center of Toulouse (CRCT), Toulouse, France
| | - Franck Bourdeaut
- INSERM U830, Laboratory of Translational Research in Pediatric Oncology, SIREDO Pediatric Oncology Center, Paris Sciences Lettres Research University, Curie Institute, Paris, France
| | - Julien Masliah-Planchon
- Pediatric Oncology Department, SIREDO Pediatric Oncology Centre, Curie Institute, Paris, France
- Paris Sciences et Lettres Research University, Institut Curie Hospital, Laboratory of Somatic Genetics, Paris, France
| | - Valérie Rigau
- Department of Pathology, Montpellier University Medical Center, Montpellier, France
- Institute for Neuroscience of Montpellier (INM), INSERM U1051, Montpellier University Hospital, Montpellier, France
| | - Sanda Alexandrescu
- Department of Pathology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
| | - Stephan Wolf
- Genomics and Proteomics Core Facility, High Throughput Sequencing Unit, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Xiao-Nan Li
- Brain Tumor Program, Texas Children's Cancer Center, Department of Pediatrics, Baylor College of Medicine, Houston, TX, USA
- Ann & Robert H. Lurie Children's Hospital of Chicago, Chicago, IL, USA
- Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Ulrich Schüller
- Institute of Neuropathology, University Medical Center, Hamburg-Eppendorf, Hamburg, Germany
- Research Institute Children's Cancer Center, Hamburg, Germany
- Department of Pediatric Hematology and Oncology, University Medical Center, Hamburg-Eppendorf, Hamburg, Germany
| | - Matija Snuderl
- Department of Pathology, NYU Langone Health, New York, NY, USA
| | - Matthias A Karajannis
- Division of Pediatric Hematology/Oncology, NYU Langone Medical Center, The Stephen D. Hassenfeld Children's Center for Cancer and Blood Disorders, New York, NY, USA
| | - Felice Giangaspero
- Department of Radiological, Oncological and Anatomopathological Sciences, Sapienza University of Rome, Rome, Italy
- IRCCS Neuromed-Mediterranean Neurological Institute, Pozzilli, Italy
| | - Nada Jabado
- Department of Pediatrics, McGill University Health Center, McGill University, Montreal, Quebec, Canada
| | - Andreas von Deimling
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- Department of Neuropathology, Heidelberg University Hospital, Heidelberg, Germany
- Clinical Cooperation Unit Neuropathology, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - David T W Jones
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
- Pediatric Glioma Research Group, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Jan O Korbel
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
| | - Katja von Hoff
- Department of Pediatric Oncology/Hematology, Charité University Medicine, Berlin, Germany
- Department for Pediatric Hematology and Oncology, University Hospital Hamburg-Eppendorf, Hamburg, Germany
| | - Peter Lichter
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
- Division of Molecular Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Annie Huang
- Division of Hematology/Oncology Arthur and Sonia Labatt Brain Tumour Research Centre, The Hospital for Sick Children, Toronto, Ontario, Canada
- Department of Pediatrics, Medical Biophysics, Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Ontario, Canada
| | - Alexander J R Bishop
- Department of Cell Systems and Anatomy, University of Texas Health at San Antonio, San Antonio, TX, USA
- Greehey Children's Cancer Research Institute, University of Texas Health at San Antonio, San Antonio, TX, USA
- Mays Cancer Center, University of Texas Health at San Antonio, San Antonio, TX, USA
| | - Stefan M Pfister
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
- Department of Pediatric Oncology, Hematology and Immunology, Heidelberg University Hospital, Heidelberg, Germany
| | - Andrey Korshunov
- Department of Neuropathology, Heidelberg University Hospital, Heidelberg, Germany
- Clinical Cooperation Unit Neuropathology, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Marcel Kool
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany.
- Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ), Heidelberg, Germany.
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany.
| |
Collapse
|
36
|
MacLean EL, Snyder-Mackler N, vonHoldt BM, Serpell JA. Highly heritable and functionally relevant breed differences in dog behaviour. Proc Biol Sci 2019; 286:20190716. [PMID: 31575369 DOI: 10.1098/rspb.2019.0716] [Citation(s) in RCA: 55] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Variation across dog breeds presents a unique opportunity to investigate the evolution and biological basis of complex behavioural traits. We integrated behavioural data from more than 14 000 dogs from 101 breeds with breed-averaged genotypic data (n = 5697 dogs) from over 100 000 loci in the dog genome. We found high levels of among-breed heritability for 14 behavioural traits (the proportion of trait variance attributable to genetic similarity among breeds). We next identified 131 single nucleotide polymorphisms associated with breed differences in behaviour, which were found in genes that are highly expressed in the brain and enriched for neurobiological functions and developmental processes, suggesting that they may be functionally associated with behavioural differences. Our results shed light on the heritability and genetic architecture of complex behavioural traits and identify dogs as a powerful model in which to address these questions.
Collapse
Affiliation(s)
- Evan L MacLean
- School of Anthropology, University of Arizona, Tucson, AZ, USA.,Department of Psychology, University of Arizona, Tucson, AZ, USA
| | - Noah Snyder-Mackler
- Department of Psychology, University of Washington, Seattle, WA, USA.,Center for Studies in Demography and Ecology, University of Washington, Seattle, WA, USA.,Washington National Primate Research Center, University of Washington, Seattle, WA, USA
| | - Bridgett M vonHoldt
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, USA
| | - James A Serpell
- School of Veterinary Medicine, University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|
37
|
Ding Z, Kihara D. Computational identification of protein-protein interactions in model plant proteomes. Sci Rep 2019; 9:8740. [PMID: 31217453 PMCID: PMC6584649 DOI: 10.1038/s41598-019-45072-8] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2019] [Accepted: 05/30/2019] [Indexed: 12/12/2022] Open
Abstract
Protein-protein interactions (PPIs) play essential roles in many biological processes. A PPI network provides crucial information on how biological pathways are structured and coordinated from individual protein functions. In the past two decades, large-scale PPI networks of a handful of organisms were determined by experimental techniques. However, these experimental methods are time-consuming, expensive, and are not easy to perform on new target organisms. Large-scale PPI data is particularly sparse in plant organisms. Here, we developed a computational approach for detecting PPIs trained and tested on known PPIs of Arabidopsis thaliana and applied to three plants, Arabidopsis thaliana, Glycine max (soybean), and Zea mays (maize) to discover new PPIs on a genome-scale. Our method considers a variety of features including protein sequences, gene co-expression, functional association, and phylogenetic profiles. This is the first work where a PPI prediction method was developed for is the first PPI prediction method applied on benchmark datasets of Arabidopsis. The method showed a high prediction accuracy of over 90% and very high precision of close to 1.0. We predicted 50,220 PPIs in Arabidopsis thaliana, 13,175,414 PPIs in corn, and 13,527,834 PPIs in soybean. Newly predicted PPIs were classified into three confidence levels according to the availability of existing supporting evidence and discussed. Predicted PPIs in the three plant genomes are made available for future reference.
Collapse
Affiliation(s)
- Ziyun Ding
- Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA.
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA.
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA.
- Department of Pediatrics, University of Cincinnati, Cincinnati, OH, 45229, USA.
| |
Collapse
|
38
|
Pomaznoy M, Ha B, Peters B. GOnet: a tool for interactive Gene Ontology analysis. BMC Bioinformatics 2018; 19:470. [PMID: 30526489 PMCID: PMC6286514 DOI: 10.1186/s12859-018-2533-3] [Citation(s) in RCA: 154] [Impact Index Per Article: 25.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2018] [Accepted: 11/21/2018] [Indexed: 01/28/2023] Open
Abstract
BACKGROUND Biological interpretation of gene/protein lists resulting from -omics experiments can be a complex task. A common approach consists of reviewing Gene Ontology (GO) annotations for entries in such lists and searching for enrichment patterns. Unfortunately, there is a gap between machine-readable output of GO software and its human-interpretable form. This gap can be bridged by allowing users to simultaneously visualize and interact with term-term and gene-term relationships. RESULTS We created the open-source GOnet web-application (available at http://tools.dice-database.org/GOnet/ ), which takes a list of gene or protein entries from human or mouse data and performs GO term annotation analysis (mapping of provided entries to GO subsets) or GO term enrichment analysis (scanning for GO categories overrepresented in the input list). The application is capable of producing parsable data formats and importantly, interactive visualizations of the GO analysis results. The interactive results allow exploration of genes and GO terms as a graph that depicts the natural hierarchy of the terms and retains relationships between terms and genes/proteins. As a result, GOnet provides insight into the functional interconnection of the submitted entries. CONCLUSIONS The application can be used for GO analysis of any biological data sources resulting in gene/protein lists. It can be helpful for experimentalists as well as computational biologists working on biological interpretation of -omics data resulting in such lists.
Collapse
Affiliation(s)
- Mikhail Pomaznoy
- Department of Vaccine Discovery, La Jolla Institute for Allergy and Immunology, La Jolla, CA USA
| | - Brendan Ha
- Department of Vaccine Discovery, La Jolla Institute for Allergy and Immunology, La Jolla, CA USA
| | - Bjoern Peters
- Department of Vaccine Discovery, La Jolla Institute for Allergy and Immunology, La Jolla, CA USA
- Department of Medicine, University of California San Diego, La Jolla, CA 92093 USA
| |
Collapse
|
39
|
Zheng W, Lin H, Liu X, Xu B. A document level neural model integrated domain knowledge for chemical-induced disease relations. BMC Bioinformatics 2018; 19:328. [PMID: 30223767 PMCID: PMC6142695 DOI: 10.1186/s12859-018-2316-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2018] [Accepted: 08/14/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The effective combination of texts and knowledge may improve performances of natural language processing tasks. For the recognition of chemical-induced disease (CID) relations which may span sentence boundaries in an article, although existing CID systems explored the utilization for knowledge bases, the effects of different knowledge on the identification of a special CID haven't been distinguished by these systems. Moreover, systems based on neural network only constructed sentence or mention level models. RESULTS In this work, we proposed an effective document level neural model integrated domain knowledge to extract CID relations from biomedical articles. Basic semantic information of an article with respect to a special CID candidate pair was learned from the document level sub-network module. Furthermore, knowledge attention depending on the representation of the article was proposed to distinguish the influences of different knowledge on the special CID pair and then the final representation of knowledge was formed by aggregating weighed knowledge. Finally, the integrated representations of texts and knowledge were passed to a softmax classifier to perform the CID recognition. Experimental results on the chemical-disease relation corpus proposed by BioCreative V show that our proposed system integrated knowledge achieves a good overall performance compared with other state-of-the-art systems. CONCLUSIONS Experimental analyses demonstrate that the introduced attention mechanism on domain knowledge plays a significant role in distinguishing influences of different knowledge on the judgment for a special CID relation.
Collapse
Affiliation(s)
- Wei Zheng
- College of Computer Science and Technology, Dalian University of Technology, Dalian, China.,College of Software, Dalian JiaoTong University, Dalian, China
| | - Hongfei Lin
- College of Computer Science and Technology, Dalian University of Technology, Dalian, China.
| | - Xiaoxia Liu
- College of Computer Science and Technology, Dalian University of Technology, Dalian, China
| | - Bo Xu
- College of Computer Science and Technology, Dalian University of Technology, Dalian, China.
| |
Collapse
|
40
|
Ding Z, Kihara D. Computational Methods for Predicting Protein-Protein Interactions Using Various Protein Features. CURRENT PROTOCOLS IN PROTEIN SCIENCE 2018; 93:e62. [PMID: 29927082 PMCID: PMC6097941 DOI: 10.1002/cpps.62] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Understanding protein-protein interactions (PPIs) in a cell is essential for learning protein functions, pathways, and mechanism of diseases. PPIs are also important targets for developing drugs. Experimental methods, both small-scale and large-scale, have identified PPIs in several model organisms. However, results cover only a part of PPIs of organisms; moreover, there are many organisms whose PPIs have not yet been investigated. To complement experimental methods, many computational methods have been developed that predict PPIs from various characteristics of proteins. Here we provide an overview of literature reports to classify computational PPI prediction methods that consider different features of proteins, including protein sequence, genomes, protein structure, function, PPI network topology, and those which integrate multiple methods. © 2018 by John Wiley & Sons, Inc.
Collapse
Affiliation(s)
- Ziyun Ding
- Department of Biological Science, Purdue University, West Lafayette, IN, 47907 USA
| | - Daisuke Kihara
- Department of Biological Science, Purdue University, West Lafayette, IN, 47907 USA
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907 USA
- Corresponding author: DK; , Phone: 1-765-496-2284 (DK)
| |
Collapse
|
41
|
Abstract
Motivation Moonlighting proteins (MPs) are an important class of proteins that perform more than one independent cellular function. MPs are gaining more attention in recent years as they are found to play important roles in various systems including disease developments. MPs also have a significant impact in computational function prediction and annotation in databases. Currently MPs are not labeled as such in biological databases even in cases where multiple distinct functions are known for the proteins. In this work, we propose a novel method named DextMP, which predicts whether a protein is a MP or not based on its textual features extracted from scientific literature and the UniProt database. Results DextMP extracts three categories of textual information for a protein: titles, abstracts from literature, and function description in UniProt. Three language models were applied and compared: a state-of-the-art deep unsupervised learning algorithm along with two other language models of different types, Term Frequency-Inverse Document Frequency in the bag-of-words and Latent Dirichlet Allocation in the topic modeling category. Cross-validation results on a dataset of known MPs and non-MPs showed that DextMP successfully predicted MPs with over 91% accuracy with significant improvement over existing MP prediction methods. Lastly, we ran DextMP with the best performing language models and text-based feature combinations on three genomes, human, yeast and Xenopus laevis, and found that about 2.5–35% of the proteomes are potential MPs. Availability and Implementation Code available at http://kiharalab.org/DextMP.
Collapse
Affiliation(s)
- Ishita K Khan
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Mansurul Bhuiyan
- Department of Computer Science, Indiana University-Purdue University Indianapolis (IUPUI), Indianapolis, IN, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN, USA.,Department of Biological Science, Purdue University, West Lafayette, IN, USA
| |
Collapse
|
42
|
Araujo FA, Barh D, Silva A, Guimarães L, Ramos RTJ. GO FEAT: a rapid web-based functional annotation tool for genomic and transcriptomic data. Sci Rep 2018; 8:1794. [PMID: 29379090 PMCID: PMC5789007 DOI: 10.1038/s41598-018-20211-9] [Citation(s) in RCA: 58] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2017] [Accepted: 01/15/2018] [Indexed: 01/28/2023] Open
Abstract
Downstream analysis of genomic and transcriptomic sequence data is often executed by functional annotation that can be performed by various bioinformatics tools and biological databases. However, a full fast integrated tool is not available for such analysis. Besides, the current available software is not able to produce analytic lists of annotations and graphs to help users in evaluating the output results. Therefore, we present the Gene Ontology Functional Enrichment Annotation Tool (GO FEAT), a free web platform for functional annotation and enrichment of genomic and transcriptomic data based on sequence homology search. The analysis can be customized and visualized as per users’ needs and specifications. GO FEAT is freely available at http://computationalbiology.ufpa.br/gofeat/ and its source code is hosted at https://github.com/fabriciopa/gofeat.
Collapse
Affiliation(s)
- Fabricio Almeida Araujo
- Universidade Federal do Pará, Instituto de Ciências Biológicas, Rua Augusto Corrêa, 01 - Guamá, Belém, PA, Brazil
| | - Debmalya Barh
- Centre for Genomics and Applied Gene Technology, Institute of Integrative Omics and Applied Biotechnology (IIOAB), Nonakuri, Purba Medinipur, WB-721172, India
| | - Artur Silva
- Universidade Federal do Pará, Instituto de Ciências Biológicas, Rua Augusto Corrêa, 01 - Guamá, Belém, PA, Brazil
| | - Luis Guimarães
- Universidade Federal do Pará, Instituto de Ciências Biológicas, Rua Augusto Corrêa, 01 - Guamá, Belém, PA, Brazil
| | - Rommel Thiago Juca Ramos
- Universidade Federal do Pará, Instituto de Ciências Biológicas, Rua Augusto Corrêa, 01 - Guamá, Belém, PA, Brazil.
| |
Collapse
|
43
|
Ding Z, Wei Q, Kihara D. Computing and Visualizing Gene Function Similarity and Coherence with NaviGO. Methods Mol Biol 2018; 1807:113-130. [PMID: 30030807 DOI: 10.1007/978-1-4939-8561-6_9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Gene ontology (GO) is a controlled vocabulary of gene functions across all species, which is widely used for functional analyses of individual genes and large-scale proteomic studies. NaviGO is a webserver for visualizing and quantifying the relationship and similarity of GO annotations. Here, we walk through functionality of the NaviGO webserver ( http://kiharalab.org/web/navigo/ ) using an example input and explain what can be learned from analysis results. NaviGO has four main functions, accessed from each page of the webserver: "GO Parents," "GO Set", "GO Enrichment", and "Protein Set." For a given list of GO terms, the "GO Parents" tab visualizes the hierarchical relationship of GO terms, and the "GO Set" tab calculates six functional similarity and association scores and presents results in a network and a multidimensional scaling plot. For a set of proteins and their associated GO terms, the "GO Enrichment" tab calculates protein GO functional enrichment, while the "Protein Set" tab calculates functional association between proteins. The NaviGO source code can be also downloaded and used locally or integrated into other software pipelines.
Collapse
Affiliation(s)
- Ziyun Ding
- Department of Biological Science, Purdue University, West Lafayette, IN, USA
| | - Qing Wei
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Daisuke Kihara
- Department of Biological Science, Purdue University, West Lafayette, IN, USA. .,Department of Computer Science, Purdue University, West Lafayette, IN, USA.
| |
Collapse
|