201
|
Marques ES, Formato E, Liang W, Leonard E, Timme‐Laragy AR. Relationships between type 2 diabetes, cell dysfunction, and redox signaling: A meta-analysis of single-cell gene expression of human pancreatic α- and β-cells. J Diabetes 2022; 14:34-51. [PMID: 34725923 PMCID: PMC8746116 DOI: 10.1111/1753-0407.13236] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Accepted: 10/29/2021] [Indexed: 12/01/2022] Open
Abstract
BACKGROUND Type 2 diabetes mellitus (T2DM) is a chronic disease characterized by insulin resistance and failure of β-cells to meet the metabolic demand for insulin. Recent advances in single-cell RNA sequencing (sc-RNA-Seq) have allowed for in-depth studies to further understand the underlying cellular mechanisms of T2DM. In β-cells, redox signaling is critical for insulin production. A meta-analysis of human pancreas islet sc-RNA-Seq data was conducted to evaluate how T2DM may modify the transcriptomes of α- and β-cells. METHODS Annotated sc-RNA-Seq data from six studies of human pancreatic islets from metabolically healthy and donors with T2DM were collected. α- and β-cells, subpopulations of proliferating α-cells, immature, and senescent β-cells were identified based on expression levels of key marker genes. Each dataset was analyzed individually before combining, using weighted comparisons. Pathways of significant genes and individual redox-related gene expression were then evaluated to further understand the role that redox signaling may play in T2DM-induced β-cell dysfunction. RESULTS α- and β-cells from T2DM donors modified genes involved in energy metabolism, immune response, autophagy, and cellular stress. α- and β-cells also had an increased nuclear factor erythroid 2-related factor 2 (NFE2L2)-mediated antioxidant response in T2DM donors. The proportion of immature and senescent β-cells increased in T2DM donors, and in immature and senescent β-cells, genes regulated by NFE2L2 were further upregulated. CONCLUSIONS These findings suggest that NFE2L2 plays a role in β-cell maturation and dysfunction. Redox singling maybe a key pathway for β-cell restoration and T2DM therapeutics.
Collapse
Affiliation(s)
- Emily Sara Marques
- Department of Environmental Health SciencesUniversity of Massachusetts AmherstAmherstMassachusettsUSA
| | - Emily Formato
- Molecular and Cellular Biology Graduate ProgramUniversity of Massachusetts AmherstAmherstMassachusettsUSA
| | - Wenle Liang
- Department of Environmental Health SciencesUniversity of Massachusetts AmherstAmherstMassachusettsUSA
| | - Emily Leonard
- Department of Environmental Health SciencesUniversity of Massachusetts AmherstAmherstMassachusettsUSA
| | - Alicia R. Timme‐Laragy
- Department of Environmental Health SciencesUniversity of Massachusetts AmherstAmherstMassachusettsUSA
| |
Collapse
|
202
|
Kotlyar M, Wong SWH, Pastrello C, Jurisica I. Improving Analysis and Annotation of Microarray Data with Protein Interactions. Methods Mol Biol 2022; 2401:51-68. [PMID: 34902122 DOI: 10.1007/978-1-0716-1839-4_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Gene expression microarrays are one of the most widely used high-throughput technologies in molecular biology, with applications such as identification of disease mechanisms and development of diagnostic and prognostic gene signatures. However, the success of these tasks is often limited because microarray analysis does not account for the complex relationships among genes, their products, and overall signaling and regulatory cascades. Incorporating protein-protein interaction data into microarray analysis can help address these challenges. This chapter reviews how protein-protein interactions can help with microarray analysis, leading to benefits such as better explanations of disease mechanisms, more complete gene annotations, improved prioritization of genes for future experiments, and gene signatures that generalize better to new data.
Collapse
Affiliation(s)
- Max Kotlyar
- Osteoarthritis Research Program, Division of Orthopedic Surgery, Schroeder Arthritis Institute, University Health Network, Toronto, ON, Canada
- Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, Toronto Western Hospital, University Health Network, Toronto, ON, Canada
| | - Serene W H Wong
- Osteoarthritis Research Program, Division of Orthopedic Surgery, Schroeder Arthritis Institute, University Health Network, Toronto, ON, Canada
- Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, Toronto Western Hospital, University Health Network, Toronto, ON, Canada
| | - Chiara Pastrello
- Osteoarthritis Research Program, Division of Orthopedic Surgery, Schroeder Arthritis Institute, University Health Network, Toronto, ON, Canada
- Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, Toronto Western Hospital, University Health Network, Toronto, ON, Canada
| | - Igor Jurisica
- Osteoarthritis Research Program, Division of Orthopedic Surgery, Schroeder Arthritis Institute, University Health Network, Toronto, ON, Canada.
- Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, Toronto Western Hospital, University Health Network, Toronto, ON, Canada.
- Departments of Medical Biophysics and Computer Science, University of Toronto, Toronto, ON, Canada.
- Institute of Neuroimmunology, Slovak Academy of Sciences, Bratislava, Slovakia.
| |
Collapse
|
203
|
Yu L, Ding Y, Wan T, Deng T, Huang H, Liu J. Significance of CD47 and Its Association With Tumor Immune Microenvironment Heterogeneity in Ovarian Cancer. Front Immunol 2021; 12:768115. [PMID: 34966389 PMCID: PMC8710451 DOI: 10.3389/fimmu.2021.768115] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Accepted: 11/23/2021] [Indexed: 01/11/2023] Open
Abstract
Background It was reported that tumor heterogeneity and the surrounding tumor microenvironment (TME) in ovarian cancer affects immunotherapy efficacy and patient outcomes. And the TME of ovarian cancer is intrinsically heterogeneous. CD47 plays vital roles in cell functional behavior and immune homeostasis relating to cancer prognosis. But how it affects TME and its contribution to heterogeneity in ovarian cancer has not been fully illustrated. Therefore, we aimed to identify a prognostic biomarker which may help explain tumor immune microenvironment heterogeneity of ovarian cancer. Methods Cancer single-cell state atlas (CancerSEA) was used to evaluate functional role of CD47. Several bioinformatics database including Oncomine, Gene Expression Profiling Interaction Analysis (GEPIA), Tumor Immune Estimation Resource (TIMER), The Human Protein Atlas (HPA), Ualcan and Kaplan-Meier plotter (KM plotter) were applied to illustrate correlation of CD47 with ovarian cancer prognosis and immune infiltration. Tumor Immune Single-cell Hub (TISCH) single cell database was employed to evaluate correlation of CD47 with tumor microenvironment. GeneMANIA was implemented to identify regulation networks of CD47. Differentially expressed genes (DEGs) between CD47 high and low expression groups were analyzed with R package DESeq2. Kyoto encyclopedia of genes and genomes (KEGG) and Gene Set Enrichment Analysis (GSEA) were utilized to explore how CD47 affect the immune related cell signaling pathway. Results CD47 expression was upregulated and connected to worse OS and PFS in ovarian cancer. Close relation was found between CD47 expression level and immune infiltration in ovarian cancer, especially with Treg cells, Monocytes, Macrophages and T cell exhaustion (P<0.05). The CD47 expression level was relatively low in plasma cells, dendritic cells and Mono/Macro cells of OV_GSE115007, in myofibroblasts, fibroblasts and endothelial cells of OV_GSE118828, compared to malignant cells of OV_GSE118828 dataset. The cell components and distribution in primary and metastatic ovarian cancer are quite distinct, which may lead to TME heterogeneity of ovarian cancer. Conclusion Our results indicated that CD47 is closely correlated to ovarian cancer immune microenvironment and might induce ovarian cancer heterogeneity. Therefore, CD47 may be used as a candidate prognostic biomarker and provide us with new insights into potential immunotherapy in ovarian cancer patients.
Collapse
Affiliation(s)
- Lan Yu
- Department of Gynecologic Oncology, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-Sen University Cancer Center, Guangzhou, China.,Department of Obstetrics and Gynecology, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Yi Ding
- Department of Gynecologic Oncology, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-Sen University Cancer Center, Guangzhou, China
| | - Ting Wan
- Department of Gynecologic Oncology, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-Sen University Cancer Center, Guangzhou, China
| | - Ting Deng
- Department of Gynecologic Oncology, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-Sen University Cancer Center, Guangzhou, China
| | - He Huang
- Department of Gynecologic Oncology, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-Sen University Cancer Center, Guangzhou, China
| | - Jihong Liu
- Department of Gynecologic Oncology, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-Sen University Cancer Center, Guangzhou, China
| |
Collapse
|
204
|
Bao Y, Gabrielpillai J, Dietrich J, Zarbl R, Strieth S, Schröck F, Dietrich D. Fibroblast growth factor (FGF), FGF receptor (FGFR), and cyclin D1 (CCND1) DNA methylation in head and neck squamous cell carcinomas is associated with transcriptional activity, gene amplification, human papillomavirus (HPV) status, and sensitivity to tyrosine kinase inhibitors. Clin Epigenetics 2021; 13:228. [PMID: 34933671 PMCID: PMC8693503 DOI: 10.1186/s13148-021-01212-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Accepted: 12/08/2021] [Indexed: 12/11/2022] Open
Abstract
Background Dysregulation of fibroblast growth factor receptor (FGFR) signaling pathway has been observed in head and neck squamous cell carcinoma (HNSCC) and is a promising therapeutic target for selective tyrosine kinase inhibitors (TKIs). Potential predictive biomarkers for response to FGFR-targeted therapies are urgently needed. Understanding the epigenetic regulation of FGF pathway related genes, i.e. FGFRs, FGFs, and CCND1, could enlighten the way towards biomarker-selected FGFR-targeted therapies. Methods We performed DNA methylation analysis of the encoding genes FGFR1, FGFR2, FGFR3, FGFR4, FGF1-14, FGF16-23, and CCND1 at single CpG site resolution (840 CpG sites) employing The Cancer Genome Research Atlas (TCGA) HNSCC cohort comprising N = 530 tumor tissue and N = 50 normal adjacent tissue samples. We correlated DNA methylation to mRNA expression with regard to human papilloma virus (HPV) and gene amplification status. Moreover, we investigated the correlation of methylation with sensitivity to the selective FGFR inhibitors PD 173074 and AZD4547 in N = 40 HPV(−) HNSCC cell lines. Results We found sequence-contextually nuanced CpG methylation patterns in concordance with epigenetically regulated genes. High methylation levels were predominantly found in the promoter flank and gene body region, while low methylation levels were present in the central promoter region for most of the analyzed CpG sites. FGFRs, FGFs, and CCND1 methylation differed significantly between tumor and normal adjacent tissue and was associated with HPV and gene amplification status. CCND1 promoter methylation correlated with CCND1 amplification. For most of the analyzed CpG sites, methylation levels correlated to mRNA expression in tumor tissue. Furthermore, we found significant correlations of DNA methylation of specific CpG sites with response to the FGFR1/3–selective inhibitors PD 173074 and AZD4547, predominantly within the transcription start site of CCND1. Conclusions Our results suggest an epigenetic regulation of CCND1, FGFRs, and FGFs via DNA methylation in HNSCC and warrants further investigation of DNA methylation as a potential predictive biomarker for response to selective FGFR inhibitors in clinical trials. Supplementary Information The online version contains supplementary material available at 10.1186/s13148-021-01212-4.
Collapse
Affiliation(s)
- Yilin Bao
- Department of Otorhinolaryngology, Head and Neck Surgery, University Medical Center Bonn (UKB), Sigmund-Freud-Str. 25, 53105, Bonn, Germany.,Department of Otolaryngology, Head and Neck Surgery, The Affiliated Hospital of Southwest Medical University, Luzhou, China
| | - Jennis Gabrielpillai
- Department of Otorhinolaryngology, Head and Neck Surgery, University Medical Center Bonn (UKB), Sigmund-Freud-Str. 25, 53105, Bonn, Germany
| | - Jörn Dietrich
- Department of Otorhinolaryngology, Head and Neck Surgery, University Medical Center Bonn (UKB), Sigmund-Freud-Str. 25, 53105, Bonn, Germany
| | - Romina Zarbl
- Department of Otorhinolaryngology, Head and Neck Surgery, University Medical Center Bonn (UKB), Sigmund-Freud-Str. 25, 53105, Bonn, Germany
| | - Sebastian Strieth
- Department of Otorhinolaryngology, Head and Neck Surgery, University Medical Center Bonn (UKB), Sigmund-Freud-Str. 25, 53105, Bonn, Germany
| | - Friederike Schröck
- Department of Psychiatry and Psychotherapy, University Hospital Bonn, Bonn, Germany
| | - Dimo Dietrich
- Department of Otorhinolaryngology, Head and Neck Surgery, University Medical Center Bonn (UKB), Sigmund-Freud-Str. 25, 53105, Bonn, Germany.
| |
Collapse
|
205
|
Cystic Fibrosis: Systems Biology Analysis from Homozygous p.Phe508del Variant Patients' Samples Reveals Perturbations in Tissue-Specific Pathways. BIOMED RESEARCH INTERNATIONAL 2021; 2021:5262000. [PMID: 34901273 PMCID: PMC8660202 DOI: 10.1155/2021/5262000] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/10/2021] [Accepted: 11/08/2021] [Indexed: 11/24/2022]
Abstract
Cystic fibrosis (CF) is an autosomal recessive disorder, caused by diverse genetic variants for the CF transmembrane conductance regulator (CFTR) protein. Among these, p.Phe508del is the most prevalent variant. The effects of this variant on the physiology of each tissue remains unknown. This study is aimed at predicting cell signaling pathways present in different tissues of fibrocystic patients, homozygous for p.Phe508del. The study involved analysis of two microarray datasets, E-GEOD-15568 and E-MTAB-360 corresponding to the rectal and bronchial epithelium, respectively, obtained from the ArrayExpress repository. Particularly, differentially expressed genes (DEGs) were predicted, protein-protein interaction (PPI) networks were designed, and centrality and functional interaction networks were analyzed. The study reported that p.Phe508del-mutated CFTR-allele in homozygous state influenced the whole gene expression in each tissue differently. Interestingly, gene ontology (GO) term enrichment analysis revealed that only “neutrophil activation” was shared between both tissues; however, nonshared DEGs were grouped into the same GO term. For further verification, functional interaction networks were generated, wherein no shared nodes were reported between these tissues. These results suggested that the p.Phe508del-mutated CFTR-allele in homozygous state promoted tissue-specific pathways in fibrocystic patients. The generated data might further assist in prediction diagnosis to define biomarkers or devising therapeutic strategies.
Collapse
|
206
|
Vlasenkova R, Nurgalieva A, Akberova N, Bogdanov M, Kiyamova R. Characterization of SLC34A2 as a Potential Prognostic Marker of Oncological Diseases. Biomolecules 2021; 11:biom11121878. [PMID: 34944522 PMCID: PMC8699446 DOI: 10.3390/biom11121878] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2021] [Revised: 12/10/2021] [Accepted: 12/10/2021] [Indexed: 12/29/2022] Open
Abstract
The main goal of this study is to consider SLC34A2 as a potential prognostic marker of oncological diseases using the mutational, expression, and survival data of cancer studies which are publicly available online. We collected data from four databases (cBioPortal, The Cancer Genome Atlas; cBioPortal, Genie; International Cancer Genome Consortium; ArrayExpress). In total, 111,283 samples were categorized according to 27 tumor locations. Ninety-nine functionally significant missense mutations and twelve functionally significant indel mutations in SLC34A2 were found. The most frequent mutations were SLC34A2-ROS1, p.T154A, p.P506S/R/L, p.G257A/E/R, p.S318W, p.A396T, p.P410L/S/H, p.S461C, p.A473T/V, and p.Y503H/C/F. The upregulation of SLC34A2 was found in samples of myeloid, bowel, ovarian, and uterine tumors; downregulation was found in tumor samples of breast, liver, lung, and skin cancer tumors. It was found that the life expectancy of breast and thymus cancer patients with an SLC34A2 mutation is lower, and it was revealed that SLC34A2 overexpression reduced the life span of patients with brain, ovarian, and pancreatic tumors. Thereby, for these types of oncological diseases, the mutational profile of SLC34A2 can be a potential prognostic marker for breast and thymus cancers, and the upregulation of SLC34A2 can be a potential prognostic marker for brain, ovarian, and pancreatic cancers.
Collapse
Affiliation(s)
- Ramilia Vlasenkova
- Department of Biochemistry, Biotechnology and Pharmacology, Institute of Fundamental Medicine and Biology, Kazan Federal University, 420008 Kazan, Russia; (R.V.); (A.N.); (N.A.); (M.B.)
| | - Alsina Nurgalieva
- Department of Biochemistry, Biotechnology and Pharmacology, Institute of Fundamental Medicine and Biology, Kazan Federal University, 420008 Kazan, Russia; (R.V.); (A.N.); (N.A.); (M.B.)
| | - Natalia Akberova
- Department of Biochemistry, Biotechnology and Pharmacology, Institute of Fundamental Medicine and Biology, Kazan Federal University, 420008 Kazan, Russia; (R.V.); (A.N.); (N.A.); (M.B.)
| | - Mikhail Bogdanov
- Department of Biochemistry, Biotechnology and Pharmacology, Institute of Fundamental Medicine and Biology, Kazan Federal University, 420008 Kazan, Russia; (R.V.); (A.N.); (N.A.); (M.B.)
- Department of Biochemistry and Molecular Biology, McGovern Medical School, The University of Texas Health Science Center, Houston, TX 77030, USA
| | - Ramziya Kiyamova
- Department of Biochemistry, Biotechnology and Pharmacology, Institute of Fundamental Medicine and Biology, Kazan Federal University, 420008 Kazan, Russia; (R.V.); (A.N.); (N.A.); (M.B.)
- Correspondence:
| |
Collapse
|
207
|
Turner AK, Yasir M, Bastkowski S, Telatin A, Page A, Webber M, Charles I. Chemical biology-whole genome engineering datasets predict new antibacterial combinations. Microb Genom 2021; 7. [PMID: 34874820 PMCID: PMC8767339 DOI: 10.1099/mgen.0.000718] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Trimethoprim and sulfamethoxazole are used commonly together as cotrimoxazole for the treatment of urinary tract and other infections. The evolution of resistance to these and other antibacterials threatens therapeutic options for clinicians. We generated and analysed a chemical-biology-whole-genome data set to predict new targets for antibacterial combinations with trimethoprim and sulfamethoxazole. For this we used a large transposon mutant library in Escherichia coli BW25113 where an outward-transcribing inducible promoter was engineered into one end of the transposon. This approach allows regulated expression of adjacent genes in addition to gene inactivation at transposon insertion sites, a methodology that has been called TraDIS-Xpress. These chemical genomic data sets identified mechanisms for both reduced and increased susceptibility to trimethoprim and sulfamethoxazole. The data identified that over-expression of FolA reduced trimethoprim susceptibility, a known mechanism for reduced susceptibility. In addition, transposon insertions into the genes tdk, deoR, ybbC, hha, ldcA, wbbK and waaS increased susceptibility to trimethoprim and likewise for rsmH, fadR, ddlB, nlpI and prc with sulfamethoxazole, while insertions in ispD, uspC, minC, minD, yebK, truD and umpG increased susceptibility to both these antibiotics. Two of these genes’ products, Tdk and IspD, are inhibited by AZT and fosmidomycin respectively, antibiotics that are known to synergise with trimethoprim. Thus, the data identified two known targets and several new target candidates for the development of co-drugs that synergise with trimethoprim, sulfamethoxazole or cotrimoxazole. We demonstrate that the TraDIS-Xpress technology can be used to generate information-rich chemical-genomic data sets that can be used for antibacterial development.
Collapse
Affiliation(s)
- Arthur K Turner
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
| | - Muhammad Yasir
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
| | - Sarah Bastkowski
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
| | - Andrea Telatin
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
| | - Andrew Page
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK.,University of East Anglia, Norwich Research Park, Norwich, NR4 7TJ, UK
| | - Mark Webber
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK.,University of East Anglia, Norwich Research Park, Norwich, NR4 7TJ, UK
| | - Ian Charles
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK.,University of East Anglia, Norwich Research Park, Norwich, NR4 7TJ, UK
| |
Collapse
|
208
|
Suzuki T, Ono Y, Bono H. Comparison of Oxidative and Hypoxic Stress Responsive Genes from Meta-Analysis of Public Transcriptomes. Biomedicines 2021; 9:biomedicines9121830. [PMID: 34944646 PMCID: PMC8698900 DOI: 10.3390/biomedicines9121830] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Revised: 11/23/2021] [Accepted: 11/30/2021] [Indexed: 01/11/2023] Open
Abstract
Analysis of RNA-sequencing (RNA-seq) data is an effective means to analyze the gene expression levels under specific conditions and discover new biological knowledge. More than 74,000 experimental series with RNA-seq have been stored in public databases as of 20 October 2021. Since this huge amount of expression data accumulated from past studies is a promising source of new biological insights, we focused on a meta-analysis of 1783 runs of RNA-seq data under the conditions of two types of stressors: oxidative stress (OS) and hypoxia. The collected RNA-seq data of OS were organized as the OS dataset to retrieve and analyze differentially expressed genes (DEGs). The OS-induced DEGs were compared with the hypoxia-induced DEGs retrieved from a previous study. The results from the meta-analysis of OS transcriptomes revealed two genes, CRIP1 and CRIP3, which were particularly downregulated, suggesting a relationship between OS and zinc homeostasis. The comparison between meta-analysis of OS and hypoxia showed that several genes were differentially expressed under both stress conditions, and it was inferred that the downregulation of cell cycle-related genes is a mutual biological process in both OS and hypoxia.
Collapse
|
209
|
A. Tindall C, Erkner E, Stichel J, G. Beck-Sickinger A, Hoffmann A, Weiner J, T. Heiker J. Cleavage of the vaspin N-terminus releases cell-penetrating peptides that affect early stages of adipogenesis and inhibit lipolysis in mature adipocytes. Adipocyte 2021; 10:216-231. [PMID: 33866927 PMCID: PMC8078822 DOI: 10.1080/21623945.2021.1910154] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Vaspin expression and function is related to metabolic disorders and comorbidities of obesity. In various cellular and animal models of obesity, diabetes and atherosclerosis vaspin has shown beneficial, protective and/or compensatory action. While testing proteases for inhibition by vaspin, we noticed specific cleavage within the vaspin N-terminus and sequence analysis predicted cell-penetrating activity for the released peptides. These findings raised the question whether these proteolytic peptides exhibit biological activity. We synthesized various N-terminal vaspin peptides to investigate cell-penetrating activity and analyse uptake mechanisms. Focusing on adipocytes, we performed microarray analysis and functional assays to elucidate biological activities of the vaspin–derived peptide, which is released by KLK7 cleavage (vaspin residues 21-30; VaspinN). Our study provides first evidence that proteolytic processing of the vaspin N-terminus releases cell-penetrating and bioactive peptides with effects on adipocyte biology. The VaspinN peptide increased preadipocyte proliferation, interfered with clonal expansion during the early stage of adipogenesis and blunted adrenergic cAMP-signalling, downstream lipolysis as well as insulin signalling in mature adipocytes. Protease-mediated release of functional N-terminal peptides presents an additional facet of vaspin action. Future studies will address the mechanisms underlying the biological activities and clarify, if vaspin-derived peptides may have potential as therapeutic agents for the treatment of metabolic diseases.
Collapse
Affiliation(s)
- Catherine A. Tindall
- Institute of Biochemistry, Faculty of Life Sciences, Leipzig University, Leipzig, Germany
| | - Estelle Erkner
- Institute of Biochemistry, Faculty of Life Sciences, Leipzig University, Leipzig, Germany
| | - Jan Stichel
- Institute of Biochemistry, Faculty of Life Sciences, Leipzig University, Leipzig, Germany
| | | | - Anne Hoffmann
- Helmholtz Institute for Metabolic, Obesity and Vascular Research (HI-MAG) of the Helmholtz Zentrum München at the University of Leipzig and University Hospital Leipzig, Leipzig, Germany
| | - Juliane Weiner
- Medical Department III - Endocrinology, Nephrology, Rheumatology, University of Leipzig Medical Center, Leipzig, Germany
| | - John T. Heiker
- Institute of Biochemistry, Faculty of Life Sciences, Leipzig University, Leipzig, Germany
- Helmholtz Institute for Metabolic, Obesity and Vascular Research (HI-MAG) of the Helmholtz Zentrum München at the University of Leipzig and University Hospital Leipzig, Leipzig, Germany
| |
Collapse
|
210
|
Burri D, Zavolan M. Shortening of 3' UTRs in most cell types composing tumor tissues implicates alternative polyadenylation in protein metabolism. RNA (NEW YORK, N.Y.) 2021; 27:1459-1470. [PMID: 34521731 PMCID: PMC8594477 DOI: 10.1261/rna.078886.121] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Accepted: 08/24/2021] [Indexed: 05/18/2023]
Abstract
During pre-mRNA maturation 3' end processing can occur at different polyadenylation sites in the 3' untranslated region (3' UTR) to give rise to transcript isoforms that differ in the length of their 3' UTRs. Longer 3' UTRs contain additional cis-regulatory elements that impact the fate of the transcript and/or of the resulting protein. Extensive alternative polyadenylation (APA) has been observed in cancers, but the mechanisms and roles remain elusive. In particular, it is unclear whether the APA occurs in the malignant cells or in other cell types that infiltrate the tumor. To resolve this, we developed a computational method, called SCUREL, that quantifies changes in 3' UTR length between groups of cells, including cells of the same type originating from tumor and control tissue. We used this method to study APA in human lung adenocarcinoma (LUAD). SCUREL relies solely on annotated 3' UTRs and on control systems such as T cell activation, and spermatogenesis gives qualitatively similar results at much greater sensitivity compared to the previously published scAPA method. In the LUAD samples, we find a general trend toward 3' UTR shortening not only in cancer cells compared to the cell type of origin, but also when comparing other cell types from the tumor vs. the control tissue environment. However, we also find high variability in the individual targets between patients. The findings help in understanding the extent and impact of APA in LUAD, which may support improvements in diagnosis and treatment.
Collapse
Affiliation(s)
- Dominik Burri
- Computational and Systems Biology, Biozentrum, University of Basel, Basel, CH-4056, Switzerland SIB Swiss Institute of Bioinformatics, Basel, CH-4056, Switzerland
| | - Mihaela Zavolan
- Computational and Systems Biology, Biozentrum, University of Basel, Basel, CH-4056, Switzerland SIB Swiss Institute of Bioinformatics, Basel, CH-4056, Switzerland
| |
Collapse
|
211
|
Defective cytokinin signaling reprograms lipid and flavonoid gene-to-metabolite networks to mitigate high salinity in Arabidopsis. Proc Natl Acad Sci U S A 2021; 118:2105021118. [PMID: 34815339 PMCID: PMC8640937 DOI: 10.1073/pnas.2105021118] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/06/2021] [Indexed: 12/13/2022] Open
Abstract
Cytokinin (CK) in plants regulates both developmental processes and adaptation to environmental stresses. Arabidopsis histidine phosphotransfer ahp2,3,5 and type-B Arabidopsis response regulator arr1,10,12 triple mutants are almost completely defective in CK signaling, and the ahp2,3,5 mutant was reported to be salt tolerant. Here, we demonstrate that the arr1,10,12 mutant is also more tolerant to salt stress than wild-type (WT) plants. A comprehensive metabolite profiling coupled with transcriptome analysis of the ahp2,3,5 and arr1,10,12 mutants was conducted to elucidate the salt tolerance mechanisms mediated by CK signaling. Numerous primary (e.g., sugars, amino acids, and lipids) and secondary (e.g., flavonoids and sterols) metabolites accumulated in these mutants under nonsaline and saline conditions, suggesting that both prestress and poststress accumulations of stress-related metabolites contribute to improved salt tolerance in CK-signaling mutants. Specifically, the levels of sugars (e.g., trehalose and galactinol), amino acids (e.g., branched-chain amino acids and γ-aminobutyric acid), anthocyanins, sterols, and unsaturated triacylglycerols were higher in the mutant plants than in WT plants. Notably, the reprograming of flavonoid and lipid pools was highly coordinated and concomitant with the changes in transcriptional levels, indicating that these metabolic pathways are transcriptionally regulated by CK signaling. The discovery of the regulatory role of CK signaling on membrane lipid reprogramming provides a greater understanding of CK-mediated salt tolerance in plants. This knowledge will contribute to the development of salt-tolerant crops with the ability to withstand salinity as a key driver to ensure global food security in the era of climate crisis.
Collapse
|
212
|
Wilks C, Zheng SC, Chen FY, Charles R, Solomon B, Ling JP, Imada EL, Zhang D, Joseph L, Leek JT, Jaffe AE, Nellore A, Collado-Torres L, Hansen KD, Langmead B. recount3: summaries and queries for large-scale RNA-seq expression and splicing. Genome Biol 2021; 22:323. [PMID: 34844637 PMCID: PMC8628444 DOI: 10.1186/s13059-021-02533-6] [Citation(s) in RCA: 90] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Accepted: 10/29/2021] [Indexed: 12/12/2022] Open
Abstract
We present recount3, a resource consisting of over 750,000 publicly available human and mouse RNA sequencing (RNA-seq) samples uniformly processed by our new Monorail analysis pipeline. To facilitate access to the data, we provide the recount3 and snapcount R/Bioconductor packages as well as complementary web resources. Using these tools, data can be downloaded as study-level summaries or queried for specific exon-exon junctions, genes, samples, or other features. Monorail can be used to process local and/or private data, allowing results to be directly compared to any study in recount3. Taken together, our tools help biologists maximize the utility of publicly available RNA-seq data, especially to improve their understanding of newly collected data. recount3 is available from http://rna.recount.bio .
Collapse
Affiliation(s)
- Christopher Wilks
- Department of Computer Science, Johns Hopkins University, Baltimore, USA
| | - Shijie C Zheng
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, USA
| | | | - Rone Charles
- Department of Computer Science, Johns Hopkins University, Baltimore, USA
| | - Brad Solomon
- Thomas M. Siebel Center for Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Jonathan P Ling
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, USA
| | - Eddie Luidy Imada
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, NY, USA
| | - David Zhang
- Institute of Child Health, University College London (UCL), London, UK
| | | | - Jeffrey T Leek
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, USA
| | - Andrew E Jaffe
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, USA
- Lieber Institute for Brain Development, Baltimore, USA
- Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, USA
- Department of Mental Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, USA
| | - Abhinav Nellore
- Department of Biomedical Engineering, Oregon Health & Science University, Portland, OR, USA
- Department of Surgery, Oregon Health & Science University, Portland, OR, USA
| | | | - Kasper D Hansen
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, USA.
- Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, USA.
| | - Ben Langmead
- Department of Computer Science, Johns Hopkins University, Baltimore, USA.
| |
Collapse
|
213
|
Kang W, Hu J, Zhao Q, Song F. Identification of an Autophagy-Related Risk Signature Correlates With Immunophenotype and Predicts Immune Checkpoint Blockade Efficacy of Neuroblastoma. Front Cell Dev Biol 2021; 9:731380. [PMID: 34746127 PMCID: PMC8567030 DOI: 10.3389/fcell.2021.731380] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2021] [Accepted: 08/31/2021] [Indexed: 11/30/2022] Open
Abstract
Neuroblastoma is one of the malignant solid tumors with the highest mortality in childhood. Targeted immunotherapy still cannot achieve satisfactory results due to heterogeneity and tolerance. Exploring markers related to prognosis and evaluating the immune microenvironment remain the major obstacles. Herein, we constructed an autophagy-related gene (ATG) risk model by multivariate Cox regression and least absolute shrinkage and selection operator regression, and identified four prognostic ATGs (BIRC5, GRID2, HK2, and RNASEL) in the training cohort, then verified the signature in the internal and external validation cohorts. BIRC5 and HK2 showed higher expression in MYCN amplified cell lines and tumor tissues consistently, whereas GRID2 and RNASEL showed the opposite trends. The correlation between the signature and clinicopathological parameters was further analyzed and showing consistency. A prognostic nomogram using risk score, International Neuroblastoma Staging System stage, age, and MYCN status was built subsequently, and the area under curves, net reclassification improvement, and integrated discrimination improvement showed more satisfactory prognostic predicting performance. The ATG prognostic signature itself can significantly divide patients with neuroblastoma into high- and low-risk groups; differentially expressed genes between the two groups were enriched in autophagy-related behaviors and immune cell reactions in gene set enrichment analysis (false discovery rate q -value < 0.05). Furthermore, we evaluated the relationship of the signature risk score with immune cell infiltration and the cancer-immunity cycle. The low-risk group was characterized by more abundant expression of chemokines and higher immune checkpoints (PDL1, PD1, CTLA-4, and IDO1). The risk score was significantly correlated with the proportions of CD8+ T cells, CD4+ memory resting T cells, follicular helper T cells, memory B cells, plasma cells, and M2 macrophages in tumor tissues. In conclusion, we developed and validated an autophagy-related signature that can accurately predict the prognosis, which might be meaningful to understand the immune microenvironment and guide immune checkpoint blockade.
Collapse
Affiliation(s)
- Wenjuan Kang
- Department of Epidemiology and Biostatistics, Key Laboratory of Molecular Cancer Epidemiology, Tianjin, National Clinical Research Center of Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin, China
| | - Jiajian Hu
- Tianjin Key Laboratory of Cancer Prevention and Therapy, Department of Pediatric Oncology, National Clinical Research Center for Cancer, Tianjin's Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin, China
| | - Qiang Zhao
- Tianjin Key Laboratory of Cancer Prevention and Therapy, Department of Pediatric Oncology, National Clinical Research Center for Cancer, Tianjin's Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin, China
| | - Fengju Song
- Department of Epidemiology and Biostatistics, Key Laboratory of Molecular Cancer Epidemiology, Tianjin, National Clinical Research Center of Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin, China
| |
Collapse
|
214
|
Okido T, Kodama Y, Mashima J, Kosuge T, Fujisawa T, Ogasawara O. DNA Data Bank of Japan (DDBJ) update report 2021. Nucleic Acids Res 2021; 50:D102-D105. [PMID: 34751405 PMCID: PMC8689959 DOI: 10.1093/nar/gkab995] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2021] [Revised: 10/08/2021] [Accepted: 10/12/2021] [Indexed: 11/29/2022] Open
Abstract
The Bioinformation and DDBJ (DNA Data Bank of Japan) Center (DDBJ Center; https://www.ddbj.nig.ac.jp) operates archival databases that collect nucleotide sequences, study and sample information, and distribute them without access restriction to progress life science research as a member of the International Nucleotide Sequence Database Collaboration (INSDC), in collaboration with the National Center for Biotechnology Information (NCBI) and the European Bioinformatics Institute. Besides the INSDC databases, the DDBJ Center also provides the Genomic Expression Archive for functional genomics data and the Japanese Genotype-phenotype Archive for human data requiring controlled access. Additionally, the DDBJ Center started a new public repository, MetaboBank, for experimental raw data and metadata from metabolomics research in October 2020. In response to the COVID-19 pandemic, the DDBJ Center openly shares SARS-CoV-2 genome sequences in collaboration with Shizuoka Prefecture and Keio University. The operation of DDBJ is based on the National Institute of Genetics (NIG) supercomputer, which is open for large-scale sequence data analysis for life science researchers. This paper reports recent updates on the archival databases and the services of DDBJ.
Collapse
Affiliation(s)
- Toshihisa Okido
- Bioinformation and DDBJ Center, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | - Yuichi Kodama
- Bioinformation and DDBJ Center, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | - Jun Mashima
- Bioinformation and DDBJ Center, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | - Takehide Kosuge
- Bioinformation and DDBJ Center, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | - Takatomo Fujisawa
- Bioinformation and DDBJ Center, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | - Osamu Ogasawara
- Bioinformation and DDBJ Center, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| |
Collapse
|
215
|
Courtot M, Gupta D, Liyanage I, Xu F, Burdett T. BioSamples database: FAIRer samples metadata to accelerate research data management. Nucleic Acids Res 2021; 50:D1500-D1507. [PMID: 34747489 PMCID: PMC8728232 DOI: 10.1093/nar/gkab1046] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 10/13/2021] [Accepted: 10/14/2021] [Indexed: 12/04/2022] Open
Abstract
The BioSamples database at EMBL-EBI is the central institutional repository for sample metadata storage and connection to EMBL-EBI archives and other resources. The technical improvements to our infrastructure described in our last update have enabled us to scale and accommodate an increasing number of communities, resulting in a higher number of submissions and more heterogeneous data. The BioSamples database now has a valuable set of features and processes to improve data quality in BioSamples, and in particular enriching metadata content and following FAIR principles. In this manuscript, we describe how BioSamples in 2021 handles requirements from our community of users through exemplar use cases: increased findability of samples and improved data management practices support the goals of the ReSOLUTE project, how the plant community benefits from being able to link genotypic to phenotypic information, and we highlight how cumulatively those improvements contribute to more complex multi-omics data integration supporting COVID-19 research. Finally, we present underlying technical features used as pillars throughout those use cases and how they are reused for expanded engagement with communities such as FAIRplus and the Global Alliance for Genomics and Health. Availability: The BioSamples database is freely available at http://www.ebi.ac.uk/biosamples. Content is distributed under the EMBL-EBI Terms of Use available at https://www.ebi.ac.uk/about/terms-of-use. The BioSamples code is available at https://github.com/EBIBioSamples/biosamples-v4 and distributed under the Apache 2.0 license.
Collapse
Affiliation(s)
- Mélanie Courtot
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Dipayan Gupta
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Isuru Liyanage
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Fuqi Xu
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Tony Burdett
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| |
Collapse
|
216
|
Band-based similarity indices for gene expression classification and clustering. Sci Rep 2021; 11:21609. [PMID: 34732744 PMCID: PMC8566472 DOI: 10.1038/s41598-021-00678-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Accepted: 10/11/2021] [Indexed: 11/16/2022] Open
Abstract
The concept of depth induces an ordering from centre outwards in multivariate data. Most depth definitions are unfeasible for dimensions larger than three or four, but the Modified Band Depth (MBD) is a notable exception that has proven to be a valuable tool in the analysis of high-dimensional gene expression data. This depth definition relates the centrality of each individual to its (partial) inclusion in all possible bands formed by elements of the data set. We assess (dis)similarity between pairs of observations by accounting for such bands and constructing binary matrices associated to each pair. From these, contingency tables are calculated and used to derive standard similarity indices. Our approach is computationally efficient and can be applied to bands formed by any number of observations from the data set. We have evaluated the performance of several band-based similarity indices with respect to that of other classical distances in standard classification and clustering tasks in a variety of simulated and real data sets. However, the use of the method is not restricted to these, the extension to other similarity coefficients being straightforward. Our experiments show the benefits of our technique, with some of the selected indices outperforming, among others, the Euclidean distance.
Collapse
|
217
|
Toufiq M, Huang SSY, Boughorbel S, Alfaki M, Rinchai D, Saraiva LR, Chaussabel D, Garand M. SysInflam HuDB, a Web Resource for Mining Human Blood Cells Transcriptomic Data Associated with Systemic Inflammatory Responses to Sepsis. THE JOURNAL OF IMMUNOLOGY 2021; 207:2195-2202. [PMID: 34663591 DOI: 10.4049/jimmunol.2100697] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 08/30/2021] [Indexed: 11/19/2022]
Abstract
Sepsis develops after a dysregulated host inflammatory response to a systemic infection. Identification of sepsis biomarkers has been challenging because of the multifactorial causes of disease susceptibility and progression. Public transcriptomic data are a valuable resource for mechanistic discoveries and cross-studies concordance of heterogeneous diseases. Nonetheless, the approach requires structured methodologies and effective visualization tools for meaningful data interpretation. Currently, no such database exists for sepsis or systemic inflammatory diseases in human. Hence we curated SysInflam HuDB (http://sepsis.gxbsidra.org/dm3/geneBrowser/list), a unique collection of human blood transcriptomic datasets associated with systemic inflammatory responses to sepsis. The transcriptome collection and the associated clinical metadata are integrated onto a user-friendly and Web-based interface that allows the simultaneous exploration, visualization, and interpretation of multiple datasets stemming from different study designs. To date, the collection encompasses 62 datasets and 5719 individual profiles. Concordance of gene expression changes with the associated literature was assessed, and additional analyses are presented to showcase database utility. Combined with custom data visualization at the group and individual levels, SysInflam HuDB facilitates the identification of specific human blood gene signatures in response to infection (e.g., patients with sepsis versus healthy control subjects) and the delineation of major genetic drivers associated with inflammation onset and progression under various conditions.
Collapse
Affiliation(s)
| | - Susie Shih Yin Huang
- Sidra Medicine, Doha, Qatar.,Division of Pediatric Cardiothoracic Surgery, Department of Surgery, Washington University School of Medicine, St. Louis, MO; and
| | | | | | | | - Luis R Saraiva
- Sidra Medicine, Doha, Qatar.,College of Health and Life Sciences, Hamad Bin Khalifa University, Doha, Qatar
| | | | | |
Collapse
|
218
|
Huang R, Wang S, Zhu R, Xian S, Huang Z, Cheng L, Zhang J. Identification of Key eRNAs for Spinal Cord Injury by Integrated Multinomial Bioinformatics Analysis. Front Cell Dev Biol 2021; 9:728242. [PMID: 34708039 PMCID: PMC8542800 DOI: 10.3389/fcell.2021.728242] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Accepted: 08/25/2021] [Indexed: 12/25/2022] Open
Abstract
Background: Spinal cord injury (SCI) is a severe neurological deficit affecting both young and older people worldwide. The potential role of key enhancer RNAs (eRNAs) in SCI remains elusive, which is a prominent challenge in the trauma repair process. This study aims to investigate the roles of key eRNAs, transcription factors (TFs), signaling pathways, and small-molecule inhibitors in SCI using multi-omics bioinformatics analysis. Methods: Microarray data of peripheral blood mononuclear cell (PBMC) samples from 27 healthy volunteers and 25 chronic-phase SCI patients were retrieved from the Gene Expression Omnibus database. Differentially expressed transcription factors (DETFs), differentially expressed enhancer RNAs (DEeRNAs), and differentially expressed target genes (DETGs) were identified using the Linear Models for Microarray Data (limma) package. Fraction of immune cells was estimated using CIBERSORT algorithm. Gene Set Variation Analysis (GSVA) was applied to identify the downstream signaling pathways. The eRNA regulatory network was constructed based on the correlation results. Connectivity Map (CMap) database was used to find potential drugs for SCI patients. The cellular communication analysis was performed to explore the molecular regulation mechanism of SCI based on single-cell RNA sequencing (scRNA-seq) data. Chromatin immunoprecipitation sequencing (ChIP-seq) and Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) data were used to validate the key regulatory mechanisms. scRNA-seq dataset was used to validate the cell subtype localization of the key eRNAs. Results: In total, 21 DETFs, 24 DEeRNAs, and 829 DETGs were identified. A regulatory network of 13 DETFs, six DEeRNAs, seven DETGs, two hallmark pathways, two immune cells, and six immune pathways was constructed. The link of Splicing factor proline and glutamine rich (SFPQ) (TF) and vesicular overexpressed in cancer prosurvival protein 1 (VOPP1) (eRNA) (R = 0.990, p < 0.001, positive), VOPP1 (eRNA) and epidermal growth factor receptor (EGFR) (target gene) (R = 0.974, p < 0.001, positive), VOPP1, and T helper (Th) cells (R = -0.987, p < 0.001, negative), and VOPP1 and hallmark coagulation (R = 0.937, p < 0.001, positive) was selected. Trichostatin A was considered the best compound target to SCI-related eRNAs (specificity = 0.471, p < 0.001). Conclusion: VOPP1, upregulated by SFPQ, strengthened the transient expression of EGFR. Th cells and coagulation were the potential downstream pathways of VOPP1. This regulatory network and potential inhibitors provide novel diagnostic biomarkers and therapeutic targets for SCI.
Collapse
Affiliation(s)
- Runzhi Huang
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Shanghai, China.,Division of Spine Surgery, Department of Orthopedics, Tongji Hospital, Tongji University School of Medicine, Shanghai, China.,Tongji University School of Medicine, Shanghai, China
| | - Siqiao Wang
- Tongji University School of Medicine, Shanghai, China
| | - Rui Zhu
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Shanghai, China.,Division of Spine Surgery, Department of Orthopedics, Tongji Hospital, Tongji University School of Medicine, Shanghai, China.,Tongji University School of Medicine, Shanghai, China
| | - Shuyuan Xian
- Tongji University School of Medicine, Shanghai, China
| | - Zongqiang Huang
- Department of Orthopedics, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Liming Cheng
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Shanghai, China.,Division of Spine Surgery, Department of Orthopedics, Tongji Hospital, Tongji University School of Medicine, Shanghai, China
| | - Jie Zhang
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Shanghai, China.,Division of Spine Surgery, Department of Orthopedics, Tongji Hospital, Tongji University School of Medicine, Shanghai, China
| |
Collapse
|
219
|
Ringwald M, Richardson JE, Baldarelli RM, Blake JA, Kadin JA, Smith C, Bult CJ. Mouse Genome Informatics (MGI): latest news from MGD and GXD. Mamm Genome 2021; 33:4-18. [PMID: 34698891 PMCID: PMC8913530 DOI: 10.1007/s00335-021-09921-0] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 09/21/2021] [Indexed: 12/01/2022]
Abstract
The Mouse Genome Informatics (MGI) database system combines multiple expertly curated community data resources into a shared knowledge management ecosystem united by common metadata annotation standards. MGI's mission is to facilitate the use of the mouse as an experimental model for understanding the genetic and genomic basis of human health and disease. MGI is the authoritative source for mouse gene, allele, and strain nomenclature and is the primary source of mouse phenotype annotations, functional annotations, developmental gene expression information, and annotations of mouse models with human diseases. MGI maintains mouse anatomy and phenotype ontologies and contributes to the development of the Gene Ontology and Disease Ontology and uses these ontologies as standard terminologies for annotation. The Mouse Genome Database (MGD) and the Gene Expression Database (GXD) are MGI's two major knowledgebases. Here, we highlight some of the recent changes and enhancements to MGD and GXD that have been implemented in response to changing needs of the biomedical research community and to improve the efficiency of expert curation. MGI can be accessed freely at http://www.informatics.jax.org .
Collapse
|
220
|
Xie B, Jiang Q, Mora A, Li X. Automatic cell type identification methods for single-cell RNA sequencing. Comput Struct Biotechnol J 2021; 19:5874-5887. [PMID: 34815832 PMCID: PMC8572862 DOI: 10.1016/j.csbj.2021.10.027] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Revised: 09/23/2021] [Accepted: 10/18/2021] [Indexed: 11/24/2022] Open
Abstract
Single-cell RNA sequencing (scRNA-seq) has become a powerful tool for scientists of many research disciplines due to its ability to elucidate the heterogeneous and complex cell-type compositions of different tissues and cell populations. Traditional cell-type identification methods for scRNA-seq data analysis are time-consuming and knowledge-dependent for manual annotation. By contrast, automatic cell-type identification methods may have the advantages of being fast, accurate, and more user friendly. Here, we discuss and evaluate thirty-two published automatic methods for scRNA-seq data analysis in terms of their prediction accuracy, F1-score, unlabeling rate and running time. We highlight the advantages and disadvantages of these methods and provide recommendations of method choice depending on the available information. The challenges and future applications of these automatic methods are further discussed. In addition, we provide a free scRNA-seq data analysis package encompassing the discussed automatic methods to help the easy usage of them in real-world applications.
Collapse
Affiliation(s)
- Bingbing Xie
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou 510060, Guangdong, China
| | - Qin Jiang
- Affiliated Eye Hospital of Nanjing Medical University, Nanjing, China
| | - Antonio Mora
- Joint School of Life Sciences, Guangzhou Medical University and Guangzhou Institutes of Biomedicine and Health (Chinese Academy of Sciences), Xinzao, Panyu District, Guangzhou 511436, Guangdong, China
| | - Xuri Li
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou 510060, Guangdong, China
| |
Collapse
|
221
|
Feller FM, Eilebrecht S, Nedielkov R, Yücel O, Alvincz J, Salinas G, Ludwig KC, Möller H, Philipp B. Investigations on the Degradation of the Bile Salt Cholate via the 9,10- Seco-Pathway Reveals the Formation of a Novel Recalcitrant Steroid Compound by a Side Reaction in Sphingobium sp. Strain Chol11. Microorganisms 2021; 9:microorganisms9102146. [PMID: 34683472 PMCID: PMC8540908 DOI: 10.3390/microorganisms9102146] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 10/07/2021] [Accepted: 10/11/2021] [Indexed: 01/30/2023] Open
Abstract
Bile salts such as cholate are steroid compounds from the digestive tracts of vertebrates, which enter the environment upon excretion, e.g., in manure. Environmental bacteria degrade bile salts aerobically via two pathway variants involving intermediates with Δ1,4- or Δ4,6-3-keto-structures of the steroid skeleton. Recent studies indicated that degradation of bile salts via Δ4,6-3-keto intermediates in Sphingobium sp. strain Chol11 proceeds via 9,10-seco cleavage of the steroid skeleton. For further elucidation, the presumptive product of this cleavage, 3,12β-dihydroxy-9,10-seco-androsta-1,3,5(10),6-tetraene-9,17-dione (DHSATD), was provided to strain Chol11 in a co-culture approach with Pseudomonas stutzeri Chol1 and as purified substrate. Strain Chol11 converted DHSATD to the so far unknown compound 4-methyl-3-deoxy-1,9,12-trihydroxyestra-1,3,5(10)7-tetraene-6,17-dione (MDTETD), presumably in a side reaction involving an unusual ring closure. MDTETD was neither degraded by strains Chol1 and Chol11 nor in enrichment cultures. Functional transcriptome profiling of zebrafish embryos after exposure to MDTETD identified a significant overrepresentation of genes linked to hormone responses. In both pathway variants, steroid degradation intermediates transiently accumulate in supernatants of laboratory cultures. Soil slurry experiments indicated that bacteria using both pathway variants were active and also released their respective intermediates into the environment. This instance could enable the formation of recalcitrant steroid metabolites by interspecies cross-feeding in agricultural soils.
Collapse
Affiliation(s)
- Franziska Maria Feller
- Institute for Molecular Microbiology and Biotechnology, University of Münster, Corrensstr. 3, 48149 Münster, Germany; (F.M.F.); (O.Y.); (K.C.L.)
| | - Sebastian Eilebrecht
- Fraunhofer Attract Eco’n’OMICs, Fraunhofer Institute for Molecular Biology and Applied Ecology IME, Auf dem Aberg 1, 57392 Schmallenberg, Germany; (S.E.); (J.A.)
| | - Ruslan Nedielkov
- Institute for Chemistry, University of Potsdam, Karl-Liebknecht-Straße 24-25, 14476 Potsdam, Germany; (R.N.); (H.M.)
| | - Onur Yücel
- Institute for Molecular Microbiology and Biotechnology, University of Münster, Corrensstr. 3, 48149 Münster, Germany; (F.M.F.); (O.Y.); (K.C.L.)
| | - Julia Alvincz
- Fraunhofer Attract Eco’n’OMICs, Fraunhofer Institute for Molecular Biology and Applied Ecology IME, Auf dem Aberg 1, 57392 Schmallenberg, Germany; (S.E.); (J.A.)
| | - Gabriela Salinas
- NGS-Services for Integrative Genomics, Institute for Human Genetics, University of Göttingen, 37077 Göttingen, Germany;
| | - Kevin Christopher Ludwig
- Institute for Molecular Microbiology and Biotechnology, University of Münster, Corrensstr. 3, 48149 Münster, Germany; (F.M.F.); (O.Y.); (K.C.L.)
| | - Heiko Möller
- Institute for Chemistry, University of Potsdam, Karl-Liebknecht-Straße 24-25, 14476 Potsdam, Germany; (R.N.); (H.M.)
| | - Bodo Philipp
- Institute for Molecular Microbiology and Biotechnology, University of Münster, Corrensstr. 3, 48149 Münster, Germany; (F.M.F.); (O.Y.); (K.C.L.)
- Department for Environmental Microbiology, Fraunhofer Institute for Molecular Biology and Applied Ecology IME, Auf dem Aberg 1, 57392 Schmallenberg, Germany
- Correspondence: ; Tel.: +49-251-8339827; Fax: +49-251-8338388
| |
Collapse
|
222
|
Zeng J, Zhang Y, Shang Y, Mai J, Shi S, Lu M, Bu C, Zhang Z, Zhang Z, Li Y, Du Z, Xiao J. CancerSCEM: a database of single-cell expression map across various human cancers. Nucleic Acids Res 2021; 50:D1147-D1155. [PMID: 34643725 PMCID: PMC8728207 DOI: 10.1093/nar/gkab905] [Citation(s) in RCA: 46] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Revised: 09/15/2021] [Accepted: 09/29/2021] [Indexed: 12/11/2022] Open
Abstract
With the proliferating studies of human cancers by single-cell RNA sequencing technique (scRNA-seq), cellular heterogeneity, immune landscape and pathogenesis within diverse cancers have been uncovered successively. The exponential explosion of massive cancer scRNA-seq datasets in the past decade are calling for a burning demand to be integrated and processed for essential investigations in tumor microenvironment of various cancer types. To fill this gap, we developed a database of Cancer Single-cell Expression Map (CancerSCEM, https://ngdc.cncb.ac.cn/cancerscem), particularly focusing on a variety of human cancers. To date, CancerSCE version 1.0 consists of 208 cancer samples across 28 studies and 20 human cancer types. A series of uniformly and multiscale analyses for each sample were performed, including accurate cell type annotation, functional gene expressions, cell interaction network, survival analysis and etc. Plus, we visualized CancerSCEM as a user-friendly web interface for users to browse, search, online analyze and download all the metadata as well as analytical results. More importantly and unprecedentedly, the newly-constructed comprehensive online analyzing platform in CancerSCEM integrates seven analyze functions, where investigators can interactively perform cancer scRNA-seq analyses. In all, CancerSCEM paves an informative and practical way to facilitate human cancer studies, and also provides insights into clinical therapy assessments.
Collapse
Affiliation(s)
- Jingyao Zeng
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,China National Center for Bioinformation, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Yadong Zhang
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,China National Center for Bioinformation, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Yunfei Shang
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,China National Center for Bioinformation, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jialin Mai
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,China National Center for Bioinformation, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Shuo Shi
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,China National Center for Bioinformation, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Mingming Lu
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,China National Center for Bioinformation, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Congfan Bu
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,China National Center for Bioinformation, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Zhewen Zhang
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,China National Center for Bioinformation, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Zaichao Zhang
- Department of Biology, The University of Western Ontario, London, Ontario N6A 5B7, Canada
| | - Yang Li
- Beijing Tongren Eye Center, Beijing key Laboratory of Intraocular Tumor Diagnosis and Treatment, Beijing Tongren Hospital, Capital Medical University, Beijing 100730, China
| | - Zhenglin Du
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,China National Center for Bioinformation, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Jingfa Xiao
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,China National Center for Bioinformation, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
223
|
The network interplay of interferon and Toll-like receptor signaling pathways in the anti-Candida immune response. Sci Rep 2021; 11:20281. [PMID: 34645905 PMCID: PMC8514550 DOI: 10.1038/s41598-021-99838-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Accepted: 09/30/2021] [Indexed: 01/22/2023] Open
Abstract
Fungal infections represent a major global health problem affecting over a billion people that kills more than 1.5 million annually. In this study, we employed an integrative approach to reveal the landscape of the human immune responses to Candida spp. through meta-analysis of microarray, bulk, and single-cell RNA sequencing (scRNA-seq) data for the blood transcriptome. We identified across these different studies a consistent interconnected network interplay of signaling molecules involved in both Toll-like receptor (TLR) and interferon (IFN) signaling cascades that is activated in response to different Candida species (C. albicans, C. auris, C. glabrata, C. parapsilosis, and C. tropicalis). Among these molecules are several types I IFN, indicating an overlap with antiviral immune responses. scRNA-seq data confirmed that genes commonly identified by the three transcriptomic methods show cell type-specific expression patterns in various innate and adaptive immune cells. These findings shed new light on the anti-Candida immune response, providing putative molecular pathways for therapeutic intervention.
Collapse
|
224
|
A proteomics sample metadata representation for multiomics integration and big data analysis. Nat Commun 2021; 12:5854. [PMID: 34615866 PMCID: PMC8494749 DOI: 10.1038/s41467-021-26111-3] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Accepted: 09/16/2021] [Indexed: 11/08/2022] Open
Abstract
The amount of public proteomics data is rapidly increasing but there is no standardized format to describe the sample metadata and their relationship with the dataset files in a way that fully supports their understanding or reanalysis. Here we propose to develop the transcriptomics data format MAGE-TAB into a standard representation for proteomics sample metadata. We implement MAGE-TAB-Proteomics in a crowdsourcing project to manually curate over 200 public datasets. We also describe tools and libraries to validate and submit sample metadata-related information to the PRIDE repository. We expect that these developments will improve the reproducibility and facilitate the reanalysis and integration of public proteomics datasets.
Collapse
|
225
|
Integrating Statistical and Machine-Learning Approach for Meta-Analysis of Bisphenol A-Exposure Datasets Reveals Effects on Mouse Gene Expression within Pathways of Apoptosis and Cell Survival. Int J Mol Sci 2021; 22:ijms221910785. [PMID: 34639124 PMCID: PMC8509605 DOI: 10.3390/ijms221910785] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Revised: 09/23/2021] [Accepted: 09/27/2021] [Indexed: 12/19/2022] Open
Abstract
Bisphenols are important environmental pollutants that are extensively studied due to different detrimental effects, while the molecular mechanisms behind these effects are less well understood. Like other environmental pollutants, bisphenols are being tested in various experimental models, creating large expression datasets found in open access storage. The meta-analysis of such datasets is, however, very complicated for various reasons. Here, we developed an integrating statistical and machine-learning model approach for the meta-analysis of bisphenol A (BPA) exposure datasets from different mouse tissues. We constructed three joint datasets following three different strategies for dataset integration: in particular, using all common genes from the datasets, uncorrelated, and not co-expressed genes, respectively. By applying machine learning methods to these datasets, we identified genes whose expression was significantly affected in all of the BPA microanalysis data tested; those involved in the regulation of cell survival include: Tnfr2, Hgf-Met, Agtr1a, Bdkrb2; signaling through Mapk8 (Jnk1)); DNA repair (Hgf-Met, Mgmt); apoptosis (Tmbim6, Bcl2, Apaf1); and cellular junctions (F11r, Cldnd1, Ctnd1 and Yes1). Our results highlight the benefit of combining existing datasets for the integrated analysis of a specific topic when individual datasets are limited in size.
Collapse
|
226
|
Qu J, Zheng B, Ohuchida K, Feng H, Chong SJF, Zhang X, Liang R, Liu Z, Shirahane K, Mizumoto K, Gong P, Nakamura M. PIK3CB is involved in metastasis through the regulation of cell adhesion to collagen I in pancreatic cancer. J Adv Res 2021; 33:127-140. [PMID: 34603784 PMCID: PMC8463925 DOI: 10.1016/j.jare.2021.02.002] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2020] [Revised: 01/28/2021] [Accepted: 02/06/2021] [Indexed: 12/15/2022] Open
Abstract
Introduction Pancreatic adenocarcinoma (PAAD) is an aggressive malignancy, with a major mortality resulting from the rapid progression of metastasis. Unfortunately, no effective treatment strategy has been developed for PAAD metastasis to date. Thus, unraveling the mechanisms involved in PAAD metastatic phenotype may facilitate the treatment for PAAD patients. Objectives PIK3CB is an oncogene implicated in cancer development and progression but less is known about whether PIK3CB participates in PAAD metastasis. Therefore, the objective of this study is to explore the mechanism(s) of PIK3CB in PAAD metastasis. Methods In our study, we examined the PIK3CB expression pattern using bioinformatic analysis and clinical material derived from patients with PAAD. Subsequently, a series of biochemical experiments were conducted to investigate the role of PIK3CB as potential mechanism(s) underlying PAAD metastasis in vivo using nude mice and in vitro using cell lines. Results We observed that PIK3CB was involved in PAAD progression. Notably, we identified that PIK3CB was involved in PAAD metastasis. Downregulation of PIK3CB significantly reduced PAAD metastatic potential in vivo. Furthermore, a series of bioinformatic analyses showed that PIK3CB was involved in cell adhesion in PAAD. Notably, PIK3CB depletion inhibited invasion potential specifically via suppressing cell adhesion to collagen I in PAAD cells. Conclusion Collectively, our findings indicate that PIK3CB is involved in PAAD metastasis through cell-matrix adhesion. We proposed that PIK3CB is a potential therapeutic target for PAAD therapy.
Collapse
Affiliation(s)
- Jianhua Qu
- Department of General Surgery & Carson International Cancer Research Center, Shenzhen University General Hospital and Shenzhen University Clinical Medical Academy, Shenzhen, Guangdong 518055, China.,Department of Physiology, Yong Loo Lin School of Medicine, National University of Singapore (NUS), Singapore 117593, Singapore
| | - Biao Zheng
- Department of General Surgery & Carson International Cancer Research Center, Shenzhen University General Hospital and Shenzhen University Clinical Medical Academy, Shenzhen, Guangdong 518055, China.,Department of Surgery and Oncology, Graduate School of Medical Sciences, Kyushu University, Fukuoka 812-8582, Japan
| | - Kenoki Ohuchida
- Department of Surgery and Oncology, Graduate School of Medical Sciences, Kyushu University, Fukuoka 812-8582, Japan.,Advanced Medical Initiatives, Graduate School of Medical Sciences, Kyushu University, Fukuoka 812-8582, Japan
| | - Haimin Feng
- Department of Surgery and Oncology, Graduate School of Medical Sciences, Kyushu University, Fukuoka 812-8582, Japan
| | | | - Xianbin Zhang
- Department of General Surgery & Carson International Cancer Research Center, Shenzhen University General Hospital and Shenzhen University Clinical Medical Academy, Shenzhen, Guangdong 518055, China
| | - Rui Liang
- Department of General Surgery & Carson International Cancer Research Center, Shenzhen University General Hospital and Shenzhen University Clinical Medical Academy, Shenzhen, Guangdong 518055, China
| | - Zhong Liu
- Department of General Surgery & Carson International Cancer Research Center, Shenzhen University General Hospital and Shenzhen University Clinical Medical Academy, Shenzhen, Guangdong 518055, China
| | - Kengo Shirahane
- Department of Surgery and Oncology, Graduate School of Medical Sciences, Kyushu University, Fukuoka 812-8582, Japan
| | - Kazuhiro Mizumoto
- Department of Surgery and Oncology, Graduate School of Medical Sciences, Kyushu University, Fukuoka 812-8582, Japan.,Cancer Center of Kyushu University Hospital, Fukuoka 812-8582, Japan
| | - Peng Gong
- Department of General Surgery & Carson International Cancer Research Center, Shenzhen University General Hospital and Shenzhen University Clinical Medical Academy, Shenzhen, Guangdong 518055, China.,Guangdong Key Laboratory of Regional Immunity and Diseases, Shenzhen University Health Science Center, Shenzhen, Guangdong 518060, China
| | - Masafumi Nakamura
- Department of Surgery and Oncology, Graduate School of Medical Sciences, Kyushu University, Fukuoka 812-8582, Japan
| |
Collapse
|
227
|
Ghandikota S, Sharma M, Jegga AG. Computational workflow for functional characterization of COVID-19 through secondary data analysis. STAR Protoc 2021; 2:100873. [PMID: 34746856 PMCID: PMC8551262 DOI: 10.1016/j.xpro.2021.100873] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
Standard transcriptomic analyses cannot fully capture the molecular mechanisms underlying disease pathophysiology and outcomes. We present a computational heterogeneous data integration and mining protocol that combines transcriptional signatures from multiple model systems, protein-protein interactions, single-cell RNA-seq markers, and phenotype-genotype associations to identify functional feature complexes. These feature modules represent a higher order multifeatured machines collectively working toward common pathophysiological goals. We apply this protocol for functional characterization of COVID-19, but it could be applied to many other diseases. For complete details on the use and execution of this protocol, please refer to Ghandikota et al. (2021).
Collapse
Affiliation(s)
- Sudhir Ghandikota
- Division of Biomedical Informatics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA,Department of Computer Science, University of Cincinnati College of Engineering, Cincinnati, OH, USA,Corresponding author
| | - Mihika Sharma
- Division of Biomedical Informatics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
| | - Anil G. Jegga
- Division of Biomedical Informatics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA,Department of Computer Science, University of Cincinnati College of Engineering, Cincinnati, OH, USA,Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, USA,Corresponding author
| |
Collapse
|
228
|
Zhou R, Feng Y, Ye J, Han Z, Liang Y, Chen Q, Xu X, Huang Y, Jia Z, Zhong W. Prediction of Biochemical Recurrence-Free Survival of Prostate Cancer Patients Leveraging Multiple Gene Expression Profiles in Tumor Microenvironment. Front Oncol 2021; 11:632571. [PMID: 34631510 PMCID: PMC8495167 DOI: 10.3389/fonc.2021.632571] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Accepted: 08/19/2021] [Indexed: 12/11/2022] Open
Abstract
Tumor-adjacent normal (TAN) tissues, which constitute tumor microenvironment and are different from healthy tissues, provide critical information at molecular levels that can be used to differentiate aggressive tumors from indolent tumors. In this study, we analyzed 52 TAN samples from the Cancer Genome Atlas (TCGA) prostate cancer patients and developed a 10-gene prognostic model that can accurately predict biochemical recurrence-free survival based on the profiles of these genes in TAN tissues. The predictive ability was validated using TAN samples from an independent cohort. These 10 prognostic genes in tumor microenvironment are different from the prognostic genes detected in tumor tissues, indicating distinct progression-related mechanisms in two tissue types. Bioinformatics analysis showed that the prognostic genes in tumor microenvironment were significantly enriched by p53 signaling pathway, which may represent the crosstalk tunnels between tumor and its microenvironment and pathways involving cell-to-cell contact and paracrine/endocrine signaling. The insight acquired by this study has advanced our knowledge of the potential role of tumor microenvironment in prostate cancer progression.
Collapse
Affiliation(s)
- Rui Zhou
- School of Medicine, South China University of Technology, Guangzhou, China
| | - Yuanfa Feng
- Department of Urology, Guangdong Key Laboratory of Clinical Molecular Medicine and Diagnostics, Guangzhou First People’s Hospital, School of Medicine, South China University of Technology, Guangzhou, China
| | - Jianheng Ye
- Department of Urology, Guangdong Key Laboratory of Clinical Molecular Medicine and Diagnostics, Guangzhou First People’s Hospital, School of Medicine, South China University of Technology, Guangzhou, China
| | - Zhaodong Han
- Department of Urology, Guangdong Key Laboratory of Clinical Molecular Medicine and Diagnostics, Guangzhou First People’s Hospital, School of Medicine, South China University of Technology, Guangzhou, China
| | - Yuxiang Liang
- Department of Urology, Guangdong Key Laboratory of Clinical Molecular Medicine and Diagnostics, Guangzhou First People’s Hospital, School of Medicine, South China University of Technology, Guangzhou, China
| | - Qingbiao Chen
- Affiliated Foshan Hospital of Southern Medical University, Southern Medical University, Foshan, China
| | - Xiaoming Xu
- Department of Urology, Hwa Mei Hospital, University of Chinese Academy of Sciences, Ningbo, China
| | - Yuhan Huang
- Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles, Los Angeles, CA, United States
| | - Zhenyu Jia
- Department of Botany and Plant Sciences, University of California, Riverside, Riverside, CA, United States
| | - Weide Zhong
- School of Medicine, South China University of Technology, Guangzhou, China
- Department of Urology, Guangdong Key Laboratory of Clinical Molecular Medicine and Diagnostics, Guangzhou First People’s Hospital, School of Medicine, South China University of Technology, Guangzhou, China
| |
Collapse
|
229
|
Johnson D, Batista D, Cochrane K, Davey RP, Etuk A, Gonzalez-Beltran A, Haug K, Izzo M, Larralde M, Lawson TN, Minotto A, Moreno P, Nainala VC, O'Donovan C, Pireddu L, Roger P, Shaw F, Steinbeck C, Weber RJM, Sansone SA, Rocca-Serra P. ISA API: An open platform for interoperable life science experimental metadata. Gigascience 2021; 10:giab060. [PMID: 34528664 PMCID: PMC8444265 DOI: 10.1093/gigascience/giab060] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Revised: 03/19/2021] [Accepted: 08/23/2021] [Indexed: 02/04/2023] Open
Abstract
BACKGROUND The Investigation/Study/Assay (ISA) Metadata Framework is an established and widely used set of open source community specifications and software tools for enabling discovery, exchange, and publication of metadata from experiments in the life sciences. The original ISA software suite provided a set of user-facing Java tools for creating and manipulating the information structured in ISA-Tab-a now widely used tabular format. To make the ISA framework more accessible to machines and enable programmatic manipulation of experiment metadata, the JSON serialization ISA-JSON was developed. RESULTS In this work, we present the ISA API, a Python library for the creation, editing, parsing, and validating of ISA-Tab and ISA-JSON formats by using a common data model engineered as Python object classes. We describe the ISA API feature set, early adopters, and its growing user community. CONCLUSIONS The ISA API provides users with rich programmatic metadata-handling functionality to support automation, a common interface, and an interoperable medium between the 2 ISA formats, as well as with other life science data formats required for depositing data in public databases.
Collapse
Affiliation(s)
- David Johnson
- Oxford e-Research Centre, Department of Engineering Science, University of Oxford, 7 Keble Road, Oxford, OX1 3QG, UK
- Department of Informatics and Media, Uppsala University, Box 513, 75120 Uppsala, Sweden
| | - Dominique Batista
- Oxford e-Research Centre, Department of Engineering Science, University of Oxford, 7 Keble Road, Oxford, OX1 3QG, UK
| | - Keeva Cochrane
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Robert P Davey
- Earlham Institute, Data infrastructure and algorithms, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Anthony Etuk
- Earlham Institute, Data infrastructure and algorithms, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Alejandra Gonzalez-Beltran
- Oxford e-Research Centre, Department of Engineering Science, University of Oxford, 7 Keble Road, Oxford, OX1 3QG, UK
- Science and Technology Facilities Council, Scientific Computing Department, Rutherford Appleton Laboratory, Harwell Campus, Didcot, OX11 0QX, UK
| | - Kenneth Haug
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
- Genome Research Limited, Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Saffron Walden, CB10 1RQ, UK
| | - Massimiliano Izzo
- Oxford e-Research Centre, Department of Engineering Science, University of Oxford, 7 Keble Road, Oxford, OX1 3QG, UK
| | - Martin Larralde
- Structural and Computational Biology Unit, European Molecular Biology Laboratory (EMBL), Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Thomas N Lawson
- School of Biosciences, University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK
| | - Alice Minotto
- Earlham Institute, Data infrastructure and algorithms, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Pablo Moreno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Venkata Chandrasekhar Nainala
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Claire O'Donovan
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Luca Pireddu
- Distributed Computing Group, CRS4: Center for Advanced Studies, Research & Development in Sardinia, Pula 09050, Italy
| | - Pierrick Roger
- CEA, LIST, Laboratory for Data Analysis and Systems’ Intelligence, MetaboHUB, Gif-Sur-Yvette F-91191, France
| | - Felix Shaw
- Earlham Institute, Data infrastructure and algorithms, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Christoph Steinbeck
- Cheminformatics and Computational Metabolomics, Institute for Analytical Chemistry, Lessingstr. 8, 07743 Jena, Germany
| | - Ralf J M Weber
- School of Biosciences, University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK
- Phenome Centre Birmingham, University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK
| | - Susanna-Assunta Sansone
- Oxford e-Research Centre, Department of Engineering Science, University of Oxford, 7 Keble Road, Oxford, OX1 3QG, UK
| | - Philippe Rocca-Serra
- Oxford e-Research Centre, Department of Engineering Science, University of Oxford, 7 Keble Road, Oxford, OX1 3QG, UK
| |
Collapse
|
230
|
K Ca channel blockers increase effectiveness of the EGF receptor TK inhibitor erlotinib in non-small cell lung cancer cells (A549). Sci Rep 2021; 11:18330. [PMID: 34526525 PMCID: PMC8443639 DOI: 10.1038/s41598-021-97406-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Accepted: 08/18/2021] [Indexed: 11/08/2022] Open
Abstract
Non-small cell lung cancer (NSCLC) has a poor prognosis with a 5 year survival rate of only ~ 10%. Important driver mutations underlying NSCLC affect the epidermal growth factor receptor (EGFR) causing the constitutive activation of its tyrosine kinase domain. There are efficient EGFR tyrosine kinase inhibitors (TKIs), but patients develop inevitably a resistance against these drugs. On the other hand, KCa3.1 channels contribute to NSCLC progression so that elevated KCa3.1 expression is a strong predictor of poor NSCLC patient prognosis. The present study tests whether blocking KCa3.1 channels increases the sensitivity of NSCLC cells towards the EGFR TKI erlotinib and overcomes drug resistance. mRNA expression of KCa3.1 channels in erlotinib-sensitive and -resistant NSCLC cells was analysed in datasets from Gene expression omnibus (GEO) and ArrayExpress. We assessed proliferation and migration of NSCLC cells. These (live cell-imaging) experiments were complemented by patch clamp experiments and Western blot analyses. We identified three out of four datasets comparing erlotinib-sensitive and -resistant NSCLC cells which revealed an altered expression of KCa3.1 mRNA in erlotinib-resistant NSCLC cells. Therefore, we evaluated the combined effect of erlotinib and the KCa3.1 channel inhibition with sencapoc. Erlotinib elicits a dose-dependent inhibition of migration and proliferation of NSCLC cells. The simultaneous application of the KCa3.1 channel blocker senicapoc increases the sensitivity towards a low dose of erlotinib (300 nmol/L) which by itself has no effect on migration and proliferation. Partial erlotinib resistance can be overcome by KCa3.1 channel blockade. The sensitivity towards erlotinib as well as the potentiating effect of KCa3.1 blockade is further increased by mimicking hypoxia. Our results suggest that KCa3.1 channel blockade may constitute a therapeutic concept for treating NSCLC and overcome EGFR TKI resistance. We propose that this is due to complementary mechanisms of action of both blockers.
Collapse
|
231
|
Hollander M, Do T, Will T, Helms V. Detecting Rewiring Events in Protein-Protein Interaction Networks Based on Transcriptomic Data. FRONTIERS IN BIOINFORMATICS 2021; 1:724297. [PMID: 36303788 PMCID: PMC9581068 DOI: 10.3389/fbinf.2021.724297] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2021] [Accepted: 08/23/2021] [Indexed: 12/25/2022] Open
Abstract
Proteins rarely carry out their cellular functions in isolation. Instead, eukaryotic proteins engage in about six interactions with other proteins on average. The aggregated protein interactome of an organism forms a “hairy ball”-type protein-protein interaction (PPI) network. Yet, in a typical human cell, only about half of all proteins are expressed at a particular time. Hence, it has become common practice to prune the full PPI network to the subset of expressed proteins. If RNAseq data is available, one can further resolve the specific protein isoforms present in a cell or tissue. Here, we review various approaches, software tools and webservices that enable users to construct context-specific or tissue-specific PPI networks and how these are rewired between two cellular conditions. We illustrate their different functionalities on the example of the interactions involving the human TNR6 protein. In an outlook, we describe how PPI networks may be integrated with epigenetic data or with data on the activity of splicing factors.
Collapse
|
232
|
Padakanti S, Tiong KL, Chen YB, Yeang CH. Genotypes of informative loci from 1000 Genomes data allude evolution and mixing of human populations. Sci Rep 2021; 11:17741. [PMID: 34493766 PMCID: PMC8423758 DOI: 10.1038/s41598-021-97129-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2020] [Accepted: 08/13/2021] [Indexed: 11/11/2022] Open
Abstract
Principal Component Analysis (PCA) projects high-dimensional genotype data into a few components that discern populations. Ancestry Informative Markers (AIMs) are a small subset of SNPs capable of distinguishing populations. We integrate these two approaches by proposing an algorithm to identify necessary informative loci whose removal from the data deteriorates the PCA structure. Unlike classical AIMs, necessary informative loci densely cover the genome, hence can illuminate the evolution and mixing history of populations. We conduct a comprehensive analysis to the genotype data of the 1000 Genomes Project using necessary informative loci. Projections along the top seven principal components demarcate populations at distinct geographic levels. Millions of necessary informative loci along each PC are identified. Population identities along each PC are approximately determined by weighted sums of minor (or major) alleles over the informative loci. Variations of allele frequencies are aligned with the history and direction of population evolution. The population distribution of projections along the top three PCs is recapitulated by a simple demographic model based on several waves of founder population separation and mixing. Informative loci possess locational concentration in the genome and functional enrichment. Genes at two hot spots encompassing dense PC 7 informative loci exhibit differential expressions among European populations. The mosaic of local ancestry in the genome of a mixed descendant from multiple populations can be inferred from partial PCA projections of informative loci. Finally, informative loci derived from the 1000 Genomes data well predict the projections of an independent genotype data of South Asians. These results demonstrate the utility and relevance of informative loci to investigate human evolution.
Collapse
Affiliation(s)
- Sridevi Padakanti
- Institute of Statistical Science, Academia Sinica, 128 Academia Road, Section 2, Taipei, Taiwan
| | - Khong-Loon Tiong
- Institute of Statistical Science, Academia Sinica, 128 Academia Road, Section 2, Taipei, Taiwan
| | - Yan-Bin Chen
- Institute of Statistical Science, Academia Sinica, 128 Academia Road, Section 2, Taipei, Taiwan
| | - Chen-Hsiang Yeang
- Institute of Statistical Science, Academia Sinica, 128 Academia Road, Section 2, Taipei, Taiwan.
| |
Collapse
|
233
|
Huang G, Zhang H, Qu Y, Huang K, Gong X, Wei J, Du H. ARMT: An automatic RNA-seq data mining tool based on comprehensive and integrative analysis in cancer research. Comput Struct Biotechnol J 2021; 19:4426-4434. [PMID: 34471489 PMCID: PMC8379379 DOI: 10.1016/j.csbj.2021.08.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Revised: 07/19/2021] [Accepted: 08/06/2021] [Indexed: 11/02/2022] Open
Abstract
The comprehensive and integrative analysis of RNA-seq data, in different molecular layers from diverse samples, holds promise to address the full-scale complexity of biological systems. Recent advances in gene set variant analysis (GSVA) are providing exciting opportunities for revealing the specific biological processes of cancer samples. However, it is still urgently needed to develop a tool, which combines GSVA and different molecular characteristic analysis, as well as prognostic characteristics of cancer patients to reveal the biological processes of disease comprehensively. Here, we develop ARMT, an automatic tool for RNA-Seq data analysis. ARMT is an efficient and integrative tool with user-friendly interface to analyze related molecular characters of single gene and gene set comprehensively based on transcriptome and genomic data, which builds the bridge for deeper information between genes and pathways, to further accelerate scientific findings. ARMT can be installed easily from https://github.com/Dulab2020/ARMT.
Collapse
Affiliation(s)
- Guanda Huang
- School of Biology and Biological Engineering, South China University of Technology, Guangzhou 510006, China
| | - Haibo Zhang
- School of Biology and Biological Engineering, South China University of Technology, Guangzhou 510006, China
| | - Yimo Qu
- School of Biology and Biological Engineering, South China University of Technology, Guangzhou 510006, China
| | - Kaitang Huang
- School of Biology and Biological Engineering, South China University of Technology, Guangzhou 510006, China
| | - Xiaocheng Gong
- School of Biology and Biological Engineering, South China University of Technology, Guangzhou 510006, China
| | - Jinfen Wei
- School of Biology and Biological Engineering, South China University of Technology, Guangzhou 510006, China
| | - Hongli Du
- School of Biology and Biological Engineering, South China University of Technology, Guangzhou 510006, China
| |
Collapse
|
234
|
Nwosu IO, Piccolo SR. A systematic review of datasets that can help elucidate relationships among gene expression, race, and immunohistochemistry-defined subtypes in breast cancer. Cancer Biol Ther 2021; 22:417-429. [PMID: 34412551 DOI: 10.1080/15384047.2021.1953902] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Scholarly requirements have led to a massive increase of transcriptomic data in the public domain, with millions of samples available for secondary research. We identified gene-expression datasets representing 10,214 breast-cancer patients in public databases. We focused on datasets that included patient metadata on race and/or immunohistochemistry (IHC) profiling of the ER, PR, and HER-2 proteins. This review provides a summary of these datasets and describes findings from 32 research articles associated with the datasets. These studies have helped to elucidate relationships between IHC, race, and/or treatment options, as well as relationships between IHC status and the breast-cancer intrinsic subtypes. We have also identified broad themes across the analysis methodologies used in these studies, including breast cancer subtyping, deriving predictive biomarkers, identifying differentially expressed genes, and optimizing data processing. Finally, we discuss limitations of prior work and recommend future directions for reusing these datasets in secondary analyses.
Collapse
Affiliation(s)
| | - Stephen R Piccolo
- Department of Biology, Brigham Young University, Provo, Utah, United States
| |
Collapse
|
235
|
Lee F, Shah I, Soong YT, Xing J, Ng IC, Tasnim F, Yu H. Reproducibility and robustness of high-throughput S1500+ transcriptomics on primary rat hepatocytes for chemical-induced hepatotoxicity assessment. Curr Res Toxicol 2021; 2:282-295. [PMID: 34467220 PMCID: PMC8384775 DOI: 10.1016/j.crtox.2021.07.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Revised: 07/15/2021] [Accepted: 07/31/2021] [Indexed: 11/06/2022] Open
Abstract
TempO-Seq assays of rat hepatocytes in collagen sandwich are highly reproducible. Gene expression analysis shows S1500+ is representative of the whole transcriptome. Connectivity mapping shows consistency between TempO-Seq and Affymetrix data. Gene set enrichment shows consistency between S1500+ and the whole transcriptome. Gene set enrichment using hallmark gene sets informs hepatotoxicity.
Cell-based in vitro models coupled with high-throughput transcriptomics (HTTr) are increasingly utilized as alternative methods to animal-based toxicity testing. Here, using a panel of 14 chemicals with different risks of human drug-induced liver injury (DILI) and two dosing concentrations, we evaluated an HTTr platform comprised of collagen sandwich primary rat hepatocyte culture and the TempO-Seq surrogate S1500+ (ST) assay. First, the HTTr platform was found to exhibit high reproducibility between technical and biological replicates (r greater than 0.85). Connectivity mapping analysis further demonstrated a high level of inter-platform reproducibility between TempO-Seq data and Affymetrix GeneChip data from the Open TG-GATES project. Second, the TempO-Seq ST assay was shown to be a robust surrogate to the whole transcriptome (WT) assay in capturing chemical-induced changes in gene expression, as evident from correlation analysis, PCA and unsupervised hierarchical clustering. Gene set enrichment analysis (GSEA) using the Hallmark gene set collection also demonstrated consistency in enrichment scores between ST and WT assays. Lastly, unsupervised hierarchical clustering of hallmark enrichment scores broadly divided the samples into hepatotoxic, intermediate, and non-hepatotoxic groups. Xenobiotic metabolism, bile acid metabolism, apoptosis, p53 pathway, and coagulation were found to be the key hallmarks driving the clustering. Taken together, our results established the reproducibility and performance of collagen sandwich culture in combination with TempO-Seq S1500+ assay, and demonstrated the utility of GSEA using the hallmark gene set collection to identify potential hepatotoxicants for further validation.
Collapse
Affiliation(s)
- Fan Lee
- Innovations in Food & Chemical Safety Program (IFCS), Institute of Bioengineering and Bioimaging (IBB), Agency for Science Technology and Research, Singapore
| | - Imran Shah
- Center for Computational Toxicology & Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC, United States
| | - Yun Ting Soong
- Innovations in Food & Chemical Safety Program (IFCS), Institute of Bioengineering and Bioimaging (IBB), Agency for Science Technology and Research, Singapore
| | - Jiangwa Xing
- Innovations in Food & Chemical Safety Program (IFCS), Institute of Bioengineering and Bioimaging (IBB), Agency for Science Technology and Research, Singapore
| | - Inn Chuan Ng
- Department of Physiology and Mechanobiology Institute, National University of Singapore, Singapore
| | - Farah Tasnim
- Innovations in Food & Chemical Safety Program (IFCS), Institute of Bioengineering and Bioimaging (IBB), Agency for Science Technology and Research, Singapore
| | - Hanry Yu
- Innovations in Food & Chemical Safety Program (IFCS), Institute of Bioengineering and Bioimaging (IBB), Agency for Science Technology and Research, Singapore.,Department of Physiology and Mechanobiology Institute, National University of Singapore, Singapore.,Critical Analytics for Manufacturing Personalized-Medicine, Singapore-MIT Alliance for Research and Technology, Singapore
| |
Collapse
|
236
|
Kerimov N, Hayhurst JD, Peikova K, Manning JR, Walter P, Kolberg L, Samoviča M, Sakthivel MP, Kuzmin I, Trevanion SJ, Burdett T, Jupp S, Parkinson H, Papatheodorou I, Yates AD, Zerbino DR, Alasoo K. A compendium of uniformly processed human gene expression and splicing quantitative trait loci. Nat Genet 2021; 53:1290-1299. [PMID: 34493866 PMCID: PMC8423625 DOI: 10.1038/s41588-021-00924-w] [Citation(s) in RCA: 152] [Impact Index Per Article: 50.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Accepted: 07/26/2021] [Indexed: 12/15/2022]
Abstract
Many gene expression quantitative trait locus (eQTL) studies have published their summary statistics, which can be used to gain insight into complex human traits by downstream analyses, such as fine mapping and co-localization. However, technical differences between these datasets are a barrier to their widespread use. Consequently, target genes for most genome-wide association study (GWAS) signals have still not been identified. In the present study, we present the eQTL Catalogue ( https://www.ebi.ac.uk/eqtl ), a resource of quality-controlled, uniformly re-computed gene expression and splicing QTLs from 21 studies. We find that, for matching cell types and tissues, the eQTL effect sizes are highly reproducible between studies. Although most QTLs were shared between most bulk tissues, we identified a greater diversity of cell-type-specific QTLs from purified cell types, a subset of which also manifested as new disease co-localizations. Our summary statistics are freely available to enable the systematic interpretation of human GWAS associations across many cell types and tissues.
Collapse
Affiliation(s)
- Nurlan Kerimov
- Institute of Computer Science, University of Tartu, Tartu, Estonia
- Open Targets, Wellcome Genome Campus, Cambridge, UK
| | - James D Hayhurst
- Open Targets, Wellcome Genome Campus, Cambridge, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK
| | - Kateryna Peikova
- Institute of Computer Science, University of Tartu, Tartu, Estonia
| | - Jonathan R Manning
- Open Targets, Wellcome Genome Campus, Cambridge, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK
| | - Peter Walter
- European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK
| | - Liis Kolberg
- Institute of Computer Science, University of Tartu, Tartu, Estonia
| | - Marija Samoviča
- Institute of Computer Science, University of Tartu, Tartu, Estonia
| | - Manoj Pandian Sakthivel
- Open Targets, Wellcome Genome Campus, Cambridge, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK
| | - Ivan Kuzmin
- Institute of Computer Science, University of Tartu, Tartu, Estonia
| | - Stephen J Trevanion
- Open Targets, Wellcome Genome Campus, Cambridge, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK
| | - Tony Burdett
- Open Targets, Wellcome Genome Campus, Cambridge, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK
| | - Simon Jupp
- Open Targets, Wellcome Genome Campus, Cambridge, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK
| | - Helen Parkinson
- Open Targets, Wellcome Genome Campus, Cambridge, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK
| | - Irene Papatheodorou
- Open Targets, Wellcome Genome Campus, Cambridge, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK
| | - Andrew D Yates
- Open Targets, Wellcome Genome Campus, Cambridge, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK
| | - Daniel R Zerbino
- Open Targets, Wellcome Genome Campus, Cambridge, UK.
- European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK.
| | - Kaur Alasoo
- Institute of Computer Science, University of Tartu, Tartu, Estonia.
- Open Targets, Wellcome Genome Campus, Cambridge, UK.
| |
Collapse
|
237
|
França TT, Al-Sbiei A, Bashir G, Mohamed YA, Salgado RC, Barreiros LA, Maria da Silva Napoleão S, Weber CW, Fernandes Severo Ferreira J, Aranda CS, Prando C, de Barros Dorna MB, Jurisica I, Fernandez-Cabezudo MJ, Ochs HD, Condino-Neto A, Al-Ramadi BK, Cabral-Marques O. CD40L modulates transcriptional signatures of neutrophils in the bone marrow associated with development and trafficking. JCI Insight 2021; 6:e148652. [PMID: 34255742 PMCID: PMC8410015 DOI: 10.1172/jci.insight.148652] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Neutrophils are produced in the BM in a process called granulopoiesis, in which progenitor cells sequentially develop into mature neutrophils. During the developmental process, which is finely regulated by distinct transcription factors, neutrophils acquire the ability to exit the BM, properly distribute throughout the body, and migrate to infection sites. Previous studies have demonstrated that CD40 ligand (CD40L) influences hematopoiesis and granulopoiesis. Here, we investigate the effect of CD40L on neutrophil development and trafficking by performing functional and transcriptome analyses. We found that CD40L signaling plays an essential role in the early stages of neutrophil generation and development in the BM. Moreover, CD40L modulates transcriptional signatures, indicating that this molecule enables neutrophils to traffic throughout the body and to migrate in response to inflammatory signals. Thus, our study provides insights into the complex relationships between CD40L signaling and granulopoiesis, and it suggests a potentially novel and nonredundant role of CD40L signaling in neutrophil development and function.
Collapse
Affiliation(s)
- Tábata Takahashi França
- Department of Immunology, Institute of Biomedical Science, University of São Paulo, São Paulo, São Paulo, Brazil
| | - Ashraf Al-Sbiei
- Department of Medical Microbiology and Immunology, College of Medicine and Health Sciences, United Arab Emirates (UAE) University, Al Ain, Abu Dhabi, United Arab Emirates
| | - Ghada Bashir
- Department of Medical Microbiology and Immunology, College of Medicine and Health Sciences, United Arab Emirates (UAE) University, Al Ain, Abu Dhabi, United Arab Emirates
| | - Yassir Awad Mohamed
- Department of Medical Microbiology and Immunology, College of Medicine and Health Sciences, United Arab Emirates (UAE) University, Al Ain, Abu Dhabi, United Arab Emirates
| | - Ranieri Coelho Salgado
- Department of Immunology, Institute of Biomedical Science, University of São Paulo, São Paulo, São Paulo, Brazil
| | - Lucila Akune Barreiros
- Department of Immunology, Institute of Biomedical Science, University of São Paulo, São Paulo, São Paulo, Brazil
| | | | - Cristina Worm Weber
- Pediatric Allergy & Immunology Clinic, Caxias do Sul, Rio Grande do Sul, Brazil
| | | | - Carolina Sanchez Aranda
- Division of Allergy, Immunology, and Rheumatology, Department of Pediatrics, Federal University of São Paulo, São Paulo, São Paulo, Brazil
| | - Carolina Prando
- Faculdades Pequeno Príncipe, Pelé Pequeno Principe Research Intitute, Curitiba, Paraná, Brazil.,Hospital Pequeno Príncipe, Curitiba, Paraná, Brazil
| | - Mayra B de Barros Dorna
- Division of Allergy and Immunology, Department of Pediatrics, Children's Institute, Hospital das Clínicas, São Paulo, São Paulo, Brazil
| | - Igor Jurisica
- Osteoarthritis Research Program, Division of Orthopedic Surgery, Schroeder Arthritis Institute, University Health Network, Krembil Research Institute, University Health Network, Departments of Medical Biophysics and Computer Science, University of Toronto, Toronto, Ontaro, Canada.,Institute of Neuroimmunology, Slovak Academy of Sciences, Bratislava, Slovakia
| | - Maria J Fernandez-Cabezudo
- Department of Biochemistry and Molecular Biology, College of Medicine and Health Sciences, UAE University, Al Ain, Abu Dhabi, United Arab Emirates
| | - Hans D Ochs
- Department of Pediatrics, University of Washington School of Medicine, and Seattle Children's Research Institute, Seattle, Washington, USA
| | - Antonio Condino-Neto
- Department of Immunology, Institute of Biomedical Science, University of São Paulo, São Paulo, São Paulo, Brazil
| | - Basel K Al-Ramadi
- Department of Medical Microbiology and Immunology, College of Medicine and Health Sciences, United Arab Emirates (UAE) University, Al Ain, Abu Dhabi, United Arab Emirates.,Zayed Center for Health Sciences, UAE University, Al Ain, Abu Dhabi, United Arab Emirates
| | - Otavio Cabral-Marques
- Department of Immunology, Institute of Biomedical Science, University of São Paulo, São Paulo, São Paulo, Brazil.,Department of Clinical and Toxicological Analyses, School of Pharmaceutical Sciences, University of São Paulo, São Paulo, São Paulo, Brazil.,Network of Immunity in Infection, Malignancy, and Autoimmunity (NIIMA), Universal Scientific Education and Research Network (USERN), São Paulo, São Paulo, Brazil
| |
Collapse
|
238
|
Chong LC, Gandhi G, Lee JM, Yeo WWY, Choi SB. Drug Discovery of Spinal Muscular Atrophy (SMA) from the Computational Perspective: A Comprehensive Review. Int J Mol Sci 2021; 22:8962. [PMID: 34445667 PMCID: PMC8396480 DOI: 10.3390/ijms22168962] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2021] [Accepted: 01/27/2021] [Indexed: 01/02/2023] Open
Abstract
Spinal muscular atrophy (SMA), one of the leading inherited causes of child mortality, is a rare neuromuscular disease arising from loss-of-function mutations of the survival motor neuron 1 (SMN1) gene, which encodes the SMN protein. When lacking the SMN protein in neurons, patients suffer from muscle weakness and atrophy, and in the severe cases, respiratory failure and death. Several therapeutic approaches show promise with human testing and three medications have been approved by the U.S. Food and Drug Administration (FDA) to date. Despite the shown promise of these approved therapies, there are some crucial limitations, one of the most important being the cost. The FDA-approved drugs are high-priced and are shortlisted among the most expensive treatments in the world. The price is still far beyond affordable and may serve as a burden for patients. The blooming of the biomedical data and advancement of computational approaches have opened new possibilities for SMA therapeutic development. This article highlights the present status of computationally aided approaches, including in silico drug repurposing, network driven drug discovery as well as artificial intelligence (AI)-assisted drug discovery, and discusses the future prospects.
Collapse
Affiliation(s)
- Li Chuin Chong
- Centre for Bioinformatics, School of Data Sciences, Perdana University, Suite 9.2, 9th Floor, Wisma Chase Perdana, Changkat Semantan, Kuala Lumpur 50490, Malaysia; (L.C.C.); (J.M.L.)
| | - Gayatri Gandhi
- Perdana University Graduate School of Medicine, Perdana University, Suite 9.2, 9th Floor, Wisma Chase Perdana, Changkat Semantan, Kuala Lumpur 50490, Malaysia; (G.G.); (W.W.Y.Y.)
| | - Jian Ming Lee
- Centre for Bioinformatics, School of Data Sciences, Perdana University, Suite 9.2, 9th Floor, Wisma Chase Perdana, Changkat Semantan, Kuala Lumpur 50490, Malaysia; (L.C.C.); (J.M.L.)
| | - Wendy Wai Yeng Yeo
- Perdana University Graduate School of Medicine, Perdana University, Suite 9.2, 9th Floor, Wisma Chase Perdana, Changkat Semantan, Kuala Lumpur 50490, Malaysia; (G.G.); (W.W.Y.Y.)
| | - Sy-Bing Choi
- Centre for Bioinformatics, School of Data Sciences, Perdana University, Suite 9.2, 9th Floor, Wisma Chase Perdana, Changkat Semantan, Kuala Lumpur 50490, Malaysia; (L.C.C.); (J.M.L.)
| |
Collapse
|
239
|
Huang WJ, He WY, Li JD, He RQ, Huang ZG, Zhou XG, Li JJ, Zeng DT, Chen JT, Wu WZ, Dang YW, Chen G. Clinical significance and molecular mechanism of angiotensin-converting enzyme 2 in hepatocellular carcinoma tissues. Bioengineered 2021; 12:4054-4069. [PMID: 34369278 PMCID: PMC8806523 DOI: 10.1080/21655979.2021.1952791] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
During the pandemic of the coronavirus disease 2019, there exist quite a few studies on angiotensin-converting enzyme 2 (ACE2) and SARS-CoV-2 infection, while little is known about ACE2 in hepatocellular carcinoma (HCC). The detailed mechanism among ACE2 and HCC still remains unclear, which needs to be further investigated. In the current study with a total of 6,926 samples, ACE2 expression was downregulated in HCC compared with non-HCC samples (standardized mean difference = −0.41). With the area under the curve of summary receiver operating characteristic = 0.82, ACE2 expression showed a better ability to differentiate HCC from non-HCC. The mRNA expression of ACE2 was related to the age, alpha-fetoprotein levels and cirrhosis of HCC patients, and it was identified as a protected factor for HCC patients via Kaplan–Meier survival, Cox regression analyses. The potential molecular mechanism of ACE2 may be relevant to catabolic and cell division. In all, decreasing ACE2 expression can be seen in HCC, and its protective role for HCC patients and underlying mechanisms were explored in the study.
Collapse
Affiliation(s)
- Wei-Jian Huang
- Department of Pathology, Redcross Hospital of Yulin, Yulin, P.R. China
| | - Wei-Ying He
- Department of Pathology, First Affiliated Hospital of Guangxi Medical University, Nanning, P.R. China
| | - Jian-Di Li
- Department of Pathology, First Affiliated Hospital of Guangxi Medical University, Nanning, P.R. China
| | - Rong-Quan He
- Department of Medical Oncology, First Affiliated Hospital of Guangxi Medical University, Nanning, P.R. China
| | - Zhi-Guang Huang
- Department of Pathology, First Affiliated Hospital of Guangxi Medical University, Nanning, P.R. China
| | - Xian-Guo Zhou
- Research Department, Guangxi Medical University Cancer Hospital, Nanning, P.R. China
| | - Jian-Jun Li
- Department of General Surgery, Second Affiliated Hospital of Guangxi Medical University, Nanning, P. R. China
| | - Da-Tong Zeng
- Department of Pathology, Redcross Hospital of Yulin, Yulin, P.R. China
| | - Ji-Tian Chen
- Department of Pathology, Lingshan People's Hospital, Qinzhou, P.R. China
| | - Wei-Zi Wu
- Department of Pathology, Lingshan People's Hospital, Qinzhou, P.R. China
| | - Yi-Wu Dang
- Department of Pathology, First Affiliated Hospital of Guangxi Medical University, Nanning, P.R. China
| | - Gang Chen
- Department of Pathology, First Affiliated Hospital of Guangxi Medical University, Nanning, P.R. China
| |
Collapse
|
240
|
Rohr M, Beardsley J, Nakkina SP, Zhu X, Aljabban J, Hadley D, Altomare D. A merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression. Sci Data 2021; 8:214. [PMID: 34381057 PMCID: PMC8358057 DOI: 10.1038/s41597-021-00998-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Accepted: 07/09/2021] [Indexed: 12/21/2022] Open
Abstract
Transcriptional profiling of pre- and post-malignant colorectal cancer (CRC) lesions enable temporal monitoring of molecular events underlying neoplastic progression. However, the most widely used transcriptomic dataset for CRC, TCGA-COAD, is devoid of adenoma samples, which increases reliance on an assortment of disparate microarray studies and hinders consensus building. To address this, we developed a microarray meta-dataset comprising 231 healthy, 132 adenoma, and 342 CRC tissue samples from twelve independent studies. Utilizing a stringent analytic framework, select datasets were downloaded from the Gene Expression Omnibus, normalized by frozen robust multiarray averaging and subsequently merged. Batch effects were then identified and removed by empirical Bayes estimation (ComBat). Finally, the meta-dataset was filtered for low variant probes, enabling downstream differential expression as well as quantitative and functional validation through cross-platform correlation and enrichment analyses, respectively. Overall, our meta-dataset provides a robust tool for investigating colorectal adenoma formation and malignant transformation at the transcriptional level with a pipeline that is modular and readily adaptable for similar analyses in other cancer types.
Collapse
Affiliation(s)
- Michael Rohr
- Burnett School of Biomedical Sciences, College of Medicine, University of Central Florida, Orlando, FL, USA
| | - Jordan Beardsley
- Burnett School of Biomedical Sciences, College of Medicine, University of Central Florida, Orlando, FL, USA
| | - Sai Preethi Nakkina
- Burnett School of Biomedical Sciences, College of Medicine, University of Central Florida, Orlando, FL, USA
| | - Xiang Zhu
- Burnett School of Biomedical Sciences, College of Medicine, University of Central Florida, Orlando, FL, USA
| | - Jihad Aljabban
- Department of Medicine, University of Wisconsin Hospital and Clinics, Madison, WI, USA
| | - Dexter Hadley
- Department of Clinical Sciences, College of Medicine, University of Central Florida, Orlando, FL, USA
| | - Deborah Altomare
- Burnett School of Biomedical Sciences, College of Medicine, University of Central Florida, Orlando, FL, USA.
| |
Collapse
|
241
|
Interpreting machine learning models to investigate circadian regulation and facilitate exploration of clock function. Proc Natl Acad Sci U S A 2021; 118:2103070118. [PMID: 34353905 PMCID: PMC8364196 DOI: 10.1073/pnas.2103070118] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
The circadian clock is an internal molecular 24-h timer that is critical to life on Earth. We describe a series of artificial intelligence (AI)– and machine learning (ML)–based approaches that enable more cost-effective analysis and insight into circadian regulation and function. Throughout the manuscript, we illuminate what is inside the ML “black box” via explanation or interpretation of predictive ML models. Using this interpretation of our models, we derive biological insights into why a prediction was made, alongside accurate predictions. Most innovatively, we use only DNA sequence features for accurate circadian gene expression prediction. Using explainable AI, we define possible, responsible regulatory elements as we make these predictions; this critically requires no prior knowledge of regulatory elements. The circadian clock is an important adaptation to life on Earth. Here, we use machine learning to predict complex, temporal, and circadian gene expression patterns in Arabidopsis. Most significantly, we classify circadian genes using DNA sequence features generated de novo from public, genomic resources, facilitating downstream application of our methods with no experimental work or prior knowledge needed. We use local model explanation that is transcript specific to rank DNA sequence features, providing a detailed profile of the potential circadian regulatory mechanisms for each transcript. Furthermore, we can discriminate the temporal phase of transcript expression using the local, explanation-derived, and ranked DNA sequence features, revealing hidden subclasses within the circadian class. Model interpretation/explanation provides the backbone of our methodological advances, giving insight into biological processes and experimental design. Next, we use model interpretation to optimize sampling strategies when we predict circadian transcripts using reduced numbers of transcriptomic timepoints. Finally, we predict the circadian time from a single, transcriptomic timepoint, deriving marker transcripts that are most impactful for accurate prediction; this could facilitate the identification of altered clock function from existing datasets.
Collapse
|
242
|
Cuesta-Astroz Y, Gischkow Rucatti G, Murgas L, SanMartín CD, Sanhueza M, Martin AJM. Filtering of Data-Driven Gene Regulatory Networks Using Drosophila melanogaster as a Case Study. Front Genet 2021; 12:649764. [PMID: 34394179 PMCID: PMC8355599 DOI: 10.3389/fgene.2021.649764] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Accepted: 04/30/2021] [Indexed: 01/12/2023] Open
Abstract
Gene Regulatory Networks (GRNs) allow the study of regulation of gene expression of whole genomes. Among the most relevant advantages of using networks to depict this key process, there is the visual representation of large amounts of information and the application of graph theory to generate new knowledge. Nonetheless, despite the many uses of GRNs, it is still difficult and expensive to assign Transcription Factors (TFs) to the regulation of specific genes. ChIP-Seq allows the determination of TF Binding Sites (TFBSs) over whole genomes, but it is still an expensive technique that can only be applied one TF at a time and requires replicates to reduce its noise. Once TFBSs are determined, the assignment of each TF and its binding sites to the regulation of specific genes is not trivial, and it is often performed by carrying out site-specific experiments that are unfeasible to perform in all possible binding sites. Here, we addressed these relevant issues with a two-step methodology using Drosophila melanogaster as a case study. First, our protocol starts by gathering all transcription factor binding sites (TFBSs) determined with ChIP-Seq experiments available at ENCODE and FlyBase. Then each TFBS is used to assign TFs to the regulation of likely target genes based on the TFBS proximity to the transcription start site of all genes. In the final step, to try to select the most likely regulatory TF from those previously assigned to each gene, we employ GENIE3, a random forest-based method, and more than 9,000 RNA-seq experiments from D. melanogaster. Following, we employed known TF protein-protein interactions to estimate the feasibility of regulatory events in our filtered networks. Finally, we show how known interactions between co-regulatory TFs of each gene increase after the second step of our approach, and thus, the consistency of the TF-gene assignment. Also, we employed our methodology to create a network centered on the Drosophila melanogaster gene Hr96 to demonstrate the role of this transcription factor on mitochondrial gene regulation.
Collapse
Affiliation(s)
- Yesid Cuesta-Astroz
- Colombian Institute of Tropical Medicine, CES University, Medellin, Colombia
| | | | - Leandro Murgas
- Laboratorio de Biologia de Redes, Centro de Genómica y Bioinformática, Facultad de Ciencias, Universidad Mayor, Santiago, Chile.,Programa de Doctorado en Genómica Integrativa, Vicerrectoría de Investigación, Universidad Mayor, Santiago, Chile
| | - Carol D SanMartín
- Departamento de Neurología y Neurocirugía, Hospital Clínico Universidad de Chile, Santiago, Chile.,Centro de Investigacíon Clínica Avanzada (CICA), Hospital Clínico Universidad de Chile, Santiago, Chile
| | - Mario Sanhueza
- Centro de Biología Integrativa, Facultad de Ciencias, Universidad Mayor, Santiago, Chile.,Escuela de Biotecnología, Facultad de Ciencias, Universidad Mayor, Santiago, Chile
| | - Alberto J M Martin
- Laboratorio de Biologia de Redes, Centro de Genómica y Bioinformática, Facultad de Ciencias, Universidad Mayor, Santiago, Chile.,Escuela de Biotecnología, Facultad de Ciencias, Universidad Mayor, Santiago, Chile
| |
Collapse
|
243
|
Pfaff J, Reinwald H, Ayobahan SU, Alvincz J, Göckener B, Shomroni O, Salinas G, Düring RA, Schäfers C, Eilebrecht S. Toxicogenomic differentiation of functional responses to fipronil and imidacloprid in Daphnia magna. AQUATIC TOXICOLOGY (AMSTERDAM, NETHERLANDS) 2021; 238:105927. [PMID: 34340001 DOI: 10.1016/j.aquatox.2021.105927] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Revised: 07/10/2021] [Accepted: 07/19/2021] [Indexed: 06/13/2023]
Abstract
Active substances of pesticides, biocides or pharmaceuticals can induce adverse side effects in the aquatic ecosystem, necessitating environmental hazard and risk assessment prior to substance registration. The freshwater crustacean Daphnia magna is a model organism for acute and chronic toxicity assessment representing aquatic invertebrates. However, standardized tests involving daphnia are restricted to the endpoints immobility and reproduction and thus provide only limited insights into the underlying modes-of-action. Here, we applied transcriptome profiling to a modified D. magna Acute Immobilization test to analyze and compare gene expression profiles induced by the GABA-gated chloride channel blocker fipronil and the nicotinic acetylcholine receptor (nAChR) agonist imidacloprid. Daphnids were expose to two low effect concentrations of each substance followed by RNA sequencing and functional classification of affected gene ontologies and pathways. For both insecticides, we observed a concentration-dependent increase in the number of differentially expressed genes, whose expression changes were highly significantly positively correlated when comparing both test concentrations. These gene expression fingerprints showed virtually no overlap between the test substances and they related well to previous data of diazepam and carbaryl, two substances targeting similar molecular key events. While, based on our results, fipronil predominantly interfered with molecular functions involved in ATPase-coupled transmembrane transport and transcription regulation, imidacloprid primarily affected oxidase and oxidoreductase activity. These findings provide evidence that systems biology approaches can be utilized to identify and differentiate modes-of-action of chemical stressors in D. magna as an invertebrate aquatic non-target organism. The mechanistic knowledge extracted from such data will in future contribute to the development of Adverse Outcome Pathways (AOPs) for read-across and prediction of population effects.
Collapse
Affiliation(s)
- Julia Pfaff
- Fraunhofer Attract Eco'n'OMICs, Fraunhofer Institute for Molecular Biology and Applied Ecology, Schmallenberg, Germany; Institute of Soil Science and Soil Conservation, Research Centre for BioSystems, Land Use and Nutrition (iFZ), Justus Liebig University Giessen, Giessen, Germany
| | - Hannes Reinwald
- Fraunhofer Attract Eco'n'OMICs, Fraunhofer Institute for Molecular Biology and Applied Ecology, Schmallenberg, Germany; Department Evolutionary Ecology and Environmental Toxicology, Faculty Biological Sciences, Goethe University Frankfurt, Frankfurt, Germany
| | - Steve U Ayobahan
- Fraunhofer Attract Eco'n'OMICs, Fraunhofer Institute for Molecular Biology and Applied Ecology, Schmallenberg, Germany
| | - Julia Alvincz
- Fraunhofer Attract Eco'n'OMICs, Fraunhofer Institute for Molecular Biology and Applied Ecology, Schmallenberg, Germany
| | - Bernd Göckener
- Department Environmental and Food Analysis, Fraunhofer Institute for Molecular Biology and Applied Ecology, Schmallenberg, Germany
| | - Orr Shomroni
- NGS-Services for Integrative Genomics, University of Göttingen, Göttingen, Germany
| | - Gabriela Salinas
- NGS-Services for Integrative Genomics, University of Göttingen, Göttingen, Germany
| | - Rolf-Alexander Düring
- Institute of Soil Science and Soil Conservation, Research Centre for BioSystems, Land Use and Nutrition (iFZ), Justus Liebig University Giessen, Giessen, Germany
| | - Christoph Schäfers
- Department of Ecotoxicology, Fraunhofer Institute for Molecular Biology and Applied Ecology, Schmallenberg, Germany
| | - Sebastian Eilebrecht
- Fraunhofer Attract Eco'n'OMICs, Fraunhofer Institute for Molecular Biology and Applied Ecology, Schmallenberg, Germany.
| |
Collapse
|
244
|
Angeloni M, Thievessen I, Engel FB, Magni P, Ferrazzi F. Functional genomics meta-analysis to identify gene set enrichment networks in cardiac hypertrophy. Biol Chem 2021; 402:953-972. [PMID: 33951759 DOI: 10.1515/hsz-2020-0378] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Accepted: 04/19/2021] [Indexed: 12/28/2022]
Abstract
In order to take advantage of the continuously increasing number of transcriptome studies, it is important to develop strategies that integrate multiple expression datasets addressing the same biological question to allow a robust analysis. Here, we propose a meta-analysis framework that integrates enriched pathways identified through the Gene Set Enrichment Analysis (GSEA) approach and calculates for each meta-pathway an empirical p-value. Validation of our approach on benchmark datasets showed comparable or even better performance than existing methods and an increase in robustness with increasing number of integrated datasets. We then applied the meta-analysis framework to 15 functional genomics datasets of physiological and pathological cardiac hypertrophy. Within these datasets we grouped expression sets measured at time points that represent the same hallmarks of heart tissue remodeling ('aggregated time points') and performed meta-analysis on the expression sets assigned to each aggregated time point. To facilitate biological interpretation, results were visualized as gene set enrichment networks. Here, our meta-analysis framework identified well-known biological mechanisms associated with pathological cardiac hypertrophy (e.g., cardiomyocyte apoptosis, cardiac contractile dysfunction, and alteration in energy metabolism). In addition, results highlighted novel, potentially cardioprotective mechanisms in physiological cardiac hypertrophy involving the down-regulation of immune cell response, which are worth further investigation.
Collapse
Affiliation(s)
- Miriam Angeloni
- Department of Nephropathology, Institute of Pathology, Friedrich-Alexander-Universität Erlangen-Nürnberg, Krankenhausstr. 8-10, D-91054 Erlangen, Germany
- Institute of Pathology, Friedrich-Alexander-Universität Erlangen-Nürnberg, Krankenhausstr. 8-10, D-91054 Erlangen, Germany
| | - Ingo Thievessen
- Biophysics Group, Department of Physics, Friedrich-Alexander-Universität Erlangen-Nürnberg, Henkestraße 91, D-91052 Erlangen, Germany
- Muscle Research Center Erlangen (MURCE), D-91052 Erlangen, Germany
| | - Felix B Engel
- Experimental Renal and Cardiovascular Research, Department of Nephropathology, Institute of Pathology, Friedrich-Alexander-Universität Erlangen-Nürnberg, Schwabachanlage 12, D-91054 Erlangen, Germany
- Muscle Research Center Erlangen (MURCE), D-91052 Erlangen, Germany
| | - Paolo Magni
- Department of Electrical, Computer and Biomedical Engineering, University of Pavia, via Ferrata 5, I-27100 Pavia, Italy
| | - Fulvia Ferrazzi
- Department of Nephropathology, Institute of Pathology, Friedrich-Alexander-Universität Erlangen-Nürnberg, Krankenhausstr. 8-10, D-91054 Erlangen, Germany
- Muscle Research Center Erlangen (MURCE), D-91052 Erlangen, Germany
| |
Collapse
|
245
|
Wu D, Miao J, Hu J, Li F, Gao D, Chen H, Feng Y, Shen Y, He A. PSMB7 Is a Key Gene Involved in the Development of Multiple Myeloma and Resistance to Bortezomib. Front Oncol 2021; 11:684232. [PMID: 34367968 PMCID: PMC8343178 DOI: 10.3389/fonc.2021.684232] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Accepted: 07/08/2021] [Indexed: 12/22/2022] Open
Abstract
Multiple myeloma (MM), the second most commonly diagnosed hematologic neoplasm, is the most significant clinical manifestation in a series of plasma cell (PC) dyscrasia. Monoclonal gammopathy of undetermined significance (MGUS) and smoldering MM (SMM), approximately 1% or 10% of which, respectively, can progress to MM per year, are the premalignant stages of MM. The overall survival (OS) of MM is significantly improved by the introduction of proteasome inhibitors (PIs), but almost all MM patients eventually relapse and resist anti-MM drugs. Therefore, it is crucial to explore the progression of MM and the mechanisms related to MM drug resistance. In this study, we used weighted gene co-expression network analysis (WGCNA) to analyze the gene expression of the dynamic process from normal plasma cells (NPC) to malignant profiling PC, and found that the abnormal gene expression was mainly concentrated in the proteasome. We also found that the expression of one of the proteasomal subunits PSMB7 was capable of distinguishing the different stages of PC dyscrasia and was the highest in ISS III. In the bortezomib (BTZ) treated NDMM patients, higher PSMB7 expression was associated with shorter survival time, and the expression of PSMB7 in the BTZ treatment group was significantly higher than in the thalidomide (Thai) treatment group. In summary, we found that PSMB7 is the key gene associated with MM disease progression and drug resistance.
Collapse
Affiliation(s)
- Dong Wu
- Department of Hematology, The Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
| | - Jiyu Miao
- Department of Hematology, The Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
| | - Jinsong Hu
- Department of Cell Biology and Genetics, Xi'an Jiaotong University Health Science Center, Xi'an, China
| | - Fangmei Li
- Department of Hematology, The Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
| | - Dandan Gao
- Department of Hematology, The Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
| | - Hongli Chen
- Department of Hematology, The Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
| | - Yuandong Feng
- Department of Hematology, The Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
| | - Ying Shen
- Department of Hematology, The Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
| | - Aili He
- Department of Hematology, The Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
| |
Collapse
|
246
|
Non-Tumor CCAAT/Enhancer-Binding Protein Delta Potentiates Tumor Cell Extravasation and Pancreatic Cancer Metastasis Formation. Biomolecules 2021; 11:biom11081079. [PMID: 34439745 PMCID: PMC8391339 DOI: 10.3390/biom11081079] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Revised: 07/16/2021] [Accepted: 07/19/2021] [Indexed: 12/16/2022] Open
Abstract
CCAAT/enhancer-binding protein delta (C/EBPδ) is a transcription factor involved in apoptosis and proliferation, which is downregulated in pancreatic ductal adenocarcinoma (PDAC) cells. Loss of nuclear C/EBPδ in PDAC cells is associated with decreased patient survival and pro-tumorigenic properties in vitro. Interestingly however, next to C/EBPδ expression in tumor cells, C/EBPδ is also expressed by cells constituting the tumor microenvironment and by cells comprising the organs and parenchyma. However, the functional relevance of systemic C/EBPδ in carcinogenesis remains elusive. Here, we consequently assessed the potential importance of C/EBPδ in somatic tissues by utilizing an orthotopic pancreatic cancer model. In doing so, we show that genetic ablation of C/EBPδ does not significantly affect primary tumor growth but has a strong impact on metastases; wildtype mice developed metastases at multiple sites, whilst this was not the case in C/EBPδ-/- mice. In line with reduced metastasis formation in C/EBPδ-/- mice, C/EBPδ-deficiency also limited tumor cell dissemination in a specific extravasation model. Tumor cell extravasation was dependent on the platelet-activating factor receptor (PAFR) as a PAFR antagonist inhibited tumor cell extravasation in wildtype mice but not in C/EBPδ-/- mice. Overall, we show that systemic C/EBPδ facilitates pancreatic cancer metastasis, and we suggest this is due to C/EBPδ-PAFR-dependent tumor cell extravasation.
Collapse
|
247
|
Mulder N, Zass L, Hamdi Y, Othman H, Panji S, Allali I, Fakim YJ. African Global Representation in Biomedical Sciences. Annu Rev Biomed Data Sci 2021; 4:57-81. [PMID: 34465182 DOI: 10.1146/annurev-biodatasci-102920-112550] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
African populations are diverse in their ethnicity, language, culture, and genetics. Although plagued by high disease burdens, until recently the continent has largely been excluded from biomedical studies. Along with limitations in research and clinical infrastructure, human capacity, and funding, this omission has resulted in an underrepresentation of African data and disadvantaged African scientists. This review interrogates the relative abundance of biomedical data from Africa, primarily in genomics and other omics. The visibility of African science through publications is also discussed. A challenge encountered in this review is the relative lack of annotation of data on their geographical or population origin, with African countries represented as a single group. In addition to the abovementioned limitations,the global representation of African data may also be attributed to the hesitation to deposit data in public repositories. Whatever the reason, the disparity should be addressed, as African data have enormous value for scientists in Africa and globally.
Collapse
Affiliation(s)
- Nicola Mulder
- Computational Biology Division, Department of Integrative Biomedical Sciences and Institute of Infectious Disease and Molecular Medicine, Faculty of Health Sciences, University of Cape Town, Cape Town 7925, South Africa; .,Wellcome Centre for Infectious Diseases Research in Africa (CIDRI-AFRICA), Faculty of Health Sciences, University of Cape Town, Cape Town 7925, South Africa
| | - Lyndon Zass
- Computational Biology Division, Department of Integrative Biomedical Sciences and Institute of Infectious Disease and Molecular Medicine, Faculty of Health Sciences, University of Cape Town, Cape Town 7925, South Africa;
| | - Yosr Hamdi
- Laboratory of Biomedical Genomics and Oncogenetics and Laboratory of Human and Experimental Pathology, Institut Pasteur de Tunis, University of Tunis El Manar, 1002 Tunis, Tunisia
| | - Houcemeddine Othman
- Sydney Brenner Institute for Molecular Bioscience, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg 2193, South Africa
| | - Sumir Panji
- Computational Biology Division, Department of Integrative Biomedical Sciences and Institute of Infectious Disease and Molecular Medicine, Faculty of Health Sciences, University of Cape Town, Cape Town 7925, South Africa;
| | - Imane Allali
- Laboratory of Human Pathologies Biology, Department of Biology, Faculty of Sciences, and Genomic Center of Human Pathologies, Faculty of Medicine and Pharmacy, Mohammed V University in Rabat, 1014 Rabat, Morocco
| | - Yasmina Jaufeerally Fakim
- Biotechnology Unit, Department of Agricultural and Food Science, Faculty of Agriculture, University of Mauritius, Réduit 80837, Mauritius
| |
Collapse
|
248
|
Day K, Dordevic AL, Truby H, Southey MC, Coort S, Murgia C. Transcriptomic changes in peripheral blood mononuclear cells with weight loss: systematic literature review and primary data synthesis. GENES AND NUTRITION 2021; 16:12. [PMID: 34281497 PMCID: PMC8287703 DOI: 10.1186/s12263-021-00692-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Accepted: 07/08/2021] [Indexed: 12/18/2022]
Abstract
Background and objectives Peripheral blood mononuclear cells (PBMCs) have shown promise as a tissue sensitive to subtle and possibly systemic transcriptomic changes, and as such may be useful in identifying responses to weight loss interventions. The primary aim was to comprehensively evaluate the transcriptomic changes that may occur during weight loss and to determine if there is a consistent response across intervention types in human populations of all ages. Methods Included studies were randomised control trials or cohort studies that administered an intervention primarily designed to decrease weight in any overweight or obese human population. A systematic search of the literature was conducted to obtain studies and gene expression databases were interrogated to locate corresponding transcriptomic datasets. Datasets were normalised using the ArrayAnalysis online tool and differential gene expression was determined using the limma package in R. Over-represented pathways were explored using the PathVisio software. Heatmaps and hierarchical clustering were utilised to visualise gene expression. Results Seven papers met the inclusion criteria, five of which had raw gene expression data available. Of these, three could be grouped into high responders (HR, ≥ 5% body weight loss) and low responders (LR). No genes were consistently differentially expressed between high and low responders across studies. Adolescents had the largest transcriptomic response to weight loss followed by adults who underwent bariatric surgery. Seven pathways were altered in two out of four studies following the intervention and the pathway ‘cytoplasmic ribosomal proteins’ (WikiPathways: WP477) was altered between HR and LR at baseline in the two datasets with both groups. Pathways related to ‘toll-like receptor signalling’ were altered in HR response to the weight loss intervention in two out of three datasets. Conclusions Transcriptomic changes in PBMCs do occur in response to weight change. Transparent and standardised data reporting is needed to realise the potential of transcriptomics for investigating phenotypic features. Registration number PROSPERO: CRD42019106582 Supplementary Information The online version contains supplementary material available at 10.1186/s12263-021-00692-6.
Collapse
Affiliation(s)
- Kaitlin Day
- Department of Nutrition, Dietetics and Food, Monash University, Level 1, 264 Ferntree Gully Road, Notting Hill, Victoria, 3168, Australia.
| | - Aimee L Dordevic
- Department of Nutrition, Dietetics and Food, Monash University, Level 1, 264 Ferntree Gully Road, Notting Hill, Victoria, 3168, Australia
| | - Helen Truby
- School of Human Movement and Nutrition Sciences, University of Queensland, Brisbane, Australia
| | - Melissa C Southey
- Department of Clinical Pathology, Melbourne Medical School, The University of Melbourne, Melbourne, Victoria, Australia.,Cancer Epidemiology Division, Cancer Council Victoria, Melbourne, Victoria, Australia.,Precision Medicine, School of Clinical Sciences at Monash Health, Monash University, Clayton, Victoria, Australia
| | - Susan Coort
- Department of Bioinformatics-BiGCaT, NUTRIM School of Nutrition and Translational Research in Metabolism, Maastricht University, Maastricht, The Netherlands
| | - Chiara Murgia
- School of Agriculture and Food, The University of Melbourne, Melbourne, Australia
| |
Collapse
|
249
|
Morgan S, Malatras A, Duguez S, Duddy W. Optimized Molecular Interaction Networks for the Study of Skeletal Muscle. J Neuromuscul Dis 2021; 8:S223-S239. [PMID: 34308911 DOI: 10.3233/jnd-210680] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
BACKGROUND Molecular interaction networks (MINs) aim to capture the complex relationships between interacting molecules within a biological system. MINs can be constructed from existing knowledge of molecular functional associations, such as protein-protein binding interactions (PPI) or gene co-expression, and these different sources may be combined into a single MIN. A given MIN may be more or less optimal in its representation of the important functional relationships of molecules in a tissue. OBJECTIVE The aim of this study was to establish whether a combined MIN derived from different types of functional association could better capture muscle-relevant biology compared to its constituent single-source MINs. METHODS MINs were constructed from functional association databases for both protein-binding and gene co-expression. The networks were then compared based on the capture of muscle-relevant genes and gene ontology (GO) terms, tested in two different ways using established biological network clustering algorithms. The top performing MINs were combined to test whether an optimal MIN for skeletal muscle could be constructed. RESULTS The STRING PPI network was the best performing single-source MIN among those tested. Combining STRING with interactions from either the MyoMiner or CoXPRESSdb gene co-expression sources resulted in a combined network with improved performance relative to its constituent networks. CONCLUSION MINs constructed from multiple types of functional association can better represent the functional relationships of molecules in a given tissue. Such networks may be used to improve the analysis and interpretation of functional genomics data in the study of skeletal muscle and neuromuscular diseases. Networks and clusters described by this study, including the combinations of STRING with MyoMiner or with CoXPRESSdb, are available for download from https://www.sys-myo.com/myominer/download.php.
Collapse
Affiliation(s)
- Stephen Morgan
- Northern Ireland Centre for Stratified Medicine, Altnagelvin Hospital Campus, Ulster University, Londonderry, Northern Ireland, UK
| | - Apostolos Malatras
- Department of Biological Sciences, Molecular Medicine Research Center, University of Cyprus, University Avenue, Nicosia, Cyprus
| | - Stephanie Duguez
- Northern Ireland Centre for Stratified Medicine, Altnagelvin Hospital Campus, Ulster University, Londonderry, Northern Ireland, UK
| | - William Duddy
- Northern Ireland Centre for Stratified Medicine, Altnagelvin Hospital Campus, Ulster University, Londonderry, Northern Ireland, UK
| |
Collapse
|
250
|
Huang Q, Carrio-Cordo P, Gao B, Paloots R, Baudis M. The Progenetix oncogenomic resource in 2021. Database (Oxford) 2021; 2021:baab043. [PMID: 34272855 PMCID: PMC8285936 DOI: 10.1093/database/baab043] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Revised: 06/16/2021] [Accepted: 06/30/2021] [Indexed: 12/02/2022]
Abstract
In cancer, copy number aberrations (CNAs) represent a type of nearly ubiquitous and frequently extensive structural genome variations. To disentangle the molecular mechanisms underlying tumorigenesis as well as identify and characterize molecular subtypes, the comparative and meta-analysis of large genomic variant collections can be of immense importance. Over the last decades, cancer genomic profiling projects have resulted in a large amount of somatic genome variation profiles, however segregated in a multitude of individual studies and datasets. The Progenetix project, initiated in 2001, curates individual cancer CNA profiles and associated metadata from published oncogenomic studies and data repositories with the aim to empower integrative analyses spanning all different cancer biologies. During the last few years, the fields of genomics and cancer research have seen significant advancement in terms of molecular genetics technology, disease concepts, data standard harmonization as well as data availability, in an increasingly structured and systematic manner. For the Progenetix resource, continuous data integration, curation and maintenance have resulted in the most comprehensive representation of cancer genome CNA profiling data with 138 663 (including 115 357 tumor) copy number variation (CNV) profiles. In this article, we report a 4.5-fold increase in sample number since 2013, improvements in data quality, ontology representation with a CNV landscape summary over 51 distinctive National Cancer Institute Thesaurus cancer terms as well as updates in database schemas, and data access including new web front-end and programmatic data access. Database URL: progenetix.org.
Collapse
Affiliation(s)
- Qingyao Huang
- Department of Molecular Life Sciences, University of Zurich, Winterthurerstrasse 190, Zurich 8057, Switzerland
- Swiss Institute of Bioinformatics, Winterthurerstrasse 190, Zurich 8057, Switzerland
| | - Paula Carrio-Cordo
- Department of Molecular Life Sciences, University of Zurich, Winterthurerstrasse 190, Zurich 8057, Switzerland
- Swiss Institute of Bioinformatics, Winterthurerstrasse 190, Zurich 8057, Switzerland
| | - Bo Gao
- Department of Molecular Life Sciences, University of Zurich, Winterthurerstrasse 190, Zurich 8057, Switzerland
- Swiss Institute of Bioinformatics, Winterthurerstrasse 190, Zurich 8057, Switzerland
| | - Rahel Paloots
- Department of Molecular Life Sciences, University of Zurich, Winterthurerstrasse 190, Zurich 8057, Switzerland
- Swiss Institute of Bioinformatics, Winterthurerstrasse 190, Zurich 8057, Switzerland
| | - Michael Baudis
- Department of Molecular Life Sciences, University of Zurich, Winterthurerstrasse 190, Zurich 8057, Switzerland
- Swiss Institute of Bioinformatics, Winterthurerstrasse 190, Zurich 8057, Switzerland
| |
Collapse
|