1
|
A critical review of datasets and computational suites for improving cancer theranostics and biomarker discovery. MEDICAL ONCOLOGY (NORTHWOOD, LONDON, ENGLAND) 2022; 39:206. [PMID: 36175717 DOI: 10.1007/s12032-022-01815-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Accepted: 07/29/2022] [Indexed: 10/14/2022]
Abstract
Cancer has been constantly evolving and so is the research pertaining to cancer diagnosis and therapeutic regimens. Early detection and specific therapeutics are the key features of modern cancer therapy. These requirements can only be fulfilled with the integration of diverse high-throughput technologies. Integration of advanced omics methodology involving genomics, epigenomics, proteomics, and transcriptomics provide a clear understanding of multi-faceted cancer. In the past few years, tremendous high-throughput data have been generated from cancer genomics and epigenomic analyses, which on further methodological analyses can yield better biological insights. The major epigenetic alterations reported in cancer are DNA methylation levels, histone post-translational modifications, and epi-miRNA regulating the oncogenes and tumor suppressor genes. While the genomic analyses like gene expression profiling, cancer gene prediction, and genome annotation divulge the genetic alterations in oncogenes or tumor suppressor genes. Also, systems biology approach using biological networks is being extensively used to identify novel cancer biomarkers. Therefore, integration of these multi-dimensional approaches will help to identify potential diagnostic and therapeutic biomarkers. Here, we reviewed the critical databases and tools dedicated to various epigenomic and genomic alterations in cancer. The review further focuses on the multi-omics resources available for further validating the identified cancer biomarkers. We also highlighted the tools for cancer biomarker discovery using a systems biology approach utilizing genomic and epigenomic data. Biomarkers predicted using such integrative approaches are shown to be more clinically relevant.
Collapse
|
2
|
Liu C, Cai D, Zeng W, Huang Y. Inferring Differential Networks by Integrating Gene Expression Data With Additional Knowledge. Front Genet 2021; 12:760155. [PMID: 34858477 PMCID: PMC8632038 DOI: 10.3389/fgene.2021.760155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 10/13/2021] [Indexed: 11/23/2022] Open
Abstract
Evidences increasingly indicate the involvement of gene network rewiring in disease development and cell differentiation. With the accumulation of high-throughput gene expression data, it is now possible to infer the changes of gene networks between two different states or cell types via computational approaches. However, the distribution diversity of multi-platform gene expression data and the sparseness and high noise rate of single-cell RNA sequencing (scRNA-seq) data raise new challenges for existing differential network estimation methods. Furthermore, most existing methods are purely rely on gene expression data, and ignore the additional information provided by various existing biological knowledge. In this study, to address these challenges, we propose a general framework, named weighted joint sparse penalized D-trace model (WJSDM), to infer differential gene networks by integrating multi-platform gene expression data and multiple prior biological knowledge. Firstly, a non-paranormal graphical model is employed to tackle gene expression data with missing values. Then we propose a weighted group bridge penalty to integrate multi-platform gene expression data and various existing biological knowledge. Experiment results on synthetic data demonstrate the effectiveness of our method in inferring differential networks. We apply our method to the gene expression data of ovarian cancer and the scRNA-seq data of circulating tumor cells of prostate cancer, and infer the differential network associated with platinum resistance of ovarian cancer and anti-androgen resistance of prostate cancer. By analyzing the estimated differential networks, we find some important biological insights about the mechanisms underlying platinum resistance of ovarian cancer and anti-androgen resistance of prostate cancer.
Collapse
Affiliation(s)
- Chen Liu
- Department of Chemotherapy, The First Affiliated Hospital of Fujian Medical University, Fuzhou, China
| | - Dehan Cai
- Department of Electrical Engineering, City University of Hong Kong, Hong Kong, China
| | - WuCha Zeng
- Department of Chemotherapy, The First Affiliated Hospital of Fujian Medical University, Fuzhou, China
| | - Yun Huang
- Department of Geriatric Medicine, The First Affiliated Hospital of Fujian Medical University, Fuzhou, China
| |
Collapse
|
3
|
Ou-Yang L, Cai D, Zhang XF, Yan H. WDNE: an integrative graphical model for inferring differential networks from multi-platform gene expression data with missing values. Brief Bioinform 2021; 22:6272792. [PMID: 33975339 DOI: 10.1093/bib/bbab086] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2021] [Revised: 02/14/2021] [Accepted: 02/23/2021] [Indexed: 11/14/2022] Open
Abstract
The mechanisms controlling biological process, such as the development of disease or cell differentiation, can be investigated by examining changes in the networks of gene dependencies between states in the process. High-throughput experimental methods, like microarray and RNA sequencing, have been widely used to gather gene expression data, which paves the way to infer gene dependencies based on computational methods. However, most differential network analysis methods are designed to deal with fully observed data, but missing values, such as the dropout events in single-cell RNA-sequencing data, are frequent. New methods are needed to take account of these missing values. Moreover, since the changes of gene dependencies may be driven by certain perturbed genes, considering the changes in gene expression levels may promote the identification of gene network rewiring. In this study, a novel weighted differential network estimation (WDNE) model is proposed to handle multi-platform gene expression data with missing values and take account of changes in gene expression levels. Simulation studies demonstrate that WDNE outperforms state-of-the-art differential network estimation methods. When applied WDNE to infer differential gene networks associated with drug resistance in ovarian tumors, cell differentiation and breast tumor heterogeneity, the hub genes in the estimated differential gene networks can provide important insights into the underlying mechanisms. Furthermore, a Matlab toolbox, differential network analysis toolbox, was developed to implement the WDNE model and visualize the estimated differential networks.
Collapse
Affiliation(s)
- Le Ou-Yang
- Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen Key Laboratory of Media Security, and Guangdong Laboratory of Artificial Intelligence and Digital Economy(SZ), College of Electronics and Information Engineering, Shenzhen University, Shenzhen, 518060, China
| | - Dehan Cai
- Department of Electrical Engineering, City University of Hong Kong, Hong Kong, 999077, China
| | - Xiao-Fei Zhang
- School of Mathematics and Statistics & Hubei Key Laboratory of Mathematical Sciences, Central China Normal University, Wuhan, 430079, China
| | - Hong Yan
- Department of Electrical Engineering, City University of Hong Kong, Hong Kong, 999077, China
| |
Collapse
|
4
|
Zhao J, Song X, Xu T, Yang Q, Liu J, Jiang B, Wu J. Identification of Potential Prognostic Competing Triplets in High-Grade Serous Ovarian Cancer. Front Genet 2021; 11:607722. [PMID: 33519912 PMCID: PMC7839966 DOI: 10.3389/fgene.2020.607722] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Accepted: 11/19/2020] [Indexed: 12/14/2022] Open
Abstract
Increasing lncRNA-associated competing triplets were found to play important roles in cancers. With the accumulation of high-throughput sequencing data in public databases, the size of available tumor samples is becoming larger and larger, which introduces new challenges to identify competing triplets. Here, we developed a novel method, called LncMiM, to detect the lncRNA–miRNA–mRNA competing triplets in ovarian cancer with tumor samples from the TCGA database. In LncMiM, non-linear correlation analysis is used to cover the problem of weak correlations between miRNA–target pairs, which is mainly due to the difference in the magnitude of the expression level. In addition, besides the miRNA, the impact of lncRNA and mRNA on the interactions in triplets is also considered to improve the identification sensitivity of LncMiM without reducing its accuracy. By using LncMiM, a total of 847 lncRNA-associated competing triplets were found. All the competing triplets form a miRNA–lncRNA pair centered regulatory network, in which ZFAS1, SNHG29, GAS5, AC112491.1, and AC099850.4 are the top five lncRNAs with most connections. The results of biological process and KEGG pathway enrichment analysis indicates that the competing triplets are mainly associated with cell division, cell proliferation, cell cycle, oocyte meiosis, oxidative phosphorylation, ribosome, and p53 signaling pathway. Through survival analysis, 107 potential prognostic biomarkers are found in the competing triplets, including FGD5-AS1, HCP5, HMGN4, TACC3, and so on. LncMiM is available at https://github.com/xiaofengsong/LncMiM.
Collapse
Affiliation(s)
- Jian Zhao
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, China
| | - Xiaofeng Song
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, China
| | - Tianyi Xu
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, China
| | - Qichang Yang
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, China
| | - Jingjing Liu
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, China
| | - Bin Jiang
- College of Automation Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, China
| | - Jing Wu
- School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, China
| |
Collapse
|
5
|
Bhyan SB, Wee Y, Liu Y, Cummins S, Zhao M. Integrative analysis of common genes and driver mutations implicated in hormone stimulation for four cancers in women. PeerJ 2019; 7:e6872. [PMID: 31205821 PMCID: PMC6556371 DOI: 10.7717/peerj.6872] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2019] [Accepted: 03/28/2019] [Indexed: 12/11/2022] Open
Abstract
Cancer is one of the leading cause of death of women worldwide, and breast, ovarian, endometrial and cervical cancers contribute significantly to this every year. Developing early genetic-based diagnostic tools may be an effective approach to increase the chances of survival and provide more treatment opportunities. However, the current cancer genetic studies are mainly conducted independently and, hence lack of common driver genes involved in cancers in women. To explore the potential common molecular mechanism, we integrated four comprehensive literature-based databases to explore the shared implicated genetic effects. Using a total of 460 endometrial, 2,068 ovarian, 2,308 breast and 537 cervical cancer-implicated genes, we identified 52 genes which are common in all four types of cancers in women. Furthermore, we defined their potential functional role in endogenous hormonal regulation pathways within the context of four cancers in women. For example, these genes are strongly associated with hormonal stimulation, which may facilitate rapid diagnosis and treatment management decision making. Additional mutational analyses on combined the cancer genome atlas datasets consisting of 5,919 gynaecological and breast tumor samples were conducted to identify the frequently mutated genes across cancer types. For those common implicated genes for hormonal stimulants, we found that three quarter of 5,919 samples had genomic alteration with the highest frequency in MYC (22%), followed by NDRG1 (19%), ERBB2 (14%), PTEN (13%), PTGS2 (13%) and CDH1 (11%). We also identified 38 hormone related genes, eight of which are associated with the ovulation cycle. Further systems biology approach of the shared genes identified 20 novel genes, of which 12 were involved in the hormone regulation in these four cancers in women. Identification of common driver genes for hormone stimulation provided an unique angle of involving the potential of the hormone stimulants-related genes for cancer diagnosis and prognosis.
Collapse
Affiliation(s)
- Salma Begum Bhyan
- Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, Sunshine Coast, QLD, Australia
| | - YongKiat Wee
- Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, Sunshine Coast, QLD, Australia
| | - Yining Liu
- The School of Public Health, Institute for Chemical Carcinogenesis, Guangzhou Medical University, Guangzhou, China
| | - Scott Cummins
- Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, Sunshine Coast, QLD, Australia
| | - Min Zhao
- Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, Sunshine Coast, QLD, Australia
| |
Collapse
|
6
|
Uddin A, Paul N, Chakraborty S. The codon usage pattern of genes involved in ovarian cancer. Ann N Y Acad Sci 2019; 1440:67-78. [PMID: 30843242 DOI: 10.1111/nyas.14019] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2018] [Revised: 01/04/2019] [Accepted: 01/14/2019] [Indexed: 12/20/2022]
Abstract
In this study, we analyzed the compositional dynamics and codon usage pattern of genes involved in ovarian cancer (OC) using a computational method. Mutations in specific genes are associated with OC, and some genes are risk factors for progression of OC, but no work has been reported yet on the codon usage pattern of genes involved in OC. Nucleotide composition analysis of OC-related genes suggested that the overall GC content was higher than AT content; that is, the genes were GC rich. The improved effective number of codons indicated that the overall extent of codon usage bias of genes involved in OC was low. The codons AGC, CTG, ATC, ACC, GTG, and GCC were overrepresented, while the codons TCG, TTA, CTA, CCG, CAA, CGT, ATA, ACG, GTA, GTT, GCG, and GGT were underrepresented in the genes. Correspondence analysis suggested that the codon usage pattern was different in different genes. A highly significant correlation was observed between GC12 and GC3 (r = 0.587, P < 0.01) of genes, suggesting that directional mutation affected the three codon positions. Our report on the codon usage pattern of genes involved in OC includes a new perspective for elucidating the mechanisms of biased usage of synonymous codons, as well as providing useful clues for molecular genetic engineering.
Collapse
Affiliation(s)
- Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial Science College, Assam, India
| | - Nirmal Paul
- Department of Biotechnology, Assam University, Assam, India
| | | |
Collapse
|
7
|
Xu T, Ou-Yang L, Hu X, Zhang XF. Identifying Gene Network Rewiring by Integrating Gene Expression and Gene Network Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:2079-2085. [PMID: 29994068 DOI: 10.1109/tcbb.2018.2809603] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Exploring the rewiring pattern of gene regulatory networks between different pathological states is an important task in bioinformatics. Although a number of computational approaches have been developed to infer differential networks from high-throughput data, most of them only focus on gene expression data. The valuable static gene regulatory network data accumulated in recent biomedical researches are neglected. In this study, we propose a new Gaussian graphical model-based method to infer differential networks by integrating gene expression and static gene regulatory network data. We first evaluate the empirical performance of our method by comparing with the state-of-the-art methods using simulation data. We also apply our method to The Cancer Genome Atlas data to identify gene network rewiring between ovarian cancers with different platinum responses, and rewiring between breast cancers of luminal A subtype and basal-like subtype. Hub genes in the estimated differential networks rediscover known genes associated with platinum resistance in ovarian cancer and signatures of the breast cancer intrinsic subtypes.
Collapse
|
8
|
Zhao M, Liu Y, Qu H. Expression of epithelial-mesenchymal transition-related genes increases with copy number in multiple cancer types. Oncotarget 2017; 7:24688-99. [PMID: 27029057 PMCID: PMC5029734 DOI: 10.18632/oncotarget.8371] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2015] [Accepted: 03/04/2016] [Indexed: 01/10/2023] Open
Abstract
Epithelial-mesenchymal transition (EMT) is a cellular process through which epithelial cells transform into mesenchymal cells. EMT-implicated genes initiate and promote cancer metastasis because mesenchymal cells have greater invasive and migration capacities than epithelial cells. In this pan-cancer analysis, we explored the relationship between gene expression changes and copy number variations (CNVs) for EMT-implicated genes. Based on curated 377 EMT-implicated genes from the literature, we identified 212 EMT-implicated genes associated with more frequent copy number gains (CNGs) than copy number losses (CNLs) using data from The Cancer Genome Atlas (TCGA). Then by correlating these CNV data with TCGA gene expression data, we identified 71 EMT-implicated genes with concordant CNGs and gene up-regulation in 20 or more tumor samples. Of those, 14 exhibited such concordance in over 110 tumor samples. These 14 genes were predominantly apoptosis regulators, which may implies that apoptosis is critical during EMT. Moreover, the 71 genes with concordant CNG and up-regulation were largely involved in cellular functions such as phosphorylation cascade signaling. This is the first observation of concordance between CNG and up-regulation of specific genes in hundreds of samples, which may indicate that somatic CNGs activate gene expression by increasing the gene dosage.
Collapse
Affiliation(s)
- Min Zhao
- School of Engineering, Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, Maroochydore, Queensland, 4558, Australia
| | - Yining Liu
- School of Engineering, Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, Maroochydore, Queensland, 4558, Australia
| | - Hong Qu
- Center for Bioinformatics, State Key Laboratory of Protein and Plant Gene Research, College of Life Sciences, Peking University, Beijing, 100871, P.R. China
| |
Collapse
|
9
|
Copy number alteration of neuropeptides and receptors in multiple cancers. Sci Rep 2017; 7:4598. [PMID: 28676692 PMCID: PMC5496884 DOI: 10.1038/s41598-017-04832-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2016] [Accepted: 05/22/2017] [Indexed: 11/29/2022] Open
Abstract
Neuropeptides are peptide hormones used as chemical signals by the neuroendocrine system to communicate between cells. Recently, neuropeptides have been recognized for their ability to act as potent cellular growth factors on many cell types, including cancer cells. However, the molecular mechanism for how this occurs is unknown. To clarify the relationship between neuropeptides and cancer, we manually curated a total of 127 human neuropeptide genes by integrating information from the literature, homologous sequences, and database searches. Using human ligand-receptor interaction data, we first identified an interactome of 226 interaction pairs between 93 neuropeptides and 133 G-protein coupled receptors. We further identified four neuropeptide-receptor functional modules with ten or more genes, all of which were highly mutated in multiple cancers. We have identified a number of neuropeptide signaling systems with both oncogenic and tumour-suppressing roles for cancer progression, such as the insulin-like growth factors. By focusing on the neuroendocrine prostate cancer mutational data, we found prevalent amplification of neuropeptide and receptors in about 72% of samples. In summary, we report the first observation of abundant copy number variations on neuropeptides and receptors, which will be valuable for the design of peptide-based cancer prognosis, diagnosis and treatment.
Collapse
|
10
|
Sirt1 Inhibits Oxidative Stress in Vascular Endothelial Cells. OXIDATIVE MEDICINE AND CELLULAR LONGEVITY 2017; 2017:7543973. [PMID: 28546854 PMCID: PMC5435972 DOI: 10.1155/2017/7543973] [Citation(s) in RCA: 176] [Impact Index Per Article: 25.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/18/2017] [Revised: 03/15/2017] [Accepted: 03/22/2017] [Indexed: 12/15/2022]
Abstract
The vascular endothelium is a layer of cells lining the inner surface of vessels, serving as a barrier that mediates microenvironment homeostasis. Deterioration of either the structure or function of endothelial cells (ECs) results in a variety of cardiovascular diseases. Previous studies have shown that reactive oxygen species (ROS) is a key factor that contributes to the impairment of ECs and the subsequent endothelial dysfunction. The longevity regulator Sirt1 is a NAD+-dependent deacetylase that has a potential antioxidative stress activity in vascular ECs. The mechanisms underlying the protective effects involve Sirt1/FOXOs, Sirt1/NF-κB, Sirt1/NOX, Sirt1/SOD, and Sirt1/eNOs pathways. In this review, we summarize the most recent reports in this field to recapitulate the potent mechanisms involving the protective role of Sirt1 in oxidative stress and to highlight the beneficial effects of Sirt1 on cardiovascular functions.
Collapse
|
11
|
Baur B, Bozdag S. ProcessDriver: A computational pipeline to identify copy number drivers and associated disrupted biological processes in cancer. Genomics 2017; 109:233-240. [PMID: 28438487 DOI: 10.1016/j.ygeno.2017.04.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2017] [Revised: 04/19/2017] [Accepted: 04/20/2017] [Indexed: 12/12/2022]
Abstract
Copy number amplifications and deletions that are recurrent in cancer samples harbor genes that confer a fitness advantage to cancer tumor proliferation and survival. One important challenge in computational biology is to separate the causal (i.e., driver) genes from passenger genes in large, aberrated regions. Many previous studies focus on the genes within the aberration (i.e., cis genes), but do not utilize the genes that are outside of the aberrated region and dysregulated as a result of the aberration (i.e., trans genes). We propose a computational pipeline, called ProcessDriver, that prioritizes candidate drivers by relating cis genes to dysregulated trans genes and biological processes. ProcessDriver is based on the assumption that a driver cis gene should be closely associated with the dysregulated trans genes and biological processes, as opposed to previous studies that assume a driver cis gene should be the most correlated gene to the copy number of an aberrated region. We applied our method on breast, bladder and ovarian cancer data from the Cancer Genome Atlas database. Our results included previously known driver genes and cancer genes, as well as potentially novel driver genes. Additionally, many genes in the final set of drivers were linked to new tumor events after initial treatment using survival analysis. Our results highlight the importance of selecting driver genes based on their widespread downstream effects in trans.
Collapse
Affiliation(s)
- Brittany Baur
- Department of Mathematics, Statistics and Computer Science, Marquette University, Milwaukee, WI, USA
| | - Serdar Bozdag
- Department of Mathematics, Statistics and Computer Science, Marquette University, Milwaukee, WI, USA.
| |
Collapse
|
12
|
Zhang XF, Ou-Yang L, Yan H. Incorporating prior information into differential network analysis using non-paranormal graphical models. Bioinformatics 2017; 33:2436-2445. [DOI: 10.1093/bioinformatics/btx208] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2016] [Accepted: 04/05/2017] [Indexed: 02/02/2023] Open
Affiliation(s)
- Xiao-Fei Zhang
- Department of Statistics, School of Mathematics and Statistics & Hubei Key Laboratory of Mathematical Sciences, Central China Normal University, Wuhan, China
- Department of Electronic Engineering, City University of Hong Kong, Hong Kong, China
| | - Le Ou-Yang
- Department of Electronic Engineering, College of Information Engineering, Shenzhen University, Shenzhen, China
| | - Hong Yan
- Department of Electronic Engineering, City University of Hong Kong, Hong Kong, China
| |
Collapse
|
13
|
Zhao M, Rotgans B, Wang T, Cummins SF. REGene: a literature-based knowledgebase of animal regeneration that bridge tissue regeneration and cancer. Sci Rep 2016; 6:23167. [PMID: 26975833 PMCID: PMC4791596 DOI: 10.1038/srep23167] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2015] [Accepted: 02/18/2016] [Indexed: 12/13/2022] Open
Abstract
Regeneration is a common phenomenon across multiple animal phyla. Regeneration-related genes (REGs) are critical for fundamental cellular processes such as proliferation and differentiation. Identification of REGs and elucidating their functions may help to further develop effective treatment strategies in regenerative medicine. So far, REGs have been largely identified by small-scale experimental studies and a comprehensive characterization of the diverse biological processes regulated by REGs is lacking. Therefore, there is an ever-growing need to integrate REGs at the genomics, epigenetics, and transcriptome level to provide a reference list of REGs for regeneration and regenerative medicine research. Towards achieving this, we developed the first literature-based database called REGene (REgeneration Gene database). In the current release, REGene contains 948 human (929 protein-coding and 19 non-coding genes) and 8445 homologous genes curated from gene ontology and extensive literature examination. Additionally, the REGene database provides detailed annotations for each REG, including: gene expression, methylation sites, upstream transcription factors, and protein-protein interactions. An analysis of the collected REGs reveals strong links to a variety of cancers in terms of genetic mutation, protein domains, and cellular pathways. We have prepared a web interface to share these regeneration genes, supported by refined browsing and searching functions at http://REGene.bioinfo-minzhao.org/.
Collapse
Affiliation(s)
- Min Zhao
- School of Engineering, Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, Maroochydore DC, Queensland, 4558, Australia
| | - Bronwyn Rotgans
- School of Engineering, Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, Maroochydore DC, Queensland, 4558, Australia
| | - Tianfang Wang
- School of Engineering, Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, Maroochydore DC, Queensland, 4558, Australia
| | - S F Cummins
- School of Engineering, Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, Maroochydore DC, Queensland, 4558, Australia
| |
Collapse
|