1
|
Fujisawa K, Shimo M, Taguchi YH, Ikematsu S, Miyata R. PCA-based unsupervised feature extraction for gene expression analysis of COVID-19 patients. Sci Rep 2021; 11:17351. [PMID: 34456333 PMCID: PMC8403676 DOI: 10.1038/s41598-021-95698-w] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2021] [Accepted: 07/23/2021] [Indexed: 01/08/2023] Open
Abstract
Coronavirus disease 2019 (COVID-19) is raging worldwide. This potentially fatal infectious disease is caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). However, the complete mechanism of COVID-19 is not well understood. Therefore, we analyzed gene expression profiles of COVID-19 patients to identify disease-related genes through an innovative machine learning method that enables a data-driven strategy for gene selection from a data set with a small number of samples and many candidates. Principal-component-analysis-based unsupervised feature extraction (PCAUFE) was applied to the RNA expression profiles of 16 COVID-19 patients and 18 healthy control subjects. The results identified 123 genes as critical for COVID-19 progression from 60,683 candidate probes, including immune-related genes. The 123 genes were enriched in binding sites for transcription factors NFKB1 and RELA, which are involved in various biological phenomena such as immune response and cell survival: the primary mediator of canonical nuclear factor-kappa B (NF-κB) activity is the heterodimer RelA-p50. The genes were also enriched in histone modification H3K36me3, and they largely overlapped the target genes of NFKB1 and RELA. We found that the overlapping genes were downregulated in COVID-19 patients. These results suggest that canonical NF-κB activity was suppressed by H3K36me3 in COVID-19 patient blood.
Collapse
Affiliation(s)
- Kota Fujisawa
- School of Life Science and Technology, Tokyo Institute of Technology, Tokyo, 152-8550, Japan.
| | - Mamoru Shimo
- Graduate School of Engineering and Science, University of the Ryukyus, Okinawa, 903-0213, Japan
| | - Y-H Taguchi
- Department of Physics, Chuo University, Tokyo, 112-8551, Japan
| | - Shinya Ikematsu
- Department of Bioresources Engineering, National Institute of Technology, OkinawaCollege, Okinawa, 905-2192, Japan
| | - Ryota Miyata
- Faculty of Engineering, University of the Ryukyus, Okinawa, 903-0213, Japan.
| |
Collapse
|
2
|
Zhang FL, Kong L, Zhao AH, Ge W, Yan ZH, Li L, De Felici M, Shen W. Inflammatory cytokines as key players of apoptosis induced by environmental estrogens in the ovary. ENVIRONMENTAL RESEARCH 2021; 198:111225. [PMID: 33971129 DOI: 10.1016/j.envres.2021.111225] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Revised: 01/02/2021] [Accepted: 04/22/2021] [Indexed: 06/12/2023]
Abstract
Natural and synthetic environmental estrogens (EEs), interfering with the physiological functions of the body's estrogens, are widespread and are rising much concern for their possible deleterious effects on human and animal health, in particular on reproduction. In fact, increasing evidence indicate that EEs can be responsible for a variety of disfunctions of the reproductive system especially in females such as premature ovarian insufficiency (POI). Because of their great structural diversity, the modes of action of EEs are controversial. One important way through which EEs exert their effects on reproduction is the induction of apoptosis in the ovary. In general, EEs can exert pro-and anti-apoptotic effects by agonizing or antagonizing numerous estrogen-dependent signaling pathways. In the present work, results concerning apoptotic pathways and diseases induced by representative EEs (such as zearalenone, bisphenol A and di-2-ethylhexyl phthalate), in ovaries throughout development are presented into an integrated network. By reviewing and elaborating these studies, we propose inflammatory factors, centered on the production of tumor necrosis factor (TNF), as a major cause of the induction of apoptosis by EEs in the mammalian ovary. As a consequence, potential strategies to prevent such EE effect are suggested.
Collapse
Affiliation(s)
- Fa-Li Zhang
- College of Life Sciences, Key Laboratory of Animal Reproduction and Germplasm Enhancement in Universities of Shandong, Qingdao Agricultural University, Qingdao, 266109, China
| | - Li Kong
- College of Life Sciences, Key Laboratory of Animal Reproduction and Germplasm Enhancement in Universities of Shandong, Qingdao Agricultural University, Qingdao, 266109, China
| | - Ai-Hong Zhao
- Qingdao Academy of Agricultural Sciences, Qingdao, 266100, China
| | - Wei Ge
- College of Life Sciences, Key Laboratory of Animal Reproduction and Germplasm Enhancement in Universities of Shandong, Qingdao Agricultural University, Qingdao, 266109, China
| | - Zi-Hui Yan
- College of Life Sciences, Key Laboratory of Animal Reproduction and Germplasm Enhancement in Universities of Shandong, Qingdao Agricultural University, Qingdao, 266109, China
| | - Lan Li
- College of Life Sciences, Key Laboratory of Animal Reproduction and Germplasm Enhancement in Universities of Shandong, Qingdao Agricultural University, Qingdao, 266109, China
| | - Massimo De Felici
- Department of Biomedicine and Prevention, University of Rome Tor Vergata, Rome, 00133, Italy.
| | - Wei Shen
- College of Life Sciences, Key Laboratory of Animal Reproduction and Germplasm Enhancement in Universities of Shandong, Qingdao Agricultural University, Qingdao, 266109, China.
| |
Collapse
|
3
|
Taguchi YH, Turki T. Tensor Decomposition-Based Unsupervised Feature Extraction Applied to Single-Cell Gene Expression Analysis. Front Genet 2019; 10:864. [PMID: 31608111 PMCID: PMC6761323 DOI: 10.3389/fgene.2019.00864] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Accepted: 08/19/2019] [Indexed: 12/14/2022] Open
Abstract
Although single-cell RNA sequencing (scRNA-seq) technology is newly invented and a promising one, but because of lack of enough information that labels individual cells, it is hard to interpret the obtained gene expression of each cell. Because of insufficient information available, unsupervised clustering, for example, t-distributed stochastic neighbor embedding and uniform manifold approximation and projection, is usually employed to obtain low-dimensional embedding that can help to understand cell–cell relationship. One possible drawback of this strategy is that the outcome is highly dependent upon genes selected for the usage of clustering. In order to fulfill this requirement, there are many methods that performed unsupervised gene selection. In this study, a tensor decomposition (TD)-based unsupervised feature extraction (FE) was applied to the integration of two scRNA-seq expression profiles that measure human and mouse midbrain development. TD-based unsupervised FE could select not only coincident genes between human and mouse but also biologically reliable genes. Coincidence between two species as well as biological reliability of selected genes is increased compared with that using principal component analysis (PCA)-based FE applied to the same data set in the previous study. Since PCA-based unsupervised FE outperformed the other three popular unsupervised gene selection methods, highly variable genes, bimodal genes, and dpFeature, TD-based unsupervised FE can do so as well. In addition to this, 10 transcription factors (TFs) that might regulate selected genes and might contribute to midbrain development were identified. These 10 TFs, BHLHE40, EGR1, GABPA, IRF3, PPARG, REST, RFX5, STAT3, TCF7L2, and ZBTB33, were previously reported to be related to brain functions and diseases. TD-based unsupervised FE is a promising method to integrate two scRNA-seq profiles effectively.
Collapse
Affiliation(s)
- Y-H Taguchi
- Department of Physics, Chuo University, Tokyo, Japan
| | - Turki Turki
- Department of Computer Science, King Abdulaziz University, Jeddah, Saudi Arabia
| |
Collapse
|
4
|
Taguchi YH. Drug candidate identification based on gene expression of treated cells using tensor decomposition-based unsupervised feature extraction for large-scale data. BMC Bioinformatics 2019; 19:388. [PMID: 30717646 PMCID: PMC7394334 DOI: 10.1186/s12859-018-2395-8] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2018] [Accepted: 09/25/2018] [Indexed: 02/08/2023] Open
Abstract
Background Although in silico drug discovery is necessary for drug development, two major strategies, a structure-based and ligand-based approach, have not been completely successful. Currently, the third approach, inference of drug candidates from gene expression profiles obtained from the cells treated with the compounds under study requires the use of a training dataset. Here, the purpose was to develop a new approach that does not require any pre-existing knowledge about the drug–protein interactions, but these interactions can be inferred by means of an integrated approach using gene expression profiles obtained from the cells treated with the analysed compounds and the existing data describing gene–gene interactions. Results In the present study, using tensor decomposition-based unsupervised feature extraction, which represents an extension of the recently proposed principal-component analysis-based feature extraction, gene sets and compounds with a significant dose-dependent activity were screened without any training datasets. Next, after these results were combined with the data showing perturbations in single-gene expression profiles, genes targeted by the analysed compounds were inferred. The set of target genes thus identified was shown to significantly overlap with known target genes of the compounds under study. Conclusions The method is specifically designed for large-scale datasets (including hundreds of treatments with compounds), not for conventional small-scale datasets. The obtained results indicate that two compounds that have not been extensively studied, WZ-3105 and CGP-60474, represent promising drug candidates targeting multiple cancers, including melanoma, adenocarcinoma, liver carcinoma, and breast, colon, and prostate cancers, which were analysed in this in silico study. Electronic supplementary material The online version of this article (10.1186/s12859-018-2395-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Y-H Taguchi
- Department of Physics, Chuo University, 1-13-27 Kasuga, Bunkyo-ku, Tokyo, 112-8551, Japan.
| |
Collapse
|
5
|
Taguchi YH. Tensor Decomposition-Based Unsupervised Feature Extraction Can Identify the Universal Nature of Sequence-Nonspecific Off-Target Regulation of mRNA Mediated by MicroRNA Transfection. Cells 2018; 7:cells7060054. [PMID: 29867052 PMCID: PMC6025034 DOI: 10.3390/cells7060054] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2018] [Revised: 05/28/2018] [Accepted: 05/31/2018] [Indexed: 12/14/2022] Open
Abstract
MicroRNA (miRNA) transfection is known to degrade target mRNAs and to decrease mRNA expression. In contrast to the notion that most of the gene expression alterations caused by miRNA transfection involve downregulation, they often involve both up- and downregulation; this phenomenon is thought to be, at least partially, mediated by sequence-nonspecific off-target effects. In this study, I used tensor decomposition-based unsupervised feature extraction to identify genes whose expression is likely to be altered by miRNA transfection. These gene sets turned out to largely overlap with one another regardless of the type of miRNA or cell lines used in the experiments. These gene sets also overlap with the gene set associated with altered expression induced by a Dicer knockout. This result suggests that the off-target effect is at least as important as the canonical function of miRNAs that suppress translation. The off-target effect is also suggested to consist of competition for the protein machinery between transfected miRNAs and miRNAs in the cell. Because the identified genes are enriched in various biological terms, these genes are likely to play critical roles in diverse biological processes.
Collapse
Affiliation(s)
- Y-H Taguchi
- Department of Physics, Chuo University, Tokyo 112-8551, Japan.
| |
Collapse
|
6
|
Taguchi YH. Tensor decomposition-based and principal-component-analysis-based unsupervised feature extraction applied to the gene expression and methylation profiles in the brains of social insects with multiple castes. BMC Bioinformatics 2018; 19:99. [PMID: 29745827 PMCID: PMC5998888 DOI: 10.1186/s12859-018-2068-7] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Background Even though coexistence of multiple phenotypes sharing the same genomic background is interesting, it remains incompletely understood. Epigenomic profiles may represent key factors, with unknown contributions to the development of multiple phenotypes, and social-insect castes are a good model for elucidation of the underlying mechanisms. Nonetheless, previous studies have failed to identify genes associated with aberrant gene expression and methylation profiles because of the lack of suitable methodology that can address this problem properly. Methods A recently proposed principal component analysis (PCA)-based and tensor decomposition (TD)-based unsupervised feature extraction (FE) can solve this problem because these two approaches can deal with gene expression and methylation profiles even when a small number of samples is available. Results PCA-based and TD-based unsupervised FE methods were applied to the analysis of gene expression and methylation profiles in the brains of two social insects, Polistes canadensis and Dinoponera quadriceps. Genes associated with differential expression and methylation between castes were identified, and analysis of enrichment of Gene Ontology terms confirmed reliability of the obtained sets of genes from the biological standpoint. Conclusions Biologically relevant genes, shown to be associated with significant differential gene expression and methylation between castes, were identified here for the first time. The identification of these genes may help understand the mechanisms underlying epigenetic control of development of multiple phenotypes under the same genomic conditions. Electronic supplementary material The online version of this article (10.1186/s12859-018-2068-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Y-H Taguchi
- Department of Physics, Chuo University, 1-13-27 Kasuga, Bunkyo-ku, Tokyo, 112-8551, Japan.
| |
Collapse
|
7
|
Taguchi YH. Tensor decomposition-based unsupervised feature extraction identifies candidate genes that induce post-traumatic stress disorder-mediated heart diseases. BMC Med Genomics 2017; 10:67. [PMID: 29322921 PMCID: PMC5763504 DOI: 10.1186/s12920-017-0302-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
Background Although post-traumatic stress disorder (PTSD) is primarily a mental disorder, it can cause additional symptoms that do not seem to be directly related to the central nervous system, which PTSD is assumed to directly affect. PTSD-mediated heart diseases are some of such secondary disorders. In spite of the significant correlations between PTSD and heart diseases, spatial separation between the heart and brain (where PTSD is primarily active) prevents researchers from elucidating the mechanisms that bridge the two disorders. Our purpose was to identify genes linking PTSD and heart diseases. Methods In this study, gene expression profiles of various murine tissues observed under various types of stress or without stress were analyzed in an integrated manner using tensor decomposition (TD). Results Based upon the obtained features, ∼ 400 genes were identified as candidate genes that may mediate heart diseases associated with PTSD. Various gene enrichment analyses supported biological reliability of the identified genes. Ten genes encoding protein-, DNA-, or mRNA-interacting proteins—ILF2, ILF3, ESR1, ESR2, RAD21, HTT, ATF2, NR3C1, TP53, and TP63—were found to be likely to regulate expression of most of these ∼ 400 genes and therefore are candidate primary genes that cause PTSD-mediated heart diseases. Approximately 400 genes in the heart were also found to be strongly affected by various drugs whose known adverse effects are related to heart diseases and/or fear memory conditioning; these data support the reliability of our findings. Conclusions TD-based unsupervised feature extraction turned out to be a useful method for gene selection and successfully identified possible genes causing PTSD-mediated heart diseases. Electronic supplementary material The online version of this article (doi:10.1186/s12920-017-0302-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Y-H Taguchi
- Department of Physics, Chuo University, 1-13-27 Kasuga, Bunkyo-ku, Tokyo, 112-8551, Japan.
| |
Collapse
|
8
|
Identification of candidate drugs using tensor-decomposition-based unsupervised feature extraction in integrated analysis of gene expression between diseases and DrugMatrix datasets. Sci Rep 2017; 7:13733. [PMID: 29062063 PMCID: PMC5653784 DOI: 10.1038/s41598-017-13003-0] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2017] [Accepted: 09/13/2017] [Indexed: 01/28/2023] Open
Abstract
Identifying drug target genes in gene expression profiles is not straightforward. Because a drug targets proteins and not mRNAs, the mRNA expression of drug target genes is not always altered. In addition, the interaction between a drug and protein can be context dependent; this means that simple drug incubation experiments on cell lines do not always reflect the real situation during active disease. In this paper, I applied tensor-decomposition-based unsupervised feature extraction to the integrated analysis using a mathematical product of gene expression in various diseases and gene expression in the DrugMatrix dataset, where comprehensive data on gene expression during various drug treatments of rats are reported. I found that this strategy, in a fully unsupervised manner, enables researchers to identify a combined set of genes and compounds that significantly overlap with gene and drug interactions identified in the past. As an example illustrating the usefulness of this strategy in drug discovery experiments, I considered cirrhosis, for which no effective drugs have ever been proposed. The present strategy identified two promising therapeutic-target genes, CYPOR and HNFA4; for their protein products, bezafibrate was identified as a promising candidate drug, supported by in silico docking analysis.
Collapse
|
9
|
Taguchi YH. Tensor decomposition-based unsupervised feature extraction applied to matrix products for multi-view data processing. PLoS One 2017; 12:e0183933. [PMID: 28841719 PMCID: PMC5571984 DOI: 10.1371/journal.pone.0183933] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2017] [Accepted: 08/04/2017] [Indexed: 01/17/2023] Open
Abstract
In the current era of big data, the amount of data available is continuously increasing. Both the number and types of samples, or features, are on the rise. The mixing of distinct features often makes interpretation more difficult. However, separate analysis of individual types requires subsequent integration. A tensor is a useful framework to deal with distinct types of features in an integrated manner without mixing them. On the other hand, tensor data is not easy to obtain since it requires the measurements of huge numbers of combinations of distinct features; if there are m kinds of features, each of which has N dimensions, the number of measurements needed are as many as Nm, which is often too large to measure. In this paper, I propose a new method where a tensor is generated from individual features without combinatorial measurements, and the generated tensor was decomposed back to matrices, by which unsupervised feature extraction was performed. In order to demonstrate the usefulness of the proposed strategy, it was applied to synthetic data, as well as three omics datasets. It outperformed other matrix-based methodologies.
Collapse
Affiliation(s)
- Y-h. Taguchi
- Department of Physics, Chuo University, 1-13-27 Kasuga, Bunkyo-ku, Tokyo 112-8551, Japan
- * E-mail:
| |
Collapse
|
10
|
Torday JS. From cholesterol to consciousness. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2017; 132:52-56. [PMID: 28830682 DOI: 10.1016/j.pbiomolbio.2017.08.009] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2017] [Revised: 08/16/2017] [Accepted: 08/18/2017] [Indexed: 11/29/2022]
Abstract
The nature of consciousness has been debated for centuries. It can be understood as part and parcel of the natural progression of life from unicellular to multicellular, calcium fluxes mediating communication within and between cells. Consciousness is the vertical integration of calcium fluxes, mediated by the Target of Rapamycin gene integrated with the cytoskeleton. The premise of this paper is that there is a fundamental physiologic integration of the organism with the environment that constitutes consciousness.
Collapse
Affiliation(s)
- John S Torday
- Department of Pediatrics, Harbor-UCLA Medical Center, 1124 W.Carson Street, Torrance, CA 90502-2006, United States.
| |
Collapse
|
11
|
Beck D, Sadler-Riggleman I, Skinner MK. Generational comparisons (F1 versus F3) of vinclozolin induced epigenetic transgenerational inheritance of sperm differential DNA methylation regions (epimutations) using MeDIP-Seq. ENVIRONMENTAL EPIGENETICS 2017; 3:dvx016. [PMID: 29147574 PMCID: PMC5685552 DOI: 10.1093/eep/dvx016] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Environmentally induced epigenetic transgenerational inheritance of disease and phenotypic variation has been shown to involve DNA methylation alterations in the germline (e.g. sperm). These differential DNA methylation regions (DMRs) are termed epimutations and in part transmit the transgenerational phenotypes. The agricultural fungicide vinclozolin exposure of a gestating female rat has previously been shown to promote transgenerational disease and epimutations in F3 generation (great-grand-offspring) animals. The current study was designed to investigate the actions of direct fetal exposure on the F1 generation rat sperm DMRs compared to the F3 transgenerational sperm DMRs. A protocol involving methylated DNA immunoprecipitation (MeDIP) followed by next-generation sequencing (Seq) was used in the current study. Bioinformatics analysis of the MeDIP-Seq data was developed and several different variations in the bioinformatic analysis were evaluated. Observations indicate needs to be considered. Interestingly, the F1 generation DMRs were found to be fewer in number and for the most part distinct from the F3 generation epimutations. Observations suggest the direct exposure induced F1 generation sperm DMRs appear to promote in subsequent generations alterations in the germ cell developmental programming that leads to the distinct epimutations in the F3 generation. This may help explain the differences in disease and phenotypes between the direct exposure F1 generation and transgenerational F3 generation. Observations demonstrate a distinction between the direct exposure versus transgenerational epigenetic programming induced by environmental exposures and provide insights into the molecular mechanisms involved in the epigenetic transgenerational inheritance phenomenon.
Collapse
Affiliation(s)
- Daniel Beck
- Center for Reproductive Biology, School of Biological Sciences, Washington State University, Pullman, WA 99164-4236, USA
| | - Ingrid Sadler-Riggleman
- Center for Reproductive Biology, School of Biological Sciences, Washington State University, Pullman, WA 99164-4236, USA
| | - Michael K. Skinner
- Center for Reproductive Biology, School of Biological Sciences, Washington State University, Pullman, WA 99164-4236, USA
- Correspondence address. Center for Reproductive Biology, School of Biological Sciences, Washington State University, Pullman, WA 99164-4236, USA. Tel: +1-509-335-1524; Fax: +1-509-335-2176; E-mail:
| |
Collapse
|
12
|
Taguchi YH. Principal Components Analysis Based Unsupervised Feature Extraction Applied to Gene Expression Analysis of Blood from Dengue Haemorrhagic Fever Patients. Sci Rep 2017; 7:44016. [PMID: 28276456 PMCID: PMC5343617 DOI: 10.1038/srep44016] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2016] [Accepted: 02/02/2017] [Indexed: 12/12/2022] Open
Abstract
Dengue haemorrhagic fever (DHF) sometimes occurs after recovery from the disease caused by Dengue virus (DENV), and is often fatal. However, the mechanism of DHF has not been determined, possibly because no suitable methodologies are available to analyse this disease. Therefore, more innovative methods are required to analyse the gene expression profiles of DENV-infected patients. Principal components analysis (PCA)-based unsupervised feature extraction (FE) was applied to the gene expression profiles of DENV-infected patients, and an integrated analysis of two independent data sets identified 46 genes as critical for DHF progression. PCA using only these 46 genes rendered the two data sets highly consistent. The application of PCA to the 46 genes of an independent third data set successfully predicted the progression of DHF. A fourth in vitro data set confirmed the identification of the 46 genes. These 46 genes included interferon- and heme-biosynthesis-related genes. The former are enriched in binding sites for STAT1, STAT2, and IRF1, which are associated with DHF-promoting antibody-dependent enhancement, whereas the latter are considered to be related to the dysfunction of spliceosomes, which may mediate haemorrhage. These results are outcomes that other type of bioinformatic analysis could hardly achieve.
Collapse
Affiliation(s)
- Y-H Taguchi
- Department of Physics, Chuo University, Tokyo, 112-8551, Japan
| |
Collapse
|
13
|
Taguchi YH. Principal component analysis based unsupervised feature extraction applied to publicly available gene expression profiles provides new insights into the mechanisms of action of histone deacetylase inhibitors. ACTA ACUST UNITED AC 2016. [DOI: 10.1016/j.nepig.2016.10.001] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
14
|
Taguchi YH, Iwadate M, Umeyama H. SFRP1 is a possible candidate for epigenetic therapy in non-small cell lung cancer. BMC Med Genomics 2016; 9 Suppl 1:28. [PMID: 27534621 PMCID: PMC4989892 DOI: 10.1186/s12920-016-0196-3] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Background Non-small cell lung cancer (NSCLC) remains a lethal disease despite many proposed treatments. Recent studies have indicated that epigenetic therapy, which targets epigenetic effects, might be a new therapeutic methodology for NSCLC. However, it is not clear which objects (e.g., genes) this treatment specifically targets. Secreted frizzled-related proteins (SFRPs) are promising candidates for epigenetic therapy in many cancers, but there have been no reports of SFRPs targeted by epigenetic therapy for NSCLC. Methods This study performed a meta-analysis of reprogrammed NSCLC cell lines instead of the direct examination of epigenetic therapy treatment to identify epigenetic therapy targets. In addition, mRNA expression/promoter methylation profiles were processed by recently proposed principal component analysis based unsupervised feature extraction and categorical regression analysis based feature extraction. Results The Wnt/β-catenin signalling pathway was extensively enriched among 32 genes identified by feature extraction. Among the genes identified, SFRP1 was specifically indicated to target β-catenin, and thus might be targeted by epigenetic therapy in NSCLC cell lines. A histone deacetylase inhibitor might reactivate SFRP1 based upon the re-analysis of a public domain data set. Numerical computation validated the binding of SFRP1 to WNT1 to suppress Wnt signalling pathway activation in NSCLC. Conclusions The meta-analysis of reprogrammed NSCLC cell lines identified SFRP1 as a promising target of epigenetic therapy for NSCLC. Electronic supplementary material The online version of this article (doi:10.1186/s12920-016-0196-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Y-H Taguchi
- Department of Physics, Chuo University, 1-13-27 Kasuga, Bunkyo-ku, 112-8551, Tokyo, Japan.
| | - Mitsuo Iwadate
- Department of Biological Science, Chuo University, 1-13-27 Kasuga, Bunkyo-ku, 112-8551, Tokyo, Japan
| | - Hideaki Umeyama
- Department of Biological Science, Chuo University, 1-13-27 Kasuga, Bunkyo-ku, 112-8551, Tokyo, Japan
| |
Collapse
|
15
|
Brieño-Enríquez MA, Larriba E, Del Mazo J. Endocrine disrupters, microRNAs, and primordial germ cells: a dangerous cocktail. Fertil Steril 2016; 106:871-9. [PMID: 27521771 DOI: 10.1016/j.fertnstert.2016.07.1100] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2016] [Revised: 07/15/2016] [Accepted: 07/18/2016] [Indexed: 12/23/2022]
Abstract
Endocrine-disrupting chemicals (EDCs) are environmental pollutants that may change the homeostasis of the endocrine system, altering the differentiation of germ cells with consequences for reproduction. In mammals, germ cell differentiation begins with primordial germ cells (PGCs) during embryogenesis. Primordial germ cell development and gametogenesis are genetically regulated processes, in which the posttranscriptional gene regulation could be mediated by small noncoding RNAs (sncRNAs) such as microRNAs (miRNAs). Here, we review the deleterious effects of exposure during fetal life to EDCs mediated by deregulation of ncRNAs, and specifically miRNAs on PGC differentiation. Moreover, the environmental stress induced by exposure to some EDCs during the embryonic window of development could trigger reproductive dysfunctions transgenerationally transmitted by epigenetic mechanisms with the involvement of miRNAs expressed in germ line cells.
Collapse
Affiliation(s)
| | - Eduardo Larriba
- Department of Cellular and Molecular Biology, Centro de Investigaciones Biológicas (CSIC), Madrid, Spain
| | - Jesús Del Mazo
- Department of Cellular and Molecular Biology, Centro de Investigaciones Biológicas (CSIC), Madrid, Spain.
| |
Collapse
|
16
|
Taguchi YH. Principal component analysis based unsupervised feature extraction applied to budding yeast temporally periodic gene expression. BioData Min 2016; 9:22. [PMID: 27366210 PMCID: PMC4928327 DOI: 10.1186/s13040-016-0101-9] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2015] [Accepted: 05/26/2016] [Indexed: 02/08/2023] Open
Abstract
Background The recently proposed principal component analysis (PCA) based unsupervised feature extraction (FE) has successfully been applied to various bioinformatics problems ranging from biomarker identification to the screening of disease causing genes using gene expression/epigenetic profiles. However, the conditions required for its successful use and the mechanisms involved in how it outperforms other supervised methods is unknown, because PCA based unsupervised FE has only been applied to challenging (i.e. not well known) problems. Results In this study, PCA based unsupervised FE was applied to an extensively studied organism, i.e., budding yeast. When applied to two gene expression profiles expected to be temporally periodic, yeast metabolic cycle (YMC) and yeast cell division cycle (YCDC), PCA based unsupervised FE outperformed simple but powerful conventional methods, with sinusoidal fitting with regards to several aspects: (i) feasible biological term enrichment without assuming periodicity for YMC; (ii) identification of periodic profiles whose period was half as long as the cell division cycle for YMC; and (iii) the identification of no more than 37 genes associated with the enrichment of biological terms related to cell division cycle for the integrated analysis of seven YCDC profiles, for which sinusoidal fittings failed. The explantation for differences between methods used and the necessary conditions required were determined by comparing PCA based unsupervised FE with fittings to various periodic (artificial, thus pre-defined) profiles. Furthermore, four popular unsupervised clustering algorithms applied to YMC were not as successful as PCA based unsupervised FE. Conclusions PCA based unsupervised FE is a useful and effective unsupervised method to investigate YMC and YCDC. This study identified why the unsupervised method without pre-judged criteria outperformed supervised methods requiring human defined criteria. Electronic supplementary material The online version of this article (doi:10.1186/s13040-016-0101-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Y-H Taguchi
- Department of Physics, Chuo University, 1-13-27 Kasuga, Bunkyo-ku, Tokyo, 112-8551 Japan
| |
Collapse
|
17
|
Identification of More Feasible MicroRNA-mRNA Interactions within Multiple Cancers Using Principal Component Analysis Based Unsupervised Feature Extraction. Int J Mol Sci 2016; 17:ijms17050696. [PMID: 27171078 PMCID: PMC4881522 DOI: 10.3390/ijms17050696] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2016] [Revised: 04/13/2016] [Accepted: 04/27/2016] [Indexed: 12/28/2022] Open
Abstract
MicroRNA(miRNA)–mRNA interactions are important for understanding many biological processes, including development, differentiation and disease progression, but their identification is highly context-dependent. When computationally derived from sequence information alone, the identification should be verified by integrated analyses of mRNA and miRNA expression. The drawback of this strategy is the vast number of identified interactions, which prevents an experimental or detailed investigation of each pair. In this paper, we overcome this difficulty by the recently proposed principal component analysis (PCA)-based unsupervised feature extraction (FE), which reduces the number of identified miRNA–mRNA interactions that properly discriminate between patients and healthy controls without losing biological feasibility. The approach is applied to six cancers: hepatocellular carcinoma, non-small cell lung cancer, esophageal squamous cell carcinoma, prostate cancer, colorectal/colon cancer and breast cancer. In PCA-based unsupervised FE, the significance does not depend on the number of samples (as in the standard case) but on the number of features, which approximates the number of miRNAs/mRNAs. To our knowledge, we have newly identified miRNA–mRNA interactions in multiple cancers based on a single common (universal) criterion. Moreover, the number of identified interactions was sufficiently small to be sequentially curated by literature searches.
Collapse
|
18
|
Schönbach C, Horton P, Yiu SM, Tan TW, Ranganathan S. GIW and InCoB are advancing bioinformatics in the Asia-Pacific. BMC Bioinformatics 2015; 16:I1. [PMID: 28102114 PMCID: PMC6389036 DOI: 10.1186/1471-2105-16-s18-i1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
GIW/InCoB2015 the joint 26th International Conference on Genome Informatics (GIW) and 14th International Conference on Bioinformatics (InCoB) held in Tokyo, September 9-11, 2015 was attended by over 200 delegates. Fifty-one out of 89 oral presentations were based on research articles accepted for publication in four BMC journal supplements and three other journals. Sixteen articles in this supplement and six articles in the BMC Systems Biology GIW/InCoB2015 Supplement are covered by this introduction. The topics range from genome informatics, protein structure informatics, image analysis to biological networks and biomarker discovery.
Collapse
Affiliation(s)
- Christian Schönbach
- Department of Biology, School of Science and Technology, Nazarbayev University, Astana, 010000 Republic of Kazakhstan
- Center for AIDS Research and International Research Center for Medical Sciences, Kumamoto University, Kumamoto, 860-0811 Japan
| | - Paul Horton
- Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology, Tokyo, 135-0064 Japan
- Department of Computational Biology, Graduate School of Frontier Sciences, University of Tokyo, Japan
| | - Siu-Ming Yiu
- Department of Computer Science, Faculty of Engineering, The University of Hong Kong, Hong Kong, HKSAR
| | - Tin Wee Tan
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, 117599
| | - Shoba Ranganathan
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, 117599
- Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney, NSW 2109 Australia
| |
Collapse
|