1
|
Lu Y, Huang J, Liu D, Kong X, Song Y, Jing L. Pangenome Data Analysis Reveals Characteristics of Resistance Gene Analogs Associated with Sclerotinia sclerotiorum Resistance in Sunflower. Life (Basel) 2024; 14:1322. [PMID: 39459622 PMCID: PMC11509514 DOI: 10.3390/life14101322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2024] [Revised: 10/10/2024] [Accepted: 10/15/2024] [Indexed: 10/28/2024] Open
Abstract
The sunflower, an important oilseed crop and food source across the world, is susceptible to several pathogens, which cause severe losses in sunflower production. The utilization of genetic resistance is the most economical, effective measure to prevent infectious diseases. Based on the sunflower pangenome, in this study, we explored the variability of resistance gene analogs (RGAs) within the species. According to a comparative analysis of RGA candidates in the sunflower pangenome using the RGAugury pipeline, a total of 1344 RGAs were identified, comprising 1107 conserved, 199 varied, and 38 rare RGAs. We also identified RGAs associated with resistance against Sclerotinia sclerotiorum (S. sclerotiorum) in sunflower at the quantitative trait locus (QTL). A total of 61 RGAs were found to be located at four quantitative trait loci (QTLs). Through a detailed expression analysis of RGAs in one susceptible and two tolerant sunflower inbred lines (ILs) across various time points post inoculation, we discovered that 348 RGAs exhibited differential expression in response to Sclerotinia head rot (SHR), with 17 of these differentially expressed RGAs being situated within the QTL regions. In addition, 15 RGA candidates had gene introgression. Our data provide a better understanding of RGAs, which facilitate genomics-based improvements in disease resistance in sunflower.
Collapse
Affiliation(s)
| | | | | | | | | | - Lan Jing
- College of Horticulture and Plant Protection, Inner Mongolia Agricultural University, Huhhot 010011, China; (Y.L.); (J.H.); (D.L.); (X.K.); (Y.S.)
| |
Collapse
|
2
|
Tsai YS, Chang YM, Lim YM, Cheong SK, Chung IF, Wong CY. Generating transcriptional regulatory networks from time-ordered stem cell differentiation RNA sequencing data. STAR Protoc 2022; 3:101541. [PMID: 36042881 PMCID: PMC9420390 DOI: 10.1016/j.xpro.2022.101541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
Abstract
We describe steps to 1) identify ascending and descending monotonic key genes from time-ordered stem cell differentiation expression data, 2) construct time-ordered transcriptional regulatory networks, and 3) infer the involvement of transcription factors along the differentiation process. For complete details on the use and execution of this protocol, please refer to Wong et al. (2020). A protocol to identify ascending and descending monotonic pattern genes Temporal gene regulation in development Pipeline can be adapted to other time-series data sets
Publisher’s note: Undertaking any experimental protocol requires adherence to local institutional guidelines for laboratory safety and ethics.
Collapse
Affiliation(s)
- Yu-Shuen Tsai
- Center for Systems and Synthetic Biology, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Yao-Ming Chang
- Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan
| | - Yang-Mooi Lim
- Faculty of Medicine and Health Sciences, Universiti Tunku Abdul Rahman, Selangor, Malaysia
| | - Soon-Keng Cheong
- Faculty of Medicine and Health Sciences, Universiti Tunku Abdul Rahman, Selangor, Malaysia
| | - I-Fang Chung
- Center for Systems and Synthetic Biology, National Yang Ming Chiao Tung University, Taipei, Taiwan; Institute of Biomedical Informatics, National Yang Ming Chiao Tung University, Taipei, Taiwan; Preventive Medicine Research Center, National Yang Ming Chiao Tung University, Taipei, Taiwan.
| | - Chee-Yin Wong
- Faculty of Medicine and Health Sciences, Universiti Tunku Abdul Rahman, Selangor, Malaysia.
| |
Collapse
|
3
|
Khoei MA, Karimi M, Karamian R, Amini S, Soorni A. Identification of the Complex Interplay Between Nematode-Related lncRNAs and Their Target Genes in Glycine max L. FRONTIERS IN PLANT SCIENCE 2021; 12:779597. [PMID: 34956274 PMCID: PMC8705754 DOI: 10.3389/fpls.2021.779597] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/19/2021] [Accepted: 11/08/2021] [Indexed: 05/26/2023]
Abstract
Soybean (Glycine max) is a major plant protein source and oilseed crop. However, plant-parasitic nematodes (PPNs) affect its annual yield. In the current study, in order to better understand the regulation of defense mechanism against PPNs in soybean, we investigated the role of long non-coding RNAs (lncRNAs) in response to two nematode species, Heterodera glycines (SCN: soybean cyst nematode) and Rotylenchulus reniformis (reniform). To this end, two publicly available RNA-seq data sets (SCN data set and RAD: reniform-associated data set) were employed to discover the lncRNAome profile of soybean under SCN and reniform infection, respectively. Upon identification of unannotated transcripts in these data sets, a seven-step pipeline was utilized to sieve these transcripts, which ended up in 384 and 283 potential lncRNAs in SCN data set and RAD, respectively. These transcripts were then used to predict cis and trans nematode-related targets in soybean genome. Computational prediction of target genes function, some of which were also among differentially expressed genes, revealed the involvement of putative nematode-responsive genes as well as enrichment of multiple stress responses in both data sets. Finally, 15 and six lncRNAs were proposed to be involved in microRNA-mediated regulation of gene expression in soybean in response to SNC and reniform infection, respectively. Collectively, this study provides a novel insight into the signaling and regulatory network of soybean-pathogen interactions and opens a new window for further research.
Collapse
Affiliation(s)
| | | | - Roya Karamian
- Department of Biology, Faculty of Sciences, Bu-Ali Sina University, Hamedan, Iran
| | | | - Aboozar Soorni
- Department of Biotechnology, College of Agriculture, Isfahan University of Technology, Isfahan, Iran
| |
Collapse
|
4
|
Rajavel A, Klees S, Schlüter JS, Bertram H, Lu K, Schmitt AO, Gültas M. Unravelling the Complex Interplay of Transcription Factors Orchestrating Seed Oil Content in Brassica napus L. Int J Mol Sci 2021; 22:1033. [PMID: 33494188 PMCID: PMC7864344 DOI: 10.3390/ijms22031033] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2020] [Revised: 01/13/2021] [Accepted: 01/17/2021] [Indexed: 11/16/2022] Open
Abstract
Transcription factors (TFs) and their complex interplay are essential for directing specific genetic programs, such as responses to environmental stresses, tissue development, or cell differentiation by regulating gene expression. Knowledge regarding TF-TF cooperations could be promising in gaining insight into the developmental switches between the cultivars of Brassica napus L., namely Zhongshuang11 (ZS11), a double-low accession with high-oil- content, and Zhongyou821 (ZY821), a double-high accession with low-oil-content. In this regard, we analysed a time series RNA-seq data set of seed tissue from both of the cultivars by mainly focusing on the monotonically expressed genes (MEGs). The consideration of the MEGs enables the capturing of multi-stage progression processes that are orchestrated by the cooperative TFs and, thus, facilitates the understanding of the molecular mechanisms determining seed oil content. Our findings show that TF families, such as NAC, MYB, DOF, GATA, and HD-ZIP are highly involved in the seed developmental process. Particularly, their preferential partner choices as well as changes in their gene expression profiles seem to be strongly associated with the differentiation of the oil content between the two cultivars. These findings are essential in enhancing our understanding of the genetic programs in both cultivars and developing novel hypotheses for further experimental studies.
Collapse
Affiliation(s)
- Abirami Rajavel
- Breeding Informatics Group, Department of Animal Sciences, Georg-August University, Margarethe von Wrangell-Weg 7, 37075 Göttingen, Germany; (A.R.); (S.K.); (J.-S.S.); (H.B.); (A.O.S.)
| | - Selina Klees
- Breeding Informatics Group, Department of Animal Sciences, Georg-August University, Margarethe von Wrangell-Weg 7, 37075 Göttingen, Germany; (A.R.); (S.K.); (J.-S.S.); (H.B.); (A.O.S.)
| | - Johanna-Sophie Schlüter
- Breeding Informatics Group, Department of Animal Sciences, Georg-August University, Margarethe von Wrangell-Weg 7, 37075 Göttingen, Germany; (A.R.); (S.K.); (J.-S.S.); (H.B.); (A.O.S.)
| | - Hendrik Bertram
- Breeding Informatics Group, Department of Animal Sciences, Georg-August University, Margarethe von Wrangell-Weg 7, 37075 Göttingen, Germany; (A.R.); (S.K.); (J.-S.S.); (H.B.); (A.O.S.)
| | - Kun Lu
- College of Agronomy and Biotechnology, Southwest University, Beibei, Chongqing 400715, China;
- Academy of Agricultural Sciences, Southwest University, Beibei, Chongqing 400715, China
- State Cultivation Base of Crop Stress Biology, Southern Mountainous Land of Southwest University, Beibei, Chongqing 400715, China
| | - Armin Otto Schmitt
- Breeding Informatics Group, Department of Animal Sciences, Georg-August University, Margarethe von Wrangell-Weg 7, 37075 Göttingen, Germany; (A.R.); (S.K.); (J.-S.S.); (H.B.); (A.O.S.)
- Center for Integrated Breeding Research (CiBreed), Albrecht-Thaer-Weg 3, Georg-August University, 37075 Göttingen, Germany
| | - Mehmet Gültas
- Breeding Informatics Group, Department of Animal Sciences, Georg-August University, Margarethe von Wrangell-Weg 7, 37075 Göttingen, Germany; (A.R.); (S.K.); (J.-S.S.); (H.B.); (A.O.S.)
- Center for Integrated Breeding Research (CiBreed), Albrecht-Thaer-Weg 3, Georg-August University, 37075 Göttingen, Germany
| |
Collapse
|
5
|
Rajavel A, Schmitt AO, Gültas M. Computational Identification of Master Regulators Influencing Trypanotolerance in Cattle. Int J Mol Sci 2021; 22:ijms22020562. [PMID: 33429951 PMCID: PMC7827104 DOI: 10.3390/ijms22020562] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Revised: 12/31/2020] [Accepted: 01/05/2021] [Indexed: 12/15/2022] Open
Abstract
African Animal Trypanosomiasis (AAT) is transmitted by the tsetse fly which carries pathogenic trypanosomes in its saliva, thus causing debilitating infection to livestock health. As the disease advances, a multistage progression process is observed based on the progressive clinical signs displayed in the host’s body. Investigation of genes expressed with regular monotonic patterns (known as Monotonically Expressed Genes (MEGs)) and of their master regulators can provide important clue for the understanding of the molecular mechanisms underlying the AAT disease. For this purpose, we analysed MEGs for three tissues (liver, spleen and lymph node) of two cattle breeds, namely trypanosusceptible Boran and trypanotolerant N’Dama. Our analysis revealed cattle breed-specific master regulators which are highly related to distinguish the genetic programs in both cattle breeds. Especially the master regulators MYC and DBP found in this study, seem to influence the immune responses strongly, thereby susceptibility and trypanotolerance of Boran and N’Dama respectively. Furthermore, our pathway analysis also bolsters the crucial roles of these master regulators. Taken together, our findings provide novel insights into breed-specific master regulators which orchestrate the regulatory cascades influencing the level of trypanotolerance in cattle breeds and thus could be promising drug targets for future therapeutic interventions.
Collapse
Affiliation(s)
- Abirami Rajavel
- Breeding Informatics Group, Department of Animal Sciences, Georg-August University, Margarethe von Wrangell-Weg 7, 37075 Göttingen, Germany; (A.R.); (A.O.S.)
| | - Armin Otto Schmitt
- Breeding Informatics Group, Department of Animal Sciences, Georg-August University, Margarethe von Wrangell-Weg 7, 37075 Göttingen, Germany; (A.R.); (A.O.S.)
- Center for Integrated Breeding Research (CiBreed), Albrecht-Thaer-Weg 3, Georg-August University, 37075 Göttingen, Germany
| | - Mehmet Gültas
- Breeding Informatics Group, Department of Animal Sciences, Georg-August University, Margarethe von Wrangell-Weg 7, 37075 Göttingen, Germany; (A.R.); (A.O.S.)
- Center for Integrated Breeding Research (CiBreed), Albrecht-Thaer-Weg 3, Georg-August University, 37075 Göttingen, Germany
- Correspondence:
| |
Collapse
|
6
|
Identification of Monotonically Differentially Expressed Genes across Pathologic Stages for Cancers. JOURNAL OF ONCOLOGY 2020; 2020:8458190. [PMID: 33273919 PMCID: PMC7676961 DOI: 10.1155/2020/8458190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Revised: 10/17/2020] [Accepted: 10/28/2020] [Indexed: 12/09/2022]
Abstract
Given the fact that cancer is a multistage progression process resulting from genetic sequence mutations, the genes whose expression values increase or decrease monotonically across pathologic stages are potentially involved in tumor progression. This may provide insightful clues about how human cancers advance, thereby facilitating more personalized treatments. By replacing the expression values of genes with their GeneRanks, we propose a procedure capable of identifying monotonically differentially expressed genes (MEGs) as the disease advances. Using three real-world gene expression data that cover three distinct cancer types-colon, esophageal, and lung cancers-the proposed procedure has demonstrated excellent performance in detecting the potential MEGs. To conclude, the proposed procedure can detect MEGs across pathologic stages of cancers very efficiently and is thus highly recommended.
Collapse
|
7
|
Wong CY, Chang YM, Tsai YS, Ng WV, Cheong SK, Chang TY, Chung IF, Lim YM. Decoding the differentiation of mesenchymal stem cells into mesangial cells at the transcriptomic level. BMC Genomics 2020; 21:467. [PMID: 32635896 PMCID: PMC7339572 DOI: 10.1186/s12864-020-06868-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Accepted: 06/23/2020] [Indexed: 02/08/2023] Open
Abstract
Background Mesangial cells play an important role in the glomerulus to provide mechanical support and maintaine efficient ultrafiltration of renal plasma. Loss of mesangial cells due to pathologic conditions may lead to impaired renal function. Mesenchymal stem cells (MSC) can differentiate into many cell types, including mesangial cells. However transcriptomic profiling during MSC differentiation into mesangial cells had not been studied yet. The aim of this study is to examine the pattern of transcriptomic changes during MSC differentiation into mesangial cells, to understand the involvement of transcription factor (TF) along the differentiation process, and finally to elucidate the relationship among TF-TF and TF-key gene or biomarkers during the differentiation of MSC into mesangial cells. Results Several ascending and descending monotonic key genes were identified by Monotonic Feature Selector. The identified descending monotonic key genes are related to stemness or regulation of cell cycle while ascending monotonic key genes are associated with the functions of mesangial cells. The TFs were arranged in a co-expression network in order of time by Time-Ordered Gene Co-expression Network (TO-GCN) analysis. TO-GCN analysis can classify the differentiation process into three stages: differentiation preparation, differentiation initiation and maturation. Furthermore, it can also explore TF-TF-key genes regulatory relationships in the muscle contraction process. Conclusions A systematic analysis for transcriptomic profiling of MSC differentiation into mesangial cells has been established. Key genes or biomarkers, TFs and pathways involved in differentiation of MSC-mesangial cells have been identified and the related biological implications have been discussed. Finally, we further elucidated for the first time the three main stages of mesangial cell differentiation, and the regulatory relationships between TF-TF-key genes involved in the muscle contraction process. Through this study, we have increased fundamental understanding of the gene transcripts during the differentiation of MSC into mesangial cells.
Collapse
Affiliation(s)
- Chee-Yin Wong
- Department of Pre-Clinical Sciences, Faculty of Medicine and Health Sciences, Universiti Tunku Abdul Rahman, Jalan Sungai Long, Bandar Sungai Long, 43000 Kajang, Selangor, Malaysia
| | - Yao-Ming Chang
- Institute of Biomedical Sciences, Academia Sinica, 128, Academia Road, Section 2, Nankang, Taipei, Taiwan
| | - Yu-Shuen Tsai
- Center for Systems and Synthetic Biology, National Yang-Ming University, No. 155, Section 2, Linong Street, Taipei, Taiwan
| | - Wailap Victor Ng
- Department of Biotechnology and Laboratory Science in Medicine, National Yang-Ming University, No. 155, Section 2, Linong Street, Taipei, Taiwan
| | - Soon-Keng Cheong
- Department of Pre-Clinical Sciences, Faculty of Medicine and Health Sciences, Universiti Tunku Abdul Rahman, Jalan Sungai Long, Bandar Sungai Long, 43000 Kajang, Selangor, Malaysia
| | - Ting-Yu Chang
- Department of Research, ChangHua Christian Hospital, 135, Nan-Hsiao Street, ChangHua City, Taiwan
| | - I-Fang Chung
- Center for Systems and Synthetic Biology, National Yang-Ming University, No. 155, Section 2, Linong Street, Taipei, Taiwan. .,Institute of Biomedical Informatics, National Yang-Ming University, No. 155, Section 2, Linong Street, Taipei, Taiwan. .,Preventive Medicine Research Center, National Yang-Ming University, No. 155, Section 2, Linong Street, Taipei, Taiwan.
| | - Yang-Mooi Lim
- Department of Pre-Clinical Sciences, Faculty of Medicine and Health Sciences, Universiti Tunku Abdul Rahman, Jalan Sungai Long, Bandar Sungai Long, 43000 Kajang, Selangor, Malaysia.
| |
Collapse
|
8
|
Identifying Cattle Breed-Specific Partner Choice of Transcription Factors during the African Trypanosomiasis Disease Progression Using Bioinformatics Analysis. Vaccines (Basel) 2020; 8:vaccines8020246. [PMID: 32456126 PMCID: PMC7350023 DOI: 10.3390/vaccines8020246] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Revised: 05/13/2020] [Accepted: 05/21/2020] [Indexed: 12/18/2022] Open
Abstract
African Animal Trypanosomiasis (AAT) is a disease caused by pathogenic trypanosomes which affects millions of livestock every year causing huge economic losses in agricultural production especially in sub-Saharan Africa. The disease is spread by the tsetse fly which carries the parasite in its saliva. During the disease progression, the cattle are prominently subjected to anaemia, weight loss, intermittent fever, chills, neuronal degeneration, congestive heart failure, and finally death. According to their different genetic programs governing the level of tolerance to AAT, cattle breeds are classified as either resistant or susceptible. In this study, we focus on the cattle breeds N’Dama and Boran which are known to be resistant and susceptible to trypanosomiasis, respectively. Despite the rich literature on both breeds, the gene regulatory mechanisms of the underlying biological processes for their resistance and susceptibility have not been extensively studied. To address the limited knowledge about the tissue-specific transcription factor (TF) cooperations associated with trypanosomiasis, we investigated gene expression data from these cattle breeds computationally. Consequently, we identified significant cooperative TF pairs (especially DBP−PPARA and DBP−THAP1 in N’Dama and DBP−PAX8 in Boran liver tissue) which could help understand the underlying AAT tolerance/susceptibility mechanism in both cattle breeds.
Collapse
|
9
|
Sun H, Sui Z, Wang D, Ba H, Zhao H, Zhang L, Li C. Identification of interactive molecules between antler stem cells and dermal papilla cells using an in vitro co-culture system. J Mol Histol 2019; 51:15-31. [DOI: 10.1007/s10735-019-09853-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2019] [Accepted: 11/30/2019] [Indexed: 12/25/2022]
|
10
|
Tian S. Identification of monotonically differentially expressed genes for non-small cell lung cancer. BMC Bioinformatics 2019; 20:177. [PMID: 30971213 PMCID: PMC6458730 DOI: 10.1186/s12859-019-2775-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Accepted: 03/22/2019] [Indexed: 12/19/2022] Open
Abstract
Background Monotonically expressed genes (MEGs) are genes whose expression values increase or decrease monotonically as a disease advances or time proceeds. Non-small cell lung cancer (NSCLC) is a multistage progression process resulting from genetic sequences mutations, the identification of MEGs for NSCLC is important. Results With the aid of a feature selection algorithm capable of identifying MEGs – the MFSelector method – two sets of potential MEGs were selected in this study: the MEGs across the different pathologic stages and the MEGs across the risk levels of death for the NSCLC patients at early stages. For the lung adenocarcinoma (AC) subtypes no statistically significant MEGs were identified across pathologic stages, however dozens of MEGs were identified across the risk levels of death. By contrast, for the squamous cell lung carcinoma (SCC) there were no statistically significant MEGs as either stage or risk level advanced. Conclusions The pathologic stage of non-small cell lung cancer patients at early stages has no prognostic value, making the identification of prognostic gene signatures for them more meaningful and highly desirable.
Collapse
Affiliation(s)
- Suyan Tian
- Division of Clinical Research, The First Hospital of Jilin University, 71 Xinmin Street, Changchun, 130021, Jilin, China.
| |
Collapse
|
11
|
Tian S, Wang C, Wang B. Incorporating Pathway Information into Feature Selection towards Better Performed Gene Signatures. BIOMED RESEARCH INTERNATIONAL 2019; 2019:2497509. [PMID: 31073522 PMCID: PMC6470448 DOI: 10.1155/2019/2497509] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/23/2018] [Accepted: 03/07/2019] [Indexed: 12/29/2022]
Abstract
To analyze gene expression data with sophisticated grouping structures and to extract hidden patterns from such data, feature selection is of critical importance. It is well known that genes do not function in isolation but rather work together within various metabolic, regulatory, and signaling pathways. If the biological knowledge contained within these pathways is taken into account, the resulting method is a pathway-based algorithm. Studies have demonstrated that a pathway-based method usually outperforms its gene-based counterpart in which no biological knowledge is considered. In this article, a pathway-based feature selection is firstly divided into three major categories, namely, pathway-level selection, bilevel selection, and pathway-guided gene selection. With bilevel selection methods being regarded as a special case of pathway-guided gene selection process, we discuss pathway-guided gene selection methods in detail and the importance of penalization in such methods. Last, we point out the potential utilizations of pathway-guided gene selection in one active research avenue, namely, to analyze longitudinal gene expression data. We believe this article provides valuable insights for computational biologists and biostatisticians so that they can make biology more computable.
Collapse
Affiliation(s)
- Suyan Tian
- Division of Clinical Research, The First Hospital of Jilin University, 71 Xinmin Street, Changchun, Jilin 130021, China
| | - Chi Wang
- Department of Biostatistics, Markey Cancer Center, The University of Kentucky, 800 Rose St., Lexington, KY 40536, USA
| | - Bing Wang
- School of Life Science, Jilin University, 2699 Qianjin Street, Changchun, Jilin 130012, China
| |
Collapse
|
12
|
Fukushima A, Sugimoto M, Hiwa S, Hiroyasu T. Elastic net-based prediction of IFN-β treatment response of patients with multiple sclerosis using time series microarray gene expression profiles. Sci Rep 2019; 9:1822. [PMID: 30755676 PMCID: PMC6372673 DOI: 10.1038/s41598-018-38441-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Accepted: 12/14/2018] [Indexed: 01/08/2023] Open
Abstract
INF-β has been widely used to treat patients with multiple sclerosis (MS) in relapse. Accurate prediction of treatment response is important for effective personalization of treatment. Microarray data have been frequently used to discover new genes and to predict treatment responses. However, conventional analytical methods suffer from three difficulties: high-dimensionality of datasets; high degree of multi-collinearity; and achieving gene identification in time-course data. The use of Elastic net, a sparse modelling method, would decrease the first two issues; however, Elastic net is currently unable to solve these three issues simultaneously. Here, we improved Elastic net to accommodate time-course data analyses. Numerical experiments were conducted using two time-course microarray datasets derived from peripheral blood mononuclear cells collected from patients with MS. The proposed methods successfully identified genes showing a high predictive ability for INF-β treatment response. Bootstrap sampling resulted in an 81% and 78% accuracy for each dataset, which was significantly higher than the 71% and 73% accuracy obtained using conventional methods. Our methods selected genes showing consistent differentiation throughout all time-courses. These genes are expected to provide new predictive biomarkers that can influence INF-β treatment for MS patients.
Collapse
Affiliation(s)
- Arika Fukushima
- Doshisha University, Graduate School of Life and Medical Sciences, Kyoto, Japan
| | - Masahiro Sugimoto
- Research and Development Center for Minimally Invasive Therapies Health Promotion and Preemptive Medicine, Tokyo Medical University, Shinjuku, Tokyo, 160-8402, Japan.,Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata, 997-0052, Japan.,University of Tsukuba, Research and Development Center for Precision Medicine, Tukuba, Ibaraki, 305-8550, Japan
| | - Satoru Hiwa
- Doshisha University, Graduate School of Life and Medical Sciences, Kyoto, Japan
| | - Tomoyuki Hiroyasu
- Doshisha University, Graduate School of Life and Medical Sciences, Kyoto, Japan.
| |
Collapse
|
13
|
Liu B, Chen Y, Yang J. LncRNAs are altered in lung squamous cell carcinoma and lung adenocarcinoma. Oncotarget 2018; 8:24275-24291. [PMID: 27903974 PMCID: PMC5421846 DOI: 10.18632/oncotarget.13651] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2016] [Accepted: 11/14/2016] [Indexed: 01/05/2023] Open
Abstract
Long non-coding RNAs (lncRNAs) have been implicated in pathogenesis of various cancers, including lung squamous cell carcinoma (LUSC) and lung adenocarcinoma (LUAD). We used cBioPortal to analyze lncRNA alteration frequencies and their ability to predict overall survival (OS) using 504 LUSC and 522 LUAD samples from The Cancer Genome Atlas (TCGA) database. In LUSC, 624 lncRNAs had alteration rates > 1% and 64 > 10%. In LUAD 625 lncRNAs had alteration rates > 1% and 36 > 10%. Among those, 620 lncRNAs had alteration frequencies > 1% in both LUSC and LUAD, while 22 were LUSC-specific and 23 were LUAD-specific. Twenty lncRNAs had alteration frequencies > 10% in both LUSC and LUAD, while 44 were LUSC-specific and 16 were LUAD specific. Genome ontology and pathway analyses produced similar results for LUSC and LUAD. Two lncRNAs (IGF2BP2-AS1 and DGCR5) correlated with better OS in LUSC, and three (MIR31HG, CDKN2A-AS1 and LINC01600) predicted poor OS in LUAD. Chip-seq and luciferase reporter assays identified potential IGF2BP2-AS1, DGCR5 and LINC01600 promoters and enhancers. This study presented lncRNA landscapes and revealed differentially expressed, highly altered lncRNAs in LUSC and LUAD. LncRNAs that act as oncogenes and lncRNA-regulating transcription factors provide novel targets for anti-lung cancer therapeutics.
Collapse
Affiliation(s)
- Bing Liu
- Department of Respiratory Medicine, Zhongnan Hospital of Wuhan University, Wuhan, China
| | - Yifei Chen
- Department of Respiratory Medicine, Zhongnan Hospital of Wuhan University, Wuhan, China
| | - Jiong Yang
- Department of Respiratory Medicine, Zhongnan Hospital of Wuhan University, Wuhan, China
| |
Collapse
|
14
|
Zhang A, Tian S. Classification of early-stage non-small cell lung cancer by weighing gene expression profiles with connectivity information. Biom J 2017; 60:537-546. [PMID: 29206308 DOI: 10.1002/bimj.201700010] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2017] [Revised: 09/10/2017] [Accepted: 10/22/2017] [Indexed: 11/11/2022]
Abstract
Pathway-based feature selection algorithms, which utilize biological information contained in pathways to guide which features/genes should be selected, have evolved quickly and become widespread in the field of bioinformatics. Based on how the pathway information is incorporated, we classify pathway-based feature selection algorithms into three major categories-penalty, stepwise forward, and weighting. Compared to the first two categories, the weighting methods have been underutilized even though they are usually the simplest ones. In this article, we constructed three different genes' connectivity information-based weights for each gene and then conducted feature selection upon the resulting weighted gene expression profiles. Using both simulations and a real-world application, we have demonstrated that when the data-driven connectivity information constructed from the data of specific disease under study is considered, the resulting weighted gene expression profiles slightly outperform the original expression profiles. In summary, a big challenge faced by the weighting method is how to estimate pathway knowledge-based weights more accurately and precisely. Only until the issue is conquered successfully will wide utilization of the weighting methods be impossible.
Collapse
Affiliation(s)
- Ao Zhang
- Intensive Care Unit (ICU), The First Hospital of Jilin University, Changchun, 130021, China
| | - Suyan Tian
- Division of Clinical Research, The First Hospital of Jilin University, Changchun, 130021, China
| |
Collapse
|
15
|
Qin N, Wang C, Lu Q, Ma Z, Dai J, Ma H, Jin G, Shen H, Hu Z. Systematic identification of long non-coding RNAs with cancer-testis expression patterns in 14 cancer types. Oncotarget 2017; 8:94769-94779. [PMID: 29212265 PMCID: PMC5706911 DOI: 10.18632/oncotarget.21930] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2017] [Accepted: 08/08/2017] [Indexed: 12/25/2022] Open
Abstract
Cancer-testis (CT) genes are a group of genes that are potential targets of immunotherapy and candidate epi-drivers participating in the development of cancers. Previous studies mainly focused on protein-coding genes, neglecting long non-coding RNAs with the same expression patterns. In this study, we performed a systematic investigation of cancer-testis long non-coding RNAs (CT-lncRNAs) with multiple independent open-access databases.We identified 1,325 extremely highly expressed CT-lncRNAs (EECT-lncRNAs) in 14 cancer types. Functional annotation revealed that CT-lncRNAs reactivated in cancers could promote genome instability and the malignant potential of cancers. We observed a mutually exclusive pattern of EECT-lncRNA activation and mutation in known oncogenes, suggesting their potential role as drivers of cancer that complement known mut-driver genes. Additionally, we provided evidence that testis-specific regulatory elements and promoter hypo-methylation may be EECT-lncRNA activation mechanisms, and EECT-lncRNAs may regulate CT gene reactivation. Taken together, our study puts forth a new hypothesis in the research field of CT genes, whereby CT-lncRNAs/EECT-lncRNAs play important roles in the progression and maintenance of tumorigenesis, expanding candidate CT epi-driver genes from coding genes to non-coding RNAs.
Collapse
Affiliation(s)
- Na Qin
- State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing 211166, China.,Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing 211166, China.,Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Medicine, Nanjing Medical University, Nanjing 211166, China
| | - Cheng Wang
- State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing 211166, China.,Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing 211166, China.,Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Medicine, Nanjing Medical University, Nanjing 211166, China.,Department of Bioinformatics, School of Basic Medical Sciences, Nanjing Medical University, Nanjing 211116, China
| | - Qun Lu
- Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing 211166, China.,Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Medicine, Nanjing Medical University, Nanjing 211166, China
| | - Zijian Ma
- Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing 211166, China.,Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Medicine, Nanjing Medical University, Nanjing 211166, China
| | - Juncheng Dai
- State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing 211166, China.,Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing 211166, China.,Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Medicine, Nanjing Medical University, Nanjing 211166, China
| | - Hongxia Ma
- State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing 211166, China.,Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing 211166, China.,Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Medicine, Nanjing Medical University, Nanjing 211166, China
| | - Guangfu Jin
- State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing 211166, China.,Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing 211166, China.,Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Medicine, Nanjing Medical University, Nanjing 211166, China
| | - Hongbing Shen
- State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing 211166, China.,Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing 211166, China.,Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Medicine, Nanjing Medical University, Nanjing 211166, China
| | - Zhibin Hu
- State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing 211166, China.,Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing 211166, China.,Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Medicine, Nanjing Medical University, Nanjing 211166, China
| |
Collapse
|
16
|
Identification of potential prognostic ceRNA module biomarkers in patients with pancreatic adenocarcinoma. Oncotarget 2017; 8:94493-94504. [PMID: 29212244 PMCID: PMC5706890 DOI: 10.18632/oncotarget.21783] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2017] [Accepted: 09/08/2017] [Indexed: 12/13/2022] Open
Abstract
Accumulating evidence suggested that long non-coding RNAs (lncRNAs) can function as competing endogenous RNAs (ceRNAs) to interact with other RNA transcripts and ceRNAs perturbation play important roles in cancer initiation and progression including pancreatic adenocarcinoma (PAAD). In this study, we constructed a PAAD-specific hallmark gene-related ceRNA network (HceNet) using paired genome-wide expression profiles of mRNA, lncRNA and miRNA and regulatory relationships between them. Based on “ceRNA hypothesis”, we analyzed the characteristics of HceNet and identified a ceRNA module comprising of 29 genes (12 lncRNAs, two miRNAs and 15 mRNAs) as potential prognostic biomarkers related to overall survival of patients with PAAD. The prognostic value of ceRNA module biomarkers was further validated in the train (Hazard Ratio (HR) =1.661, 95% CI: 1.275–2.165, p<1.00e-4), test (HR=1.546, 95% CI: 1.238-1.930, p<1.00e-4), and entire (HR=1.559, 95% CI: 1.321-1.839, p<1.00e-4) datasets. Our study provides candidate prognostic biomarkers for PAAD and increases our understanding of ceRNA-related regulatory mechanism in PAAD pathogenesis.
Collapse
|
17
|
LncSubpathway: a novel approach for identifying dysfunctional subpathways associated with risk lncRNAs by integrating lncRNA and mRNA expression profiles and pathway topologies. Oncotarget 2017; 8:15453-15469. [PMID: 28152521 PMCID: PMC5362499 DOI: 10.18632/oncotarget.14973] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2016] [Accepted: 01/10/2017] [Indexed: 02/01/2023] Open
Abstract
Long non-coding RNAs (lncRNAs) play important roles in various biological processes, including the development of many diseases. Pathway analysis is a valuable aid for understanding the cellular functions of these transcripts. We have developed and characterized LncSubpathway, a novel method that integrates lncRNA and protein coding gene (PCG) expression with interactome data to identify disease risk subpathways that functionally associated with risk lncRNAs. LncSubpathway identifies the most relevance regions which are related with risk lncRNA set and implicated with study conditions through simultaneously considering the dysregulation extent of lncRNAs, PCGs and their correlations. Simulation studies demonstrated that the sensitivity and false positive rates of LncSubpathway were within acceptable ranges, and that LncSubpathway could accurately identify dysregulated regions that related with disease risk lncRNAs within pathways. When LncSubpathway was applied to colorectal carcinoma and breast cancer subtype datasets, it identified cancer type- and breast cancer subtype-related meaningful subpathways. Further, analysis of its robustness and reproducibility indicated that LncSubpathway was a reliable means of identifying subpathways that functionally associated with lncRNAs. LncSubpathway is freely available at http://www.bio-bigdata.com/lncSubpathway/.
Collapse
|
18
|
Xu C, Qi R, Ping Y, Li J, Zhao H, Wang L, Du MY, Xiao Y, Li X. Systemically identifying and prioritizing risk lncRNAs through integration of pan-cancer phenotype associations. Oncotarget 2017; 8:12041-12051. [PMID: 28076842 PMCID: PMC5355324 DOI: 10.18632/oncotarget.14510] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2016] [Accepted: 12/12/2016] [Indexed: 02/01/2023] Open
Abstract
LncRNAs have emerged as a major class of regulatory molecules involved in normal cellular physiology and disease, our knowledge of lncRNAs is very limited and it has become a major research challenge in discovering novel disease-related lncRNAs in cancers. Based on the assumption that diverse diseases with similar phenotype associations show similar molecular mechanisms, we presented a pan-cancer network-based prioritization approach to systematically identify disease-specific risk lncRNAs by integrating disease phenotype associations. We applied this strategy to approximately 2800 tumor samples from 14 cancer types for prioritizing disease risk lncRNAs. Our approach yielded an average area under the ROC curve (AUC) of 80.66%, with the highest AUC (98.14%) for medulloblastoma. When evaluated using leave-one-out cross-validation (LOOCV) for prioritization of disease candidate genes, the average AUC score of 97.16% was achieved. Moreover, we demonstrated the robustness as well as the integrative importance of this approach, including disease phenotype associations, known disease genes and the numbers of cancer types. Taking glioblastoma multiforme as a case study, we identified a candidate lncRNA gene SNHG1 as a novel disease risk factor for disease diagnosis and prognosis. In summary, we provided a novel lncRNA prioritization approach by integrating pan-cancer phenotype associations that could help researchers better understand the important roles of lncRNAs in human cancers.
Collapse
Affiliation(s)
- Chaohan Xu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang, China
| | - Rui Qi
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang, China
| | - Yanyan Ping
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang, China
| | - Jie Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang, China
| | - Hongying Zhao
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang, China
| | - Li Wang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang, China
| | | | - Yun Xiao
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang, China.,Key Laboratory of Cardiovascular Medicine Research, Harbin Medical University, Ministry of Education, China
| | - Xia Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang, China
| |
Collapse
|
19
|
Shi X, Xu Y, Zhang C, Feng L, Sun Z, Han J, Su F, Zhang Y, Li C, Li X. Subpathway-LNCE: Identify dysfunctional subpathways competitively regulated by lncRNAs through integrating lncRNA-mRNA expression profile and pathway topologies. Oncotarget 2016; 7:69857-69870. [PMID: 27634882 PMCID: PMC5342520 DOI: 10.18632/oncotarget.12005] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2016] [Accepted: 09/02/2016] [Indexed: 12/14/2022] Open
Abstract
Recently, studies have reported that long noncoding RNAs (lncRNAs) can act as modulators of mRNAs through competitively binding to microRNAs (miRNAs) and have relevance to tumorigenesis as well as other diseases. Identify lncRNA competitively regulated subpathway not only can gain insight into the initiation and progression of disease, but also help for understanding the functional roles of lncRNAs in the disease context. Here, we present an effective method, Subpathway-LNCE, which was specifically designed to identify lncRNAs competitively regulated functions and the functional roles of these competitive regulation lncRNAs have not be well characterized in diseases. Moreover, the method integrated lncRNA-mRNA expression profile and pathway topologies. Using prostate cancer datasets and LUAD data sets, we confirmed the effectiveness of our method in identifying disease associated dysfunctional subpathway that regulated by lncRNAs. By analyzing kidney renal clear cell carcinoma related lncRNA competitively regulated subpathway network, we show that Subpathway-LNCE can help uncover disease key lncRNAs. Furthermore, we demonstrated that our method is reproducible and robust. Subpathway-LNCE provide a flexible tool to identify lncRNA competitively regulated signal subpathways underlying certain condition, and help to expound the functional roles of lncRNAs in various status. Subpathway-LNCE has been developed as an R package freely available at https://cran.rstudio.com/web/packages/SubpathwayLNCE/.
Collapse
Affiliation(s)
- Xinrui Shi
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Yanjun Xu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Chunlong Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Li Feng
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Zeguo Sun
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Junwei Han
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Fei Su
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Yunpeng Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Chunquan Li
- Department of Medical Informatics, Daqing Campus, Harbin Medical University, Daqing, 163319, China
| | - Xia Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| |
Collapse
|