1
|
Chiricosta L, D’Angiolini S, Gugliandolo A, Mazzon E. Artificial Intelligence Predictor for Alzheimer’s Disease Trained on Blood Transcriptome: The Role of Oxidative Stress. Int J Mol Sci 2022; 23:ijms23095237. [PMID: 35563628 PMCID: PMC9104709 DOI: 10.3390/ijms23095237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Revised: 05/04/2022] [Accepted: 05/05/2022] [Indexed: 02/01/2023] Open
Abstract
Alzheimer’s disease (AD) is an incurable neurodegenerative disease diagnosed by clinicians through healthcare records and neuroimaging techniques. These methods lack sensitivity and specificity, so new antemortem non-invasive strategies to diagnose AD are needed. Herein, we designed a machine learning predictor based on transcriptomic data obtained from the blood of AD patients and individuals without dementia (non-AD) through an 8 × 60 K microarray. The dataset was used to train different models with different hyperparameters. The support vector machines method allowed us to reach a Receiver Operating Characteristic score of 93% and an accuracy of 89%. High score levels were also achieved by the neural network and logistic regression methods. Furthermore, the Gene Ontology enrichment analysis of the features selected to train the model along with the genes differentially expressed between the non-AD and AD transcriptomic profiles shows the “mitochondrial translation” biological process to be the most interesting. In addition, inspection of the KEGG pathways suggests that the accumulation of β-amyloid triggers electron transport chain impairment, enhancement of reactive oxygen species and endoplasmic reticulum stress. Taken together, all these elements suggest that the oxidative stress induced by β-amyloid is a key feature trained by the model for the prediction of AD with high accuracy.
Collapse
|
2
|
Marete A, Ariel O, Ibeagha-Awemu E, Bissonnette N. Identification of Long Non-coding RNA Isolated From Naturally Infected Macrophages and Associated With Bovine Johne's Disease in Canadian Holstein Using a Combination of Neural Networks and Logistic Regression. Front Vet Sci 2021; 8:639053. [PMID: 33969037 PMCID: PMC8100051 DOI: 10.3389/fvets.2021.639053] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2020] [Accepted: 02/15/2021] [Indexed: 01/15/2023] Open
Abstract
Mycobacterium avium ssp. paratuberculosis (MAP) causes chronic enteritis in most ruminants. The pathogen MAP causes Johne's disease (JD), a chronic, incurable, wasting disease. Weight loss, diarrhea, and a gradual drop in milk production characterize the disease's clinical phase, culminating in death. Several studies have characterized long non-coding RNA (lncRNA) in bovine tissues, and a previous study characterizes (lncRNA) in macrophages infected with MAP in vitro. In this study, we aim to characterize the lncRNA in macrophages from cows naturally infected with MAP. From 15 herds, feces and blood samples were collected for each cow older than 24 months, twice yearly over 3–5 years. Paired samples were analyzed by fecal PCR and blood ELISA. We used RNA-seq data to study lncRNA in macrophages from 33 JD(+) and 33 JD(–) dairy cows. We performed RNA-seq analysis using the “new Tuxedo” suite. We characterized lncRNA using logistic regression and multilayered neural networks and used DESeq2 for differential expression analysis and Panther and Reactome classification systems for gene ontology (GO) analysis. The study identified 13,301 lncRNA, 605 of which were novel lncRNA. We found seven genes close to differentially expressed lncRNA, including CCDC174, ERI1, FZD1, TWSG1, ZBTB38, ZNF814, and ZSCAN4. None of the genes associated with susceptibility to JD have been cited in the literature. LncRNA target genes were significantly enriched for biological process GO terms involved in immunity and nucleic acid regulation. These include the MyD88 pathway (TLR5), GO:0043312 (neutrophil degranulation), GO:0002446 (neutrophil-mediated immunity), and GO:0042119 (neutrophil activation). These results identified lncRNA with potential roles in host immunity and potential candidate genes and pathways through which lncRNA might function in response to MAP infection.
Collapse
Affiliation(s)
- Andrew Marete
- Agriculture and Agri-Food Canada, Sherbrooke Research and Development Centre, Sherbrooke, QC, Canada
| | - Olivier Ariel
- Agriculture and Agri-Food Canada, Sherbrooke Research and Development Centre, Sherbrooke, QC, Canada.,Faculty of Science, Sherbrooke University, Sherbrooke, QC, Canada
| | - Eveline Ibeagha-Awemu
- Agriculture and Agri-Food Canada, Sherbrooke Research and Development Centre, Sherbrooke, QC, Canada
| | - Nathalie Bissonnette
- Agriculture and Agri-Food Canada, Sherbrooke Research and Development Centre, Sherbrooke, QC, Canada
| |
Collapse
|
3
|
Zheng H, Xu J, Hao S, Liu X, Ning J, Song X, Jiang L, Liu Z. Expression of BANCR promotes papillary thyroid cancer by targeting thyroid stimulating hormone receptor. Oncol Lett 2018; 16:2009-2015. [PMID: 30034553 DOI: 10.3892/ol.2018.8810] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2016] [Accepted: 08/23/2017] [Indexed: 12/28/2022] Open
Abstract
Papillary thyroid carcinoma (PTC) is the most common form of non-medullary thyroid cancer, accounting for ~80% of all cases of thyroid cancer. The aim of the present study was to explore the role of BRAF-activated long noncoding RNA (BANCR) in the development of PTC. Using reverse transcription-quantitative polymerase chain reaction (RT-qPCR), the mRNA expression levels of BANCR, thyroid-stimulating hormone receptor (TSHR) and cyclin D1 between PTC and benign control thyroid nodule tissue samples from 60 patients were determined. Using RT-qPCR and western blot analysis, the expression levels of TSHR and cyclin D1 mRNA and protein were determined in cells transfected with BANCR-small interfering (si)RNA. An MTT assay and flow cytometry were used to analyze the effect of BANCR knockdown on the proliferation and cell cycle distribution of IHH-4 PTC cells. The expression of BANCR, TSHR and cyclin D1 was increased in the PTC group compared with the control group based on the RT-qPCR data. The transfection of IHH-4 cells with BANCR-siRNA induced the inhibition of TSHR and cyclin D1 expression compared with a transfection control. In addition, the proliferation of the IHH-4 cells transfected with BANCR-siRNA was suppressed, relative to the transfection control, and cells arrested in the G0/G1 phase, potentially due to the inhibition of the expression of cyclin D1. The data suggested that the expression of BANCR may promote the development of malignant thyroid nodules via the modulation of TSHR expression and its downstream effector, cyclin D1.
Collapse
Affiliation(s)
- Haitao Zheng
- Department of Thyroid Surgery, Yantai Yuhuangding Hospital, Yantai, Shangdong 264099, P.R. China
| | - Jie Xu
- Department of Thyroid Surgery, Yantai Yuhuangding Hospital, Yantai, Shangdong 264099, P.R. China
| | - Shaolong Hao
- Department of Thyroid Surgery, Yantai Yuhuangding Hospital, Yantai, Shangdong 264099, P.R. China
| | - Xincheng Liu
- Department of Thyroid Surgery, Yantai Yuhuangding Hospital, Yantai, Shangdong 264099, P.R. China
| | - Jinrao Ning
- Department of Thyroid Surgery, Yantai Yuhuangding Hospital, Yantai, Shangdong 264099, P.R. China
| | - Xicheng Song
- Department of Ear, Nose and Throat & Head and Neck Surgery, Yantai Yuhuangding Hospital, Yantai, Shangdong 264099, P.R. China
| | - Lixin Jiang
- Department of Thyroid Surgery, Yantai Yuhuangding Hospital, Yantai, Shangdong 264099, P.R. China
| | - Zongying Liu
- Department of Laboratory, People's Hospital of Pingyi County, Pingyi, Shangdong 273399, P.R. China
| |
Collapse
|
4
|
Bai WL, Zhao SJ, Wang ZY, Zhu YB, Dang YL, Cong YY, Xue HL, Wang W, Deng L, Guo D, Wang SQ, Zhu YX, Yin RH. LncRNAs in Secondary Hair Follicle of Cashmere Goat: Identification, Expression, and Their Regulatory Network in Wnt Signaling Pathway. Anim Biotechnol 2017; 29:199-211. [PMID: 28846493 DOI: 10.1080/10495398.2017.1356731] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
Long noncoding RNAs (lncRNAs) are a novel class of eukaryotic transcripts. They are thought to act as a critical regulator of protein-coding gene expression. Herein, we identified and characterized 13 putative lncRNAs from the expressed sequence tags from secondary hair follicle of Cashmere goat. Furthermore, we investigated their transcriptional pattern in secondary hair follicle of Liaoning Cashmere goat during telogen and anagen phases. Also, we generated intracellular regulatory networks of upregulated lncRNAs at anagen in Wnt signaling pathway based on bioinformatics analysis. The relative expression of six putative lncRNAs (lncRNA-599618, -599556, -599554, -599547, -599531, and -599509) at the anagen phase is significantly higher than that at telogen. Compared with anagen, the relative expression of four putative lncRNAs (lncRNA-599528, -599518, -599511, and -599497) was found to be significantly upregulated at telogen phase. The network generated showed that a rich and complex regulatory relationship of the putative lncRNAs and related miRNAs with their target genes in Wnt signaling pathway. Our results from the present study provided a foundation for further elucidating the functional and regulatory mechanisms of these putative lncRNAs in the development of secondary hair follicle and cashmere fiber growth of Cashmere goat.
Collapse
Affiliation(s)
- Wen L Bai
- a College of Animal Science and Veterinary Medicine , Shenyang Agricultural University , Shenyang , P. R. China
| | - Su J Zhao
- b Institute of Biotechnology , Sichuan Animal Science Academy , Chengdu , P. R. China
| | - Ze Y Wang
- a College of Animal Science and Veterinary Medicine , Shenyang Agricultural University , Shenyang , P. R. China
| | - Yu B Zhu
- a College of Animal Science and Veterinary Medicine , Shenyang Agricultural University , Shenyang , P. R. China
| | - Yun L Dang
- a College of Animal Science and Veterinary Medicine , Shenyang Agricultural University , Shenyang , P. R. China
| | - Yu Y Cong
- a College of Animal Science and Veterinary Medicine , Shenyang Agricultural University , Shenyang , P. R. China
| | - Hui L Xue
- a College of Animal Science and Veterinary Medicine , Shenyang Agricultural University , Shenyang , P. R. China
| | - Wei Wang
- a College of Animal Science and Veterinary Medicine , Shenyang Agricultural University , Shenyang , P. R. China
| | - Liang Deng
- a College of Animal Science and Veterinary Medicine , Shenyang Agricultural University , Shenyang , P. R. China
| | - Dan Guo
- c Academy of Animal Husbandry Science of Liaoning Province , Liaoyang , P. R. China
| | - Shi Q Wang
- c Academy of Animal Husbandry Science of Liaoning Province , Liaoyang , P. R. China
| | - Yan X Zhu
- c Academy of Animal Husbandry Science of Liaoning Province , Liaoyang , P. R. China
| | - Rong H Yin
- a College of Animal Science and Veterinary Medicine , Shenyang Agricultural University , Shenyang , P. R. China
| |
Collapse
|
5
|
Dallagiovanna B, Pereira IT, Origa-Alves AC, Shigunov P, Naya H, Spangenberg L. lncRNAs are associated with polysomes during adipose-derived stem cell differentiation. Gene 2017; 610:103-111. [PMID: 28185860 DOI: 10.1016/j.gene.2017.02.004] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2016] [Revised: 01/04/2017] [Accepted: 02/05/2017] [Indexed: 12/22/2022]
Abstract
Over the past few years, an increasing number of long noncoding RNAs (lncRNAs) have been identified in mammalian genomes. Most of these lncRNAs are expressed at low levels in different human cell types. lncRNAs are found not only in the nucleus but are also enriched in the cytosolic fraction and are associated with translating polysomes. Expression of lncRNAs that have putative roles in cell differentiation has been identified in embryonic and adult stem cells. Nevertheless, the mechanisms by which lncRNAs operate in the cell are still poorly understood.Here, we studied the expression of the subpopulation of lncRNAs that are associated with polysomes in adipose-derived stem cells (hASCs) during their commitment to adipocytes. We established that lncRNAs and protein coding genes have similar expression levels. The relatively comparable expression of these transcripts could be a particular feature of hASCs. We then show that lncRNAs are associated with polysomes in undifferentiated and early differentiating cells, which was confirmed by quantitative RT-PCR. The association of lncRNAs with polysomes was also comparable to that of mRNAs. Our results suggest that the presence of lncRNAs in the polysomal RNA fraction is not the result of random association. We observed that a high percentage of lncRNAs are actively mobilized to or from polysomes during early stages of adipogenesis. Moreover, we found several lncRNAs that can potentially target miRNAs relevant to adipogenesis.
Collapse
Affiliation(s)
- Bruno Dallagiovanna
- Laboratorio de Biologia Basica de Celulas-tronco, FIOCRUZ-PR, Rua Professor Algacyr Munhoz Mader, 3775, 81.350-010 Curitiba, Brazil.
| | - Isabela T Pereira
- Laboratorio de Biologia Basica de Celulas-tronco, FIOCRUZ-PR, Rua Professor Algacyr Munhoz Mader, 3775, 81.350-010 Curitiba, Brazil.
| | - Ana Carolina Origa-Alves
- Laboratorio de Biologia Basica de Celulas-tronco, FIOCRUZ-PR, Rua Professor Algacyr Munhoz Mader, 3775, 81.350-010 Curitiba, Brazil.
| | - Patricia Shigunov
- Laboratorio de Biologia Basica de Celulas-tronco, FIOCRUZ-PR, Rua Professor Algacyr Munhoz Mader, 3775, 81.350-010 Curitiba, Brazil.
| | - Hugo Naya
- Bioinformatics Unit, Institut Pasteur de Montevideo, Mataojo 2020, CP 11400 Montevideo, Uruguay.
| | - Lucía Spangenberg
- Bioinformatics Unit, Institut Pasteur de Montevideo, Mataojo 2020, CP 11400 Montevideo, Uruguay.
| |
Collapse
|
6
|
Characterizing and annotating the genome using RNA-seq data. SCIENCE CHINA-LIFE SCIENCES 2016; 60:116-125. [PMID: 27294835 DOI: 10.1007/s11427-015-0349-4] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Accepted: 12/16/2015] [Indexed: 10/21/2022]
Abstract
Bioinformatics methods for various RNA-seq data analyses are in fast evolution with the improvement of sequencing technologies. However, many challenges still exist in how to efficiently process the RNA-seq data to obtain accurate and comprehensive results. Here we reviewed the strategies for improving diverse transcriptomic studies and the annotation of genetic variants based on RNA-seq data. Mapping RNA-seq reads to the genome and transcriptome represent two distinct methods for quantifying the expression of genes/transcripts. Besides the known genes annotated in current databases, many novel genes/transcripts (especially those long noncoding RNAs) still can be identified on the reference genome using RNA-seq. Moreover, owing to the incompleteness of current reference genomes, some novel genes are missing from them. Genome- guided and de novo transcriptome reconstruction are two effective and complementary strategies for identifying those novel genes/transcripts on or beyond the reference genome. In addition, integrating the genes of distinct databases to conduct transcriptomics and genetics studies can improve the results of corresponding analyses.
Collapse
|
7
|
Koufariotis LT, Chen YPP, Chamberlain A, Vander Jagt C, Hayes BJ. A catalogue of novel bovine long noncoding RNA across 18 tissues. PLoS One 2015; 10:e0141225. [PMID: 26496443 PMCID: PMC4619662 DOI: 10.1371/journal.pone.0141225] [Citation(s) in RCA: 70] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2015] [Accepted: 10/05/2015] [Indexed: 11/19/2022] Open
Abstract
Long non-coding RNA (lncRNA) have been implicated in diverse biological roles including gene regulation and genomic imprinting. Identifying lncRNA in bovine across many differing tissue would contribute to the current repertoire of bovine lncRNA, and help further improve our understanding of the evolutionary importance and constraints of these transcripts. Additionally, it could aid in identifying sites in the genome outside of protein coding genes where mutations could contribute to variation in complex traits. This is particularly important in bovine as genomic predictions are increasingly used in genetic improvement for milk and meat production. Our aim was to identify and annotate novel long non coding RNA transcripts in the bovine genome captured from RNA Sequencing (RNA-Seq) data across 18 tissues, sampled in triplicate from a single cow. To address the main challenge in identifying lncRNA, namely distinguishing lncRNA transcripts from unannotated genes and protein coding genes, a lncRNA identification pipeline with a number of filtering steps was developed. A total of 9,778 transcripts passed the filtering pipeline. The bovine lncRNA catalogue includes MALAT1 and HOTAIR, both of which have been well described in human and mouse genomes. We attempted to validate the lncRNA in libraries from three additional cows. 726 (87.47%) liver and 1,668 (55.27%) blood class 3 lncRNA were validated with stranded liver and blood libraries respectively. Additionally, this study identified a large number of novel unknown transcripts in the bovine genome with high protein coding potential, illustrating a clear need for better annotations of protein coding genes.
Collapse
Affiliation(s)
- Lambros T. Koufariotis
- College of Science, Health and Engineering, La Trobe University Bundoora, Melbourne, Victoria, Australia
- Department of Environment and Primary Industries, AgriBio Bundoora, Melbourne, Victoria, Australia
- Dairy Futures Co-operative Research Centre, Melbourne, Victoria, Australia
- * E-mail:
| | - Yi-Ping Phoebe Chen
- College of Science, Health and Engineering, La Trobe University Bundoora, Melbourne, Victoria, Australia
| | - Amanda Chamberlain
- Department of Environment and Primary Industries, AgriBio Bundoora, Melbourne, Victoria, Australia
- Dairy Futures Co-operative Research Centre, Melbourne, Victoria, Australia
| | - Christy Vander Jagt
- Department of Environment and Primary Industries, AgriBio Bundoora, Melbourne, Victoria, Australia
- Dairy Futures Co-operative Research Centre, Melbourne, Victoria, Australia
| | - Ben J. Hayes
- College of Science, Health and Engineering, La Trobe University Bundoora, Melbourne, Victoria, Australia
- Department of Environment and Primary Industries, AgriBio Bundoora, Melbourne, Victoria, Australia
- Dairy Futures Co-operative Research Centre, Melbourne, Victoria, Australia
| |
Collapse
|
8
|
Wang G, Chen H, Liu J. The long noncoding RNA LINC01207 promotes proliferation of lung adenocarcinoma. Am J Cancer Res 2015; 5:3162-3173. [PMID: 26693067 PMCID: PMC4656738] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2015] [Accepted: 04/26/2015] [Indexed: 06/05/2023] Open
Abstract
Lung adenocarcinoma (LAD) and lung squamous cell cancer (LSCC) are two most common histological types of lung cancer, while they differ in many aspects. Recent evidence shows that long non-coding RNAs (lncRNAs) play an important role in the process of cancer initiation and progression. Thus, characterization of LAD and LSCC associated lncRNAs may help understand the difference between LAD and LSCC. Here, we analyzed three sets of RNA-seq data, including LAD RNA-seq data from TCGA project. We identified a novel lncRNA, long intergenic non-protein coding RNA 1207 (LINC01207) which was significantly up-regulated in LAD tissues compared with paired non-tumor tissues (5.78 fold increase, P<0.05), while there was no significant differences between LSCC tissues and adjacent non-tumor tissues. The expression level of LINC01207 was associated with TNM stage of LAD patients, and higher LINC01207 level indicated advanced TNM stage (P<0.05) and shorter survival (HR=2.53, P<0.05). By small interfering RNA (siRNA) mediated knockdown of LINC01207, we determined the biological function of LINC01207 in A549 cell line. After knockdown of LINC01207, cell proliferation ability was inhibited. Further analysis showed that after silence of LINC01207, the percentage of apoptotic cells significantly increased. By RNA immunoprecipitation and Chromatin immunoprecipitation assay, we demonstrated that LINC01207 could bind with EZH2 and mediated trimethylation of histone 3 lysine 27 at the promoter region of Bad, an important pro-apoptotic gene. Finally, we developed xenograft tumor models in nude mice and xenograft tumors derived from A549 cells transfected with siRNA-LINC01207 had significantly lower tumor weight and smaller tumor volume. In summary, the novel lncRNA, LINC01207 is specifically up-regulated in LAD but not in LSCC; and LINC01207 could promote LAD cell growth both in vivo and in vitro.
Collapse
Affiliation(s)
- Gongchao Wang
- Department of Surgery, School of Nursing, Shandong University Jinan 250012, China
| | - Hongbo Chen
- Department of Surgery, School of Nursing, Shandong University Jinan 250012, China
| | - Jun Liu
- Department of Surgery, School of Nursing, Shandong University Jinan 250012, China
| |
Collapse
|
9
|
Wang W, Wang X, Zhang Y, Li Z, Xie X, Wang J, Gao M, Zhang S, Hou Y. Transcriptome Analysis of Canine Cardiac Fat Pads: Involvement of Two Novel Long Non-Coding RNAs in Atrial Fibrillation Neural Remodeling. J Cell Biochem 2015; 116:809-21. [PMID: 25559442 DOI: 10.1002/jcb.25037] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2014] [Accepted: 12/11/2014] [Indexed: 02/01/2023]
Affiliation(s)
- Weizong Wang
- Department of Cardiology; Shandong Provincial Qianfoshan Hospital; Shandong University; Jinan 250014 China
| | - Ximin Wang
- Department of Cardiology; Shandong Provincial Qianfoshan Hospital; Shandong University; Jinan 250014 China
| | - Yujiao Zhang
- Department of Cardiology; Shandong Provincial Qianfoshan Hospital; Shandong University; Jinan 250014 China
| | - Zhan Li
- Department of Cardiology; Shandong Provincial Qianfoshan Hospital; Shandong University; Jinan 250014 China
| | - Xinxing Xie
- Department of Cardiology; Shandong Provincial Qianfoshan Hospital; Shandong University; Jinan 250014 China
| | - Jiangrong Wang
- Department of Cardiology; Shandong Provincial Qianfoshan Hospital; Shandong University; Jinan 250014 China
| | - Mei Gao
- Department of Cardiology; Shandong Provincial Qianfoshan Hospital; Shandong University; Jinan 250014 China
| | - Shuyu Zhang
- School of Radiation Medicine and Protection and Collaborative Innovation Center of Radiation Medicine of Jiangsu Higher Education Institutions; Soochow University; Suzhou 215123 China
| | - Yinglong Hou
- Department of Cardiology; Shandong Provincial Qianfoshan Hospital; Shandong University; Jinan 250014 China
| |
Collapse
|
10
|
DNA methylation patterns of protein coding genes and long noncoding RNAs in female schizophrenic patients. Eur J Med Genet 2015; 58:95-104. [DOI: 10.1016/j.ejmg.2014.12.001] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2014] [Accepted: 12/04/2014] [Indexed: 12/11/2022]
|
11
|
Cheng J, Wang X, Cai N, Ma Z, Zhang L, Lv Z. RNAs specifically affect gene expression in a length, position and sequence dependent manner. INTERNATIONAL JOURNAL OF CLINICAL AND EXPERIMENTAL PATHOLOGY 2014; 7:948-958. [PMID: 24696713 PMCID: PMC3971297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Received: 12/18/2013] [Accepted: 02/10/2014] [Indexed: 06/03/2023]
Abstract
We aim to explore if RNA regulating gene expression is affected by length, sequence and position of RNA. HeLa cells were co-transfected with modulator plasmids (derived from pcDNA3.1 vector containing different length regulating sequences that produce RNAs) and reporter plasmids (derived from pEGFP-C1 vector); In addition, HeLa cells were transfected with plasmids that possess different sequences of downstream or adjacent genes of GFP reporter gene. We found that long inserting sequences of modulator plasmids induced stronger GFP gene activation than short inserting sequences. Changing of downstream sequences of GFP gene induced significant effects on GFP gene expression. Short sequences of adjacent genes of GFP activated GFP gene. Bioinformatics analysis of genes which is highly expressed in differentiating cells (thymocyte cells, germinal center B-cells) and quiescent cells (T cells, B cells) shows that differentiating cells produce longer RNA than quiescent cells. These findings demonstrate that the length, sequence and producing position of RNAs are important factors for RNA regulating gene expression.
Collapse
Affiliation(s)
- Jianjun Cheng
- Department of Genetics, Hebei Medical University, Hebei Key Lab of Laboratory AnimalShijiazhuang 050017, Hebei Province, China
| | - Xiufang Wang
- Department of Genetics, Hebei Medical University, Hebei Key Lab of Laboratory AnimalShijiazhuang 050017, Hebei Province, China
| | - Nianguang Cai
- Hebei North UniversityZhangjiakou 075000, Hebei Province, China
| | - Zhihong Ma
- Clinical Laboratory, The Second Hospital of TangshanTangshan 063000, Hebei Province, China
| | - Liyan Zhang
- Hebei North UniversityZhangjiakou 075000, Hebei Province, China
| | - Zhanjun Lv
- Department of Genetics, Hebei Medical University, Hebei Key Lab of Laboratory AnimalShijiazhuang 050017, Hebei Province, China
| |
Collapse
|
12
|
A genetic program theory of aging using an RNA population model. Ageing Res Rev 2014; 13:46-54. [PMID: 24263168 DOI: 10.1016/j.arr.2013.11.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2013] [Accepted: 11/08/2013] [Indexed: 12/11/2022]
Abstract
Aging is a common characteristic of multicellular eukaryotes. Copious hypotheses have been proposed to explain the mechanisms of aging, but no single theory is generally acceptable. In this article, we refine the RNA population gene activating model (Lv et al., 2003) based on existing reports as well as on our own latest findings. We propose the RNA population model as a genetic theory of aging. The new model can also be applied to differentiation and tumorigenesis and could explain the biological significance of non-coding DNA, RNA, and repetitive sequence DNA. We provide evidence from the literature as well as from our own findings for the roles of repetitive sequences in gene activation. In addition, we predict several phenomena related to aging and differentiation based on this model.
Collapse
|
13
|
Weikard R, Hadlich F, Kuehn C. Identification of novel transcripts and noncoding RNAs in bovine skin by deep next generation sequencing. BMC Genomics 2013; 14:789. [PMID: 24225384 PMCID: PMC3833843 DOI: 10.1186/1471-2164-14-789] [Citation(s) in RCA: 96] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2013] [Accepted: 11/04/2013] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Deep RNA sequencing (RNAseq) has opened a new horizon for understanding global gene expression. The functional annotation of non-model mammalian genomes including bovines is still poor compared to that of human and mouse. This particularly applies to tissues without direct significance for milk and meat production, like skin, in spite of its multifunctional relevance for the individual. Thus, applying an RNAseq approach, we performed a whole transcriptome analysis of pigmented and nonpigmented bovine skin to describe the comprehensive transcript catalogue of this tissue. RESULTS A total of 39,577 unique primary skin transcripts were mapped to the bovine reference genome assembly. The majority of the transcripts were mapped to known transcriptional units (65%). In addition to the reannotation of known genes, a substantial number (10,884) of unknown transcripts (UTs) were discovered, which had not previously been annotated. The classification of UTs was based on the prediction of their coding potential and comparative sequence analysis, subsequently followed by meticulous manual curation. The classification analysis and experimental validation of selected UTs confirmed that RNAseq data can be used to amend the annotation of known genes by providing evidence for additional exons, untranslated regions or splice variants, by approving genes predicted in silico and by identifying novel bovine loci. A large group of UTs (4,848) was predicted to potentially represent long noncoding RNA (lncRNA). Predominantly, potential lncRNAs mapped in intergenic chromosome regions (4,365) and therefore, were classified as potential intergenic lncRNA. Our analysis revealed that only about 6% of all UTs displayed interspecies conservation and discovered a variety of unknown transcripts without interspecies homology but specific expression in bovine skin. CONCLUSIONS The results of our study demonstrate a complex transcript pattern for bovine skin and suggest a possible functional relevance of novel transcripts, including lncRNA, in the modulation of pigmentation processes. The results also indicate that the comprehensive identification and annotation of unknown transcripts from whole transcriptome analysis using RNAseq data remains a tremendous future challenge.
Collapse
Affiliation(s)
- Rosemarie Weikard
- Leibniz Institute for Farm Animal Biology (FBN), Institute of Genome Biology, Dummerstorf 18196, Germany.
| | | | | |
Collapse
|
14
|
Biswas AK, Zhang B, Wu X, Gao JX. CNCTDiscriminator: coding and noncoding transcript discriminator - an excursion through hypothesis learning and ensemble learning approaches. J Bioinform Comput Biol 2013; 11:1342002. [PMID: 24131051 DOI: 10.1142/s021972001342002x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The statistics about the open reading frames, the base compositions and the properties of the predicted secondary structures have potential to address the problem of discriminating coding and noncoding transcripts. Again, the Next Generation Sequencing platform, RNA-seq, provides us bounty of data from which expression profiles of the transcripts can be extracted which urged us adding a new set of dimension in this classification task. In this paper, we proposed CNCTDiscriminator -- a coding and noncoding transcript discriminating system where we applied the integration of these four categories of features about the transcripts. The feature integration was done using both hypothesis learning and feature specific ensemble learning approaches. The CNCTDiscriminator model which was trained with composition and ORF features outperforms (precision 83.86%, recall 82.01%) other three popular methods -- CPC (precision 98.31%, recall 25.95%), CPAT (precision 97.74%, recall 52.50%) and PORTRAIT (precision 84.37%, recall 73.2%) when applied to an independent benchmark dataset. However, the CNCTDiscriminator model that was trained using the ensemble approach shows comparable performance (precision 89.85%, recall 71.08%).
Collapse
Affiliation(s)
- Ashis Kumer Biswas
- Computer Science and Engineering, The University of Texas at Arlington, Arlington, Texas 76019, USA
| | | | | | | |
Collapse
|
15
|
Kadakkuzha BM, Puthanveettil SV. Genomics and proteomics in solving brain complexity. MOLECULAR BIOSYSTEMS 2013; 9:1807-21. [PMID: 23615871 PMCID: PMC6425491 DOI: 10.1039/c3mb25391k] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The human brain is extraordinarily complex, composed of billions of neurons and trillions of synaptic connections. Neurons are organized into circuit assemblies that are modulated by specific interneurons and non-neuronal cells, such as glia and astrocytes. Data on human genome sequences predicts that each of these cells in the human brain has the potential of expressing ∼20 000 protein coding genes and tens of thousands of noncoding RNAs. A major challenge in neuroscience is to determine (1) how individual neurons and circuitry utilize this potential during development and maturation of the nervous system, and for higher brain functions such as cognition, and (2) how this potential is altered in neurological and psychiatric disorders. In this review, we will discuss how recent advances in next generation sequencing, proteomics and bioinformatics have transformed our understanding of gene expression and the functions of neural circuitry, memory storage, and disorders of cognition.
Collapse
Affiliation(s)
- Beena M Kadakkuzha
- Department of Neuroscience, The Scripps Research Institute, Scripps Florida 130 Scripps Way, Jupiter, FL 33458, USA
| | | |
Collapse
|
16
|
Chen G, Chen J, Shi C, Shi L, Tong W, Shi T. Dissecting the Characteristics and Dynamics of Human Protein Complexes at Transcriptome Cascade Using RNA-Seq Data. PLoS One 2013; 8:e66521. [PMID: 23824284 PMCID: PMC3688907 DOI: 10.1371/journal.pone.0066521] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2013] [Accepted: 05/06/2013] [Indexed: 11/19/2022] Open
Abstract
Human protein complexes play crucial roles in various biological processes as the functional module. However, the expression features of human protein complexes at the transcriptome cascade are poorly understood. Here, we used the RNA-Seq data from 16 disparate tissues and four types of human cancers to explore the characteristics and dynamics of human protein complexes. We observed that many individual components of human protein complexes can be generated by multiple distinct transcripts. Similar with yeast, the human protein complex constituents are inclined to co-express in diverse tissues. The dominant isoform of the genes involved in protein complexes tend to encode the complex constituents in each tissue. Our results indicate that the protein complex dynamics not only correlate with the presence or absence of complexes, but may also be related to the major isoform switching for complex subunits. Between any two cancers of breast, colon, lung and prostate, we found that only a few of the differentially expressed transcripts associated with complexes were identical, but 5-10 times more protein complexes involved in differentially expressed transcripts were common. Collectively, our study reveals novel properties and dynamics of human protein complexes at the transcriptome cascade in diverse normal tissues and different cancers.
Collapse
Affiliation(s)
- Geng Chen
- The Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, China
| | - Jiwei Chen
- The Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, China
| | - Caiping Shi
- The Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, China
| | - Leming Shi
- National Center for Toxicological Research, US Food and Drug Administration, Jefferson, Arkansas, United States of America
| | - Weida Tong
- National Center for Toxicological Research, US Food and Drug Administration, Jefferson, Arkansas, United States of America
| | - Tieliu Shi
- The Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, China
- * E-mail:
| |
Collapse
|
17
|
Vandernoot VA, Langevin SA, Solberg OD, Lane PD, Curtis DJ, Bent ZW, Williams KP, Patel KD, Schoeniger JS, Branda SS, Lane TW. cDNA normalization by hydroxyapatite chromatography to enrich transcriptome diversity in RNA-seq applications. Biotechniques 2013; 53:373-80. [PMID: 23227988 DOI: 10.2144/000113937] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2012] [Accepted: 11/28/2012] [Indexed: 11/23/2022] Open
Abstract
Second-generation sequencing (SGS) has become the preferred method for RNA transcriptome profiling of organisms and single cells. However, SGS analysis of transcriptome diversity (including protein-coding transcripts and regulatory non-coding RNAs) is inefficient unless the sample of interest is first depleted of nucleic acids derived from ribosomal RNA (rRNA), which typically account for up to 95% of total intracellular RNA content. Here we describe a novel microscale hydroxyapatite chromatography (HAC) normalization method to remove eukaryotic and prokaryotic high abundant rRNA species, thereby increasing sequence coverage depth and transcript diversity across non-rRNA populations. RNA-seq analysis of Escherichia coli K-12 and human intracellular total RNA showed that HAC-based normalization enriched for all non-ribosomal RNA species regardless of RNA transcript abundance or length when compared with untreated controls. Microcolumn HAC normalization generated rRNA-depleted cDNA libraries comparable to the well-established duplex specific nuclease (DSN) normalization and Ribo-Zero rRNA-depletion methods, thus establishing microscale HAC as an effective, cost saving, and non-destructive alternative normalization technique.
Collapse
Affiliation(s)
- Victoria A Vandernoot
- Biotechnology and Bioengineering Department, Sandia National Laboratories, Livermore, CA, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Spradling KD, Glenn JP, Garcia R, Shade RE, Cox LA. The baboon kidney transcriptome: analysis of transcript sequence, splice variants, and abundance. PLoS One 2013; 8:e57563. [PMID: 23637735 PMCID: PMC3634053 DOI: 10.1371/journal.pone.0057563] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2012] [Accepted: 01/24/2013] [Indexed: 12/25/2022] Open
Abstract
The baboon is an invaluable model for the study of human health and disease, including many complex diseases of the kidney. Although scientists have made great progress in developing this animal as a model for numerous areas of biomedical research, genomic resources for the baboon, such as a quality annotated genome, are still lacking. To this end, we characterized the baboon kidney transcriptome using high-throughput cDNA sequencing (RNA-Seq) to identify genes, gene variants, single nucleotide polymorphisms (SNPs), insertion-deletion polymorphisms (InDels), cellular functions, and key pathways in the baboon kidney to provide a genomic resource for the baboon. Analysis of our sequencing data revealed 45,499 high-confidence SNPs and 29,813 InDels comparing baboon cDNA sequences with the human hg18 reference assembly and identified 35,900 cDNAs in the baboon kidney, including 35,150 transcripts representing 15,369 genic genes that are novel for the baboon. Gene ontology analysis of our sequencing dataset also identified numerous biological functions and canonical pathways that were significant in the baboon kidney, including a large number of metabolic pathways that support known functions of the kidney. The results presented in this study catalogues the transcribed mRNAs, noncoding RNAs, and hypothetical proteins in the baboon kidney and establishes a genomic resource for scientists using the baboon as an experimental model.
Collapse
Affiliation(s)
- Kimberly D Spradling
- Department of Genetics, Texas Biomedical Research Institute, San Antonio, Texas, United States of America.
| | | | | | | | | |
Collapse
|
19
|
Long noncoding RNAs in development and disease of the central nervous system. Trends Genet 2013; 29:461-8. [PMID: 23562612 DOI: 10.1016/j.tig.2013.03.002] [Citation(s) in RCA: 253] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2012] [Revised: 02/25/2013] [Accepted: 03/11/2013] [Indexed: 12/29/2022]
Abstract
The central nervous system (CNS) is a complex biological system composed of numerous cell types working in concert. The intricate development and functioning of this highly ordered structure depends upon exquisite spatial and temporal control of gene expression in the cells comprising the CNS. Thus, gene regulatory networks that control cell fates and functions play critical roles in the CNS. Failure to develop and maintain intricate regulatory networks properly leads to impaired development or neural dysfunction, which might manifest as neurological disorders. Long noncoding RNAs (lncRNAs) are emerging as important components of gene regulatory networks, working in concert with transcription factors and epigenetic regulators of gene expression. Interestingly, many lncRNAs are highly expressed in the adult and developing brain, often showing precise temporal and spatial patterns of expression. This specificity of expression and growing awareness of the importance of lncRNAs suggest that they play key roles in CNS development and function. In this review, we highlight the growing evidence for the importance of lncRNAs in the CNS and the indications that their dysregulation underlies some neurological disorders.
Collapse
|
20
|
Chen G, Wang C, Shi L, Qu X, Chen J, Yang J, Shi C, Chen L, Zhou P, Ning B, Tong W, Shi T. Incorporating the human gene annotations in different databases significantly improved transcriptomic and genetic analyses. RNA (NEW YORK, N.Y.) 2013; 19:479-89. [PMID: 23431329 PMCID: PMC3677258 DOI: 10.1261/rna.037473.112] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2012] [Accepted: 01/14/2013] [Indexed: 05/18/2023]
Abstract
Human gene annotation is crucial for conducting transcriptomic and genetic studies; however, the impacts of human gene annotations in diverse databases on related studies have been less evaluated. To enable full use of various human annotation resources and better understand the human transcriptome, here we systematically compare the human annotations present in RefSeq, Ensembl (GENCODE), and AceView on diverse transcriptomic and genetic analyses. We found that the human gene annotations in the three databases are far from complete. Although Ensembl and AceView annotated more genes than RefSeq, more than 15,800 genes from Ensembl (or AceView) are within the intergenic and intronic regions of AceView (or Ensembl) annotation. The human transcriptome annotations in RefSeq, Ensembl, and AceView had distinct effects on short-read mapping, gene and isoform expression profiling, and differential expression calling. Furthermore, our findings indicate that the integrated annotation of these databases can obtain a more complete gene set and significantly enhance those transcriptomic analyses. We also observed that many more known SNPs were located within genes annotated in Ensembl and AceView than in RefSeq. In particular, 1033 of 3041 trait/disease-associated SNPs involved in about 200 human traits/diseases that were previously reported to be in RefSeq intergenic regions could be relocated within Ensembl and AceView genes. Our findings illustrate that a more complete transcriptome generated by incorporating human gene annotations in diverse databases can strikingly improve the overall results of transcriptomic and genetic studies.
Collapse
Affiliation(s)
- Geng Chen
- Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai 200241, China
| | - Charles Wang
- Functional Genomics Core, Beckman Research Institute, City of Hope Comprehensive Cancer Center, Duarte, California 91010, USA
| | - Leming Shi
- National Center for Toxicological Research, US Food and Drug Administration, Jefferson, Arkansas 72079, USA
| | - Xiongfei Qu
- Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai 200241, China
| | - Jiwei Chen
- Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai 200241, China
| | - Jianmin Yang
- Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai 200241, China
| | - Caiping Shi
- Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai 200241, China
| | - Long Chen
- Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai 200241, China
| | - Peiying Zhou
- Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai 200241, China
| | - Baitang Ning
- National Center for Toxicological Research, US Food and Drug Administration, Jefferson, Arkansas 72079, USA
| | - Weida Tong
- National Center for Toxicological Research, US Food and Drug Administration, Jefferson, Arkansas 72079, USA
| | - Tieliu Shi
- Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai 200241, China
- Corresponding authorE-mail
| |
Collapse
|
21
|
Chen G, Shi T. Next-generation sequencing technologies for personalized medicine: promising but challenging. SCIENCE CHINA-LIFE SCIENCES 2013; 56:101-3. [PMID: 23393024 DOI: 10.1007/s11427-013-4436-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2013] [Indexed: 11/28/2022]
|
22
|
Kavanagh T, Mills JD, Kim WS, Halliday GM, Janitz M. Pathway analysis of the human brain transcriptome in disease. J Mol Neurosci 2012; 51:28-36. [PMID: 23263795 DOI: 10.1007/s12031-012-9940-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2012] [Accepted: 12/10/2012] [Indexed: 01/10/2023]
Abstract
Pathway analysis is a powerful method for discerning differentially regulated genes and elucidating their biological importance. It allows for the identification of perturbed or aberrantly expressed genes within a biological context from extensive data sets and offers a simplistic approach for interrogating such data sets. With the growing use of microarrays and RNA-Seq, data for genome-wide studies are growing at an alarming rate, and the use of deep sequencing is revealing elements of the genome previously uncharacterised. Through the employment of pathway analysis, mechanisms in complex diseases may be explored and novel causatives found primarily through differentially regulated genes. Further, with the implementation of next generation sequencing, a deeper resolution may be attained, particularly in identification of isoform diversity and SNPs. Here, we look at a broad overview of pathway analysis in the human brain transcriptome and its relevance in teasing out underlying causes of complex diseases. We will outline processes in data gathering and analysis of particular diseases in which these approaches have been successful.
Collapse
Affiliation(s)
- Tomas Kavanagh
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, 2052, Australia
| | | | | | | | | |
Collapse
|
23
|
Ng SY, Stanton LW. Long non-coding RNAs in stem cell pluripotency. WILEY INTERDISCIPLINARY REVIEWS-RNA 2012; 4:121-8. [PMID: 23139157 DOI: 10.1002/wrna.1146] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Pluripotency refers to the self-renewal of undifferentiated embryonic stem cells (ESCs), and is maintained by a tightly regulated gene regulatory network involving an intricate interplay between transcription factors and their genomic targets, as well as epigenetic processes that influence gene expression. Long non-coding RNAs (lncRNAs) are newly discovered members of gene regulatory networks that govern a variety of cell functions. Defined as RNA transcripts larger than 200 nucleotides, lncRNAs have little or no protein-coding capacity and have been shown to act via various mechanisms, and are important in a variety of biological functions. Recent reports have described the discovery of pluripotent lncRNAs involved in the maintenance and induction of stem cell pluripotency. Here, we discuss how lncRNAs may integrate into the pluripotency network, as well as prominent questions in this emerging field.
Collapse
Affiliation(s)
- Shi-Yan Ng
- Genome Institute of Singapore, Stem Cell and Developmental Biology Group, Singapore
| | | |
Collapse
|
24
|
De novo transcriptome assembly of RNA-Seq reads with different strategies. SCIENCE CHINA-LIFE SCIENCES 2012; 54:1129-33. [PMID: 22227905 DOI: 10.1007/s11427-011-4256-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/06/2011] [Accepted: 11/15/2011] [Indexed: 10/14/2022]
Abstract
De novo transcriptome assembly is an important approach in RNA-Seq data analysis and it can help us to reconstruct the transcriptome and investigate gene expression profiles without reference genome sequences. We carried out transcriptome assemblies with two RNA-Seq datasets generated from human brain and cell line, respectively. We then determined an efficient way to yield an optimal overall assembly using three different strategies. We first assembled brain and cell line transcriptome using a single k-mer length. Next we tested a range of values of k-mer length and coverage cutoff in assembling. Lastly, we combined the assembled contigs from a range of k values to generate a final assembly. By comparing these assembly results, we found that using only one k-mer value for assembly is not enough to generate good assembly results, but combining the contigs from different k-mer values could yield longer contigs and greatly improve the overall assembly.
Collapse
|
25
|
Chen G, Wang C, Shi T. Overview of available methods for diverse RNA-Seq data analyses. SCIENCE CHINA-LIFE SCIENCES 2012; 54:1121-8. [PMID: 22227904 DOI: 10.1007/s11427-011-4255-x] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/06/2011] [Accepted: 11/16/2011] [Indexed: 12/11/2022]
Abstract
RNA-Seq technology is becoming widely used in various transcriptomics studies; however, analyzing and interpreting the RNA-Seq data face serious challenges. With the development of high-throughput sequencing technologies, the sequencing cost is dropping dramatically with the sequencing output increasing sharply. However, the sequencing reads are still short in length and contain various sequencing errors. Moreover, the intricate transcriptome is always more complicated than we expect. These challenges proffer the urgent need of efficient bioinformatics algorithms to effectively handle the large amount of transcriptome sequencing data and carry out diverse related studies. This review summarizes a number of frequently-used applications of transcriptome sequencing and their related analyzing strategies, including short read mapping, exon-exon splice junction detection, gene or isoform expression quantification, differential expression analysis and transcriptome reconstruction.
Collapse
Affiliation(s)
- Geng Chen
- Center for Bioinformatics and Computational Biology, Institute of Biomedical Sciences, School of Life Science, East China Normal University, Shanghai 200241, China
| | | | | |
Collapse
|