1
|
Genome-Wide Identification of Putative MicroRNAs in Cassava ( Manihot esculenta Crantz) and Their Functional Landscape in Cellular Regulation. BIOMED RESEARCH INTERNATIONAL 2019; 2019:2019846. [PMID: 31321230 PMCID: PMC6607727 DOI: 10.1155/2019/2019846] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/02/2019] [Revised: 05/14/2019] [Accepted: 05/22/2019] [Indexed: 11/18/2022]
Abstract
MicroRNAs are small noncoding RNAs, involved in the regulation of many cellular processes in plants. Hundreds of miRNAs have been identified in cassava by various techniques, yet these identifications were constrained by a lack of miRNA templates and the narrow range of conditions in transcriptome study. In this research, we conducted genome-wide analysis identification, whereby miRNAs from cassava genome were thoroughly screened using bioinformatics approach independent of predefined templates and studied conditions. Our work provided a catalog of putative mature miRNAs and explored the landscape of miRNAome in cassava. These putative miRNAs were validated using statistical analysis as well as available cassava expression data. We showed that the crowded locations of cassava miRNAs are consistent with other plants and animals and hypothesized to have the same evolutionary origin. At least 10 conserved miRNAs were identified in cassava based on the comparative study of miRNA conservation. Finally, investigation of miRNAs and target gene relationships enabled us to envisage the complexities of cellular regulatory systems modulated at posttranscriptional level.
Collapse
|
2
|
Fehlmann T, Backes C, Kahraman M, Haas J, Ludwig N, Posch AE, Würstle ML, Hübenthal M, Franke A, Meder B, Meese E, Keller A. Web-based NGS data analysis using miRMaster: a large-scale meta-analysis of human miRNAs. Nucleic Acids Res 2017; 45:8731-8744. [PMID: 28911107 PMCID: PMC5587802 DOI: 10.1093/nar/gkx595] [Citation(s) in RCA: 52] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2017] [Accepted: 07/04/2017] [Indexed: 01/21/2023] Open
Abstract
The analysis of small RNA NGS data together with the discovery of new small RNAs is among the foremost challenges in life science. For the analysis of raw high-throughput sequencing data we implemented the fast, accurate and comprehensive web-based tool miRMaster. Our toolbox provides a wide range of modules for quantification of miRNAs and other non-coding RNAs, discovering new miRNAs, isomiRs, mutations, exogenous RNAs and motifs. Use-cases comprising hundreds of samples are processed in less than 5 h with an accuracy of 99.4%. An integrative analysis of small RNAs from 1836 data sets (20 billion reads) indicated that context-specific miRNAs (e.g. miRNAs present only in one or few different tissues / cell types) still remain to be discovered while broadly expressed miRNAs appear to be largely known. In total, our analysis of known and novel miRNAs indicated nearly 22 000 candidates of precursors with one or two mature forms. Based on these, we designed a custom microarray comprising 11 872 potential mature miRNAs to assess the quality of our prediction. MiRMaster is a convenient-to-use tool for the comprehensive and fast analysis of miRNA NGS data. In addition, our predicted miRNA candidates provided as custom array will allow researchers to perform in depth validation of candidates interesting to them.
Collapse
Affiliation(s)
- Tobias Fehlmann
- Chair for Clinical Bioinformatics, Saarland University, Saarbrücken, Germany
| | - Christina Backes
- Chair for Clinical Bioinformatics, Saarland University, Saarbrücken, Germany
| | - Mustafa Kahraman
- Chair for Clinical Bioinformatics, Saarland University, Saarbrücken, Germany.,Hummingbird Diagnostics GmbH, Heidelberg, Germany
| | - Jan Haas
- Department of Internal Medicine III, University Hospital Heidelberg, Heidelberg, Germany.,German Center for Cardiovascular Research (DZHK), Heidelberg, Germany.,Klaus Tschira Institute for Integrative Computational Cardiology, Heidelberg, Germany
| | - Nicole Ludwig
- Department of Human Genetics, Saarland University, Homburg, Germany
| | | | | | - Matthias Hübenthal
- Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel, Germany
| | - Andre Franke
- Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel, Germany
| | - Benjamin Meder
- Department of Internal Medicine III, University Hospital Heidelberg, Heidelberg, Germany.,German Center for Cardiovascular Research (DZHK), Heidelberg, Germany.,Klaus Tschira Institute for Integrative Computational Cardiology, Heidelberg, Germany
| | - Eckart Meese
- Department of Human Genetics, Saarland University, Homburg, Germany
| | - Andreas Keller
- Chair for Clinical Bioinformatics, Saarland University, Saarbrücken, Germany
| |
Collapse
|
3
|
Amaral AJ, Andrade J, Foxall RB, Matoso P, Matos AM, Soares RS, Rocha C, Ramos CG, Tendeiro R, Serra-Caetano A, Guerra-Assunção JA, Santa-Marta M, Gonçalves J, Gama-Carvalho M, Sousa AE. miRNA profiling of human naive CD4 T cells links miR-34c-5p to cell activation and HIV replication. EMBO J 2017; 36:346-360. [PMID: 27993935 PMCID: PMC5286376 DOI: 10.15252/embj.201694335] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2016] [Revised: 10/25/2016] [Accepted: 10/31/2016] [Indexed: 01/16/2023] Open
Abstract
Cell activation is a vital step for T-cell memory/effector differentiation as well as for productive HIV infection. To identify novel regulators of this process, we used next-generation sequencing to profile changes in microRNA expression occurring in purified human naive CD4 T cells in response to TCR stimulation and/or HIV infection. Our results demonstrate, for the first time, the transcriptional up-regulation of miR-34c-5p in response to TCR stimulation in naive CD4 T cells. The induction of this miR was further consistently found to be reduced by both HIV-1 and HIV-2 infections. Overexpression of miR-34c-5p led to changes in the expression of several genes involved in TCR signaling and cell activation, confirming its role as a novel regulator of naive CD4 T-cell activation. We additionally show that miR-34c-5p promotes HIV-1 replication, suggesting that its down-regulation during HIV infection may be part of an anti-viral host response.
Collapse
Affiliation(s)
- Andreia J Amaral
- Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisboa, Portugal
- University of Lisboa, Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, Lisboa, Portugal
| | - Jorge Andrade
- Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisboa, Portugal
- University of Lisboa, Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, Lisboa, Portugal
| | - Russell B Foxall
- Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisboa, Portugal
| | - Paula Matoso
- Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisboa, Portugal
| | - Ana M Matos
- University of Lisboa, Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, Lisboa, Portugal
| | - Rui S Soares
- Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisboa, Portugal
| | - Cheila Rocha
- Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisboa, Portugal
| | - Christian G Ramos
- University of Lisboa, Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, Lisboa, Portugal
| | - Rita Tendeiro
- Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisboa, Portugal
| | - Ana Serra-Caetano
- Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisboa, Portugal
| | - José A Guerra-Assunção
- Bill Lyons Informatics Centre, UCL Cancer Institute, University College London, London, UK
| | - Mariana Santa-Marta
- Research Institute for Medicines (iMed ULisboa), Faculdade de Farmácia, Universidade de Lisboa, Lisboa, Portugal
| | - João Gonçalves
- Research Institute for Medicines (iMed ULisboa), Faculdade de Farmácia, Universidade de Lisboa, Lisboa, Portugal
| | - Margarida Gama-Carvalho
- University of Lisboa, Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, Lisboa, Portugal
| | - Ana E Sousa
- Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisboa, Portugal
| |
Collapse
|
4
|
Prabahar A, Natarajan J. Prediction of microRNAs involved in immune system diseases through network based features. J Biomed Inform 2016; 65:34-45. [PMID: 27871823 DOI: 10.1016/j.jbi.2016.11.003] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2016] [Revised: 09/28/2016] [Accepted: 11/13/2016] [Indexed: 12/13/2022]
Abstract
MicroRNAs are a class of small non-coding regulatory RNA molecules that modulate the expression of several genes at post-transcriptional level and play a vital role in disease pathogenesis. Recent research shows that a range of miRNAs are involved in the regulation of immunity and its deregulation results in immune mediated diseases such as cancer, inflammation and autoimmune diseases. Computational discovery of these immune miRNAs using a set of specific features is highly desirable. In the current investigation, we present a SVM based classification system which uses a set of novel network based topological and motif features in addition to the baseline sequential and structural features to predict immune specific miRNAs from other non-immune miRNAs. The classifier was trained and tested on a balanced set of equal number of positive and negative examples to show the discriminative power of our network features. Experimental results show that our approach achieves an accuracy of 90.2% and outperforms the classification accuracy of 63.2% reported using the traditional miRNA sequential and structural features. The proposed classifier was further validated with two immune disease sub-class datasets related to multiple sclerosis microarray data and psoriasis RNA-seq data with higher accuracy. These results indicate that our classifier which uses network and motif features along with sequential and structural features will lead to significant improvement in classifying immune miRNAs and hence can be applied to identify other specific classes of miRNAs as an extensible miRNA classification system.
Collapse
Affiliation(s)
- Archana Prabahar
- Data Mining and Text Mining Laboratory, Department of Bioinformatics, Bharathiar University, Coimbatore 641 046, India.
| | - Jeyakumar Natarajan
- Data Mining and Text Mining Laboratory, Department of Bioinformatics, Bharathiar University, Coimbatore 641 046, India.
| |
Collapse
|
5
|
Lertampaiporn S, Thammarongtham C, Nukoolkit C, Kaewkamnerdpong B, Ruengjitchatchawalya M. Identification of non-coding RNAs with a new composite feature in the Hybrid Random Forest Ensemble algorithm. Nucleic Acids Res 2014; 42:e93. [PMID: 24771344 PMCID: PMC4066759 DOI: 10.1093/nar/gku325] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2014] [Revised: 04/02/2014] [Accepted: 04/07/2014] [Indexed: 12/13/2022] Open
Abstract
To identify non-coding RNA (ncRNA) signals within genomic regions, a classification tool was developed based on a hybrid random forest (RF) with a logistic regression model to efficiently discriminate short ncRNA sequences as well as long complex ncRNA sequences. This RF-based classifier was trained on a well-balanced dataset with a discriminative set of features and achieved an accuracy, sensitivity and specificity of 92.11%, 90.7% and 93.5%, respectively. The selected feature set includes a new proposed feature, SCORE. This feature is generated based on a logistic regression function that combines five significant features-structure, sequence, modularity, structural robustness and coding potential-to enable improved characterization of long ncRNA (lncRNA) elements. The use of SCORE improved the performance of the RF-based classifier in the identification of Rfam lncRNA families. A genome-wide ncRNA classification framework was applied to a wide variety of organisms, with an emphasis on those of economic, social, public health, environmental and agricultural significance, such as various bacteria genomes, the Arthrospira (Spirulina) genome, and rice and human genomic regions. Our framework was able to identify known ncRNAs with sensitivities of greater than 90% and 77.7% for prokaryotic and eukaryotic sequences, respectively. Our classifier is available at http://ncrna-pred.com/HLRF.htm.
Collapse
Affiliation(s)
- Supatcha Lertampaiporn
- Biological Engineering Program, Faculty of Engineering, King Mongkut's University of Technology Thonburi, 126 Pracha Uthit Rd, Bangmod, Thung Khru, Bangkok 10140, Thailand
| | - Chinae Thammarongtham
- Biochemical Engineering and Pilot Plant Research and Development Unit, National Center for Genetic Engineering and Biotechnology at King Mongkut's University of Technology Thonburi (Bang Khun Thian Campus), 49 Soi Thian Thale 25, Bang Khun Thian Chai Thale Rd, Tha Kham, Bangkok 10150, Thailand
| | - Chakarida Nukoolkit
- School of Information Technology, King Mongkut's University of Technology Thonburi, 126 Pracha Uthit Rd, Bangmod, Thung Khru, Bangkok 10140, Thailand
| | - Boonserm Kaewkamnerdpong
- Biological Engineering Program, Faculty of Engineering, King Mongkut's University of Technology Thonburi, 126 Pracha Uthit Rd, Bangmod, Thung Khru, Bangkok 10140, Thailand
| | - Marasri Ruengjitchatchawalya
- Biotechnology Program, School of Bioresources and Technology, King Mongkut's University of Technology Thonburi (Bang Khun Thian Campus), 49 Soi Thian Thale 25, Bang Khun Thian Chai Thale Rd, Tha Kham, Bangkok 10150, Thailand Bioinformatics and Systems Biology Program, King Mongkut's University of Technology Thonburi (Bang Khun Thian Campus), 49 Soi Thian Thale 25, Bang Khun Thian Chai Thale Rd, Tha Kham, Bangkok 10150, Thailand
| |
Collapse
|
6
|
Mao Y, Li Q, Zhang Y, Zhang J, Wei G, Tao S. Genome-wide analysis of selective constraints on high stability regions of mRNA reveals multiple compensatory mutations in Escherichia coli. PLoS One 2013; 8:e73299. [PMID: 24086278 PMCID: PMC3785496 DOI: 10.1371/journal.pone.0073299] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2013] [Accepted: 07/18/2013] [Indexed: 12/27/2022] Open
Abstract
Message RNA (mRNA) carries a large number of local secondary structures, with structural stability to participate in the regulations of gene expression. A worthy question is how the local structural stability is maintained under the constraint that multiple selective pressures are imposed on mRNA local regions. Here, we performed the first genome-wide study of natural selection operating on high structural stability regions (HSRs) of mRNAs in Escherichia coli. We found that HSR tends to adjust the folded conformation to reduce the harm of mutations, showing a high level of mutational robustness. Moreover, guanine preference in HSR was observed, supporting the hypothesis that the selective constraint for high structural stability may partly account for the high percentage of G content in Escherichia coli genome. Notably, we found a substantially reduced synonymous substitution rate in HSRs compared with that in their adjacent regions. Surprisingly and interestingly, the non-key sites in HSRs, which have slight effect on structural stability, have synonymous substitution rate equivalent to background regions. To explain this result, we identified compensatory mutations in HSRs based on structural stability, and found that a considerable number of synonymous mutations occur to restore the structural stability decreased heavily by the mutations on key sites. Overall, these results suggest a significant role of local structural stability as a selective force operating on mRNA, which furthers our understanding of the constraints imposed on protein-coding RNAs.
Collapse
Affiliation(s)
- Yuanhui Mao
- College of Life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling, Shaanxi, China
- Bioinformatics Center, Northwest A&F University, Yangling, Shaanxi, China
| | - Qian Li
- College of Life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling, Shaanxi, China
- Bioinformatics Center, Northwest A&F University, Yangling, Shaanxi, China
| | - Yinwen Zhang
- College of Life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling, Shaanxi, China
- Bioinformatics Center, Northwest A&F University, Yangling, Shaanxi, China
| | - Junjie Zhang
- College of Life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling, Shaanxi, China
- Bioinformatics Center, Northwest A&F University, Yangling, Shaanxi, China
| | - Gehong Wei
- College of Life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling, Shaanxi, China
- * E-mail: (GW); (ST)
| | - Shiheng Tao
- College of Life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling, Shaanxi, China
- Bioinformatics Center, Northwest A&F University, Yangling, Shaanxi, China
- * E-mail: (GW); (ST)
| |
Collapse
|
7
|
Mao Y, Li Q, Wang W, Liang P, Tao S. Number variation of high stability regions is correlated with gene functions. Genome Biol Evol 2013; 5:484-93. [PMID: 23407773 PMCID: PMC3622296 DOI: 10.1093/gbe/evt020] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Various regulatory elements in messenger RNAs (mRNAs) carrying the secondary structure play important roles in a wide range of expression processes. Numerous recent works have focused on the discovery of these functional elements that contain the conserved mRNA structures. However, to date, regions with high structural stability have been largely overlooked. In this study, we defined high stability regions (HSRs) in the coding sequences (CDSs) in bacteria based on the normalized folding free energy. We found that CDSs had high number of HSRs, and these HSRs showed high structural context robustness compared with random sequences, indicating a direct selective constraint imposed on HSRs. A reduced ribosome speed was detected near the start position of HSR, implying a possibility that HSR acted as obstacle to drive translational pausing that coordinated protein synthesis. Interestingly, we found that genes with high HSR density were enriched in the processes of translation, protein folding, and cell division. In addition, essential genes exhibited higher HSR density than nonessential genes. Overall, our study presented the previously unappreciated correlation between the number variation of HSRs and cellular processes.
Collapse
Affiliation(s)
- Yuanhui Mao
- College of life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling, Shaanxi, China
| | | | | | | | | |
Collapse
|
8
|
Lertampaiporn S, Thammarongtham C, Nukoolkit C, Kaewkamnerdpong B, Ruengjitchatchawalya M. Heterogeneous ensemble approach with discriminative features and modified-SMOTEbagging for pre-miRNA classification. Nucleic Acids Res 2012; 41:e21. [PMID: 23012261 PMCID: PMC3592496 DOI: 10.1093/nar/gks878] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
An ensemble classifier approach for microRNA precursor (pre-miRNA) classification was proposed based upon combining a set of heterogeneous algorithms including support vector machine (SVM), k-nearest neighbors (kNN) and random forest (RF), then aggregating their prediction through a voting system. Additionally, the proposed algorithm, the classification performance was also improved using discriminative features, self-containment and its derivatives, which have shown unique structural robustness characteristics of pre-miRNAs. These are applicable across different species. By applying preprocessing methods--both a correlation-based feature selection (CFS) with genetic algorithm (GA) search method and a modified-Synthetic Minority Oversampling Technique (SMOTE) bagging rebalancing method--improvement in the performance of this ensemble was observed. The overall prediction accuracies obtained via 10 runs of 5-fold cross validation (CV) was 96.54%, with sensitivity of 94.8% and specificity of 98.3%-this is better in trade-off sensitivity and specificity values than those of other state-of-the-art methods. The ensemble model was applied to animal, plant and virus pre-miRNA and achieved high accuracy, >93%. Exploiting the discriminative set of selected features also suggests that pre-miRNAs possess high intrinsic structural robustness as compared with other stem loops. Our heterogeneous ensemble method gave a relatively more reliable prediction than those using single classifiers. Our program is available at http://ncrna-pred.com/premiRNA.html.
Collapse
Affiliation(s)
- Supatcha Lertampaiporn
- Biological Engineering Program, King Mongkut's University of Technology Thonburi, Bang Mod, Thung Khru, Bangkok 10140, Thailand
| | | | | | | | | |
Collapse
|
9
|
Parts L, Hedman ÅK, Keildson S, Knights AJ, Abreu-Goodger C, van de Bunt M, Guerra-Assunção JA, Bartonicek N, van Dongen S, Mägi R, Nisbet J, Barrett A, Rantalainen M, Nica AC, Quail MA, Small KS, Glass D, Enright AJ, Winn J, Deloukas P, Dermitzakis ET, McCarthy MI, Spector TD, Durbin R, Lindgren CM. Extent, causes, and consequences of small RNA expression variation in human adipose tissue. PLoS Genet 2012; 8:e1002704. [PMID: 22589741 PMCID: PMC3349731 DOI: 10.1371/journal.pgen.1002704] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2012] [Accepted: 03/27/2012] [Indexed: 12/12/2022] Open
Abstract
Small RNAs are functional molecules that modulate mRNA transcripts and have been implicated in the aetiology of several common diseases. However, little is known about the extent of their variability within the human population. Here, we characterise the extent, causes, and effects of naturally occurring variation in expression and sequence of small RNAs from adipose tissue in relation to genotype, gene expression, and metabolic traits in the MuTHER reference cohort. We profiled the expression of 15 to 30 base pair RNA molecules in subcutaneous adipose tissue from 131 individuals using high-throughput sequencing, and quantified levels of 591 microRNAs and small nucleolar RNAs. We identified three genetic variants and three RNA editing events. Highly expressed small RNAs are more conserved within mammals than average, as are those with highly variable expression. We identified 14 genetic loci significantly associated with nearby small RNA expression levels, seven of which also regulate an mRNA transcript level in the same region. In addition, these loci are enriched for variants significant in genome-wide association studies for body mass index. Contrary to expectation, we found no evidence for negative correlation between expression level of a microRNA and its target mRNAs. Trunk fat mass, body mass index, and fasting insulin were associated with more than twenty small RNA expression levels each, while fasting glucose had no significant associations. This study highlights the similar genetic complexity and shared genetic control of small RNA and mRNA transcripts, and gives a quantitative picture of small RNA expression variation in the human population.
Collapse
Affiliation(s)
- Leopold Parts
- Wellcome Trust Sanger Institute, Hinxton, United Kingdom
| | - Åsa K. Hedman
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Sarah Keildson
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | | | - Cei Abreu-Goodger
- European Bioinformatics Institute, Hinxton, United Kingdom
- National Laboratory of Genomics for Biodiversity (Langebio), Cinvestav, Irapuato, Mexico
| | - Martijn van de Bunt
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- Oxford Centre for Diabetes, Endocrinology, and Metabolism, University of Oxford, Oxford, United Kingdom
| | - José Afonso Guerra-Assunção
- European Bioinformatics Institute, Hinxton, United Kingdom
- PDBC, Instituto Gulbenkian de Ciência, Oeiras, Portugal
| | | | | | - Reedik Mägi
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- Estonian Genome Center, University of Tartu, Tartu, Estonia
| | - James Nisbet
- Wellcome Trust Sanger Institute, Hinxton, United Kingdom
| | - Amy Barrett
- Oxford Centre for Diabetes, Endocrinology, and Metabolism, University of Oxford, Oxford, United Kingdom
| | - Mattias Rantalainen
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- Department of Statistics, University of Oxford, Oxford, United Kingdom
| | - Alexandra C. Nica
- Department of Genetic Medicine and Development and Institute of Genetics and Genomics in Geneva, University of Geneva Medical School, Geneva, Switzerland
| | | | - Kerrin S. Small
- Department of Twin Research and Genetic Epidemiology, King's College London, London, United Kingdom
| | - Daniel Glass
- Department of Twin Research and Genetic Epidemiology, King's College London, London, United Kingdom
| | | | - John Winn
- Microsoft Research, Cambridge, United Kingdom
| | | | - Panos Deloukas
- Wellcome Trust Sanger Institute, Hinxton, United Kingdom
| | - Emmanouil T. Dermitzakis
- Department of Genetic Medicine and Development and Institute of Genetics and Genomics in Geneva, University of Geneva Medical School, Geneva, Switzerland
| | - Mark I. McCarthy
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- Oxford Centre for Diabetes, Endocrinology, and Metabolism, University of Oxford, Oxford, United Kingdom
| | - Timothy D. Spector
- Department of Twin Research and Genetic Epidemiology, King's College London, London, United Kingdom
| | - Richard Durbin
- Wellcome Trust Sanger Institute, Hinxton, United Kingdom
| | - Cecilia M. Lindgren
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
10
|
Mutation in a primate-conserved retrotransposon reveals a noncoding RNA as a mediator of infantile encephalopathy. Proc Natl Acad Sci U S A 2012; 109:4980-5. [PMID: 22411793 DOI: 10.1073/pnas.1111596109] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
The human genome is densely populated with transposons and transposon-like repetitive elements. Although the impact of these transposons and elements on human genome evolution is recognized, the significance of subtle variations in their sequence remains mostly unexplored. Here we report homozygosity mapping of an infantile neurodegenerative disease locus in a genetic isolate. Complete DNA sequencing of the 400-kb linkage locus revealed a point mutation in a primate-specific retrotransposon that was transcribed as part of a unique noncoding RNA, which was expressed in the brain. In vitro knockdown of this RNA increased neuronal apoptosis, consistent with the inappropriate dosage of this RNA in vivo and with the phenotype. Moreover, structural analysis of the sequence revealed a small RNA-like hairpin that was consistent with the putative gain of a functional site when mutated. We show here that a mutation in a unique transposable element-containing RNA is associated with lethal encephalopathy, and we suggest that RNAs that harbor evolutionarily recent repetitive elements may play important roles in human brain development.
Collapse
|
11
|
Mendes ND, Freitas AT, Vasconcelos AT, Sagot MF. Combination of measures distinguishes pre-miRNAs from other stem-loops in the genome of the newly sequenced Anopheles darlingi. BMC Genomics 2010; 11:529. [PMID: 20920257 PMCID: PMC2997018 DOI: 10.1186/1471-2164-11-529] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2010] [Accepted: 09/29/2010] [Indexed: 12/16/2022] Open
Abstract
Background Efforts using computational algorithms towards the enumeration of the full set of miRNAs of an organism have been limited by strong reliance on arguments of precursor conservation and feature similarity. However, miRNA precursors may arise anew or be lost across the evolutionary history of a species and a newly sequenced genome may be evolutionarily too distant from other genomes for an adequate comparative analysis. In addition, the learning of intricate classification rules based purely on features shared by miRNA precursors that are currently known may reflect a perpetuating identification bias rather than a sound means to tell true miRNAs from other genomic stem-loops. Results We show that there is a strong bias amongst annotated pre-miRNAs towards robust stem-loops in the genomes of Drosophila melanogaster and Anopheles gambiae and we propose a scoring scheme for precursor candidates which combines four robustness measures. Additionally, we identify several known pre-miRNA homologs in the newly sequenced Anopheles darlingi and show that most are found amongst the top-scoring precursor candidates. Furthermore, a comparison of the performance of our approach is made against two single-genome pre-miRNA classification methods. Conclusions In this paper we present a strategy to sieve through the vast amount of stem-loops found in metazoan genomes in search of pre-miRNAs, significantly reducing the set of candidates while retaining most known miRNA precursors. This approach makes no use of conservation data and relies solely on properties derived from our knowledge of miRNA biogenesis.
Collapse
Affiliation(s)
- Nuno D Mendes
- Équipe BAOBAB, Laboratoire de Biométrie et Biologie Évolutive (UMR 5558), CNRS, Univ, Lyon 1, 43 Bd du 11 Nov 1918, 69622, Villeurbanne Cedex, France.
| | | | | | | |
Collapse
|
12
|
Lazzari B, Caprera A, Cestaro A, Merelli I, Del Corvo M, Fontana P, Milanesi L, Velasco R, Stella A. Ontology-oriented retrieval of putative microRNAs in Vitis vinifera via GrapeMiRNA: a web database of de novo predicted grape microRNAs. BMC PLANT BIOLOGY 2009; 9:82. [PMID: 19563653 PMCID: PMC2717091 DOI: 10.1186/1471-2229-9-82] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2009] [Accepted: 06/29/2009] [Indexed: 05/04/2023]
Abstract
BACKGROUND Two complete genome sequences are available for Vitis vinifera Pinot noir. Based on the sequence and gene predictions produced by the IASMA, we performed an in silico detection of putative microRNA genes and of their targets, and collected the most reliable microRNA predictions in a web database. The application is available at http://www.itb.cnr.it/ptp/grapemirna/. DESCRIPTION The program FindMiRNA was used to detect putative microRNA genes in the grape genome. A very high number of predictions was retrieved, calling for validation. Nine parameters were calculated and, based on the grape microRNAs dataset available at miRBase, thresholds were defined and applied to FindMiRNA predictions having targets in gene exons. In the resulting subset, predictions were ranked according to precursor positions and sequence similarity, and to target identity. To further validate FindMiRNA predictions, comparisons to the Arabidopsis genome, to the grape Genoscope genome, and to the grape EST collection were performed. Results were stored in a MySQL database and a web interface was prepared to query the database and retrieve predictions of interest. CONCLUSION The GrapeMiRNA database encompasses 5,778 microRNA predictions spanning the whole grape genome. Predictions are integrated with information that can be of use in selection procedures. Tools added in the web interface also allow to inspect predictions according to gene ontology classes and metabolic pathways of targets. The GrapeMiRNA database can be of help in selecting candidate microRNA genes to be validated.
Collapse
Affiliation(s)
- Barbara Lazzari
- Technology Park Lodi, Località Cascina Codazza, Via Einstein, 26900 Lodi, Italy
| | - Andrea Caprera
- Technology Park Lodi, Località Cascina Codazza, Via Einstein, 26900 Lodi, Italy
| | - Alessandro Cestaro
- IASMA Research Center, Via E. Mach 1, 38010 San Michele all'Adige (TN), Italy
| | - Ivan Merelli
- Institute for Biomedical Technologies (CNR), via Fratelli Cervi 93, 20090 Segrate (MI), Italy
| | - Marcello Del Corvo
- Technology Park Lodi, Località Cascina Codazza, Via Einstein, 26900 Lodi, Italy
| | - Paolo Fontana
- IASMA Research Center, Via E. Mach 1, 38010 San Michele all'Adige (TN), Italy
| | - Luciano Milanesi
- Institute for Biomedical Technologies (CNR), via Fratelli Cervi 93, 20090 Segrate (MI), Italy
| | - Riccardo Velasco
- IASMA Research Center, Via E. Mach 1, 38010 San Michele all'Adige (TN), Italy
| | - Alessandra Stella
- Technology Park Lodi, Località Cascina Codazza, Via Einstein, 26900 Lodi, Italy
- Institute of Agricultural Biology and Biotechnology (CNR), via Bassini 15, 20133 Milan, Italy
| |
Collapse
|