1
|
Cohen-Davidi E, Veksler-Lublinsky I. Benchmarking the negatives: Effect of negative data generation on the classification of miRNA-mRNA interactions. PLoS Comput Biol 2024; 20:e1012385. [PMID: 39186797 PMCID: PMC11379385 DOI: 10.1371/journal.pcbi.1012385] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 09/06/2024] [Accepted: 08/04/2024] [Indexed: 08/28/2024] Open
Abstract
MicroRNAs (miRNAs) are small non-coding RNAs that regulate gene expression post-transcriptionally. In animals, this regulation is achieved via base-pairing with partially complementary sequences on mainly 3' UTR region of messenger RNAs (mRNAs). Computational approaches that predict miRNA target interactions (MTIs) facilitate the process of narrowing down potential targets for experimental validation. The availability of new datasets of high-throughput, direct MTIs has led to the development of machine learning (ML) based methods for MTI prediction. To train an ML algorithm, it is beneficial to provide entries from all class labels (i.e., positive and negative). Currently, no high-throughput assays exist for capturing negative examples. Therefore, current ML approaches must rely on either artificially generated or inferred negative examples deduced from experimentally identified positive miRNA-target datasets. Moreover, the lack of uniform standards for generating such data leads to biased results and hampers comparisons between studies. In this comprehensive study, we collected methods for generating negative data for animal miRNA-target interactions and investigated their impact on the classification of true human MTIs. Our study relies on training ML models on a fixed positive dataset in combination with different negative datasets and evaluating their intra- and cross-dataset performance. As a result, we were able to examine each method independently and evaluate ML models' sensitivity to the methodologies utilized in negative data generation. To achieve a deep understanding of the performance results, we analyzed unique features that distinguish between datasets. In addition, we examined whether one-class classification models that utilize solely positive interactions for training are suitable for the task of MTI classification. We demonstrate the importance of negative data in MTI classification, analyze specific methodological characteristics that differentiate negative datasets, and highlight the challenge of ML models generalizing interaction rules from training to testing sets derived from different approaches. This study provides valuable insights into the computational prediction of MTIs that can be further used to establish standards in the field.
Collapse
Affiliation(s)
- Efrat Cohen-Davidi
- Department of Software and Information Systems Engineering, Ben-Gurion University of the Negev, Beer-Sheva, Israel
| | - Isana Veksler-Lublinsky
- Department of Software and Information Systems Engineering, Ben-Gurion University of the Negev, Beer-Sheva, Israel
| |
Collapse
|
2
|
Daniel Thomas S, Vijayakumar K, John L, Krishnan D, Rehman N, Revikumar A, Kandel Codi JA, Prasad TSK, S S V, Raju R. Machine Learning Strategies in MicroRNA Research: Bridging Genome to Phenome. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2024; 28:213-233. [PMID: 38752932 DOI: 10.1089/omi.2024.0047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2024]
Abstract
MicroRNAs (miRNAs) have emerged as a prominent layer of regulation of gene expression. This article offers the salient and current aspects of machine learning (ML) tools and approaches from genome to phenome in miRNA research. First, we underline that the complexity in the analysis of miRNA function ranges from their modes of biogenesis to the target diversity in diverse biological conditions. Therefore, it is imperative to first ascertain the miRNA coding potential of genomes and understand the regulatory mechanisms of their expression. This knowledge enables the efficient classification of miRNA precursors and the identification of their mature forms and respective target genes. Second, and because one miRNA can target multiple mRNAs and vice versa, another challenge is the assessment of the miRNA-mRNA target interaction network. Furthermore, long-noncoding RNA (lncRNA)and circular RNAs (circRNAs) also contribute to this complexity. ML has been used to tackle these challenges at the high-dimensional data level. The present expert review covers more than 100 tools adopting various ML approaches pertaining to, for example, (1) miRNA promoter prediction, (2) precursor classification, (3) mature miRNA prediction, (4) miRNA target prediction, (5) miRNA- lncRNA and miRNA-circRNA interactions, (6) miRNA-mRNA expression profiling, (7) miRNA regulatory module detection, (8) miRNA-disease association, and (9) miRNA essentiality prediction. Taken together, we unpack, critically examine, and highlight the cutting-edge synergy of ML approaches and miRNA research so as to develop a dynamic and microlevel understanding of human health and diseases.
Collapse
Affiliation(s)
- Sonet Daniel Thomas
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
- Centre for Systems Biology and Molecular Medicine (CSBMM), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| | - Krithika Vijayakumar
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| | - Levin John
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| | - Deepak Krishnan
- Centre for Systems Biology and Molecular Medicine (CSBMM), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| | - Niyas Rehman
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| | - Amjesh Revikumar
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
- Kerala Genome Data Centre, Kerala Development and Innovation Strategic Council, Thiruvananthapuram, Kerala, India
| | - Jalaluddin Akbar Kandel Codi
- Department of Surgical Oncology, Yenepoya Medical College, Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| | | | - Vinodchandra S S
- Department of Computer Science, University of Kerala, Thiruvananthapuram, Kerala, India
| | - Rajesh Raju
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
- Centre for Systems Biology and Molecular Medicine (CSBMM), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| |
Collapse
|
3
|
Yang TH, Chen JC, Lee YH, Lu SY, Wu SH, Chang FY, Huang YC, Lee MH, Tseng YY, Wu WS. Identifying Human miRNA Target Sites via Learning the Interaction Patterns between miRNA and mRNA Segments. J Chem Inf Model 2024; 64:2445-2453. [PMID: 37903033 DOI: 10.1021/acs.jcim.3c01150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2023]
Abstract
miRNAs (microRNAs) target specific mRNA (messenger RNA) sites to regulate their translation expression. Although miRNA targeting can rely on seed region base pairing, animal miRNAs, including human miRNAs, typically cooperate with several cofactors, leading to various noncanonical pairing rules. Therefore, identifying the binding sites of animal miRNAs remains challenging. Because experiments for mapping miRNA targets are costly, computational methods are preferred for extracting potential miRNA-mRNA fragment binding pairs first. However, existing prediction tools can have significant false positives due to the prevalent noncanonical miRNA binding behaviors and the information-biased training negative sets that were used while constructing these tools. To overcome these obstacles, we first prepared an information-balanced miRNA binding pair ground-truth data set. A miRNA-mRNA interaction-aware model was then designed to help identify miRNA binding events. On the test set, our model (auROC = 94.4%) outperformed existing models by at least 2.8% in auROC. Furthermore, we showed that this model can suggest potential binding patterns for miRNA-mRNA sequence interacting pairs. Finally, we made the prepared data sets and the designed model available at http://cosbi2.ee.ncku.edu.tw/mirna_binding/download.
Collapse
Affiliation(s)
- Tzu-Hsien Yang
- Department of Biomedical Engineering, National Cheng Kung University, No.1, University Road, Tainan 701, Taiwan
- Medical Device Innovation Center, National Cheng Kung University, No.1 University Road, Tainan 701, Taiwan
| | - Jhih-Cheng Chen
- Department of Electrical Engineering, National Cheng Kung University, No.1, University Road, Tainan 701, Taiwan
| | - Yuan-Han Lee
- Department of Electrical Engineering, National Cheng Kung University, No.1, University Road, Tainan 701, Taiwan
| | - Shang-Yi Lu
- Department of Electrical Engineering, National Cheng Kung University, No.1, University Road, Tainan 701, Taiwan
| | - Sheng-Hang Wu
- Department of Information Management, National University of Kaohsiung, Kaohsiung University Rd, Kaohsiung 811, Taiwan
| | - Fang-Yuan Chang
- Department of Information Management, National University of Kaohsiung, Kaohsiung University Rd, Kaohsiung 811, Taiwan
| | - Yan-Cheng Huang
- Department of Electrical Engineering, National Cheng Kung University, No.1, University Road, Tainan 701, Taiwan
| | - Mei-Hsien Lee
- Department of Mathematics, University of Taipei, No.1, Ai-Guo West Road, Taipei 100234, Taiwan
| | - Yan-Yuan Tseng
- Center for Molecular Medicine and Genetics, Wayne State University, School of Medicine, Detroit, Michigan 48201, United States
| | - Wei-Sheng Wu
- Department of Electrical Engineering, National Cheng Kung University, No.1, University Road, Tainan 701, Taiwan
| |
Collapse
|
4
|
Zhan PC, Yang T, Zhang Y, Liu KY, Li Z, Zhang YY, Liu X, Liu NN, Wang HX, Shang B, Chen Y, Jiang HY, Zhao XT, Shao JH, Chen Z, Wang XD, Wang K, Gao JB, Lyu PJ. Radiomics using CT images for preoperative prediction of lymph node metastasis in perihilar cholangiocarcinoma: a multi-centric study. Eur Radiol 2024; 34:1280-1291. [PMID: 37589900 DOI: 10.1007/s00330-023-10108-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Revised: 06/07/2023] [Accepted: 06/29/2023] [Indexed: 08/18/2023]
Abstract
OBJECTIVES To develop a CT-based radiomics model for preoperative prediction of lymph node (LN) metastasis in perihilar cholangiocarcinoma (pCCA). METHODS The study enrolled consecutive pCCA patients from three independent Chinese medical centers. The Boruta algorithm was applied to build the radiomics signature for the primary tumor and LN. The k-means algorithm was employed to cluster the selected LNs based on the radiomics signature LN. Support vector machines were used to construct the prediction models. The diagnostic efficiency was measured by the area under the receiver operating characteristic curve (AUC). The optimal model was evaluated in terms of calibration, clinical usefulness, and prognostic value. RESULTS A total of 214 patients were included in the study (mean age: 61.6 years ± 9.4; 130 male). The selected LNs were classified into two clusters, which were significantly correlated with LN metastasis in all cohorts (p < 0.001). The model incorporated the clinical risk factors, radiomics signature primary tumor, and the LN cluster obtained the best discrimination, with AUC values of 0.981 (95% CI: 0.962-1), 0.896 (95% CI: 0.810-0.982), and 0.865 (95% CI: 0.768-0.961) in the training, internal validation, and external validation cohorts, respectively. High-risk patients predicted by the optimal model had shorter overall survival than low-risk patients (median, 13.7 vs. 27.3 months, p < 0.001). CONCLUSIONS The study proposed a radiomics model with good performance to predict LN metastasis in pCCA. As a noninvasive preoperative prediction tool, this model may help in patient risk stratification and personalized treatment. CLINICAL RELEVANCE STATEMENT A CT-based radiomics model accurately predicts lymph node metastasis in perihilar cholangiocarcinoma patients. This noninvasive preoperative tool can aid in patient risk stratification and personalized treatment, potentially improving patient outcomes. KEY POINTS • The radiomics model based on contrast-enhanced CT is a useful tool for preoperative prediction of lymph node metastasis in perihilar cholangiocarcinoma. • Radiomics features extracted from lymph nodes show great potential for predicting lymph node metastasis. • The study is the first to identify a lymph node phenotype with a high probability of metastasis based on radiomics.
Collapse
Affiliation(s)
- Peng-Chao Zhan
- Department of Radiology, The First Affiliated Hospital of Zhengzhou University, No.1 Jianshe Road, ZhengzhouZhengzhou, 450052, China
| | - Ting Yang
- Department of Radiology, West China Hospital, Sichuan University, Chengdu, 610041, China
| | - Yuan Zhang
- Department of Radiology, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, 510080, China
| | - Ke-Yan Liu
- Zhengzhou University Medical College, Zhengzhou, 450052, China
| | - Zhen Li
- Department of Interventional Radiology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, China.
| | - Yu-Yuan Zhang
- Department of Interventional Radiology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, China
| | - Xing Liu
- Department of Radiology, The First Affiliated Hospital of Zhengzhou University, No.1 Jianshe Road, ZhengzhouZhengzhou, 450052, China
| | - Na-Na Liu
- Department of Radiology, The First Affiliated Hospital of Zhengzhou University, No.1 Jianshe Road, ZhengzhouZhengzhou, 450052, China
| | - Hui-Xia Wang
- Department of Radiology, The First Affiliated Hospital of Zhengzhou University, No.1 Jianshe Road, ZhengzhouZhengzhou, 450052, China
| | - Bo Shang
- Department of Radiology, The First Affiliated Hospital of Zhengzhou University, No.1 Jianshe Road, ZhengzhouZhengzhou, 450052, China
| | - Yan Chen
- Department of Radiology, The First Affiliated Hospital of Zhengzhou University, No.1 Jianshe Road, ZhengzhouZhengzhou, 450052, China
| | - Han-Yu Jiang
- Department of Radiology, West China Hospital, Sichuan University, Chengdu, 610041, China
| | - Xiang-Tian Zhao
- Department of Radiology, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, 510080, China
| | - Jing-Hai Shao
- Department of Radiology, He Nan Sui Xian People's Hospital, Shangqiu, 476000, China
| | - Zhe Chen
- Department of Radiology, People's Hospital of Tanghe, Nanyang, 473000, China
| | - Xin-Dong Wang
- Department of Radiology, People's Hospital of Tanghe, Nanyang, 473000, China
| | - Kang Wang
- Department of Radiology, People's Hospital of Tanghe, Nanyang, 473000, China
| | - Jian-Bo Gao
- Department of Radiology, The First Affiliated Hospital of Zhengzhou University, No.1 Jianshe Road, ZhengzhouZhengzhou, 450052, China.
| | - Pei-Jie Lyu
- Department of Radiology, The First Affiliated Hospital of Zhengzhou University, No.1 Jianshe Road, ZhengzhouZhengzhou, 450052, China.
| |
Collapse
|
5
|
Chiang TW, Jhong SE, Chen YC, Chen CY, Wu WS, Chuang TJ. FL-circAS: an integrative resource and analysis for full-length sequences and alternative splicing of circular RNAs with nanopore sequencing. Nucleic Acids Res 2024; 52:D115-D123. [PMID: 37823705 PMCID: PMC10767854 DOI: 10.1093/nar/gkad829] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 08/26/2023] [Accepted: 10/02/2023] [Indexed: 10/13/2023] Open
Abstract
Circular RNAs (circRNAs) are RNA molecules with a continuous loop structure characterized by back-splice junctions (BSJs). While analyses of short-read RNA sequencing have identified millions of BSJ events, it is inherently challenging to determine exact full-length sequences and alternatively spliced (AS) isoforms of circRNAs. Recent advances in nanopore long-read sequencing with circRNA enrichment bring an unprecedented opportunity for investigating the issues. Here, we developed FL-circAS (https://cosbi.ee.ncku.edu.tw/FL-circAS/), which collected such long-read sequencing data of 20 cell lines/tissues and thereby identified 884 636 BSJs with 1 853 692 full-length circRNA isoforms in human and 115 173 BSJs with 135 617 full-length circRNA isoforms in mouse. FL-circAS also provides multiple circRNA features. For circRNA expression, FL-circAS calculates expression levels for each circRNA isoform, cell line/tissue specificity at both the BSJ and isoform levels, and AS entropy for each BSJ across samples. For circRNA biogenesis, FL-circAS identifies reverse complementary sequences and RNA binding protein (RBP) binding sites residing in flanking sequences of BSJs. For functional patterns, FL-circAS identifies potential microRNA/RBP binding sites and several types of evidence for circRNA translation on each full-length circRNA isoform. FL-circAS provides user-friendly interfaces for browsing, searching, analyzing, and downloading data, serving as the first resource for discovering full-length circRNAs at the isoform level.
Collapse
Affiliation(s)
- Tai-Wei Chiang
- Genomics Research Center, Academia Sinica, Taipei, Taiwan
| | - Song-En Jhong
- Department of Electrical Engineering, National Cheng Kung University, Tainan, Taiwan
| | - Yu-Chen Chen
- Genomics Research Center, Academia Sinica, Taipei, Taiwan
| | - Chia-Ying Chen
- Genomics Research Center, Academia Sinica, Taipei, Taiwan
| | - Wei-Sheng Wu
- Department of Electrical Engineering, National Cheng Kung University, Tainan, Taiwan
| | | |
Collapse
|
6
|
Przybyszewski J, Malawski M, Lichołai S. GraphTar: applying word2vec and graph neural networks to miRNA target prediction. BMC Bioinformatics 2023; 24:436. [PMID: 37978418 PMCID: PMC10657114 DOI: 10.1186/s12859-023-05564-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2023] [Accepted: 11/09/2023] [Indexed: 11/19/2023] Open
Abstract
BACKGROUND MicroRNAs (miRNAs) are short, non-coding RNA molecules that regulate gene expression by binding to specific mRNAs, inhibiting their translation. They play a critical role in regulating various biological processes and are implicated in many diseases, including cardiovascular, oncological, gastrointestinal diseases, and viral infections. Computational methods that can identify potential miRNA-mRNA interactions from raw data use one-dimensional miRNA-mRNA duplex representations and simple sequence encoding techniques, which may limit their performance. RESULTS We have developed GraphTar, a new target prediction method that uses a novel graph-based representation to reflect the spatial structure of the miRNA-mRNA duplex. Unlike existing approaches, we use the word2vec method to accurately encode RNA sequence information. In conjunction with the novel encoding method, we use a graph neural network classifier that can accurately predict miRNA-mRNA interactions based on graph representation learning. As part of a comparative study, we evaluate three different node embedding approaches within the GraphTar framework and compare them with other state-of-the-art target prediction methods. The results show that the proposed method achieves similar performance to the best methods in the field and outperforms them on one of the datasets. CONCLUSIONS In this study, a novel miRNA target prediction approach called GraphTar is introduced. Results show that GraphTar is as effective as existing methods and even outperforms them in some cases, opening new avenues for further research. However, the expansion of available datasets is critical for advancing the field towards real-world applications.
Collapse
Affiliation(s)
- Jan Przybyszewski
- Sano Centre for Computational Medicine, Czarnowiejska 36, 30-054, Cracow, Poland.
| | - Maciej Malawski
- Sano Centre for Computational Medicine, Czarnowiejska 36, 30-054, Cracow, Poland
| | - Sabina Lichołai
- Division of Molecular Biology and Clinical Genetics, Faculty of Medicine, Jagiellonian University Medical College, Skawińska 8, 31-066, Cracow, Poland
| |
Collapse
|
7
|
Schmitz U. Overview of Computational and Experimental Methods to Identify Tissue-Specific MicroRNA Targets. Methods Mol Biol 2023; 2630:155-177. [PMID: 36689183 DOI: 10.1007/978-1-0716-2982-6_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
As ubiquitous posttranscriptional regulators of gene expression, microRNAs (miRNAs) play key roles in cell physiology and function across taxa. In the last two decades, we have gained a good understanding about miRNA biogenesis pathways, modes of action, and consequences of miRNA-mediated gene regulation. More recently, research has focused on exploring causes for miRNA dysregulation, miRNA-mediated crosstalk between genes and signaling pathways, and the role of miRNAs in disease.This chapter discusses methods for the identification of miRNA-target interactions and causes for tissue-specific miRNA-target regulation. Computational approaches for predicting miRNA target sites and assessing tissue-specific target regulation are discussed. Moreover, there is an emphasis on features that affect miRNA target recognition and how high-throughput sequencing protocols can help in assessing miRNA-mediated gene regulation on a genome-wide scale. In addition, this chapter introduces some experimental approaches for the validation of miRNA targets as well as web-based resources sharing predicted and validated miRNA-target interactions.
Collapse
Affiliation(s)
- Ulf Schmitz
- Department of Molecular & Cell Biology, College of Public Health, Medical & Vet Sciences, James Cook University, Douglas, Australia.
- Centre for Tropical Bioinformatics and Molecular Biology, Australian Institute of Tropical Health and Medicine, James Cook University, Cairns, Australia.
| |
Collapse
|
8
|
Small RNA Targets: Advances in Prediction Tools and High-Throughput Profiling. BIOLOGY 2022; 11:biology11121798. [PMID: 36552307 PMCID: PMC9775672 DOI: 10.3390/biology11121798] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 11/27/2022] [Accepted: 12/08/2022] [Indexed: 12/14/2022]
Abstract
MicroRNAs (miRNAs) are an abundant class of small non-coding RNAs that regulate gene expression at the post-transcriptional level. They are suggested to be involved in most biological processes of the cell primarily by targeting messenger RNAs (mRNAs) for cleavage or translational repression. Their binding to their target sites is mediated by the Argonaute (AGO) family of proteins. Thus, miRNA target prediction is pivotal for research and clinical applications. Moreover, transfer-RNA-derived fragments (tRFs) and other types of small RNAs have been found to be potent regulators of Ago-mediated gene expression. Their role in mRNA regulation is still to be fully elucidated, and advancements in the computational prediction of their targets are in their infancy. To shed light on these complex RNA-RNA interactions, the availability of good quality high-throughput data and reliable computational methods is of utmost importance. Even though the arsenal of computational approaches in the field has been enriched in the last decade, there is still a degree of discrepancy between the results they yield. This review offers an overview of the relevant advancements in the field of bioinformatics and machine learning and summarizes the key strategies utilized for small RNA target prediction. Furthermore, we report the recent development of high-throughput sequencing technologies, and explore the role of non-miRNA AGO driver sequences.
Collapse
|
9
|
Feitosa RM, Prieto-Oliveira P, Brentani H, Machado-Lima A. MicroRNA target prediction tools for animals: Where we are at and where we are going to - A systematic review. Comput Biol Chem 2022; 100:107729. [DOI: 10.1016/j.compbiolchem.2022.107729] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Revised: 07/08/2022] [Accepted: 07/09/2022] [Indexed: 11/26/2022]
|
10
|
Mégret L, Mendoza C, Arrieta Lobo M, Brouillet E, Nguyen TTY, Bouaziz O, Chambaz A, Néri C. Precision machine learning to understand micro-RNA regulation in neurodegenerative diseases. Front Mol Neurosci 2022; 15:914830. [PMID: 36157078 PMCID: PMC9500540 DOI: 10.3389/fnmol.2022.914830] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Accepted: 08/19/2022] [Indexed: 11/13/2022] Open
Abstract
Micro-RNAs (miRNAs) are short (∼21 nt) non-coding RNAs that regulate gene expression through the degradation or translational repression of mRNAs. Accumulating evidence points to a role of miRNA regulation in the pathogenesis of a wide range of neurodegenerative (ND) diseases such as, for example, Alzheimer’s disease, Parkinson’s disease, amyotrophic lateral sclerosis and Huntington disease (HD). Several systems level studies aimed to explore the role of miRNA regulation in NDs, but these studies remain challenging. Part of the problem may be related to the lack of sufficiently rich or homogeneous data, such as time series or cell-type-specific data obtained in model systems or human biosamples, to account for context dependency. Part of the problem may also be related to the methodological challenges associated with the accurate system-level modeling of miRNA and mRNA data. Here, we critically review the main families of machine learning methods used to analyze expression data, highlighting the added value of using shape-analysis concepts as a solution for precisely modeling highly dimensional miRNA and mRNA data such as the ones obtained in the study of the HD process, and elaborating on the potential of these concepts and methods for modeling complex omics data.
Collapse
Affiliation(s)
- Lucile Mégret
- Sorbonne Université, Centre National de la Recherche Scientifique UMR 8256, Paris, France
- *Correspondence: Lucile Mégret,
| | - Cloé Mendoza
- Sorbonne Université, Centre National de la Recherche Scientifique UMR 8256, Paris, France
| | - Maialen Arrieta Lobo
- Sorbonne Université, Centre National de la Recherche Scientifique UMR 8256, Paris, France
| | - Emmanuel Brouillet
- Sorbonne Université, Centre National de la Recherche Scientifique UMR 8256, Paris, France
| | - Thi-Thanh-Yen Nguyen
- Université Paris Cité, MAP5 (Centre National de la Recherche Scientifique UMR 8145), Paris, France
| | - Olivier Bouaziz
- Université Paris Cité, MAP5 (Centre National de la Recherche Scientifique UMR 8145), Paris, France
| | - Antoine Chambaz
- Université Paris Cité, MAP5 (Centre National de la Recherche Scientifique UMR 8145), Paris, France
| | - Christian Néri
- Sorbonne Université, Centre National de la Recherche Scientifique UMR 8256, Paris, France
- Christian Néri,
| |
Collapse
|
11
|
Chiang TW, Mai TL, Chuang TJ. CircMiMi: a stand-alone software for constructing circular RNA-microRNA-mRNA interactions across species. BMC Bioinformatics 2022; 23:164. [PMID: 35524165 PMCID: PMC9074202 DOI: 10.1186/s12859-022-04692-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Accepted: 04/17/2022] [Indexed: 01/22/2023] Open
Abstract
Background Circular RNAs (circRNAs) are a class of non-coding RNAs formed by pre-mRNA back-splicing, which are widely expressed in animal/plant cells and often play an important role in regulating microRNA (miRNA) activities. While numerous databases have collected a large amount of predicted circRNA candidates and provided the corresponding circRNA-regulated interactions, a stand-alone package for constructing circRNA-miRNA-mRNA interactions based on user-identified circRNAs across species is lacking. Results We present CircMiMi (circRNA-miRNA-mRNA interactions), a modular, Python-based software to identify circRNA-miRNA-mRNA interactions across 18 species (including 16 animals and 2 plants) with the given coordinates of circRNA junctions. The CircMiMi-constructed circRNA-miRNA-mRNA interactions are derived from circRNA-miRNA and miRNA-mRNA axes with the support of computational predictions and/or experimental data. CircMiMi also allows users to examine alignment ambiguity of back-splice junctions for checking circRNA reliability and examine reverse complementary sequences residing in the sequences flanking the circularized exons for investigating circRNA formation. We further employ CircMiMi to identify circRNA-miRNA-mRNA interactions based on the circRNAs collected in NeuroCirc, a large-scale database of circRNAs in the human brain. We construct circRNA-miRNA-mRNA interactions comprising differentially expressed circRNAs, and miRNAs in autism spectrum disorder (ASD) and cross-species analyze the relevance of the targets to ASD. We thus provide a rich set of ASD-associated circRNA-miRNA-mRNA axes and a useful starting point for investigation of regulatory mechanisms in ASD pathophysiology. Conclusions CircMiMi allows users to identify circRNA-mediated interactions in multiple species, shedding light on regulatory roles of circRNAs. The software package and web interface are freely available at https://github.com/TreesLab/CircMiMi and http://circmimi.genomics.sinica.edu.tw/, respectively. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-022-04692-0.
Collapse
Affiliation(s)
- Tai-Wei Chiang
- Genomics Research Center, Academia Sinica, Taipei, Taiwan
| | - Te-Lun Mai
- Department of Life Science, National Taiwan University, Taipei, Taiwan
| | | |
Collapse
|
12
|
Machine Learning Based Methods and Best Practices of microRNA-Target Prediction and Validation. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2022; 1385:109-131. [DOI: 10.1007/978-3-031-08356-3_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
13
|
Ben Or G, Veksler-Lublinsky I. Comprehensive machine-learning-based analysis of microRNA-target interactions reveals variable transferability of interaction rules across species. BMC Bioinformatics 2021; 22:264. [PMID: 34030625 PMCID: PMC8146624 DOI: 10.1186/s12859-021-04164-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Accepted: 05/04/2021] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND MicroRNAs (miRNAs) are small non-coding RNAs that regulate gene expression post-transcriptionally via base-pairing with complementary sequences on messenger RNAs (mRNAs). Due to the technical challenges involved in the application of high-throughput experimental methods, datasets of direct bona fide miRNA targets exist only for a few model organisms. Machine learning (ML)-based target prediction models were successfully trained and tested on some of these datasets. There is a need to further apply the trained models to organisms in which experimental training data are unavailable. However, it is largely unknown how the features of miRNA-target interactions evolve and whether some features have remained fixed during evolution, raising questions regarding the general, cross-species applicability of currently available ML methods. RESULTS We examined the evolution of miRNA-target interaction rules and used data science and ML approaches to investigate whether these rules are transferable between species. We analyzed eight datasets of direct miRNA-target interactions in four species (human, mouse, worm, cattle). Using ML classifiers, we achieved high accuracy for intra-dataset classification and found that the most influential features of all datasets overlap significantly. To explore the relationships between datasets, we measured the divergence of their miRNA seed sequences and evaluated the performance of cross-dataset classification. We found that both measures coincide with the evolutionary distance between the compared species. CONCLUSIONS The transferability of miRNA-targeting rules between species depends on several factors, the most associated factors being the composition of seed families and evolutionary distance. Furthermore, our feature-importance results suggest that some miRNA-target features have evolved while others remained fixed during the evolution of the species. Our findings lay the foundation for the future development of target prediction tools that could be applied to "non-model" organisms for which minimal experimental data are available. AVAILABILITY AND IMPLEMENTATION The code is freely available at https://github.com/gbenor/TPVOD .
Collapse
Affiliation(s)
- Gilad Ben Or
- Department of Software and Information Systems Engineering, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Isana Veksler-Lublinsky
- Department of Software and Information Systems Engineering, Ben-Gurion University of the Negev, Beer Sheva, Israel
| |
Collapse
|
14
|
Gu T, Zhao X, Barbazuk WB, Lee JH. miTAR: a hybrid deep learning-based approach for predicting miRNA targets. BMC Bioinformatics 2021; 22:96. [PMID: 33639834 PMCID: PMC7912887 DOI: 10.1186/s12859-021-04026-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2020] [Accepted: 02/14/2021] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND microRNAs (miRNAs) have been shown to play essential roles in a wide range of biological processes. Many computational methods have been developed to identify targets of miRNAs. However, the majority of these methods depend on pre-defined features that require considerable efforts and resources to compute and often prove suboptimal at predicting miRNA targets. RESULTS We developed a novel hybrid deep learning-based (DL-based) approach that is capable of predicting miRNA targets at a higher accuracy. This approach integrates convolutional neural networks (CNNs) that excel in learning spatial features and recurrent neural networks (RNNs) that discern sequential features. Therefore, our approach has the advantages of learning both the intrinsic spatial and sequential features of miRNA:target. The inputs for our approach are raw sequences of miRNAs and genes that can be obtained effortlessly. We applied our approach on two human datasets from recently miRNA target prediction studies and trained two models. We demonstrated that the two models consistently outperform the previous methods according to evaluation metrics on test datasets. Comparing our approach with currently available alternatives on independent datasets shows that our approach delivers substantial improvements in performance. We also show with multiple evidences that our approach is more robust than other methods on small datasets. Our study is the first study to perform comparisons across multiple existing DL-based approaches on miRNA target prediction. Furthermore, we examined the contribution of a Max pooling layer in between the CNN and RNN and demonstrated that it improves the performance of all our models. Finally, a unified model was developed that is robust on fitting different input datasets. CONCLUSIONS We present a new DL-based approach for predicting miRNA targets and demonstrate that our approach outperforms the current alternatives. We supplied an easy-to-use tool, miTAR, at https://github.com/tjgu/miTAR . Furthermore, our analysis results support that Max Pooling generally benefits the hybrid models and potentially prevents overfitting for hybrid models.
Collapse
Affiliation(s)
- Tongjun Gu
- Bioinformatics, Interdisciplinary Center for Biotechnology Research, University of Florida, Gainesville, FL, USA. .,Division of Quantitative Sciences, University of Florida Health Cancer Center, University of Florida, Gainesville, FL, USA.
| | - Xiwu Zhao
- Department of Ophthalmology and Visual Sciences, University of Michigan, Ann Arbor, MI, USA
| | - William Bradley Barbazuk
- Bioinformatics, Interdisciplinary Center for Biotechnology Research, University of Florida, Gainesville, FL, USA.,Department of Biology, University of Florida, Gainesville, FL, USA.,Genetics Institute, University of Florida, Gainesville, FL, USA
| | - Ji-Hyun Lee
- Division of Quantitative Sciences, University of Florida Health Cancer Center, University of Florida, Gainesville, FL, USA.,Department of Biostatistics, University of Florida, Gainesville, FL, USA
| |
Collapse
|
15
|
Jiang H, Yang M, Chen X, Li M, Li Y, Wang J. miRTMC: A miRNA Target Prediction Method Based on Matrix Completion Algorithm. IEEE J Biomed Health Inform 2020; 24:3630-3641. [PMID: 32287029 DOI: 10.1109/jbhi.2020.2987034] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
microRNAs (miRNAs) are small non-coding RNAs which modulate the stability of gene targets and their rates of translation into proteins at transcriptional level and post-transcriptional level. miRNA dysfunctions can lead to human diseases because of dysregulation of their targets. Correct miRNA target prediction will lead to better understanding of the mechanisms of human diseases and provide hints on curing them. In recent years, computational miRNA target prediction methods have been proposed according to the interaction rules between miRNAs and targets. However, these methods suffer from high false positive rates due to the complicated relationship between miRNAs and their targets. The rapidly growing number of experimentally validated miRNA targets enables predicting miRNA targets with high precision via accurate data analysis. Taking advantage of these known miRNA targets, a novel recommendation system model (miRTMC) for miRNA target prediction is established using a new matrix completion algorithm. In miRTMC, a heterogeneous network is constructed by integrating the miRNA similarity network, the gene similarity network, and the miRNA-gene interaction network. Our assumption is that the latent factors determining whether a gene is the target of miRNA or not are highly correlated, i.e., the adjacency matrix of the heterogeneous network is low-rank, which is then completed by using a nuclear norm regularized linear least squares model under non-negative constraints. Alternating direction method of multipliers (ADMM) is adopted to numerically solve the matrix completion problem. Our results show that miRTMC outperforms the competing methods in terms of various evaluation metrics. Our software package is available at https://github.com/hjiangcsu/miRTMC.
Collapse
|
16
|
Mégret L, Nair SS, Dancourt J, Aaronson J, Rosinski J, Neri C. Combining feature selection and shape analysis uncovers precise rules for miRNA regulation in Huntington's disease mice. BMC Bioinformatics 2020; 21:75. [PMID: 32093602 PMCID: PMC7041117 DOI: 10.1186/s12859-020-3418-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2020] [Accepted: 02/17/2020] [Indexed: 12/12/2022] Open
Abstract
Background MicroRNA (miRNA) regulation is associated with several diseases, including neurodegenerative diseases. Several approaches can be used for modeling miRNA regulation. However, their precision may be limited for analyzing multidimensional data. Here, we addressed this question by integrating shape analysis and feature selection into miRAMINT, a methodology that we used for analyzing multidimensional RNA-seq and proteomic data from a knock-in mouse model (Hdh mice) of Huntington’s disease (HD), a disease caused by CAG repeat expansion in huntingtin (htt). This dataset covers 6 CAG repeat alleles and 3 age points in the striatum and cortex of Hdh mice. Results Remarkably, compared to previous analyzes of this multidimensional dataset, the miRAMINT approach retained only 31 explanatory striatal miRNA-mRNA pairs that are precisely associated with the shape of CAG repeat dependence over time, among which 5 pairs with a strong change of target expression levels. Several of these pairs were previously associated with neuronal homeostasis or HD pathogenesis, or both. Such miRNA-mRNA pairs were not detected in cortex. Conclusions These data suggest that miRNA regulation has a limited global role in HD while providing accurately-selected miRNA-target pairs to study how the brain may compute molecular responses to HD over time. These data also provide a methodological framework for researchers to explore how shape analysis can enhance multidimensional data analytics in biology and disease.
Collapse
Affiliation(s)
- Lucile Mégret
- Sorbonne Université, CNRS UMR8256, INSERM ERL U1164, Brain-C Lab, Paris, France.
| | | | - Julia Dancourt
- Sorbonne Université, CNRS UMR8256, INSERM ERL U1164, Brain-C Lab, Paris, France
| | | | | | - Christian Neri
- Sorbonne Université, CNRS UMR8256, INSERM ERL U1164, Brain-C Lab, Paris, France.
| |
Collapse
|
17
|
Maji RK, Khatua S, Ghosh Z. A Supervised Ensemble Approach for Sensitive microRNA Target Prediction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020; 17:37-46. [PMID: 30040648 DOI: 10.1109/tcbb.2018.2858252] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
MicroRNAs, a class of small non-coding RNAs, regulate important biological functions via post-transcriptional regulation of messenger RNAs (mRNAs). Despite rapid development in miRNA research, precise experimental methods to determine miRNA target interactions are still lacking. This motivated us to explore the in silico target interaction features and incorporate them in predictive modeling. We propose a systematic approach towards developing a sensitive miRNA target prediction model to explore the interplay of target recognition features. In the first step, we have employed a supervised ensemble under-sampling approach to address the problem of imbalance in the training dataset due to a larger number of negative instances. Various feature selection techniques were evaluated to obtain the optimal feature subset that best recognizes the true miRNA-mRNA targets. In the second step, we have built our optimal model, miRTPred, a novel blending ensemble-based approach that combines the predictions of the best performing traditional and classical ensemble models, through a weighted voting classifier, achieving a sensitivity of 87 percent and F1-score of 0.88 for 3'UTR region of the mRNA transcript. miRTPred outperforms popular machine learning (ML) and non-ML approaches to target prediction algorithms. miRTPred is freely available at http://bicresources.jcbose.ac.in/zhumur/mirtpred.
Collapse
|
18
|
Kern F, Backes C, Hirsch P, Fehlmann T, Hart M, Meese E, Keller A. What's the target: understanding two decades of in silico microRNA-target prediction. Brief Bioinform 2019; 21:1999-2010. [PMID: 31792536 DOI: 10.1093/bib/bbz111] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Revised: 08/01/2019] [Accepted: 08/02/2019] [Indexed: 12/25/2022] Open
Abstract
MOTIVATION Since the initial discovery of microRNAs as post-transcriptional, regulatory key players in the 1990s, a total number of $2656$ mature microRNAs have been publicly described for Homo sapiens. As discovery of new miRNAs is still on-going, target identification remains to be an essential and challenging step preceding functional annotation analysis. One key challenge for researchers seems to be the selection of the most appropriate tool out of the larger multiverse of published solutions for a given research study set-up. RESULTS In this review we collectively describe the field of in silico target prediction in the course of time and point out long withstanding principles as well as recent developments. By compiling a catalog of characteristics about the 98 prediction methods and identifying common and exclusive traits, we signpost a simplified mechanism to address the problem of application selection. Going further we devised interpretation strategies for common types of output as generated by frequently used computational methods. To this end, our work specifically aims to make prospective users aware of common mistakes and practical questions that arise during the application of target prediction tools. AVAILABILITY An interactive implementation of our recommendations including materials shown in the manuscript is freely available at https://www.ccb.uni-saarland.de/mtguide.
Collapse
Affiliation(s)
- Fabian Kern
- Chair for Clinical Bioinformatics, Saarland University, Saarbrücken, 66123, Germany and Department of Human Genetics, Saarland University, Homburg, 66424, Germany
| | - Christina Backes
- Chair for Clinical Bioinformatics, Saarland University, Saarbrücken, 66123, Germany and Department of Human Genetics, Saarland University, Homburg, 66424, Germany
| | - Pascal Hirsch
- Chair for Clinical Bioinformatics, Saarland University, Saarbrücken, 66123, Germany and Department of Human Genetics, Saarland University, Homburg, 66424, Germany
| | - Tobias Fehlmann
- Chair for Clinical Bioinformatics, Saarland University, Saarbrücken, 66123, Germany and Department of Human Genetics, Saarland University, Homburg, 66424, Germany
| | - Martin Hart
- Chair for Clinical Bioinformatics, Saarland University, Saarbrücken, Germany and Department of Human Genetics, Saarland University Hospital, Homburg Germany
| | - Eckart Meese
- Chair for Clinical Bioinformatics, Saarland University, Saarbrücken, Germany and Department of Human Genetics, Saarland University Hospital, Homburg Germany
| | - Andreas Keller
- Chair for Clinical Bioinformatics, Saarland University, Saarbrücken, 66123, Germany and Department of Human Genetics, Saarland University, Homburg, 66424, Germany.,Center for Bioinformatics, Saarland University, Saarbrücken, Germany.,School of Medicine Office, Stanford University, Stanford, CA, USA.,Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA, USA
| |
Collapse
|
19
|
Wen M, Cong P, Zhang Z, Lu H, Li T. DeepMirTar: a deep-learning approach for predicting human miRNA targets. Bioinformatics 2019; 34:3781-3787. [PMID: 29868708 DOI: 10.1093/bioinformatics/bty424] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2017] [Accepted: 05/28/2018] [Indexed: 12/22/2022] Open
Abstract
Motivation MicroRNAs (miRNAs) are small non-coding RNAs that function in RNA silencing and post-transcriptional regulation of gene expression by targeting messenger RNAs (mRNAs). Because the underlying mechanisms associated with miRNA binding to mRNA are not fully understood, a major challenge of miRNA studies involves the identification of miRNA-target sites on mRNA. In silico prediction of miRNA-target sites can expedite costly and time-consuming experimental work by providing the most promising miRNA-target-site candidates. Results In this study, we reported the design and implementation of DeepMirTar, a deep-learning-based approach for accurately predicting human miRNA targets at the site level. The predicted miRNA-target sites are those having canonical or non-canonical seed, and features, including high-level expert-designed, low-level expert-designed and raw-data-level, were used to represent the miRNA-target site. Comparison with other state-of-the-art machine-learning methods and existing miRNA-target-prediction tools indicated that DeepMirTar improved overall predictive performance. Availability and implementation DeepMirTar is freely available at https://github.com/Bjoux2/DeepMirTar_SdA. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ming Wen
- College of Chemistry and Chemical Engineering, Central South University, Changsha, People's Republic of China
| | - Peisheng Cong
- School of Chemical Science and Engineering, Tongji University, Shanghai, People's Republic of China
| | - Zhimin Zhang
- College of Chemistry and Chemical Engineering, Central South University, Changsha, People's Republic of China
| | - Hongmei Lu
- College of Chemistry and Chemical Engineering, Central South University, Changsha, People's Republic of China
| | - Tonghua Li
- School of Chemical Science and Engineering, Tongji University, Shanghai, People's Republic of China
| |
Collapse
|
20
|
Al-Akwaa FM, Yunits B, Huang S, Alhajaji H, Garmire LX. Lilikoi: an R package for personalized pathway-based classification modeling using metabolomics data. Gigascience 2018; 7:5237705. [PMID: 30535020 PMCID: PMC6290884 DOI: 10.1093/gigascience/giy136] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2018] [Accepted: 11/05/2018] [Indexed: 01/01/2023] Open
Abstract
Lilikoi (the Hawaiian word for passion fruit) is a new and comprehensive R package for personalized pathway-based classification modeling using metabolomics data. Four basic modules are presented as the backbone of the package: feature mapping module, which standardizes the metabolite names provided by users and maps them to pathways; dimension transformation module, which transforms the metabolomic profiles to personalized pathway-based profiles using pathway deregulation scores; feature selection module, which helps to select the significant pathway features related to the disease phenotypes; and classification and prediction module, which offers various machine learning classification algorithms. The package is freely available under the GPLv3 license through the github repository at: https://github.com/lanagarmire/lilikoi and CRAN: https://cran.r-project.org/web/packages/lilikoi/index.html.
Collapse
Affiliation(s)
- Fadhl M Al-Akwaa
- Department of Computational Medicine and Bioinformatics, Building 520, 1600 Huron Parkway, Ann Arbor, MI 48109, USA
| | - Breck Yunits
- University of Hawaii Cancer Center, Department of Epidemiology, 701 Ilalo Street, Honolulu, HI USA 96813
| | - Sijia Huang
- University of Hawaii Cancer Center, Department of Epidemiology, 701 Ilalo Street, Honolulu, HI USA 96813.,Molecular Biology and Bioengineering Graduate Program, University of Hawaii at Monoa, Honolulu, HI, USA 96822
| | - Hassam Alhajaji
- University of Hawaii Cancer Center, Department of Epidemiology, 701 Ilalo Street, Honolulu, HI USA 96813
| | - Lana X Garmire
- Department of Computational Medicine and Bioinformatics, Building 520, 1600 Huron Parkway, Ann Arbor, MI 48109, USA
| |
Collapse
|
21
|
Amirkhah R, Meshkin HN, Farazmand A, Rasko JEJ, Schmitz U. Computational and Experimental Identification of Tissue-Specific MicroRNA Targets. Methods Mol Biol 2017; 1580:127-147. [PMID: 28439832 DOI: 10.1007/978-1-4939-6866-4_11] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
In this chapter we discuss computational methods for the prediction of microRNA (miRNA) targets. More specifically, we consider machine learning-based approaches and explain why these methods have been relatively unsuccessful in reducing the number of false positive predictions. Further we suggest approaches designed to improve their performance by considering tissue-specific target regulation. We argue that the miRNA targetome differs depending on the tissue type and introduce a novel algorithm that predicts miRNA targets specifically for colorectal cancer. We discuss features of miRNAs and target sites that affect target recognition, and how next-generation sequencing data can support the identification of novel miRNAs, differentially expressed miRNAs and their tissue-specific mRNA targets. In addition, we introduce some experimental approaches for the validation of miRNA targets as well as web-based resources sharing predicted and validated miRNA target interactions.
Collapse
Affiliation(s)
- Raheleh Amirkhah
- Reza Institute of Cancer Bioinformatics and Personalized Medicine, Mashhad, Iran
| | - Hojjat Naderi Meshkin
- Stem Cells and Regenerative Medicine Research Group, Academic Center for Education, Culture Research (ACECR), Khorasan Razavi Branch, Mashhad, Iran
| | - Ali Farazmand
- Department of Cell and Molecular Biology, School of Biology, College of Science, University of Tehran, Tehran, Iran
| | - John E J Rasko
- Gene & Stem Cell Therapy Program, Centenary Institute, Camperdown; Sydney Medical School, University of Sydney, Camperdown, NSW, 2050, Australia
| | - Ulf Schmitz
- Gene & Stem Cell Therapy Program, Centenary Institute, Camperdown; Sydney Medical School, University of Sydney, Camperdown, NSW, 2050, Australia.
| |
Collapse
|
22
|
Garmire LX, Gliske S, Nguyen QC, Chen JH, Nemati S, VAN Horn JD, Moore JH, Shreffler C, Dunn M. THE TRAINING OF NEXT GENERATION DATA SCIENTISTS IN BIOMEDICINE. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2016; 22:640-645. [PMID: 27897014 DOI: 10.1142/9789813207813_0059] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
With the booming of new technologies, biomedical science has transformed into digitalized, data intensive science. Massive amount of data need to be analyzed and interpreted, demand a complete pipeline to train next generation data scientists. To meet this need, the transinstitutional Big Data to Knowledge (BD2K) Initiative has been implemented since 2014, complementing other NIH institutional efforts. In this report, we give an overview the BD2K K01 mentored scientist career awards, which have demonstrated early success. We address the specific trainings needed in representative data science areas, in order to make the next generation of data scientists in biomedicine.
Collapse
Affiliation(s)
- Lana X Garmire
- 2Epidemiology Program, University of Hawaii Cancer Center,Honolulu, HI, 96813, USA†Work partially supported by grant NIH Big Data 2 Knowledge Award K01ES025434 (to LXG), K01ES026839 (to SG), K01ES025433 (to QCN), K01ES026837 (to JHC), K01ES025445 (to SN), U24 ES026465 (to JDV), by the National Institute of Environmental Health Sciences through funds provided by the trans-NIH Big Data to Knowledge (BD2K) initiative,*work is also partially supported by P20 COBRE GM103457 awarded by NIH/NIGMS, NICHD R01 HD084633, NLM R01 LM012373, and Hawaii Community Foundation Medical Research Grant 14ADVC-64566.,
| | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Leclercq M, Diallo AB, Blanchette M. Prediction of human miRNA target genes using computationally reconstructed ancestral mammalian sequences. Nucleic Acids Res 2016; 45:556-566. [PMID: 27899600 PMCID: PMC5314757 DOI: 10.1093/nar/gkw1085] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2016] [Revised: 09/26/2016] [Accepted: 11/13/2016] [Indexed: 11/14/2022] Open
Abstract
MicroRNAs (miRNA) are short single-stranded RNA molecules derived from hairpin-forming precursors that play a crucial role as post-transcriptional regulators in eukaryotes and viruses. In the past years, many microRNA target genes (MTGs) have been identified experimentally. However, because of the high costs of experimental approaches, target genes databases remain incomplete. Although several target prediction programs have been developed in the recent years to identify MTGs in silico, their specificity and sensitivity remain low. Here, we propose a new approach called MirAncesTar, which uses ancestral genome reconstruction to boost the accuracy of existing MTGs prediction tools for human miRNAs. For each miRNA and each putative human target UTR, our algorithm makes uses of existing prediction tools to identify putative target sites in the human UTR, as well as in its mammalian orthologs and inferred ancestral sequences. It then evaluates evidence in support of selective pressure to maintain target site counts (rather than sequences), accounting for the possibility of target site turnover. It finally integrates this measure with several simpler ones using a logistic regression predictor. MirAncesTar improves the accuracy of existing MTG predictors by 26% to 157%. Source code and prediction results for human miRNAs, as well as supporting evolutionary data are available at http://cs.mcgill.ca/∼blanchem/mirancestar.
Collapse
Affiliation(s)
- Mickael Leclercq
- School of Computer Science and McGill Centre for Bioinformatics, McGill University, Montreal, Quebec, H3A0E9, Canada
| | - Abdoulaye Baniré Diallo
- Laboratoire de bio-informatique du département informatique, Université du Québec à Montréal, Montréal, Québec H2X 3Y7, Canada
| | - Mathieu Blanchette
- School of Computer Science and McGill Centre for Bioinformatics, McGill University, Montreal, Quebec, H3A0E9, Canada
| |
Collapse
|
24
|
Man XF, Tan SW, Tang HN, Guo Y, Tang CY, Tang J, Zhou CL, Zhou HD. MiR-503 inhibits adipogenesis by targeting bone morphogenetic protein receptor 1a. Am J Transl Res 2016; 8:2727-2737. [PMID: 27398155 PMCID: PMC4931166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2016] [Accepted: 05/05/2016] [Indexed: 06/06/2023]
Abstract
Adipogenesis plays a key role in the regulation of whole-body energy homeostasis and is critically related to obesity. To overcome obesity and its associated disorders, it is necessary to elucidate the molecular mechanisms involved in adipogenesis. An adipogenesis-related miRNA array analysis demonstrated that miR-503 was differentially expressed before and after adipocyte differentiation; however, the exact role of miR-503 in adipocyte differentiation is unclear. Thus, the objective of this study was to further examine miR-503 in adipocyte differentiation. We found significantly decreased expression of miR-503 during adipocyte differentiation process. Using bioinformatic analysis, miR-503 was identified as a potential regulator of Bone Morphogenetic Protein Receptor 1a (BMPR1a). We then validated BMPR1a as the target of miR-503 using a dual luciferase assay, and found decreased miR-503 and increased BMPR1a expression during adipogenesis. Overexpression of miR-503 in preadipocytes repressed expression of BMPR1a and adipogenic-related factors such as CCAAT/enhancer binding protein a (C/EBPα), proliferator-activated receptor-gamma (PPARγ), and adipocyte protein 2 (AP2). In addition, miR-503 overexpression impaired the phosphoinositol-3 kinase (PI3K)/Akt pathway. Inhibition of miR-503 had the opposite effect. Additionally, BMPR1a interference by siRNA attenuated adipocyte differentiation and the accumulation of lipid droplets via downregulating the PI3K/Akt signaling pathway. Our study provides the first evidence of the role miR-503 plays in adipocyte differentiation by regulating BMPR1a via the PI3K/Akt pathway, which may become a novel target for obesity therapy.
Collapse
Affiliation(s)
- Xiao-Fei Man
- Institute of Endocrinology and Metabolism, The Second Xiangya Hospital of Central South University Changsha City, Hunan Province, China
| | - Shu-Wen Tan
- Institute of Endocrinology and Metabolism, The Second Xiangya Hospital of Central South University Changsha City, Hunan Province, China
| | - Hao-Neng Tang
- Institute of Endocrinology and Metabolism, The Second Xiangya Hospital of Central South University Changsha City, Hunan Province, China
| | - Yue Guo
- Institute of Endocrinology and Metabolism, The Second Xiangya Hospital of Central South University Changsha City, Hunan Province, China
| | - Chen-Yi Tang
- Institute of Endocrinology and Metabolism, The Second Xiangya Hospital of Central South University Changsha City, Hunan Province, China
| | - Jun Tang
- Institute of Endocrinology and Metabolism, The Second Xiangya Hospital of Central South University Changsha City, Hunan Province, China
| | - Ci-La Zhou
- Institute of Endocrinology and Metabolism, The Second Xiangya Hospital of Central South University Changsha City, Hunan Province, China
| | - Hou-De Zhou
- Institute of Endocrinology and Metabolism, The Second Xiangya Hospital of Central South University Changsha City, Hunan Province, China
| |
Collapse
|
25
|
Ching T, Peplowska K, Huang S, Zhu X, Shen Y, Molnar J, Yu H, Tiirikainen M, Fogelgren B, Fan R, Garmire LX. Pan-Cancer Analyses Reveal Long Intergenic Non-Coding RNAs Relevant to Tumor Diagnosis, Subtyping and Prognosis. EBioMedicine 2016; 7:62-72. [PMID: 27322459 PMCID: PMC4909364 DOI: 10.1016/j.ebiom.2016.03.023] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2015] [Revised: 03/02/2016] [Accepted: 03/16/2016] [Indexed: 12/17/2022] Open
Abstract
Long intergenic noncoding RNAs (lincRNAs) are a relatively new class of non-coding RNAs that have the potential as cancer biomarkers. To seek a panel of lincRNAs as pan-cancer biomarkers, we have analyzed transcriptomes from over 3300 cancer samples with clinical information. Compared to mRNA, lincRNAs exhibit significantly higher tissue specificities that are then diminished in cancer tissues. Moreover, lincRNA clustering results accurately classify tumor subtypes. Using RNA-Seq data from thousands of paired tumor and adjacent normal samples in The Cancer Genome Atlas (TCGA), we identify six lincRNAs as potential pan-cancer diagnostic biomarkers (PCAN-1 to PCAN-6). These lincRNAs are robustly validated using cancer samples from four independent RNA-Seq data sets, and are verified by qPCR in both primary breast cancers and MCF-7 cell line. Interestingly, the expression levels of these six lincRNAs are also associated with prognosis in various cancers. We further experimentally explored the growth and migration dependence of breast and colon cancer cell lines on two of the identified lncRNAs. In summary, our study highlights the emerging role of lincRNAs as potentially powerful and biologically functional pan-cancer biomarkers and represents a significant leap forward in understanding the biological and clinical functions of lincRNAs in cancers. LincRNAs exhibit significantly higher tissue specificities that mRNAs, which are then diminished in cancer tissues. LincRNAs are highly deregulated in cancers and their expression strongly correlates with molecular subtypes A panel of diagnostic lincRNA biomarkers are discovered using the pan-cancer samples of The Cancer Genome Atlas (TCGA), and further validated with multiple independent data sets. Knocking down experiments of some pan-cancer up-regulated lincRNAs slow down the cell growth and migration in some cancer cell lines, suggesting that lincRNAs may be biologically functional.
Most of the work on cancer characterization, diagnosis, prognosis and treatment have been focused on the protein coding genes. Long intergenic non-coding RNAs (lincRNAs) are a relatively new class of RNA molecules that are understudied for their biological and clinical functions. This report aims to expand our understanding on the roles of lincRNA. Specifically, we demonstrate the relevance of lincRNAs to tumor diagnosis, subtyping and prognosis. We further propose a panel of lincRNAs as potentially robust pan-cancer diagnostic biomarkers.
Collapse
Affiliation(s)
- Travers Ching
- Molecular Biosciences and Bioengineering Graduate Program, University of Hawaii at Manoa, Honolulu, HI 96822, USA; Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI 96813, USA
| | - Karolina Peplowska
- Genomics Shared Resource, University of Hawaii Cancer Center, Honolulu, HI, 96813, USA
| | - Sijia Huang
- Molecular Biosciences and Bioengineering Graduate Program, University of Hawaii at Manoa, Honolulu, HI 96822, USA; Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI 96813, USA
| | - Xun Zhu
- Molecular Biosciences and Bioengineering Graduate Program, University of Hawaii at Manoa, Honolulu, HI 96822, USA; Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI 96813, USA
| | - Yi Shen
- Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI 96813, USA
| | - Janos Molnar
- Genomics Shared Resource, University of Hawaii Cancer Center, Honolulu, HI, 96813, USA
| | - Herbert Yu
- Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI 96813, USA
| | - Maarit Tiirikainen
- Genomics Shared Resource, University of Hawaii Cancer Center, Honolulu, HI, 96813, USA
| | - Ben Fogelgren
- Department of Anatomy, Biochemistry and Physiology, John A. Burns School of Medicine, University of Hawaii, Honolulu, HI 96813, USA
| | - Rong Fan
- Department of Biomedical Engineering, Yale University, New Haven, CT 06520, USA
| | - Lana X Garmire
- Molecular Biosciences and Bioengineering Graduate Program, University of Hawaii at Manoa, Honolulu, HI 96822, USA; Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI 96813, USA.
| |
Collapse
|
26
|
Ching T, Masaki J, Weirather J, Garmire LX. Non-coding yet non-trivial: a review on the computational genomics of lincRNAs. BioData Min 2015; 8:44. [PMID: 26697116 PMCID: PMC4687140 DOI: 10.1186/s13040-015-0075-z] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2015] [Accepted: 12/04/2015] [Indexed: 02/01/2023] Open
Abstract
Long intergenic non-coding RNAs (lincRNAs) represent one of the most mysterious RNA species encoded by the human genome. Thanks to next generation sequencing (NGS) technology and its applications, we have recently witnessed a surge in non-coding RNA research, including lincRNA research. Here, we summarize the recent advancement in genomics studies of lincRNAs. We review the emerging characteristics of lincRNAs, the experimental and computational approaches to identify lincRNAs, their known mechanisms of regulation, the computational methods and resources for lincRNA functional predictions, and discuss the challenges to understanding lincRNA comprehensively.
Collapse
Affiliation(s)
- Travers Ching
- Molecular Biosciences and Bioengineering Graduate Program, University of Hawaii at Manoa, Honolulu, HI 96822 USA
- Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI 96813 USA
| | - Jayson Masaki
- Laboratory of Immunology and Signal Transduction, Chaminade University of Honolulu, Honolulu, HI 96816 USA
| | - Jason Weirather
- Department of Internal Medicine, University of Iowa, Iowa City, IA 52242 USA
| | - Lana X. Garmire
- Molecular Biosciences and Bioengineering Graduate Program, University of Hawaii at Manoa, Honolulu, HI 96822 USA
- Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI 96813 USA
| |
Collapse
|
27
|
Chou CH, Chang NW, Shrestha S, Hsu SD, Lin YL, Lee WH, Yang CD, Hong HC, Wei TY, Tu SJ, Tsai TR, Ho SY, Jian TY, Wu HY, Chen PR, Lin NC, Huang HT, Yang TL, Pai CY, Tai CS, Chen WL, Huang CY, Liu CC, Weng SL, Liao KW, Hsu WL, Huang HD. miRTarBase 2016: updates to the experimentally validated miRNA-target interactions database. Nucleic Acids Res 2015; 44:D239-47. [PMID: 26590260 PMCID: PMC4702890 DOI: 10.1093/nar/gkv1258] [Citation(s) in RCA: 798] [Impact Index Per Article: 88.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2015] [Accepted: 10/30/2015] [Indexed: 02/07/2023] Open
Abstract
MicroRNAs (miRNAs) are small non-coding RNAs of approximately 22 nucleotides, which negatively regulate the gene expression at the post-transcriptional level. This study describes an update of the miRTarBase (http://miRTarBase.mbc.nctu.edu.tw/) that provides information about experimentally validated miRNA-target interactions (MTIs). The latest update of the miRTarBase expanded it to identify systematically Argonaute-miRNA-RNA interactions from 138 crosslinking and immunoprecipitation sequencing (CLIP-seq) data sets that were generated by 21 independent studies. The database contains 4966 articles, 7439 strongly validated MTIs (using reporter assays or western blots) and 348 007 MTIs from CLIP-seq. The number of MTIs in the miRTarBase has increased around 7-fold since the 2014 miRTarBase update. The miRNA and gene expression profiles from The Cancer Genome Atlas (TCGA) are integrated to provide an effective overview of this exponential growth in the miRNA experimental data. These improvements make the miRTarBase one of the more comprehensively annotated, experimentally validated miRNA-target interactions databases and motivate additional miRNA research efforts.
Collapse
Affiliation(s)
- Chih-Hung Chou
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Nai-Wen Chang
- Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, Taipei, 106, Taiwan
| | - Sirjana Shrestha
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Sheng-Da Hsu
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Yu-Ling Lin
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, 300, Taiwan Center for Bioinformatics Research, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Wei-Hsiang Lee
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, 300, Taiwan Clinical Research Center, Chung Shan Medical University Hospital, Taichung, 402, Taiwan
| | - Chi-Dung Yang
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, 300, Taiwan Institute of Population Health Sciences, National Health Research Institutes, Miaoli, 350, Taiwan
| | - Hsiao-Chin Hong
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Ting-Yen Wei
- Interdisciplinary Program of Life Science, National Tsing Hua University, Hsinchu, 300, Taiwan
| | - Siang-Jyun Tu
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Tzi-Ren Tsai
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Shu-Yi Ho
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Ting-Yan Jian
- Institute of Molecular Medicine and Bioengineering, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Hsin-Yi Wu
- Institute of Molecular Medicine and Bioengineering, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Pin-Rong Chen
- Institute of Molecular Medicine and Bioengineering, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Nai-Chieh Lin
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Hsin-Tzu Huang
- Degree Program of Applied Science and Technology, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Tzu-Ling Yang
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Chung-Yuan Pai
- Institute of Molecular Medicine and Bioengineering, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Chun-San Tai
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, 300, Taiwan Institute of Molecular Medicine and Bioengineering, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Wen-Liang Chen
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, 300, Taiwan Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Chia-Yen Huang
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, 300, Taiwan Gynecologic Cancer Center, Department of Obstetrics and Gynecology, Cathay General Hospital, Taipei, 106, Taiwan
| | - Chun-Chi Liu
- Institute of Genomics and Bioinformatics, National Chung Hsing University, Taichung, 402, Taiwan
| | - Shun-Long Weng
- Department of Obstetrics and Gynecology, Hsinchu Mackay Memorial Hospital, Hsinchu, 300, Taiwan Mackay Medicine, Nursing and Management College, Taipei, 112, Taiwan Department of Medicine, Mackay Medical College, New Taipei City, 252, Taiwan
| | - Kuang-Wen Liao
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, 300, Taiwan Institute of Molecular Medicine and Bioengineering, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Wen-Lian Hsu
- Institute of Information Science, Academia Sinica, Taipei, 115, Taiwan
| | - Hsien-Da Huang
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, 300, Taiwan Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, 300, Taiwan Center for Bioinformatics Research, National Chiao Tung University, Hsinchu, 300, Taiwan Department of Biomedical Science and Environmental Biology, Kaohsiung Medical University, Kaohsiung, 807, Taiwan
| |
Collapse
|
28
|
Laganà A. Computational Prediction of microRNA Targets. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2015; 887:231-52. [PMID: 26662994 DOI: 10.1007/978-3-319-22380-3_12] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Computational prediction of microRNA (miRNA) targets is a fundamental step towards the characterization of miRNA function and the understanding of their role in disease. A single miRNA can regulate hundreds of different gene transcripts through partial sequence complementarity and a single gene may be regulated by several miRNAs acting cooperatively. The remarkable advances made in recent years have allowed the identification of key features for functional miRNA binding sites. A plethora of prediction tools are now available, but their accuracies remain rather poor, as miRNA target recognition has revealed itself to be a very complex and dynamic mechanism, still only partially understood.In this chapter, the principles of miRNA target prediction in animals are presented, together with the most up-to-date and effective computational approaches and tools available.
Collapse
Affiliation(s)
- Alessandro Laganà
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, One Gustave L Levy Pl, New York, NY, 10029, USA.
| |
Collapse
|