1
|
Dong Z, Wang J, Chen G, Guo Y, Zhao N, Wang Z, Zhang B. A high-quality chromosome-level genome assembly of the Chinese medaka Oryzias sinensis. Sci Data 2024; 11:322. [PMID: 38548787 PMCID: PMC10978949 DOI: 10.1038/s41597-024-03173-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Accepted: 03/21/2024] [Indexed: 04/01/2024] Open
Abstract
Oryzias sinensis, also known as Chinese medaka or Chinese ricefish, is a commonly used animal model for aquatic environmental assessment in the wild as well as gene function validation or toxicology research in the lab. Here, a high-quality chromosome-level genome assembly of O. sinensis was generated using single-tube long fragment read (stLFR) reads, Nanopore long-reads, and Hi-C sequencing data. The genome is 796.58 Mb, and a total of 712.17 Mb of the assembled sequences were anchored to 23 pseudo-chromosomes. A final set of 22,461 genes were annotated, with 98.67% being functionally annotated. The Benchmarking Universal Single-Copy Orthologs (BUSCO) benchmark of genome assembly and gene annotation reached 95.1% (93.3% single-copy) and 94.6% (91.7% single-copy), respectively. Furthermore, we also use ATAC-seq to uncover chromosome transposase-accessibility as well as related genome area function enrichment for Oryzias sinensis. This study offers a new improved foundation for future genomics research in Chinese medaka.
Collapse
Affiliation(s)
- Zhongdian Dong
- Key Laboratory of Aquaculture in the South China Sea for Aquatic Economic Animals of Guangdong Higher Education Institutes, College of Fishery, Guangdong Ocean University, Zhanjiang, 524088, China
- Guangdong Provincial Key Laboratory of Aquatic Animal Disease Control and Healthy Culture, College of Fishery, Guangdong Ocean University, Zhanjiang, 524088, China
| | - Jiangman Wang
- Qingdao Marine Management Support Center, Qingdao, Shandong, China
| | - Guozhu Chen
- National Plateau Wetland Research Center, College of Wetlands, Southwest Forestry University, Kunming, 650224, China
| | - Yusong Guo
- Key Laboratory of Aquaculture in the South China Sea for Aquatic Economic Animals of Guangdong Higher Education Institutes, College of Fishery, Guangdong Ocean University, Zhanjiang, 524088, China
| | - Na Zhao
- Key Laboratory of Aquaculture in the South China Sea for Aquatic Economic Animals of Guangdong Higher Education Institutes, College of Fishery, Guangdong Ocean University, Zhanjiang, 524088, China
- Southern Marine Science and Engineering Guangdong Laboratory-Zhanjiang, Zhanjiang, 524000, China
| | - Zhongduo Wang
- Key Laboratory of Aquaculture in the South China Sea for Aquatic Economic Animals of Guangdong Higher Education Institutes, College of Fishery, Guangdong Ocean University, Zhanjiang, 524088, China.
- Guangdong Provincial Key Laboratory of Aquatic Animal Disease Control and Healthy Culture, College of Fishery, Guangdong Ocean University, Zhanjiang, 524088, China.
| | - Bo Zhang
- Key Laboratory of Aquaculture in the South China Sea for Aquatic Economic Animals of Guangdong Higher Education Institutes, College of Fishery, Guangdong Ocean University, Zhanjiang, 524088, China.
- Southern Marine Science and Engineering Guangdong Laboratory-Zhanjiang, Zhanjiang, 524000, China.
| |
Collapse
|
2
|
Orro A, Trombetti GA. High-Accuracy ncRNA Function Prediction via Deep Learning Using Global and Local Sequence Information. Biomedicines 2023; 11:1631. [PMID: 37371726 DOI: 10.3390/biomedicines11061631] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Revised: 06/01/2023] [Accepted: 06/02/2023] [Indexed: 06/29/2023] Open
Abstract
The prediction of the biological function of non-coding ribonucleic acid (ncRNA) is an important step towards understanding the regulatory mechanisms underlying many diseases. Since non-coding RNAs are present in great abundance in human cells and are functionally diverse, developing functional prediction tools is necessary. With recent advances in non-coding RNA biology and the availability of complete genome sequences for a large number of species, we now have a window of opportunity for studying non-coding RNA biology. However, the computational methods used to predict the non-coding RNA functions are mostly either scarcely accurate, when based on sequence information alone, or prohibitively expensive in terms of computational burden when a secondary structure prediction is needed. We propose a novel computational method to predict the biological function of non-coding RNA genes that is based on a collection of deep network architectures utilizing solely ncRNA sequence information and which does not rely on or require expensive secondary ncRNA structure information. The approach presented in this work exhibits comparable or superior accuracy to methods that employ both sequence and structural features, at a much lower computational cost.
Collapse
Affiliation(s)
- Alessandro Orro
- Institute for Biomedical Technologies, National Research Council (ITB-CNR), 20054 Segrate, Italy
| | - Gabriele A Trombetti
- Institute for Biomedical Technologies, National Research Council (ITB-CNR), 20054 Segrate, Italy
| |
Collapse
|
3
|
Castro-Muñoz LJ, Vázquez Ulloa E, Sahlgren C, Lizano M, De La Cruz-Hernández E, Contreras-Paredes A. Modulating epigenetic modifications for cancer therapy (Review). Oncol Rep 2023; 49:59. [PMID: 36799181 PMCID: PMC9942256 DOI: 10.3892/or.2023.8496] [Citation(s) in RCA: 23] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Accepted: 11/08/2022] [Indexed: 02/12/2023] Open
Abstract
Cancer is a global public health concern. Alterations in epigenetic processes are among the earliest genomic aberrations occurring during cancer development and are closely related to progression. Unlike genetic mutations, aberrations in epigenetic processes are reversible, which opens the possibility for novel pharmacological treatments. Non‑coding RNAs (ncRNAs) represent an essential epigenetic mechanism, and emerging evidence links ncRNAs to carcinogenesis. Epigenetic drugs (epidrugs) are a group of promising target therapies for cancer treatment acting as coadjuvants to reverse drug resistance in cancer. The present review describes central epigenetic aberrations during malignant transformation and explains how epidrugs target DNA methylation, histone modifications and ncRNAs. Furthermore, clinical trials focused on evaluating the effect of these epidrugs alone or in combination with other anticancer therapies and other ncRNA‑based therapies are discussed. The use of epidrugs promises to be an effective tool for reversing drug resistance in some patients with cancer.
Collapse
Affiliation(s)
| | - Elenaé Vázquez Ulloa
- Faculty of Science and Engineering/Cell Biology, University of Turku and Åbo Akademi University, Turku 20500, Finland
- Turku Bioscience, University of Turku and Åbo Akademi University, Turku 20500, Finland
| | - Cecilia Sahlgren
- Faculty of Science and Engineering/Cell Biology, University of Turku and Åbo Akademi University, Turku 20500, Finland
- Turku Bioscience, University of Turku and Åbo Akademi University, Turku 20500, Finland
- Department of Biomedical Engineering, Eindhoven University of Technology, Eindhoven 5600 MB, The Netherlands
- Institute for Complex Molecular Systems, Eindhoven University of Technology, Eindhoven 5600 MB, The Netherlands
| | - Marcela Lizano
- Unidad de Investigacion Biomedica en Cancer, Instituto Nacional de Cancerología-Universidad Nacional Autonoma de Mexico, Ciudad de Mexico 14080, Mexico
- Departamento de Medicina Genomica y Toxicologia Ambiental, Instituto de Investigaciones Biomedicas, Universidad Nacional Autonoma de Mexico, Mexico 04510, Mexico
| | - Erick De La Cruz-Hernández
- Laboratory of Research in Metabolic and Infectious Diseases, Multidisciplinary Academic Division of Comalcalco, Juarez Autonomous University of Tabasco, Comalcalco, Tabasco 86650, Mexico
| | - Adriana Contreras-Paredes
- Unidad de Investigacion Biomedica en Cancer, Instituto Nacional de Cancerología-Universidad Nacional Autonoma de Mexico, Ciudad de Mexico 14080, Mexico
| |
Collapse
|
4
|
Pérez-Rodríguez D, López-Fernández H, Agís-Balboa RC. Application of miRNA-seq in neuropsychiatry: A methodological perspective. Comput Biol Med 2021; 135:104603. [PMID: 34216893 DOI: 10.1016/j.compbiomed.2021.104603] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Revised: 06/21/2021] [Accepted: 06/21/2021] [Indexed: 10/21/2022]
Abstract
MiRNAs are emerging as key molecules to study neuropsychiatric diseases. However, despite the large number of methodologies and software for miRNA-seq analyses, there is little supporting literature for researchers in this area. This review focuses on evaluating how miRNA-seq has been used to study neuropsychiatric diseases to date, analyzing both the main findings discovered and the bioinformatics workflows and tools used from a methodological perspective. The objective of this review is two-fold: first, to evaluate current miRNA-seq procedures used in neuropsychiatry; and second, to offer comprehensive information that can serve as a guide to new researchers in bioinformatics. After conducting a systematic search (from 2016 to June 30, 2020) of articles using miRNA-seq in neuropsychiatry, we have seen that it has already been used for different types of studies in three main categories: diagnosis, prognosis, and mechanism. We carefully analyzed the bioinformatics workflows of each study, observing a high degree of variability with respect to the tools and methods used and several methodological complexities that are identified and discussed in this review.
Collapse
Affiliation(s)
- Daniel Pérez-Rodríguez
- Translational Neuroscience Group-CIBERSAM, Galicia Sur Health Research Institute (IIS Galicia Sur), Área Sanitaria de Vigo-Hospital Álvaro Cunqueiro, SERGAS-UVIGO, 36213, Vigo, Spain; NeuroEpigenetics Lab. University Hospital Complex of Vigo, SERGAS-UVIGO, 36213, Vigo, Spain
| | - Hugo López-Fernández
- Instituto de Investigação e Inovação Em Saúde (I3S), Universidade Do Porto, Rua Alfredo Allen, 208, 4200-135, Porto, Portugal; CINBIO, Universidade de Vigo, Department of Computer Science, ESEI - Escuela Superior de Ingeniería Informática, 32004, Ourense, Spain; SING Research Group, Galicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO, Spain.
| | - Roberto C Agís-Balboa
- Translational Neuroscience Group-CIBERSAM, Galicia Sur Health Research Institute (IIS Galicia Sur), Área Sanitaria de Vigo-Hospital Álvaro Cunqueiro, SERGAS-UVIGO, 36213, Vigo, Spain; NeuroEpigenetics Lab. University Hospital Complex of Vigo, SERGAS-UVIGO, 36213, Vigo, Spain.
| |
Collapse
|
5
|
Gupta M, Chandan K, Sarwat M. Role of microRNA and Long Non-Coding RNA in Hepatocellular Carcinoma. Curr Pharm Des 2020; 26:415-428. [PMID: 31939724 PMCID: PMC7403690 DOI: 10.2174/1381612826666200115093835] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2019] [Accepted: 12/04/2019] [Indexed: 02/08/2023]
Abstract
Hepatocellular carcinoma (HCC) accounts for about 80-90% of all liver cancers and is found to be the third most common cause of cancer mortality in the Asia-Pacific region. Risk factors include hepatitis B and C virus, cirrhosis, aflatoxin-contaminated food, alcohol, and diabetes. Surgically removing the tumor tissue seems effective but a high chance of recurrence has led to an urgent need to develop novel molecules for the treatment of HCC. Clinical management with sorafenib is found to be effective but it is only able to prolong survival for a few months. Various side effects like gastrointestinal and abdominal pain, hypertension, and hemorrhage are also associated with sorafenib, which calls for the unmet need of effective therapies against HCC. Similarly, the genetic mechanisms behind the occurrence of HCC are still unknown and need to be expounded further for developing newer candidates. Since unearthing the concept of these variants, transcriptomics has revealed the role of non-coding RNAs (ncRNAs) in many cellular, physiological and pathobiological processes. They are also found to be widely associated and abundantly expressed in a variety of cancer. Aberrant expression and mutations are closely related to tumorigenesis and metastasis and hence are classified as novel biomarkers and therapeutic targets for the treatment of cancer, including HCC. Herein, this review summarises the relationship between ncRNAs and hepatocellular carcinoma.
Collapse
Affiliation(s)
- Meenakshi Gupta
- Amity Institute of Pharmacy, Amity University, Noida-201313, Uttar Pradesh, India
| | - Kumari Chandan
- Amity Institute of Pharmacy, Amity University, Noida-201313, Uttar Pradesh, India
| | - Maryam Sarwat
- Amity Institute of Pharmacy, Amity University, Noida-201313, Uttar Pradesh, India
| |
Collapse
|
6
|
Rojano E, Seoane P, Ranea JAG, Perkins JR. Regulatory variants: from detection to predicting impact. Brief Bioinform 2019; 20:1639-1654. [PMID: 29893792 PMCID: PMC6917219 DOI: 10.1093/bib/bby039] [Citation(s) in RCA: 65] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2018] [Revised: 04/18/2018] [Indexed: 02/01/2023] Open
Abstract
Variants within non-coding genomic regions can greatly affect disease. In recent years, increasing focus has been given to these variants, and how they can alter regulatory elements, such as enhancers, transcription factor binding sites and DNA methylation regions. Such variants can be considered regulatory variants. Concurrently, much effort has been put into establishing international consortia to undertake large projects aimed at discovering regulatory elements in different tissues, cell lines and organisms, and probing the effects of genetic variants on regulation by measuring gene expression. Here, we describe methods and techniques for discovering disease-associated non-coding variants using sequencing technologies. We then explain the computational procedures that can be used for annotating these variants using the information from the aforementioned projects, and prediction of their putative effects, including potential pathogenicity, based on rule-based and machine learning approaches. We provide the details of techniques to validate these predictions, by mapping chromatin-chromatin and chromatin-protein interactions, and introduce Clustered Regularly Interspaced Short Palindromic Repeats-Associated Protein 9 (CRISPR-Cas9) technology, which has already been used in this field and is likely to have a big impact on its future evolution. We also give examples of regulatory variants associated with multiple complex diseases. This review is aimed at bioinformaticians interested in the characterization of regulatory variants, molecular biologists and geneticists interested in understanding more about the nature and potential role of such variants from a functional point of views, and clinicians who may wish to learn about variants in non-coding genomic regions associated with a given disease and find out what to do next to uncover how they impact on the underlying mechanisms.
Collapse
Affiliation(s)
- Elena Rojano
- Department of Molecular Biology and Biochemistry, University of Malaga (UMA), 29010 Malaga, Spain
| | - Pedro Seoane
- Department of Molecular Biology and Biochemistry, University of Malaga (UMA), 29010 Malaga, Spain
| | - Juan A G Ranea
- CIBER de Enfermedades Raras, ISCIII, Madrid, Spain and Department of Molecular Biology and Biochemistry, University of Malaga (UMA), 29010 Malaga, Spain
| | - James R Perkins
- Research laboratory, IBIMA-Regional University Hospital of Malaga, UMA, Malaga 29009, Spain
| |
Collapse
|
7
|
Li Y, He Y, Han S, Liang Y. Identification and Functional Inference for Tumor-Associated Long Non-Coding RNA. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019; 16:1288-1301. [PMID: 28358691 DOI: 10.1109/tcbb.2017.2687442] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Gastric cancer is one of the top leading causes of cancer mortality worldwide especially in China. In recent years, some lncRNAs are discovered to be dysregulated in many cancers. The study on long non-coding RNAs (lncRNAs) relationship with cancers has attracted increasing attention. The molecular mechanism of gastric cancer remains largely unclear factors, especially for lncRNAs. Experiments are feasible to obtain related information, however, experimental identification of cancer-related lncRNAs usually possesses high time complexity and high cost. In this paper, a computational method is proposed to determine the relationship between lncRNA and gastric cancer by reusing the exon-based array of gastric cancer. One specific lncRNAs LINC00365 and its target differentially expressed genes whose products are predicted as blood, urine, or salvia-excretory are identified to be candidates for a combined biomarker for gastric cancer. Further biological function and molecular mechanism of the gastric cancer related lncRNAs and coding gene biomarkers are inferred in terms of multi-source biological knowledge.
Collapse
|
8
|
Huang S, Ichikawa Y, Yoshitake K, Kinoshita S, Igarashi Y, Omori F, Maeyama K, Nagai K, Watabe S, Asakawa S. Identification and Characterization of microRNAs and Their Predicted Functions in Biomineralization in the Pearl Oyster ( Pinctada fucata). BIOLOGY 2019; 8:biology8020047. [PMID: 31212990 PMCID: PMC6627748 DOI: 10.3390/biology8020047] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/18/2019] [Revised: 06/12/2019] [Accepted: 06/13/2019] [Indexed: 11/16/2022]
Abstract
The biological process of pearl formation is an ongoing research topic, and a number of genes associated with this process have been identified. However, the involvement of microRNAs (miRNAs) in biomineralization in the pearl oyster, Pinctada fucata, is not well understood. In order to investigate the divergence and function of miRNAs in P. fucata, we performed a transcriptome analysis of small RNA libraries prepared from adductor muscle, gill, ovary, and mantle tissues. We identified 186 known and 42 novel miRNAs in these tissues. Clustering analysis showed that the expression patterns of miRNAs were similar among the somatic tissues, but they differed significantly between the somatic and ovary tissues. To validate the existence of the identified miRNAs, nine known and three novel miRNAs were verified by stem-loop qRT-PCR using U6 snRNA as an internal reference. The expression abundance and target prediction between miRNAs and biomineralization-related genes indicated that miR-1990c-3p, miR-876, miR-9a-3p, and novel-3 may be key factors in the regulatory network that act by controlling the formation of matrix proteins or the differentiation of mineralogenic cells during shell formation in mantle tissue. Our findings serve to further clarify the processes underlying biomineralization in P. fucata.
Collapse
Affiliation(s)
- Songqian Huang
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Bunkyo-ku, Tokyo 113-8657, Japan.
| | - Yuki Ichikawa
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Bunkyo-ku, Tokyo 113-8657, Japan.
| | - Kazutoshi Yoshitake
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Bunkyo-ku, Tokyo 113-8657, Japan.
| | - Shigeharu Kinoshita
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Bunkyo-ku, Tokyo 113-8657, Japan.
| | - Yoji Igarashi
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Bunkyo-ku, Tokyo 113-8657, Japan.
| | - Fumito Omori
- Mikimoto Pharmaceutical CO., LTD., Kurose 1425, Ise, Mie 516-8581, Japan.
| | - Kaoru Maeyama
- Mikimoto Pharmaceutical CO., LTD., Kurose 1425, Ise, Mie 516-8581, Japan.
| | - Kiyohito Nagai
- Pearl Research Laboratory, K. MIKIMOTO & CO., LTD., Osaki Hazako 923, Hamajima, Shima, Mie 517-0403, Japan.
| | - Shugo Watabe
- School of Marine Biosciences, Kitasato University, Minami-ku, Sagamihara, Kanagawa 252-0313, Japan.
| | - Shuichi Asakawa
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Bunkyo-ku, Tokyo 113-8657, Japan.
| |
Collapse
|
9
|
Emamjomeh A, Zahiri J, Asadian M, Behmanesh M, Fakheri BA, Mahdevar G. Identification, Prediction and Data Analysis of Noncoding RNAs: A Review. Med Chem 2019; 15:216-230. [PMID: 30484409 DOI: 10.2174/1573406414666181015151610] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2017] [Revised: 06/03/2018] [Accepted: 09/30/2018] [Indexed: 12/13/2022]
Abstract
BACKGROUND Noncoding RNAs (ncRNAs) which play an important role in various cellular processes are important in medicine as well as in drug design strategies. Different studies have shown that ncRNAs are dis-regulated in cancer cells and play an important role in human tumorigenesis. Therefore, it is important to identify and predict such molecules by experimental and computational methods, respectively. However, to avoid expensive experimental methods, computational algorithms have been developed for accurately and fast prediction of ncRNAs. OBJECTIVE The aim of this review was to introduce the experimental and computational methods to identify and predict ncRNAs structure. Also, we explained the ncRNA's roles in cellular processes and drugs design, briefly. METHOD In this survey, we will introduce ncRNAs and their roles in biological and medicinal processes. Then, some important laboratory techniques will be studied to identify ncRNAs. Finally, the state-of-the-art models and algorithms will be introduced along with important tools and databases. RESULTS The results showed that the integration of experimental and computational approaches improves to identify ncRNAs. Moreover, the high accurate databases, algorithms and tools were compared to predict the ncRNAs. CONCLUSION ncRNAs prediction is an exciting research field, but there are different difficulties. It requires accurate and reliable algorithms and tools. Also, it should be mentioned that computational costs of such algorithm including running time and usage memory are very important. Finally, some suggestions were presented to improve computational methods of ncRNAs gene and structural prediction.
Collapse
Affiliation(s)
- Abbasali Emamjomeh
- Laboratory of Computational Biotechnology and Bioinformatics (CBB), Department of Plant Breeding and Biotechnology (PBB), University of Zabol, Zabol, Iran
| | - Javad Zahiri
- Bioinformatics and Computational Omics Lab (BioCOOL), Department of Biophysics, Faculty of Biological Sciences, Tarbiat Modares University, Tehran, Iran
| | - Mehrdad Asadian
- Department of Plant Breeding and Biotechnology (PBB), Faculty of Agriculture, University of Zabol, Zabol, Iran
| | - Mehrdad Behmanesh
- Department of Genetics, Faculty of Biological Sciences, Tarbiat Modares University, Tehran, Iran
| | - Barat A Fakheri
- Department of Plant Breeding and Biotechnology (PBB), Faculty of Agriculture, University of Zabol, Zabol, Iran
| | - Ghasem Mahdevar
- Department of Mathematics, Faculty of Sciences, University of Isfahan, Isfahan, Iran
| |
Collapse
|
10
|
Abstract
One of the most important resources for researchers of noncoding RNAs is the information available in public databases spread over the internet. However, the effective exploration of this data can represent a daunting task, given the large amount of databases available and the variety of stored data. This chapter describes a classification of databases based on information source, type of RNA, source organisms, data formats, and the mechanisms for information retrieval, detailing the relevance of each of these classifications and its usability by researchers. This classification is used to update a 2012 review, indexing now more than 229 public databases. This review will include an assessment of the new trends for ncRNA research based on the information that is being offered by the databases. Additionally, we will expand the previous analysis focusing on the usability and application of these databases in pathogen and disease research. Finally, this chapter will analyze how currently available database schemas can help the development of new and improved web resources.
Collapse
|
11
|
You C, Zhu K, Zhang Q, Yan J, Wang Y, Li J. ODNA: a manually curated database of noncoding RNAs associated with orthopedics. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2019; 2019:5641100. [PMID: 31781773 PMCID: PMC6882730 DOI: 10.1093/database/baz126] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Revised: 04/04/2019] [Accepted: 10/02/2019] [Indexed: 12/20/2022]
Affiliation(s)
- Changcheng You
- Department of Orthopedic Surgery, Second Affiliated Hospital, Harbin Medical University, Harbin, China
| | - Kai Zhu
- Harbin Children's Hospital, Harbin, China
| | - Qiuhua Zhang
- Department of Orthopedic Surgery, Second Affiliated Hospital, Harbin Medical University, Harbin, China
| | - Jnglong Yan
- Department of Orthopedic Surgery, Second Affiliated Hospital, Harbin Medical University, Harbin, China
| | - Yufu Wang
- Department of Orthopedic Surgery, Second Affiliated Hospital, Harbin Medical University, Harbin, China
| | - Jing Li
- Department of Pathology and Centre of Electron Microscope, Faculty of Basic Science, Harbin Medical University, Harbin, China.,Laboratory Medicine and Pathology, Faculty of Medicine & Dentistry, University of Alberta, Edmonton, Canada
| |
Collapse
|
12
|
Liao P, Li S, Cui X, Zheng Y. A comprehensive review of web-based resources of non-coding RNAs for plant science research. Int J Biol Sci 2018; 14:819-832. [PMID: 29989090 PMCID: PMC6036741 DOI: 10.7150/ijbs.24593] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2017] [Accepted: 03/14/2018] [Indexed: 01/06/2023] Open
Abstract
Non-coding RNAs (ncRNAs) are transcribed from genome but not translated into proteins. Many ncRNAs are key regulators of plants growth and development, metabolism and stress tolerance. In order to make the web-based ncRNA resources for plant science research be more easily accessible and understandable, we made a comprehensive review for 83 web-based resources of three types, including genome databases containing ncRNA data, microRNA (miRNA) databases and long non-coding RNA (lncRNA) databases. To facilitate effective usage of these resources, we also suggested some preferred resources of miRNAs and lncRNAs for performing meaningful analysis.
Collapse
Affiliation(s)
- Peiran Liao
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, Yunnan, 650500,China
| | - Shipeng Li
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, Yunnan, 650500,China
| | - Xiuming Cui
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, Yunnan, 650500,China
- Yunnan key laboratory of Panax notoginseng, Kunming, Yunnan, 650500, China
| | - Yun Zheng
- Yunnan Key Laboratory of Primate Biomedical Research, Institute of Primate Translational Medicine, Kunming University of Science and Technology, Kunming, Yunnan, 650500, China
| |
Collapse
|
13
|
Swain TD. Revisiting the phylogeny of Zoanthidea (Cnidaria: Anthozoa): Staggered alignment of hypervariable sequences improves species tree inference. Mol Phylogenet Evol 2017; 118:1-12. [PMID: 28919505 DOI: 10.1016/j.ympev.2017.09.008] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2016] [Revised: 09/11/2017] [Accepted: 09/13/2017] [Indexed: 10/18/2022]
Abstract
The recent rapid proliferation of novel taxon identification in the Zoanthidea has been accompanied by a parallel propagation of gene trees as a tool of species discovery, but not a corresponding increase in our understanding of phylogeny. This disparity is caused by the trade-off between the capabilities of automated DNA sequence alignment and data content of genes applied to phylogenetic inference in this group. Conserved genes or segments are easily aligned across the order, but produce poorly resolved trees; hypervariable genes or segments contain the evolutionary signal necessary for resolution and robust support, but sequence alignment is daunting. Staggered alignments are a form of phylogeny-informed sequence alignment composed of a mosaic of local and universal regions that allow phylogenetic inference to be applied to all nucleotides from both hypervariable and conserved gene segments. Comparisons between species tree phylogenies inferred from all data (staggered alignment) and hypervariable-excluded data (standard alignment) demonstrate improved confidence and greater topological agreement with other sources of data for the complete-data tree. This novel phylogeny is the most comprehensive to date (in terms of taxa and data) and can serve as an expandable tool for evolutionary hypothesis testing in the Zoanthidea. Spanish language abstract available in Text S1. Translation by L. O. Swain, DePaul University, Chicago, Illinois, 60604, USA.
Collapse
Affiliation(s)
- Timothy D Swain
- Integrative Research Center, Field Museum of Natural History, Chicago, IL 60605, USA; Department of Civil and Environmental Engineering, Northwestern University, Evanston, IL 60208, USA.
| |
Collapse
|
14
|
Cohen JE, Lee PR, Fields RD. Systematic identification of 3'-UTR regulatory elements in activity-dependent mRNA stability in hippocampal neurons. Philos Trans R Soc Lond B Biol Sci 2015; 369:rstb.2013.0509. [PMID: 25135970 PMCID: PMC4142030 DOI: 10.1098/rstb.2013.0509] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
Ongoing neuronal activity during development and plasticity acts to refine synaptic connections and contributes to the induction of plasticity and ultimately long-term memory storage. Activity-dependent, post-transcriptional control of mRNAs occurs through transport to axonal and dendritic compartments, local translation and mRNA stability. We have identified a mechanism that contributes to activity-dependent regulation of mRNA stability during synaptic plasticity in rat hippocampal neurons. In this study, we demonstrate rapid, post-transcriptional control over process-enriched mRNAs by neuronal activity. Systematic analysis of the 3'-UTRs of destabilized transcripts, identifies enrichment in sequence motifs corresponding to microRNA (miRNA)-binding sites. The miRNAs that were identified, miR-326-3p/miR-330-5p, miR-485-5p, miR-666-3p and miR-761 are predicted to regulate networks of genes important in plasticity and development. We find that these miRNAs are developmentally regulated in the hippocampus, many increasing by postnatal day 14. We further find that miR-485-5p controls NGF-induced neurite outgrowth in PC12 cells, tau expression and axonal development in hippocampal neurons. miRNAs can function at the synapse to rapidly control and affect short- and long-term changes at the synapse. These processes likely occur during refinement of synaptic connections and contribute to the induction of plasticity and learning and memory.
Collapse
Affiliation(s)
- Jonathan E Cohen
- Section on Nervous System Development and Plasticity, The Eunice Kennedy Shriver National Institute of Child and Human Development, National Institute of Health, Building 35, Room 2A211, Bethesda, MD 20892-3714, USA
| | - Philip R Lee
- Section on Nervous System Development and Plasticity, The Eunice Kennedy Shriver National Institute of Child and Human Development, National Institute of Health, Building 35, Room 2A211, Bethesda, MD 20892-3714, USA
| | - R Douglas Fields
- Section on Nervous System Development and Plasticity, The Eunice Kennedy Shriver National Institute of Child and Human Development, National Institute of Health, Building 35, Room 2A211, Bethesda, MD 20892-3714, USA
| |
Collapse
|
15
|
Sun M, Kraus WL. From discovery to function: the expanding roles of long noncoding RNAs in physiology and disease. Endocr Rev 2015; 36:25-64. [PMID: 25426780 PMCID: PMC4309736 DOI: 10.1210/er.2014-1034] [Citation(s) in RCA: 314] [Impact Index Per Article: 34.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Long noncoding RNAs (lncRNAs) are a relatively poorly understood class of RNAs with little or no coding capacity transcribed from a set of incompletely annotated genes. They have received considerable attention in the past few years and are emerging as potentially important players in biological regulation. Here we discuss the evolving understanding of this new class of molecular regulators that has emerged from ongoing research, which continues to expand our databases of annotated lncRNAs and provide new insights into their physical properties, molecular mechanisms of action, and biological functions. We outline the current strategies and approaches that have been employed to identify and characterize lncRNAs, which have been instrumental in revealing their multifaceted roles ranging from cis- to trans-regulation of gene expression and from epigenetic modulation in the nucleus to posttranscriptional control in the cytoplasm. In addition, we highlight the molecular and biological functions of some of the best characterized lncRNAs in physiology and disease, especially those relevant to endocrinology, reproduction, metabolism, immunology, neurobiology, muscle biology, and cancer. Finally, we discuss the tremendous diagnostic and therapeutic potential of lncRNAs in cancer and other diseases.
Collapse
Affiliation(s)
- Miao Sun
- Laboratory of Signaling and Gene Regulation, Cecil H. and Ida Green Center for Reproductive Biology Sciences and Division of Basic Reproductive Biology Research, Department of Obstetrics and Gynecology, University of Texas Southwestern Medical Center, Dallas, Texas 75390
| | | |
Collapse
|
16
|
Paschoal AR, Maracaja-Coutinho V, Setubal JC, Simões ZLP, Verjovski-Almeida S, Durham AM. Non-coding transcription characterization and annotation. RNA Biol 2014; 9:274-82. [DOI: 10.4161/rna.19352] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
|
17
|
Li C, Yang L, Lin C. Long noncoding RNAs in prostate cancer: mechanisms and applications. Mol Cell Oncol 2014; 1:e963469. [PMID: 27308347 DOI: 10.4161/23723548.2014.963469] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2014] [Revised: 08/04/2014] [Accepted: 08/12/2014] [Indexed: 12/26/2022]
Abstract
A large proportion of the control of gene expression in humans is mediated by noncoding elements in the genome. Long noncoding RNAs (lncRNAs) have emerged as a new class of pivotal regulatory components, orchestrating extensive cellular processes and connections. LncRNAs play various roles from chromatin modification to alternative splicing and post-transcriptional processing and are involved in almost all aspects of eukaryotic regulation. LncRNA-based mechanisms modulate cell fates during development, and their dysregulation underscores many human disorders, especially cancer, through chromosomal translocation, deletion, and nucleotide expansions. Recent studies demonstrate that multiple prostate cancer risk loci are associated with lncRNAs and that ectopic expression of these transcripts triggers a cascade of cellular events driving tumor initiation and progression. The recent increased rate of discovery of lncRNAs has been leveraged for application in clinical strategies such as novel biomarkers and therapeutic targets. Despite this potential, many issues remain to be addressed in this fast-growing field.
Collapse
Affiliation(s)
- Chunlai Li
- Department of Molecular and Cellular Oncology; The University of Texas MD Anderson Cancer Center ; Houston, TX, 77030, USA
| | - Liuqing Yang
- Department of Molecular and Cellular Oncology; The University of Texas MD Anderson Cancer Center; Houston, TX, 77030, USA; Program in Cancer Biology; The University of Texas Graduate School of Biomedical Sciences at Houston; Houston, TX, 77030, USA
| | - Chunru Lin
- Department of Molecular and Cellular Oncology; The University of Texas MD Anderson Cancer Center; Houston, TX, 77030, USA; Program in Cancer Biology; The University of Texas Graduate School of Biomedical Sciences at Houston; Houston, TX, 77030, USA
| |
Collapse
|
18
|
Chan WL, Huang HD, Chang JG. lncRNAMap: A map of putative regulatory functions in the long non-coding transcriptome. Comput Biol Chem 2014; 50:41-9. [DOI: 10.1016/j.compbiolchem.2014.01.003] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/23/2013] [Indexed: 12/20/2022]
|
19
|
Nielsen MM, Tehler D, Vang S, Sudzina F, Hedegaard J, Nordentoft I, Ørntoft TF, Lund AH, Pedersen JS. Identification of expressed and conserved human noncoding RNAs. RNA (NEW YORK, N.Y.) 2014; 20:236-251. [PMID: 24344320 PMCID: PMC3895275 DOI: 10.1261/rna.038927.113] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/02/2013] [Accepted: 11/07/2013] [Indexed: 06/03/2023]
Abstract
The past decade has shown mammalian genomes to be pervasively transcribed and identified thousands of noncoding (nc) transcripts. It is currently unclear to what extent these transcripts are of functional importance, as experimental functional evidence exists for only a small fraction. Here, we characterize the expression and evolutionary conservation properties of 12,115 known and novel nc transcripts, including structural RNAs, long nc RNAs (lncRNAs), antisense RNAs, EvoFold predictions, ultraconserved elements, and expressed nc regions. Expression levels are evaluated across 12 human tissues using a custom-designed microarray, supplemented with RNAseq. Conservation levels are evaluated at both the base level and at the syntenic level. We combine these measures with epigenetic mark annotations to identify subsets of novel nc transcripts that show characteristics similar to known functional ncRNAs. Few novel nc transcripts show both high expression and conservation levels. However, overall, we observe a positive correlation between expression and both conservation and epigenetic annotations, suggesting that a subset of the expressed transcripts are under purifying selection and likely functional. The identified subsets of expressed and conserved novel nc transcripts may form the basis for further functional characterization.
Collapse
Affiliation(s)
- Morten Muhlig Nielsen
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Skejby, DK-8200 Aarhus N, Denmark
| | - Disa Tehler
- Biotech Research and Innovation Centre, University of Copenhagen, DK-2200 Copenhagen, Denmark
| | - Søren Vang
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Skejby, DK-8200 Aarhus N, Denmark
| | - Frantisek Sudzina
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Skejby, DK-8200 Aarhus N, Denmark
| | - Jakob Hedegaard
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Skejby, DK-8200 Aarhus N, Denmark
| | - Iver Nordentoft
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Skejby, DK-8200 Aarhus N, Denmark
| | - Torben Falck Ørntoft
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Skejby, DK-8200 Aarhus N, Denmark
| | - Anders H. Lund
- Biotech Research and Innovation Centre, University of Copenhagen, DK-2200 Copenhagen, Denmark
| | - Jakob Skou Pedersen
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Skejby, DK-8200 Aarhus N, Denmark
| |
Collapse
|
20
|
Arrigo P. MicroRNA and noncoding RNA-related data sources. Methods Mol Biol 2014; 1107:73-89. [PMID: 24272432 DOI: 10.1007/978-1-62703-748-8_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
Noncoding RNAs (ncRNAs) are ribonucleic acids capable of controlling different genetic and metabolic functions. These molecules have been recently organized into different classes, and among them microRNAs (miRNAs) are extensively studied. MicroRNAs are short oligomers mainly involved in posttranscriptional gene silencing. The specific research field, focused on structural and functional characterization of microRNAs, is commonly called mirnomics. The exploitation of the interest in microRNAs has stimulated the organization of several databases that are often integrated with analytical tools in order to predict microRNA targets, or to find those miRNAs capable to inhibit the expression of a specific protein. This work attempts to provide an overview of accessible information about microRNAs and other noncoding RNAs that has been gathered in curated databases.
Collapse
|
21
|
Identification and characterisation of non-coding small RNAs in the pathogenic filamentous fungus Trichophyton rubrum. BMC Genomics 2013; 14:931. [PMID: 24377353 PMCID: PMC3890542 DOI: 10.1186/1471-2164-14-931] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2013] [Accepted: 12/20/2013] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND Accumulating evidence demonstrates that non-coding RNAs (ncRNAs) are indispensable components of many organisms and play important roles in cellular events, regulation, and development. RESULTS Here, we analysed the small non-coding RNA (ncRNA) transcriptome of Trichophyton rubrum by constructing and sequencing a cDNA library from conidia and mycelia. We identified 352 ncRNAs and their corresponding genomic loci. These ncRNA candidates included 198 entirely novel ncRNAs and 154 known ncRNAs classified as snRNAs, snoRNAs and other known ncRNAs. Further bioinformatic analysis detected 96 snoRNAs, including 56 snoRNAs that had been annotated in other organisms and 40 novel snoRNAs. All snoRNAs belonged to two major classes--C/D box snoRNAs and H/ACA snoRNAs--and their potential target sites in rRNAs and snRNAs were predicted. To analyse the evolutionary conservation of the ncRNAs in T. rubrum, we aligned all 352 ncRNAs to the genomes of six dermatophytes and to the NCBI non-redundant nucleotide database (NT). The results showed that most of the identified snRNAs were conserved in dermatophytes. Of the 352 ncRNAs, 102 also had genomic loci in other dermatophytes, and 27 were dermatophyte-specific. CONCLUSIONS Our systematic analysis may provide important clues to the function and evolution of ncRNAs in T. rubrum. These results also provide important information to complement the current annotation of the T. rubrum genome, which primarily comprises protein-coding genes.
Collapse
|
22
|
Xie C, Yuan J, Li H, Li M, Zhao G, Bu D, Zhu W, Wu W, Chen R, Zhao Y. NONCODEv4: exploring the world of long non-coding RNA genes. Nucleic Acids Res 2013; 42:D98-103. [PMID: 24285305 PMCID: PMC3965073 DOI: 10.1093/nar/gkt1222] [Citation(s) in RCA: 338] [Impact Index Per Article: 30.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
NONCODE (http://www.bioinfo.org/noncode/) is an integrated knowledge database dedicated to non-coding RNAs (excluding tRNAs and rRNAs). Non-coding RNAs (ncRNAs) have been implied in diseases and identified to play important roles in various biological processes. Since NONCODE version 3.0 was released 2 years ago, discovery of novel ncRNAs has been promoted by high-throughput RNA sequencing (RNA-Seq). In this update of NONCODE, we expand the ncRNA data set by collection of newly identified ncRNAs from literature published in the last 2 years and integration of the latest version of RefSeq and Ensembl. Particularly, the number of long non-coding RNA (lncRNA) has increased sharply from 73 327 to 210 831. Owing to similar alternative splicing pattern to mRNAs, the concept of lncRNA genes was put forward to help systematic understanding of lncRNAs. The 56 018 and 46 475 lncRNA genes were generated from 95 135 and 67 628 lncRNAs for human and mouse, respectively. Additionally, we present expression profile of lncRNA genes by graphs based on public RNA-seq data for human and mouse, as well as predict functions of these lncRNA genes. The improvements brought to the database also include an incorporation of an ID conversion tool from RefSeq or Ensembl ID to NONCODE ID and a service of lncRNA identification. NONCODE is also accessible through http://www.noncode.org/.
Collapse
Affiliation(s)
- Chaoyong Xie
- Bioinformatics Research Group, Advanced Computing Research Laboratory, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China, University of Chinese Academy of Sciences, Beijing 100049, China, Laboratory of Noncoding RNA, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China and Taicang Institute of Life Sciences Information, Suzhou 215400, China
| | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Severino P, Oliveira LS, Torres N, Andreghetto FM, Klingbeil MDFG, Moyses R, Wünsch-Filho V, Nunes FD, Mathor MB, Paschoal AR, Durham AM. High-throughput sequencing of small RNA transcriptomes reveals critical biological features targeted by microRNAs in cell models used for squamous cell cancer research. BMC Genomics 2013; 14:735. [PMID: 24160351 PMCID: PMC3870990 DOI: 10.1186/1471-2164-14-735] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2012] [Accepted: 10/17/2013] [Indexed: 11/11/2022] Open
Abstract
Background The implication of post-transcriptional regulation by microRNAs in molecular mechanisms underlying cancer disease is well documented. However, their interference at the cellular level is not fully explored. Functional in vitro studies are fundamental for the comprehension of their role; nevertheless results are highly dependable on the adopted cellular model. Next generation small RNA transcriptomic sequencing data of a tumor cell line and keratinocytes derived from primary culture was generated in order to characterize the microRNA content of these systems, thus helping in their understanding. Both constitute cell models for functional studies of microRNAs in head and neck squamous cell carcinoma (HNSCC), a smoking-related cancer. Known microRNAs were quantified and analyzed in the context of gene regulation. New microRNAs were investigated using similarity and structural search, ab initio classification, and prediction of the location of mature microRNAs within would-be precursor sequences. Results were compared with small RNA transcriptomic sequences from HNSCC samples in order to access the applicability of these cell models for cancer phenotype comprehension and for novel molecule discovery. Results Ten miRNAs represented over 70% of the mature molecules present in each of the cell types. The most expressed molecules were miR-21, miR-24 and miR-205, Accordingly; miR-21 and miR-205 have been previously shown to play a role in epithelial cell biology. Although miR-21 has been implicated in cancer development, and evaluated as a biomarker in HNSCC progression, no significant expression differences were seen between cell types. We demonstrate that differentially expressed mature miRNAs target cell differentiation and apoptosis related biological processes, indicating that they might represent, with acceptable accuracy, the genetic context from which they derive. Most miRNAs identified in the cancer cell line and in keratinocytes were present in tumor samples and cancer-free samples, respectively, with miR-21, miR-24 and miR-205 still among the most prevalent molecules at all instances. Thirteen miRNA-like structures, containing reads identified by the deep sequencing, were predicted from putative miRNA precursor sequences. Strong evidences suggest that one of them could be a new miRNA. This molecule was mostly expressed in the tumor cell line and HNSCC samples indicating a possible biological function in cancer. Conclusions Critical biological features of cells must be fully understood before they can be chosen as models for functional studies. Expression levels of miRNAs relate to cell type and tissue context. This study provides insights on miRNA content of two cell models used for cancer research. Pathways commonly deregulated in HNSCC might be targeted by most expressed and also by differentially expressed miRNAs. Results indicate that the use of cell models for cancer research demands careful assessment of underlying molecular characteristics for proper data interpretation. Additionally, one new miRNA-like molecule with a potential role in cancer was identified in the cell lines and clinical samples.
Collapse
Affiliation(s)
- Patricia Severino
- Albert Einstein Research and Education Institute, Hospital Israelita Albert Einstein, Sao Paulo, SP, Brazil.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
24
|
|
25
|
Sridhar S, Sharma A, Kongshaug H, Nilsen F, Jonassen I. Whole genome sequencing of the fish pathogen Francisella noatunensis subsp. orientalis Toba04 gives novel insights into Francisella evolution and pathogenecity. BMC Genomics 2012; 13:598. [PMID: 23131096 PMCID: PMC3532336 DOI: 10.1186/1471-2164-13-598] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2012] [Accepted: 10/31/2012] [Indexed: 01/23/2023] Open
Abstract
BACKGROUND Francisella is a genus of gram-negative bacterium highly virulent in fishes and human where F. tularensis is causing the serious disease tularaemia in human. Recently Francisella species have been reported to cause mortality in aquaculture species like Atlantic cod and tilapia. We have completed the sequencing and draft assembly of the Francisella noatunensis subsp. orientalisToba04 strain isolated from farmed Tilapia. Compared to other available Francisella genomes, it is most similar to the genome of Francisella philomiragia subsp. philomiragia, a free-living bacterium not virulent to human. RESULTS The genome is rearranged compared to the available Francisella genomes even though we found no IS-elements in the genome. Nearly 16% percent of the predicted ORFs are pseudogenes. Computational pathway analysis indicates that a number of the metabolic pathways are disrupted due to pseudogenes. Comparing the novel genome with other available Francisella genomes, we found around 2.5% of unique genes present in Francisella noatunensis subsp. orientalis Toba04 and a list of genes uniquely present in the human-pathogenic Francisella subspecies. Most of these genes might have transferred from bacterial species through horizontal gene transfer. Comparative analysis between human and fish pathogen also provide insights into genes responsible for pathogenecity. Our analysis of pseudogenes indicates that the evolution of Francisella subspecies's pseudogenes from Tilapia is old with large number of pseudogenes having more than one inactivating mutation. CONCLUSIONS The fish pathogen has lost non-essential genes some time ago. Evolutionary analysis of the Francisella genomes, strongly suggests that human and fish pathogenic Francisella species have evolved independently from free-living metabolically competent Francisella species. These findings will contribute to understanding the evolution of Francisella species and pathogenesis.
Collapse
Affiliation(s)
- Settu Sridhar
- Department of Informatics, University of Bergen, Norway
| | | | | | | | | |
Collapse
|
26
|
Volders PJ, Helsens K, Wang X, Menten B, Martens L, Gevaert K, Vandesompele J, Mestdagh P. LNCipedia: a database for annotated human lncRNA transcript sequences and structures. Nucleic Acids Res 2012; 41:D246-51. [PMID: 23042674 PMCID: PMC3531107 DOI: 10.1093/nar/gks915] [Citation(s) in RCA: 395] [Impact Index Per Article: 32.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Here, we present LNCipedia (http://www.lncipedia.org), a novel database for human long non-coding RNA (lncRNA) transcripts and genes. LncRNAs constitute a large and diverse class of non-coding RNA genes. Although several lncRNAs have been functionally annotated, the majority remains to be characterized. Different high-throughput methods to identify new lncRNAs (including RNA sequencing and annotation of chromatin-state maps) have been applied in various studies resulting in multiple unrelated lncRNA data sets. LNCipedia offers 21 488 annotated human lncRNA transcripts obtained from different sources. In addition to basic transcript information and gene structure, several statistics are determined for each entry in the database, such as secondary structure information, protein coding potential and microRNA binding sites. Our analyses suggest that, much like microRNAs, many lncRNAs have a significant secondary structure, in-line with their presumed association with proteins or protein complexes. Available literature on specific lncRNAs is linked, and users or authors can submit articles through a web interface. Protein coding potential is assessed by two different prediction algorithms: Coding Potential Calculator and HMMER. In addition, a novel strategy has been integrated for detecting potentially coding lncRNAs by automatically re-analysing the large body of publicly available mass spectrometry data in the PRIDE database. LNCipedia is publicly available and allows users to query and download lncRNA sequences and structures based on different search criteria. The database may serve as a resource to initiate small- and large-scale lncRNA studies. As an example, the LNCipedia content was used to develop a custom microarray for expression profiling of all available lncRNAs.
Collapse
|
27
|
Malkaram SA, Hassan YI, Zempleni J. Online tools for bioinformatics analyses in nutrition sciences. Adv Nutr 2012; 3:654-65. [PMID: 22983844 PMCID: PMC3648747 DOI: 10.3945/an.112.002477] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Recent advances in "omics" research have resulted in the creation of large datasets that were generated by consortiums and centers, small datasets that were generated by individual investigators, and bioinformatics tools for mining these datasets. It is important for nutrition laboratories to take full advantage of the analysis tools to interrogate datasets for information relevant to genomics, epigenomics, transcriptomics, proteomics, and metabolomics. This review provides guidance regarding bioinformatics resources that are currently available in the public domain, with the intent to provide a starting point for investigators who want to take advantage of the opportunities provided by the bioinformatics field.
Collapse
Affiliation(s)
- Sridhar A. Malkaram
- Department of Nutrition and Health Sciences, University of Nebraska, Lincoln, Nebraska
| | - Yousef I. Hassan
- Nutrition and Food Science Department, Faculty of Health Sciences, University of Kalamoon, Deirattiah, Syria
| | - Janos Zempleni
- Department of Nutrition and Health Sciences, University of Nebraska, Lincoln, Nebraska,To whom correspondence should be addressed: E-mail:
| |
Collapse
|
28
|
Hou ZC, Sterner KN, Romero R, Than NG, Gonzalez JM, Weckle A, Xing J, Benirschke K, Goodman M, Wildman DE. Elephant transcriptome provides insights into the evolution of eutherian placentation. Genome Biol Evol 2012; 4:713-25. [PMID: 22546564 PMCID: PMC3381679 DOI: 10.1093/gbe/evs045] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
The chorioallantoic placenta connects mother and fetus in eutherian pregnancies. In order to understand the evolution of the placenta and provide further understanding of placenta biology, we sequenced the transcriptome of a term placenta of an African elephant (Loxodonta africana) and compared these data with RNA sequence and microarray data from other eutherian placentas including human, mouse, and cow. We characterized the composition of 55,910 expressed sequence tag (i.e., cDNA) contigs using our custom annotation pipeline. A Markov algorithm was used to cluster orthologs of human, mouse, cow, and elephant placenta transcripts. We found 2,963 genes are commonly expressed in the placentas of these eutherian mammals. Gene ontology categories previously suggested to be important for placenta function (e.g., estrogen receptor signaling pathway, cell motion and migration, and adherens junctions) were significantly enriched in these eutherian placenta–expressed genes. Genes duplicated in different lineages and also specifically expressed in the placenta contribute to the great diversity observed in mammalian placenta anatomy. We identified 1,365 human lineage–specific, 1,235 mouse lineage–specific, 436 cow lineage–specific, and 904 elephant-specific placenta-expressed (PE) genes. The most enriched clusters of human-specific PE genes are signal/glycoprotein and immunoglobulin, and humans possess a deeply invasive human hemochorial placenta that comes into direct contact with maternal immune cells. Inference of phylogenetically conserved and derived transcripts demonstrates the power of comparative transcriptomics to trace placenta evolution and variation across mammals and identified candidate genes that may be important in the normal function of the human placenta, and their dysfunction may be related to human pregnancy complications.
Collapse
Affiliation(s)
- Zhuo-Cheng Hou
- Perinatology Research Branch, Eunice Kennedy Shriver National Institute of Child Health and Human Development/NIH/DHHS, Detroit, Michigan, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Lee-Liu D, Almonacid LI, Faunes F, Melo F, Larrain J. Transcriptomics using next generation sequencing technologies. Methods Mol Biol 2012; 917:293-317. [PMID: 22956096 DOI: 10.1007/978-1-61779-992-1_18] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
Next generation sequencing technologies may now be applied to the study of transcriptomics. RNA-Seq or RNA sequencing employs high-throughput sequencing of complementary DNA fragments delivering a transcriptional profile. In this chapter, we aim to provide a starting point for Xenopus researchers planning on starting an RNA-Seq transcriptomics study. We begin by providing a section on template isolation and library preparation. The next section comprises the main bioinformatics procedures that need to be performed for raw data processing, normalization, and differential gene expression. Finally, we have included a section on studying deep sequencing results in Xenopus, which offers general guidance as to what can be done in this model.
Collapse
Affiliation(s)
- Dasfne Lee-Liu
- Center for Aging and Regeneration and Millennium Nucleus in Regenerative Biology, Pontificia Universidad Catolica de Chile, Santiago, Chile
| | | | | | | | | |
Collapse
|
30
|
Liu W, Zhao Y, Cui P, Lin Q, Ding F, Xin C, Tan X, Song S, Yu J, Hu S. Thousands of Novel Transcripts Identified in Mouse Cerebrum, Testis, and ES Cells Based on ribo-minus RNA Sequencing. Front Genet 2011; 2:93. [PMID: 22303387 PMCID: PMC3268642 DOI: 10.3389/fgene.2011.00093] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2011] [Accepted: 12/07/2011] [Indexed: 11/13/2022] Open
Abstract
The high-throughput next-generation sequencing technologies provide an excellent opportunity for the detection of less-abundance transcripts that may not be identifiable by previously available techniques. Here, we report a discovery of thousands of novel transcripts (mostly non-coding RNAs) that are expressed in mouse cerebrum, testis, and embryonic stem (ES) cells, through an in-depth analysis of rmRNA-seq data. These transcripts show significant associations with transcriptional start and elongation signals. At the upstream of these transcripts we observed significant enrichment of histone marks (histone H3 lysine 4 trimethylation, H3K4me3), RNAPII binding sites, and cap analysis of gene expression tags that mark transcriptional start sites. Along the length of these transcripts, we also observed enrichment of histone H3 lysine 36 trimethylation (H3K36me3). Moreover, these transcripts show strong purifying selection in their genomic loci, exonic sequences, and promoter regions, implying functional constraints on the evolution of these transcripts. These results define a collection of novel transcripts in the mouse genome and indicate their potential functions in the mouse tissues and cells.
Collapse
Affiliation(s)
- Wanfei Liu
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences Beijing, China
| | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Badisco L, Ott SR, Rogers SM, Matheson T, Knapen D, Vergauwen L, Verlinden H, Marchal E, Sheehy MRJ, Burrows M, Broeck JV. Microarray-based transcriptomic analysis of differences between long-term gregarious and solitarious desert locusts. PLoS One 2011; 6:e28110. [PMID: 22132225 PMCID: PMC3223224 DOI: 10.1371/journal.pone.0028110] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2011] [Accepted: 11/01/2011] [Indexed: 12/02/2022] Open
Abstract
Desert locusts (Schistocerca gregaria) show an extreme form of phenotypic plasticity and can transform between a cryptic solitarious phase and a swarming gregarious phase. The two phases differ extensively in behavior, morphology and physiology but very little is known about the molecular basis of these differences. We used our recently generated Expressed Sequence Tag (EST) database derived from S. gregaria central nervous system (CNS) to design oligonucleotide microarrays and compare the expression of thousands of genes in the CNS of long-term gregarious and solitarious adult desert locusts. This identified 214 differentially expressed genes, of which 40% have been annotated to date. These include genes encoding proteins that are associated with CNS development and modeling, sensory perception, stress response and resistance, and fundamental cellular processes. Our microarray analysis has identified genes whose altered expression may enable locusts of either phase to deal with the different challenges they face. Genes for heat shock proteins and proteins which confer protection from infection were upregulated in gregarious locusts, which may allow them to respond to acute physiological challenges. By contrast the longer-lived solitarious locusts appear to be more strongly protected from the slowly accumulating effects of ageing by an upregulation of genes related to anti-oxidant systems, detoxification and anabolic renewal. Gregarious locusts also had a greater abundance of transcripts for proteins involved in sensory processing and in nervous system development and plasticity. Gregarious locusts live in a more complex sensory environment than solitarious locusts and may require a greater turnover of proteins involved in sensory transduction, and possibly greater neuronal plasticity.
Collapse
Affiliation(s)
- Liesbeth Badisco
- Department of Animal Physiology and Neurobiology, Katholieke Universiteit Leuven, Leuven, Belgium
| | - Swidbert R. Ott
- Department of Zoology, University of Cambridge, Cambridge, United Kingdom
| | - Stephen M. Rogers
- Department of Zoology, University of Cambridge, Cambridge, United Kingdom
| | - Thomas Matheson
- Department of Biology, University of Leicester, Leicester, United Kingdom
| | - Dries Knapen
- Department of Biology, Universiteit Antwerpen, Antwerpen, Belgium
| | - Lucia Vergauwen
- Department of Biology, Universiteit Antwerpen, Antwerpen, Belgium
| | - Heleen Verlinden
- Department of Animal Physiology and Neurobiology, Katholieke Universiteit Leuven, Leuven, Belgium
| | - Elisabeth Marchal
- Department of Animal Physiology and Neurobiology, Katholieke Universiteit Leuven, Leuven, Belgium
| | - Matt R. J. Sheehy
- Department of Biology, University of Leicester, Leicester, United Kingdom
- Faculty of Medicine and Health Sciences, University of Nottingham, Nottingham, United Kingdom
| | - Malcolm Burrows
- Department of Zoology, University of Cambridge, Cambridge, United Kingdom
| | - Jozef Vanden Broeck
- Department of Animal Physiology and Neurobiology, Katholieke Universiteit Leuven, Leuven, Belgium
- * E-mail:
| |
Collapse
|
32
|
Gibb EA, Brown CJ, Lam WL. The functional role of long non-coding RNA in human carcinomas. Mol Cancer 2011; 10:38. [PMID: 21489289 PMCID: PMC3098824 DOI: 10.1186/1476-4598-10-38] [Citation(s) in RCA: 1314] [Impact Index Per Article: 101.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2011] [Accepted: 04/13/2011] [Indexed: 12/15/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) are emerging as new players in the cancer paradigm demonstrating potential roles in both oncogenic and tumor suppressive pathways. These novel genes are frequently aberrantly expressed in a variety of human cancers, however the biological functions of the vast majority remain unknown. Recently, evidence has begun to accumulate describing the molecular mechanisms by which these RNA species function, providing insight into the functional roles they may play in tumorigenesis. In this review, we highlight the emerging functional role of lncRNAs in human cancer.
Collapse
Affiliation(s)
- Ewan A Gibb
- British Columbia Cancer Agency Research Centre, Vancouver, Canada.
| | | | | |
Collapse
|
33
|
Wang Y, Chen J, Wei G, He H, Zhu X, Xiao T, Yuan J, Dong B, He S, Skogerbø G, Chen R. The Caenorhabditis elegans intermediate-size transcriptome shows high degree of stage-specific expression. Nucleic Acids Res 2011; 39:5203-14. [PMID: 21378118 PMCID: PMC3130273 DOI: 10.1093/nar/gkr102] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
Earlier studies have revealed a substantial amount of transcriptional activity occurring outside annotated protein-coding genes of the Caenorhabditis elegans genome. One important fraction of this transcriptional activity relates to intermediate-size (70–500 nt) transcripts (is-ncRNAs) of mostly unknown function. Profiling the expression of this segment of the transcriptome on a tiling array through the C. elegans life cycle identified 5866 hitherto unannotated transcripts. The novel loci were distributed across intronic and intergenic space, with some enrichment toward protein-coding gene termini. The majority of the putative is-ncRNAs showed either stage-specific expression, or distinct developmental variation in their expression levels. More than 200 loci showed male-specific expression, and conserved loci were significantly enriched on the X chromosome, both observations strongly suggesting involvement of is-ncRNAs in sex-specific functions. Half of the novel loci were conserved in other nematodes, and numerous loci showed significant conservational correlations to nearby coding genes. Assuming functional roles for most of the novel loci, the data imply a nematode is-ncRNA tool kit of considerable size and variety.
Collapse
Affiliation(s)
- Yunfei Wang
- Bioinformatics Laboratory and National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Lai CE, Tsai MY, Liu YC, Wang CW, Chen KT, Lu CL. FASTR3D: a fast and accurate search tool for similar RNA 3D structures. Nucleic Acids Res 2009; 37:W287-95. [PMID: 19435878 PMCID: PMC2703968 DOI: 10.1093/nar/gkp330] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
FASTR3D is a web-based search tool that allows the user to fast and accurately search the PDB database for structurally similar RNAs. Currently, it allows the user to input three types of queries: (i) a PDB code of an RNA tertiary structure (default), optionally with specified residue range, (ii) an RNA secondary structure, optionally with primary sequence, in the dot-bracket notation and (iii) an RNA primary sequence in the FASTA format. In addition, the user can run FASTR3D with specifying additional filtering options: (i) the released date of RNA structures in the PDB database, and (ii) the experimental methods used to determine RNA structures and their least resolutions. In the output page, FASTR3D will show the user-queried RNA molecule, as well as user-specified options, followed by a detailed list of identified structurally similar RNAs. Particularly, when queried with RNA tertiary structures, FASTR3D provides a graphical display to show the structural superposition of the query structure and each of identified structures. FASTR3D is now available online at http://bioalgorithm.life.nctu.edu.tw/FASTR3D/.
Collapse
Affiliation(s)
- Chin-En Lai
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu 300, Taiwan
| | | | | | | | | | | |
Collapse
|
35
|
Rose D, Jöris J, Hackermüller J, Reiche K, Li Q, Stadler PF. Duplicated RNA genes in teleost fish genomes. J Bioinform Comput Biol 2009; 6:1157-75. [PMID: 19090022 DOI: 10.1142/s0219720008003886] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2007] [Revised: 06/17/2008] [Accepted: 06/18/2008] [Indexed: 12/29/2022]
Abstract
Teleost fishes share a duplication of their entire genomes. We report here on a computational survey of structured non-coding RNAs (ncRNAs) in teleost genomes, focusing on the fate of fish-specific duplicates. As in other metazoan groups, we find evidence of a large number (11,543) of structured RNAs, most of which (~86%) are clade-specific or evolve so fast that their tetrapod homologs cannot be detected. In surprising contrast to protein-coding genes, the fish-specific genome duplication did not lead to a large number of paralogous ncRNAs: only 188 candidates, mostly microRNAs, appear in a larger copy number in teleosts than in tetrapods, suggesting that large-scale gene duplications do not play a major role in the expansion of the vertebrate ncRNA inventory.
Collapse
Affiliation(s)
- Dominic Rose
- Bioinformatics Group, Department of Computer Science, Interdisciplinary Center for Bioinformatics, University of Leipzig, Leipzig, Germany.
| | | | | | | | | | | |
Collapse
|
36
|
Abstract
In recent years, sRNAs (small non-coding RNAs) have been found to be abundant in eukaryotes and bacteria and have been recognized as a novel class of gene expression regulators. In contrast, much less is known about sRNAs in archaea, except for snoRNAs (small nucleolar RNAs) that are involved in the modification of bases in stable RNAs. Therefore bioinformatic and experimental RNomics approaches were undertaken to search for the presence of sRNAs in the model archaeon Haloferax volcanii, resulting in more than 150 putative sRNA genes being identified. Northern blot analyses were used to study (differential) expression of sRNA genes. Several chromosomal deletion mutants of sRNA genes were generated and compared with the wild-type. It turned out that two sRNAs are essential for growth at low salt concentrations and high temperatures respectively, and one is involved in the regulation of carbon metabolism. Taken together, it could be shown that sRNAs are as abundant in H. volcanii as they are in well-studied bacterial species and that they fulfil important biological roles under specific conditions.
Collapse
|
37
|
Espagne E, Lespinet O, Malagnac F, Da Silva C, Jaillon O, Porcel BM, Couloux A, Aury JM, Ségurens B, Poulain J, Anthouard V, Grossetete S, Khalili H, Coppin E, Déquard-Chablat M, Picard M, Contamine V, Arnaise S, Bourdais A, Berteaux-Lecellier V, Gautheret D, de Vries RP, Battaglia E, Coutinho PM, Danchin EG, Henrissat B, Khoury RE, Sainsard-Chanet A, Boivin A, Pinan-Lucarré B, Sellem CH, Debuchy R, Wincker P, Weissenbach J, Silar P. The genome sequence of the model ascomycete fungus Podospora anserina. Genome Biol 2008; 9:R77. [PMID: 18460219 PMCID: PMC2441463 DOI: 10.1186/gb-2008-9-5-r77] [Citation(s) in RCA: 233] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2007] [Revised: 02/12/2008] [Accepted: 05/06/2008] [Indexed: 12/13/2022] Open
Abstract
A 10X draft sequence of Podospora anserina genome shows highly dynamic evolution since its divergence from Neurospora crassa. Background The dung-inhabiting ascomycete fungus Podospora anserina is a model used to study various aspects of eukaryotic and fungal biology, such as ageing, prions and sexual development. Results We present a 10X draft sequence of P. anserina genome, linked to the sequences of a large expressed sequence tag collection. Similar to higher eukaryotes, the P. anserina transcription/splicing machinery generates numerous non-conventional transcripts. Comparison of the P. anserina genome and orthologous gene set with the one of its close relatives, Neurospora crassa, shows that synteny is poorly conserved, the main result of evolution being gene shuffling in the same chromosome. The P. anserina genome contains fewer repeated sequences and has evolved new genes by duplication since its separation from N. crassa, despite the presence of the repeat induced point mutation mechanism that mutates duplicated sequences. We also provide evidence that frequent gene loss took place in the lineages leading to P. anserina and N. crassa. P. anserina contains a large and highly specialized set of genes involved in utilization of natural carbon sources commonly found in its natural biotope. It includes genes potentially involved in lignin degradation and efficient cellulose breakdown. Conclusion The features of the P. anserina genome indicate a highly dynamic evolution since the divergence of P. anserina and N. crassa, leading to the ability of the former to use specific complex carbon sources that match its needs in its natural biotope.
Collapse
Affiliation(s)
- Eric Espagne
- Univ Paris-Sud, Institut de Génétique et Microbiologie, UMR8621, 91405 Orsay cedex, France
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|