1
|
Arlt C, Wachtmeister T, Köhrer K, Stich B. Affordable, accurate and unbiased RNA sequencing by manual library miniaturization: A case study in barley. PLANT BIOTECHNOLOGY JOURNAL 2023; 21:2241-2253. [PMID: 37593840 PMCID: PMC10579711 DOI: 10.1111/pbi.14126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Revised: 05/12/2023] [Accepted: 07/01/2023] [Indexed: 08/19/2023]
Abstract
We present an easy-to-reproduce manual miniaturized full-length RNA sequencing (RNAseq) library preparation workflow that does not require the upfront investment in expensive lab equipment or long setup times. With minimal adjustments to an established commercial protocol, we were able to manually miniaturize the RNAseq library preparation by a factor of up to 1:8. This led to cost savings for miniaturized library preparation of up to 86.1% compared to the gold standard. The resulting data were the basis of a rigorous quality control analysis that inspected: sequencing quality metrics, gene body coverage, raw read duplications, alignment statistics, read pair duplications, detected transcripts and sequence variants. We also included a deep dive data analysis identifying rRNA contamination and suggested ways to circumvent these. In the end, we could not find any indication of biases or inaccuracies caused by the RNAseq library miniaturization. The variance in detected transcripts was minimal and not influenced by the miniaturization level. Our results suggest that the workflow is highly reproducible and the sequence data suitable for downstream analyses such as differential gene expression analysis or variant calling.
Collapse
Affiliation(s)
- Christopher Arlt
- Institute of Quantitative Genetics and Genomics of PlantsHeinrich Heine University DuesseldorfDuesseldorfGermany
| | - Thorsten Wachtmeister
- Genomics & Transcriptomics Laboratory, Biological and Medical Research Centre (BMFZ)Heinrich Heine University DuesseldorfDuesseldorfGermany
| | - Karl Köhrer
- Genomics & Transcriptomics Laboratory, Biological and Medical Research Centre (BMFZ)Heinrich Heine University DuesseldorfDuesseldorfGermany
| | - Benjamin Stich
- Institute of Quantitative Genetics and Genomics of PlantsHeinrich Heine University DuesseldorfDuesseldorfGermany
- Cluster of Excellence on Plant Sciences (CEPLAS)DuesseldorfGermany
- Max Planck Institute for Plant Breeding ResearchCologneGermany
- Present address:
Institute for Breeding Research on Agricultural CropsJulius Kühn Institute (JKI) ‐ Federal Research Centre for Cultivated PlantsSanitzGermany
| |
Collapse
|
2
|
Kokubo R, Hirano M, Tajima Y, Yunaiyama D, Saito K. Effects of Β‒Blocker Administration on Cardiac Function: A Coronary Computed Tomography Angiography Study. Curr Med Imaging 2022; 18:1517-1525. [PMID: 35593335 PMCID: PMC9903291 DOI: 10.2174/1573405618666220518104929] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Revised: 03/30/2022] [Accepted: 04/07/2022] [Indexed: 01/25/2023]
Abstract
BACKGROUND β-blockers are widely used for lowering heart rate (HR) during coronary computed tomography angiography (CCTA); however, they should be used with caution for patients with heart failure as they may have a negative inotropic effect. OBJECTIVE To clarify the effects of β-blockers (oral and intravenous injection) on cardiac function using CCTA. METHODS A total of 244 patients (men: women = 166: 78; mean age, 64.4 years old) suspected of having ischemic cardiac disease and had undergone echocardiography within 3 months before and after CCTA were included in the study. Systematic errors in ejection fraction (EF) were corrected by calculating ΔEF from the EF difference between echocardiography and CCTA in patients not using β- blockers. Univariate and multivariate analyses were performed for factors affecting ΔEF. In addition, HR between, before, and during CCTA were compared by Wilcoxon's test. RESULTS Temporary oral or intravenous administration of β-blockers at the CCTA had no significant effects on EF (p = 0.70), whereas HR was significantly decreased (p < 0.001). However, regular administration of β-blockers increases the EF on CCTA. CONCLUSION The administration of β-blockers immediately before CCTA affects HR but not EF. Premedication with β-blockers can be safely used for patients who undergo CCTA, and CCTA is useful for EF evaluation, independent of the use of β-blockers.
Collapse
Affiliation(s)
- Reiji Kokubo
- Department of Radiology, Tokyo Medical University, Tokyo, Japan; ,Address correspondence to this author at the Department of Radiology, Tokyo Medical University, 6-7-1 Nishishinjuku, Shinjuku-ku, Tokyo 160-0023, Japan; Tel: +81-3-3342-6111; Fax: +81-3-3348-6314; E-mail:
| | - Masaharu Hirano
- Department of Cardiology, Tokyo Medical University, Tokyo, Japan
| | - Yu Tajima
- Department of Radiology, Tokyo Medical University, Tokyo, Japan;
| | | | - Kazuhiro Saito
- Department of Radiology, Tokyo Medical University, Tokyo, Japan;
| |
Collapse
|
3
|
Tuschl K, White RJ, Trivedi C, Valdivia LE, Niklaus S, Bianco IH, Dadswell C, González-Méndez R, Sealy IM, Neuhauss SCF, Houart C, Rihel J, Wilson SW, Busch-Nentwich EM. Loss of slc39a14 causes simultaneous manganese hypersensitivity and deficiency in zebrafish. Dis Model Mech 2022; 15:dmm044594. [PMID: 35514229 PMCID: PMC9227717 DOI: 10.1242/dmm.044594] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2020] [Accepted: 04/25/2022] [Indexed: 12/15/2022] Open
Abstract
Manganese neurotoxicity is a hallmark of hypermanganesemia with dystonia 2, an inherited manganese transporter defect caused by mutations in SLC39A14. To identify novel potential targets of manganese neurotoxicity, we performed transcriptome analysis of slc39a14-/- mutant zebrafish that were exposed to MnCl2. Differentially expressed genes mapped to the central nervous system and eye, and pathway analysis suggested that Ca2+ dyshomeostasis and activation of the unfolded protein response are key features of manganese neurotoxicity. Consistent with this interpretation, MnCl2 exposure led to decreased whole-animal Ca2+ levels, locomotor defects and changes in neuronal activity within the telencephalon and optic tectum. In accordance with reduced tectal activity, slc39a14-/- zebrafish showed changes in visual phototransduction gene expression, absence of visual background adaptation and a diminished optokinetic reflex. Finally, numerous differentially expressed genes in mutant larvae normalised upon MnCl2 treatment indicating that, in addition to neurotoxicity, manganese deficiency is present either subcellularly or in specific cells or tissues. Overall, we assembled a comprehensive set of genes that mediate manganese-systemic responses and found a highly correlated and modulated network associated with Ca2+ dyshomeostasis and cellular stress. This article has an associated First Person interview with the first author of the paper.
Collapse
Affiliation(s)
- Karin Tuschl
- UCL GOS Institute of Child Health, University College London, 30 Guilford Street, London WC1N 1EH, UK
- Department of Cell and Developmental Biology, University College London, Gower Street, London WC1E 6BT, UK
- Department of Developmental Neurobiology and MRC Centre for Neurodevelopmental Disorders, IoPPN, Kings College London, New Hunt's House, Guy's Campus, London SE1 1UL, UK
| | - Richard J. White
- School of Biological and Behavioural Sciences, Faculty of Science and Engineering, Queen Mary University of London, London E1 4NS, UK
- Cambridge Institute of Therapeutic Immunology & Infectious Disease (CITIID), Jeffrey Cheah Biomedical Centre, University of Cambridge, Puddicombe Way, Cambridge CB2 0AW, UK
| | - Chintan Trivedi
- Department of Cell and Developmental Biology, University College London, Gower Street, London WC1E 6BT, UK
| | - Leonardo E. Valdivia
- Department of Cell and Developmental Biology, University College London, Gower Street, London WC1E 6BT, UK
- Center for Integrative Biology, Facultad de Ciencias, Universidad Mayor, Camino La Pirámide 5750, Huechuraba 8580745, Chile
- Escuela de Biotecnología, Facultad de Ciencias, Universidad Mayor, Camino La Pirámide 5750, Huechuraba 8580745, Chile
| | - Stephanie Niklaus
- Department of Molecular Life Sciences, University of Zurich, Winterthurerstrasse 190, 8057, Zurich, Switzerland
| | - Isaac H. Bianco
- Department of Neuroscience, Physiology & Pharmacology, University College London, Gower Street, London WC1E 6BT, UK
| | - Chris Dadswell
- School of Life Sciences, University of Sussex, Brighton BN1 9QJ, UK
| | | | - Ian M. Sealy
- School of Biological and Behavioural Sciences, Faculty of Science and Engineering, Queen Mary University of London, London E1 4NS, UK
- Cambridge Institute of Therapeutic Immunology & Infectious Disease (CITIID), Jeffrey Cheah Biomedical Centre, University of Cambridge, Puddicombe Way, Cambridge CB2 0AW, UK
| | - Stephan C. F. Neuhauss
- Department of Molecular Life Sciences, University of Zurich, Winterthurerstrasse 190, 8057, Zurich, Switzerland
| | - Corinne Houart
- Department of Developmental Neurobiology and MRC Centre for Neurodevelopmental Disorders, IoPPN, Kings College London, New Hunt's House, Guy's Campus, London SE1 1UL, UK
| | - Jason Rihel
- Department of Cell and Developmental Biology, University College London, Gower Street, London WC1E 6BT, UK
| | - Stephen W. Wilson
- Department of Cell and Developmental Biology, University College London, Gower Street, London WC1E 6BT, UK
| | - Elisabeth M. Busch-Nentwich
- School of Biological and Behavioural Sciences, Faculty of Science and Engineering, Queen Mary University of London, London E1 4NS, UK
- Cambridge Institute of Therapeutic Immunology & Infectious Disease (CITIID), Jeffrey Cheah Biomedical Centre, University of Cambridge, Puddicombe Way, Cambridge CB2 0AW, UK
| |
Collapse
|
4
|
White RJ, Mackay E, Wilson SW, Busch-Nentwich EM. Allele-specific gene expression can underlie altered transcript abundance in zebrafish mutants. eLife 2022; 11:72825. [PMID: 35175196 PMCID: PMC8884726 DOI: 10.7554/elife.72825] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Accepted: 02/16/2022] [Indexed: 11/13/2022] Open
Abstract
In model organisms, RNA-sequencing (RNA-seq) is frequently used to assess the effect of genetic mutations on cellular and developmental processes. Typically, animals heterozygous for a mutation are crossed to produce offspring with different genotypes. Resultant embryos are grouped by genotype to compare homozygous mutant embryos to heterozygous and wild-type siblings. Genes that are differentially expressed between the groups are assumed to reveal insights into the pathways affected by the mutation. Here we show that in zebrafish, differentially expressed genes are often over-represented on the same chromosome as the mutation due to different levels of expression of alleles from different genetic backgrounds. Using an incross of haplotype-resolved wild-type fish, we found evidence of widespread allele-specific expression, which appears as differential expression when comparing embryos homozygous for a region of the genome to their siblings. When analysing mutant transcriptomes, this means that the differential expression of genes on the same chromosome as a mutation of interest may not be caused by that mutation. Typically, the genomic location of a differentially expressed gene is not considered when interpreting its importance with respect to the phenotype. This could lead to pathways being erroneously implicated or overlooked due to the noise of spurious differentially expressed genes on the same chromosome as the mutation. These observations have implications for the interpretation of RNA-seq experiments involving outbred animals and non-inbred model organisms.
Collapse
Affiliation(s)
- Richard J White
- Cambridge Institute of Therapeutic Immunology & Infectious Disease (CITIID), Department of Medicine, University of Cambridge, Cambridge, United Kingdom
| | - Eirinn Mackay
- Department of Cell and Developmental Biology, University College London, London, United Kingdom
| | - Stephen W Wilson
- Department of Cell and Developmental Biology, University College London, London, United Kingdom
| | - Elisabeth M Busch-Nentwich
- Cambridge Institute of Therapeutic Immunology & Infectious Disease (CITIID), Department of Medicine, University of Cambridge, Cambridge, United Kingdom.,School of Biological and Behavioural Sciences, Faculty of Science and Engineering, Queen Mary University of London, London, United Kingdom
| |
Collapse
|
5
|
Li CY, Zhang WW, Xiang JL, Wang XH, Wang JL, Li J. Integrated analysis highlights multiple long non‑coding RNAs and their potential roles in the progression of human esophageal squamous cell carcinoma. Oncol Rep 2019; 42:2583-2599. [PMID: 31638253 PMCID: PMC6859451 DOI: 10.3892/or.2019.7377] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2019] [Accepted: 09/20/2019] [Indexed: 12/24/2022] Open
Abstract
Esophageal squamous cell carcinoma (ESCC) is a prevalent aggressive malignant tumor with poor prognosis. Investigations into the molecular changes that occur as a result of the disease, as well as identification of novel biomarkers for its diagnosis and prognosis, are urgently required. Long non‑coding RNAs (lncRNAs) have been reported to play a critical role in tumor progression. The present study performed data mining analyses for ESCC via an integrated study of accumulated datasets and identification of the differentially expressed lncRNAs from the Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA) databases. The identified intersection of differentially expressed genes (lncRNAs, miRNAs and mRNAs) in ESCC tissues between the GEO and TCGA datasets was investigated. Based on these intersected lncRNAs, the present study constructed a competitive endogenous RNA (ceRNA) network of lncRNAs in ESCC. A total of 81 intersection lncRNAs were identified; 67 of these were included in the ceRNA network. Functional analyses revealed that these 67 key lncRNAs primarily dominated cellular biological processes. The present study then analyzed the associations between the expression levels of these 67 key lncRNAs and the clinicopathological characteristics of the ESCC patients, as well as their survival time using TCGA. The results revealed that 31 of these lncRNAs were associated with tumor grade, tumor‑node‑metastasis (TNM) stage and lymphatic metastasis status (P<0.05). In addition, 15 key lncRNAs were demonstrated to be associated with survival time (P<0.05). Finally, 5 key lncRNAs were selected for validation of their expression levels in 30 patients newly diagnosed with ESCC via reverse transcription‑quantitative PCR (RT‑qPCR). The results suggested that the fold changes in the trends of up‑ and downregulation between GEO, TCGA and RT‑qPCR were consistent. In addition, it was also demonstrated that a select few of these 5 key lncRNAs were significantly associated with TNM stage and lymph node metastasis (P<0.05). The results of the clinically relevant analysis and the aforementioned bioinformatics were similar, hence proving that the bioinformatics analysis used in the present study is credible. Overall, the results from the present study may provide further insight into the functional characteristics of lncRNAs in ESCC through bioinformatics integrative analysis of the GEO and TCGA datasets, and reveal potential diagnostic and prognostic biomarkers for ESCC.
Collapse
Affiliation(s)
- Cheng-Yun Li
- Department of Toxicology, School of Public Health, Lanzhou University, Lanzhou, Gansu 730000, P.R. China
| | - Wen-Wen Zhang
- Department of Toxicology, School of Public Health, Lanzhou University, Lanzhou, Gansu 730000, P.R. China
| | - Ji-Lian Xiang
- Department of Gastroenterology, Third People's Hospital of Gansu Province, Lanzhou, Gansu 730000, P.R. China
| | - Xing-Hua Wang
- Department of Gastrointestinal Surgery, Gansu Wuwei Tumor Hospital, Wuwei, Gansu 733000, P.R. China
| | - Jun-Ling Wang
- Department of Toxicology, School of Public Health, Lanzhou University, Lanzhou, Gansu 730000, P.R. China
| | - Jin Li
- Department of Toxicology, School of Public Health, Lanzhou University, Lanzhou, Gansu 730000, P.R. China
| |
Collapse
|
6
|
Dooley CM, Wali N, Sealy IM, White RJ, Stemple DL, Collins JE, Busch-Nentwich EM. The gene regulatory basis of genetic compensation during neural crest induction. PLoS Genet 2019; 15:e1008213. [PMID: 31199790 PMCID: PMC6594659 DOI: 10.1371/journal.pgen.1008213] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2019] [Revised: 06/26/2019] [Accepted: 05/27/2019] [Indexed: 12/15/2022] Open
Abstract
The neural crest (NC) is a vertebrate-specific cell type that contributes to a wide range of different tissues across all three germ layers. The gene regulatory network (GRN) responsible for the formation of neural crest is conserved across vertebrates. Central to the induction of the NC GRN are AP-2 and SoxE transcription factors. NC induction robustness is ensured through the ability of some of these transcription factors to compensate loss of function of gene family members. However the gene regulatory events underlying compensation are poorly understood. We have used gene knockout and RNA sequencing strategies to dissect NC induction and compensation in zebrafish. We genetically ablate the NC using double mutants of tfap2a;tfap2c or remove specific subsets of the NC with sox10 and mitfa knockouts and characterise genome-wide gene expression levels across multiple time points. We find that compensation through a single wild-type allele of tfap2c is capable of maintaining early NC induction and differentiation in the absence of tfap2a function, but many target genes have abnormal expression levels and therefore show sensitivity to the reduced tfap2 dosage. This separation of morphological and molecular phenotypes identifies a core set of genes required for early NC development. We also identify the 15 somites stage as the peak of the molecular phenotype which strongly diminishes at 24 hpf even as the morphological phenotype becomes more apparent. Using gene knockouts, we associate previously uncharacterised genes with pigment cell development and establish a role for maternal Hippo signalling in melanocyte differentiation. This work extends and refines the NC GRN while also uncovering the transcriptional basis of genetic compensation via paralogues. Embryonic development is an intricate process that requires genes to be active at the right time and place. Organisms have evolved mechanisms that ensure faithful execution of developmental programmes even if genes fail to function. For example, in a process called genetic compensation, one or more genes become activated in response to loss of function of another. In this work we use the zebrafish model to investigate how two related genes, tfap2a and tfap2c, interact to ensure establishment of the neural crest, a vertebrate-specific cell type that contributes to many different tissues. Losing tfap2a activity causes mild morphological defects and losing tfap2c has no visible effect. Yet when both are inactive, embryos are severely abnormal due to lack of neural crest-derived tissues. Here we show that loss of tfap2a triggers upregulation of tfap2c which prevents the loss of neural crest tissue. However, the genes normally regulated by tfap2a respond differently to tfap2c allowing us to identify the first tier of the Ap2 network and new players in neural crest biology. Our work demonstrates that the expression signature of partial, but morphologically sufficient, genetic compensation provides an opportunity to dissect gene regulatory networks.
Collapse
Affiliation(s)
| | - Neha Wali
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - Ian M. Sealy
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
- Department of Medicine, University of Cambridge, Cambridge, United Kingdom
| | - Richard J. White
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
- Department of Medicine, University of Cambridge, Cambridge, United Kingdom
| | - Derek L. Stemple
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - John E. Collins
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - Elisabeth M. Busch-Nentwich
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
- Department of Medicine, University of Cambridge, Cambridge, United Kingdom
- * E-mail:
| |
Collapse
|
7
|
Fuentes R, Letelier J, Tajer B, Valdivia LE, Mullins MC. Fishing forward and reverse: Advances in zebrafish phenomics. Mech Dev 2018; 154:296-308. [PMID: 30130581 PMCID: PMC6289646 DOI: 10.1016/j.mod.2018.08.007] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2018] [Revised: 08/06/2018] [Accepted: 08/17/2018] [Indexed: 12/15/2022]
Abstract
Understanding how the genome instructs the phenotypic characteristics of an organism is one of the major scientific endeavors of our time. Advances in genetics have progressively deciphered the inheritance, identity and biological relevance of genetically encoded information, contributing to the rise of several, complementary omic disciplines. One of them is phenomics, an emergent area of biology dedicated to the systematic multi-scale analysis of phenotypic traits. This discipline provides valuable gene function information to the rapidly evolving field of genetics. Current molecular tools enable genome-wide analyses that link gene sequence to function in multi-cellular organisms, illuminating the genome-phenome relationship. Among vertebrates, zebrafish has emerged as an outstanding model organism for high-throughput phenotyping and modeling of human disorders. Advances in both systematic mutagenesis and phenotypic analyses of embryonic and post-embryonic stages in zebrafish have revealed the function of a valuable collection of genes and the general structure of several complex traits. In this review, we summarize multiple large-scale genetic efforts addressing parental, embryonic, and adult phenotyping in the zebrafish. The genetic and quantitative tools available in the zebrafish model, coupled with the broad spectrum of phenotypes that can be assayed, make it a powerful model for phenomics, well suited for the dissection of genotype-phenotype associations in development, physiology, health and disease.
Collapse
Affiliation(s)
- Ricardo Fuentes
- Department of Cell and Developmental Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Joaquín Letelier
- Centro Andaluz de Biología del Desarrollo (CSIC/UPO/JA), Seville, Spain; Center for Integrative Biology, Facultad de Ciencias, Universidad Mayor, Santiago, Chile
| | - Benjamin Tajer
- Department of Cell and Developmental Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Leonardo E Valdivia
- Center for Integrative Biology, Facultad de Ciencias, Universidad Mayor, Santiago, Chile.
| | - Mary C Mullins
- Department of Cell and Developmental Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
8
|
Fu Y, Wu PH, Beane T, Zamore PD, Weng Z. Elimination of PCR duplicates in RNA-seq and small RNA-seq using unique molecular identifiers. BMC Genomics 2018; 19:531. [PMID: 30001700 PMCID: PMC6044086 DOI: 10.1186/s12864-018-4933-1] [Citation(s) in RCA: 92] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2018] [Accepted: 07/08/2018] [Indexed: 12/12/2022] Open
Abstract
Background RNA-seq and small RNA-seq are powerful, quantitative tools to study gene regulation and function. Common high-throughput sequencing methods rely on polymerase chain reaction (PCR) to expand the starting material, but not every molecule amplifies equally, causing some to be overrepresented. Unique molecular identifiers (UMIs) can be used to distinguish undesirable PCR duplicates derived from a single molecule and identical but biologically meaningful reads from different molecules. Results We have incorporated UMIs into RNA-seq and small RNA-seq protocols and developed tools to analyze the resulting data. Our UMIs contain stretches of random nucleotides whose lengths sufficiently capture diverse molecule species in both RNA-seq and small RNA-seq libraries generated from mouse testis. Our approach yields high-quality data while allowing unique tagging of all molecules in high-depth libraries. Conclusions Using simulated and real datasets, we demonstrate that our methods increase the reproducibility of RNA-seq and small RNA-seq data. Notably, we find that the amount of starting material and sequencing depth, but not the number of PCR cycles, determine PCR duplicate frequency. Finally, we show that computational removal of PCR duplicates based only on their mapping coordinates introduces substantial bias into data analysis. Electronic supplementary material The online version of this article (10.1186/s12864-018-4933-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yu Fu
- Bioinformatics Program, Boston University, 44 Cummington Mall, Boston, MA, 02215, USA.,Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, 368 Plantation Street, Worcester, MA, 01605, USA
| | - Pei-Hsuan Wu
- RNA Therapeutics Institute and Howard Hughes Medical Institute, University of Massachusetts Medical School, 368 Plantation Street, Worcester, MA, 01605, USA
| | - Timothy Beane
- RNA Therapeutics Institute and Howard Hughes Medical Institute, University of Massachusetts Medical School, 368 Plantation Street, Worcester, MA, 01605, USA
| | - Phillip D Zamore
- RNA Therapeutics Institute and Howard Hughes Medical Institute, University of Massachusetts Medical School, 368 Plantation Street, Worcester, MA, 01605, USA.
| | - Zhiping Weng
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, 368 Plantation Street, Worcester, MA, 01605, USA. .,Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, 368 Plantation Street, Worcester, MA, 01605, USA.
| |
Collapse
|
9
|
White RJ, Collins JE, Sealy IM, Wali N, Dooley CM, Digby Z, Stemple DL, Murphy DN, Billis K, Hourlier T, Füllgrabe A, Davis MP, Enright AJ, Busch-Nentwich EM. A high-resolution mRNA expression time course of embryonic development in zebrafish. eLife 2017; 6. [PMID: 29144233 PMCID: PMC5690287 DOI: 10.7554/elife.30860] [Citation(s) in RCA: 199] [Impact Index Per Article: 28.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2017] [Accepted: 11/04/2017] [Indexed: 12/18/2022] Open
Abstract
We have produced an mRNA expression time course of zebrafish development across 18 time points from 1 cell to 5 days post-fertilisation sampling individual and pools of embryos. Using poly(A) pulldown stranded RNA-seq and a 3′ end transcript counting method we characterise temporal expression profiles of 23,642 genes. We identify temporal and functional transcript co-variance that associates 5024 unnamed genes with distinct developmental time points. Specifically, a class of over 100 previously uncharacterised zinc finger domain containing genes, located on the long arm of chromosome 4, is expressed in a sharp peak during zygotic genome activation. In addition, the data reveal new genes and transcripts, differential use of exons and previously unidentified 3′ ends across development, new primary microRNAs and temporal divergence of gene paralogues generated in the teleost genome duplication. To make this dataset a useful baseline reference, the data can be browsed and downloaded at Expression Atlas and Ensembl.
Collapse
Affiliation(s)
| | - John E Collins
- Wellcome Trust Sanger Institute, Hinxton, United Kingdom
| | - Ian M Sealy
- Wellcome Trust Sanger Institute, Hinxton, United Kingdom
| | - Neha Wali
- Wellcome Trust Sanger Institute, Hinxton, United Kingdom
| | | | - Zsofia Digby
- Wellcome Trust Sanger Institute, Hinxton, United Kingdom
| | | | - Daniel N Murphy
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Konstantinos Billis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Thibaut Hourlier
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Anja Füllgrabe
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Matthew P Davis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Anton J Enright
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Elisabeth M Busch-Nentwich
- Wellcome Trust Sanger Institute, Hinxton, United Kingdom.,Department of Medicine, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
10
|
Smith T, Heger A, Sudbery I. UMI-tools: modeling sequencing errors in Unique Molecular Identifiers to improve quantification accuracy. Genome Res 2017; 27:491-499. [PMID: 28100584 PMCID: PMC5340976 DOI: 10.1101/gr.209601.116] [Citation(s) in RCA: 1002] [Impact Index Per Article: 143.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2016] [Accepted: 01/17/2017] [Indexed: 01/06/2023]
Abstract
Unique Molecular Identifiers (UMIs) are random oligonucleotide barcodes that are increasingly used in high-throughput sequencing experiments. Through a UMI, identical copies arising from distinct molecules can be distinguished from those arising through PCR amplification of the same molecule. However, bioinformatic methods to leverage the information from UMIs have yet to be formalized. In particular, sequencing errors in the UMI sequence are often ignored or else resolved in an ad hoc manner. We show that errors in the UMI sequence are common and introduce network-based methods to account for these errors when identifying PCR duplicates. Using these methods, we demonstrate improved quantification accuracy both under simulated conditions and real iCLIP and single-cell RNA-seq data sets. Reproducibility between iCLIP replicates and single-cell RNA-seq clustering are both improved using our proposed network-based method, demonstrating the value of properly accounting for errors in UMIs. These methods are implemented in the open source UMI-tools software package.
Collapse
Affiliation(s)
- Tom Smith
- Computational Genomics Analysis and Training Programme, MRC WIMM Centre for Computational Biology, University of Oxford, Oxford OX3 9DS, United Kingdom
| | - Andreas Heger
- Computational Genomics Analysis and Training Programme, MRC WIMM Centre for Computational Biology, University of Oxford, Oxford OX3 9DS, United Kingdom
| | - Ian Sudbery
- Department of Molecular Biology and Biotechnology, University of Sheffield, Sheffield S10 2TN, United Kingdom
| |
Collapse
|
11
|
Haug K, Salek RM, Steinbeck C. Global open data management in metabolomics. Curr Opin Chem Biol 2017; 36:58-63. [PMID: 28092796 PMCID: PMC5344029 DOI: 10.1016/j.cbpa.2016.12.024] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2016] [Revised: 12/19/2016] [Accepted: 12/19/2016] [Indexed: 11/18/2022]
Abstract
The metabolome allows accessing the external influences under which an organism exists and develops in a dynamic way. Recent years have seen the establishment of a global network for metabolomics data exchange. Global metabolomics data exchange is leading to an exponential growth of publically available metabolomics data for re-analysis.
Chemical Biology employs chemical synthesis, analytical chemistry and other tools to study biological systems. Recent advances in both molecular biology such as next generation sequencing (NGS) have led to unprecedented insights towards the evolution of organisms’ biochemical repertoires. Because of the specific data sharing culture in Genomics, genomes from all kingdoms of life become readily available for further analysis by other researchers. While the genome expresses the potential of an organism to adapt to external influences, the Metabolome presents a molecular phenotype that allows us to asses the external influences under which an organism exists and develops in a dynamic way. Steady advancements in instrumentation towards high-throughput and highresolution methods have led to a revival of analytical chemistry methods for the measurement and analysis of the metabolome of organisms. This steady growth of metabolomics as a field is leading to a similar accumulation of big data across laboratories worldwide as can be observed in all of the other omics areas. This calls for the development of methods and technologies for handling and dealing with such large datasets, for efficiently distributing them and for enabling re-analysis. Here we describe the recently emerging ecosystem of global open-access databases and data exchange efforts between them, as well as the foundations and obstacles that enable or prevent the data sharing and reanalysis of this data.
Collapse
Affiliation(s)
- Kenneth Haug
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Reza M Salek
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Christoph Steinbeck
- European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.
| |
Collapse
|
12
|
Tan H, Onichtchouk D, Winata C. DANIO-CODE: Toward an Encyclopedia of DNA Elements in Zebrafish. Zebrafish 2015; 13:54-60. [PMID: 26671609 PMCID: PMC4742988 DOI: 10.1089/zeb.2015.1179] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
The zebrafish has emerged as a model organism for genomics studies. The symposium “Toward an encyclopedia of DNA elements in zebrafish” held in London in December 2014, was coorganized by Ferenc Müller and Fiona Wardle. This meeting is a follow-up of a similar previous workshop held 2 years earlier and represents a push toward the formalization of a community effort to annotate functional elements in the zebrafish genome. The meeting brought together zebrafish researchers, bioinformaticians, as well as members of established consortia, to exchange scientific findings and experience, as well as to discuss the initial steps toward the formation of a DANIO-CODE consortium. In this study, we provide the latest updates on the current progress of the consortium's efforts, opening up a broad invitation to researchers to join in and contribute to DANIO-CODE.
Collapse
Affiliation(s)
- Haihan Tan
- 1 Randall Division of Cell and Molecular Biophysics, King's College London , London, United Kingdom
| | - Daria Onichtchouk
- 2 Developmental Biology, Institute Biology I, Faculty of Biology, Albert-Ludwigs-University Freiburg , Freiburg, Germany
| | - Cecilia Winata
- 3 International Institute of Molecular and Cell Biology , Warsaw, Poland .,4 Max Planck Institute for Heart and Lung Research , Bad Nauheim, Germany
| |
Collapse
|
13
|
Bielczyk-Maczyńska E, Lam Hung L, Ferreira L, Fleischmann T, Weis F, Fernández-Pevida A, Harvey SA, Wali N, Warren AJ, Barroso I, Stemple DL, Cvejic A. The Ribosome Biogenesis Protein Nol9 Is Essential for Definitive Hematopoiesis and Pancreas Morphogenesis in Zebrafish. PLoS Genet 2015; 11:e1005677. [PMID: 26624285 PMCID: PMC4666468 DOI: 10.1371/journal.pgen.1005677] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2015] [Accepted: 10/26/2015] [Indexed: 12/27/2022] Open
Abstract
Ribosome biogenesis is a ubiquitous and essential process in cells. Defects in ribosome biogenesis and function result in a group of human disorders, collectively known as ribosomopathies. In this study, we describe a zebrafish mutant with a loss-of-function mutation in nol9, a gene that encodes a non-ribosomal protein involved in rRNA processing. nol9sa1022/sa1022 mutants have a defect in 28S rRNA processing. The nol9sa1022/sa1022 larvae display hypoplastic pancreas, liver and intestine and have decreased numbers of hematopoietic stem and progenitor cells (HSPCs), as well as definitive erythrocytes and lymphocytes. In addition, ultrastructural analysis revealed signs of pathological processes occurring in endothelial cells of the caudal vein, emphasizing the complexity of the phenotype observed in nol9sa1022/sa1022 larvae. We further show that both the pancreatic and hematopoietic deficiencies in nol9sa1022/sa1022 embryos were due to impaired cell proliferation of respective progenitor cells. Interestingly, genetic loss of Tp53 rescued the HSPCs but not the pancreatic defects. In contrast, activation of mRNA translation via the mTOR pathway by L-Leucine treatment did not revert the erythroid or pancreatic defects. Together, we present the nol9sa1022/sa1022 mutant, a novel zebrafish ribosomopathy model, which recapitulates key human disease characteristics. The use of this genetically tractable model will enhance our understanding of the tissue-specific mechanisms following impaired ribosome biogenesis in the context of an intact vertebrate.
Collapse
Affiliation(s)
- Ewa Bielczyk-Maczyńska
- Department of Haematology, University of Cambridge, Cambridge, United Kingdom
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
- NHS Blood and Transplant, Cambridge, United Kingdom
| | - Laure Lam Hung
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Lauren Ferreira
- Department of Haematology, University of Cambridge, Cambridge, United Kingdom
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Tobias Fleischmann
- Department of Haematology, University of Cambridge, Cambridge, United Kingdom
- Cambridge Institute for Medical Research, Cambridge, United Kingdom
- Wellcome Trust-Medical Research Council Stem Cell Institute, University of Cambridge, Cambridge, United Kingdom
| | - Félix Weis
- Department of Haematology, University of Cambridge, Cambridge, United Kingdom
- Cambridge Institute for Medical Research, Cambridge, United Kingdom
- Wellcome Trust-Medical Research Council Stem Cell Institute, University of Cambridge, Cambridge, United Kingdom
| | - Antonio Fernández-Pevida
- Department of Haematology, University of Cambridge, Cambridge, United Kingdom
- Cambridge Institute for Medical Research, Cambridge, United Kingdom
- Wellcome Trust-Medical Research Council Stem Cell Institute, University of Cambridge, Cambridge, United Kingdom
| | - Steven A. Harvey
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Neha Wali
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Alan J. Warren
- Department of Haematology, University of Cambridge, Cambridge, United Kingdom
- Cambridge Institute for Medical Research, Cambridge, United Kingdom
- Wellcome Trust-Medical Research Council Stem Cell Institute, University of Cambridge, Cambridge, United Kingdom
| | - Inês Barroso
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Derek L. Stemple
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Ana Cvejic
- Department of Haematology, University of Cambridge, Cambridge, United Kingdom
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
- Wellcome Trust-Medical Research Council Stem Cell Institute, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|