1
|
Zhang H, Alarcon C, Cavallari LH, Nutescu E, Carvill GL, Perera MA, Hernandez W. Genomewide Association Study Identifies Copy Number Variants Associated With Warfarin Dose Response and Risk of Venous Thromboembolism in African Americans. Clin Pharmacol Ther 2023; 113:624-633. [PMID: 36507737 PMCID: PMC11238476 DOI: 10.1002/cpt.2820] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Accepted: 12/05/2022] [Indexed: 12/14/2022]
Abstract
The anticoagulant warfarin is commonly used to control and prevent thrombotic disorders, such as venous thromboembolism (VTE), which disproportionately afflicts African Americans. Despite the importance of copy number variants (CNVs), few studies have focused on characterizing and understanding their role in drug response and disease risk among African Americans. In this study, we conduct the first genome-wide analysis of CNVs to more comprehensively account for the contribution of genetic variation in warfarin dose requirement and VTE risk among African Americans. We used hidden Markov models to detect CNVs from high-throughput single-nucleotide polymorphism arrays for 340 African American participants in the International Warfarin Pharmacogenetics Consortium. We identified 11,570 CNVs resulting in 2,038 copy number variable regions (CNVRs) and found 3 CNVRs associated with warfarin dose requirement and 3 CNVRs associated with VTE risk in African Americans. CNVRs 1q31.2del and 6q14.1del were associated with increased warfarin dose requirement (β = 11.18 and 4.94, respectively; Pemp = < 0.002); CNVR 19p13.31del was associated with decreased warfarin dose requirement (β = -1.41, Pemp = 0.0004); CNVRs (2p22.1del and 5q35.1-q35.2del) were found to be associated with increased risk of VTE (odds ratios (ORs) = 1.88 and 14.9, respectively; Pemp ≤0.02); and CNVR 10q26.12del was associated with a decreased risk of VTE (OR = 0.6; Pemp = 0.05). Modeling of the 10q26.12del in HepG2 cells revealed that this deletion results in decreased fibrinogen gene expression, decreased fibrinogen and WDR11 protein levels, and decreased secretion of fibrinogen into the extracellular matrix. We found robust evidence that CNVRs could contribute to warfarin dose requirement and risk of VTE in African Americans and for 10q26.3del describe a possible pathogenic mechanism.
Collapse
Affiliation(s)
- Honghong Zhang
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, Illinois, USA
| | - Cristina Alarcon
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, Illinois, USA
| | - Larisa H. Cavallari
- Department of Pharmacotherapy and Translational Research, Center for Pharmacogenomics and Precision Medicine, College of Pharmacy, University of Florida, Gainesville, Florida, USA
| | - Edith Nutescu
- Department of Pharmacy Practice, College of Pharmacy, University of Illinois Chicago, Chicago, Illinois, USA
| | - Gemma L. Carvill
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, Illinois, USA
- Department of Neurology, Feinberg School of Medicine, Northwestern University, Chicago, Illinois, USA
| | - Minoli A. Perera
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, Illinois, USA
| | - Wenndy Hernandez
- Section of Cardiology, University of Chicago, Chicago, Illinois, USA
| |
Collapse
|
2
|
Hahn O, Heining FM, Janzen J, Becker JCR, Bertlich M, Thelen P, Mansour JJ, Duensing S, Pahernik S, Trojan L, Popeneciu IV. Modulating the Heat Sensitivity of Prostate Cancer Cell Lines In Vitro: A New Impact for Focal Therapies. Biomedicines 2020; 8:E585. [PMID: 33316876 PMCID: PMC7763367 DOI: 10.3390/biomedicines8120585] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Revised: 11/25/2020] [Accepted: 12/04/2020] [Indexed: 11/17/2022] Open
Abstract
Focal therapies such as high-intensity focused ultrasound (HiFU) are an emerging therapeutic option for prostate cancer (PCA). Thermal or mechanical effects mediate most therapies. Moreover, locally administered drugs such as bicalutamide or docetaxel are new focal therapeutic options. We assessed the impact of such focal medical treatments on cell viability and heat sensitivity by pre-treating PCA cell lines and then gradually exposing them to heat. The individual heat response of the cell lines tested differed largely. Vertebral-Cancer of the Prostate (VCaP) cells showed an increase in metabolic activity at 40-50 °C. Androgen receptor (AR)-negative PC3 cells showed an increase at 51.3 °C and were overall more resistant to higher temperatures. Pre-treatment of VCaP cells with testosterone (VCaPrev) leads to a more PC3-like kinetic of the heat response. Pre-treatment with finasteride and bicalutamide did not cause changes in heat sensitivity in any cell line. Mitoxantrone treatment, however, shifted heat-induced proliferation loss to lower temperature in VCaP cells. Further analysis via RNAseq identified a possible correlation of heat resistance with H3K27me3-dependent gene regulation, which could be related to an increase in the histone methyltransferase EZH2 and a possible neuroendocrine differentiation. Pre-treatment with mitoxantrone might be a perspective for HiFU treatment. Further studies are needed to evaluate possible combinations with Hsp90 or EZH2 inhibitors.
Collapse
Affiliation(s)
- Oliver Hahn
- Department of Urology, University Medical Center Göttingen, 37075 Göttingen, Germany; (F.M.H.); (J.J.); (J.C.R.B.); (M.B.); (P.T.); (L.T.)
| | - Franziska M. Heining
- Department of Urology, University Medical Center Göttingen, 37075 Göttingen, Germany; (F.M.H.); (J.J.); (J.C.R.B.); (M.B.); (P.T.); (L.T.)
| | - Jörn Janzen
- Department of Urology, University Medical Center Göttingen, 37075 Göttingen, Germany; (F.M.H.); (J.J.); (J.C.R.B.); (M.B.); (P.T.); (L.T.)
| | - Johanna C. R. Becker
- Department of Urology, University Medical Center Göttingen, 37075 Göttingen, Germany; (F.M.H.); (J.J.); (J.C.R.B.); (M.B.); (P.T.); (L.T.)
| | - Marina Bertlich
- Department of Urology, University Medical Center Göttingen, 37075 Göttingen, Germany; (F.M.H.); (J.J.); (J.C.R.B.); (M.B.); (P.T.); (L.T.)
| | - Paul Thelen
- Department of Urology, University Medical Center Göttingen, 37075 Göttingen, Germany; (F.M.H.); (J.J.); (J.C.R.B.); (M.B.); (P.T.); (L.T.)
| | - Josef J. Mansour
- Department of Urology, Heidelberg School of Medicine, University of Heidelberg, 69120 Heidelberg, Germany; (J.J.M.); (S.D.); (S.P.)
| | - Stefan Duensing
- Department of Urology, Heidelberg School of Medicine, University of Heidelberg, 69120 Heidelberg, Germany; (J.J.M.); (S.D.); (S.P.)
| | - Sascha Pahernik
- Department of Urology, Heidelberg School of Medicine, University of Heidelberg, 69120 Heidelberg, Germany; (J.J.M.); (S.D.); (S.P.)
- Department of Urology, Paracelsus Medical University Nuremberg, 90419 Nuremberg, Germany
| | - Lutz Trojan
- Department of Urology, University Medical Center Göttingen, 37075 Göttingen, Germany; (F.M.H.); (J.J.); (J.C.R.B.); (M.B.); (P.T.); (L.T.)
| | - Ionel V. Popeneciu
- Department of Urology, University Medical Center Göttingen, 37075 Göttingen, Germany; (F.M.H.); (J.J.); (J.C.R.B.); (M.B.); (P.T.); (L.T.)
| |
Collapse
|
3
|
Zhou J, Zhang Y, Cui P, Luo L, Chen H, Liang B, Jiang J, Ning C, Tian L, Zhong X, Ye L, Liang H, Huang J. Gut Microbiome Changes Associated With HIV Infection and Sexual Orientation. Front Cell Infect Microbiol 2020; 10:434. [PMID: 33102244 PMCID: PMC7546801 DOI: 10.3389/fcimb.2020.00434] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2019] [Accepted: 07/15/2020] [Indexed: 12/20/2022] Open
Abstract
Background: Many studies have explored changes in the gut microbiome associated with HIV infection, but the consistent pattern of changes has not been clarified. Men who have sex with men (MSM) are very likely to be an independent influencing factor of the gut microbiome, but relevant research is still lacking. Methods: We conducted a meta-analysis by screening 12 published studies of 16S rRNA gene amplicon sequencing of gut microbiomes related to HIV/AIDS (six of these studies contain data that is relevant and available to MSM) from NCBI and EBI databases. The analysis of gut microbiomes related to HIV infection status and MSM status included 1,288 samples (HIV-positive (HIV+) individuals, n = 744; HIV-negative (HIV–) individuals, n = 544) and 632 samples (MSM, n = 328; non-MSM, n = 304), respectively. The alpha diversity indexes, beta diversity indexes, differentially enriched genera, differentially enriched species, and differentially enriched Kyoto Encyclopedia of Genes and Genomes (KEGG) functional pathways related to gut microbiomes were calculated. Finally, the overall trend of the above indicators was evaluated. Results: Our results indicate that HIV+ status is associated with decreased alpha diversity of the gut microbiome. MSM status is an important factor that affects the study of HIV-related gut microbiomes; that is, MSM are associated with alpha diversity changes in the gut microbiome regardless of HIV infection, and the changes in the gut microbiome composition of MSM are more significant than those of HIV+ individuals. A consistent change in Bacteroides caccae, Bacteroides ovatus, Bacteroides uniformis, and Prevotella stercorea was found in HIV+ individuals and MSM. The differential expression of the gut microbiome may be accompanied by changes in functional pathways of carbohydrate metabolism, amino acid metabolism, and lipid Metabolism. Conclusions: This study shows that the changes in the gut microbiome are related to HIV and MSM status. Importantly, MSM status may have a far greater impact on the gut microbiome than HIV status.
Collapse
Affiliation(s)
- Jie Zhou
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Guangxi Universities Key Laboratory of Prevention and Control of Highly Prevalent Disease, Guangxi Medical University, Nanning, China.,School of Public Health, Guangxi Medical University, Nanning, China
| | - Yu Zhang
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Guangxi Universities Key Laboratory of Prevention and Control of Highly Prevalent Disease, Guangxi Medical University, Nanning, China.,School of Public Health, Guangxi Medical University, Nanning, China
| | - Ping Cui
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Guangxi Universities Key Laboratory of Prevention and Control of Highly Prevalent Disease, Guangxi Medical University, Nanning, China.,Guangxi Collaborative Innovation Center for Biomedicine, Life Science Institute, Guangxi Medical University, Nanning, China
| | - Lijia Luo
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Guangxi Universities Key Laboratory of Prevention and Control of Highly Prevalent Disease, Guangxi Medical University, Nanning, China.,School of Public Health, Guangxi Medical University, Nanning, China
| | - Hui Chen
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Guangxi Universities Key Laboratory of Prevention and Control of Highly Prevalent Disease, Guangxi Medical University, Nanning, China.,The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Bingyu Liang
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Guangxi Universities Key Laboratory of Prevention and Control of Highly Prevalent Disease, Guangxi Medical University, Nanning, China.,School of Public Health, Guangxi Medical University, Nanning, China
| | - Junjun Jiang
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Guangxi Universities Key Laboratory of Prevention and Control of Highly Prevalent Disease, Guangxi Medical University, Nanning, China.,School of Public Health, Guangxi Medical University, Nanning, China
| | - Chuanyi Ning
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Guangxi Universities Key Laboratory of Prevention and Control of Highly Prevalent Disease, Guangxi Medical University, Nanning, China.,Guangxi Collaborative Innovation Center for Biomedicine, Life Science Institute, Guangxi Medical University, Nanning, China
| | - Li Tian
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Guangxi Universities Key Laboratory of Prevention and Control of Highly Prevalent Disease, Guangxi Medical University, Nanning, China.,The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Xiaodan Zhong
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Guangxi Universities Key Laboratory of Prevention and Control of Highly Prevalent Disease, Guangxi Medical University, Nanning, China.,The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Li Ye
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Guangxi Universities Key Laboratory of Prevention and Control of Highly Prevalent Disease, Guangxi Medical University, Nanning, China.,School of Public Health, Guangxi Medical University, Nanning, China.,Guangxi Collaborative Innovation Center for Biomedicine, Life Science Institute, Guangxi Medical University, Nanning, China
| | - Hao Liang
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Guangxi Universities Key Laboratory of Prevention and Control of Highly Prevalent Disease, Guangxi Medical University, Nanning, China.,School of Public Health, Guangxi Medical University, Nanning, China.,Guangxi Collaborative Innovation Center for Biomedicine, Life Science Institute, Guangxi Medical University, Nanning, China
| | - Jiegang Huang
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Guangxi Universities Key Laboratory of Prevention and Control of Highly Prevalent Disease, Guangxi Medical University, Nanning, China.,School of Public Health, Guangxi Medical University, Nanning, China
| |
Collapse
|
4
|
Yao E, Buels R, Stein L, Sen TZ, Holmes I. JBrowse Connect: A server API to connect JBrowse instances and users. PLoS Comput Biol 2020; 16:e1007261. [PMID: 32810130 PMCID: PMC7508408 DOI: 10.1371/journal.pcbi.1007261] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2019] [Revised: 09/22/2020] [Accepted: 05/27/2020] [Indexed: 11/19/2022] Open
Abstract
We describe JBrowse Connect, an optional expansion to the JBrowse genome browser, targeted at developers. JBrowse Connect allows live messaging, notifications for new annotation tracks, heavy-duty analyses initiated by the user from within the browser, and other dynamic features. We present example applications of JBrowse Connect that allow users 1) to specify and execute BLAST searches by either running on the same host as the webserver, with a self-contained BLAST module leveraging NCBI Blast+ commands, or via a managed Galaxy instance that can optionally run on a different host, and 2) to run the primer design service Primer3. JBrowse Connect allows users to track job progress and view results in the context of the browser. The software is available under a choice of open source licenses including LGPL and the Artistic License.
Collapse
Affiliation(s)
- Eric Yao
- Department of Bioengineering, Stanley Hall, University of California, Berkeley, California, United States of America
- U.S. Department of Agriculture, Agricultural Research Service, Western Regional Research Center, Crop Improvement and Genetics Research Unit, Albany, California, United States of America
| | - Robert Buels
- Department of Bioengineering, Stanley Hall, University of California, Berkeley, California, United States of America
| | - Lincoln Stein
- Ontario Institute for Cancer Research, Toronto, Ontario, Canada
| | - Taner Z. Sen
- U.S. Department of Agriculture, Agricultural Research Service, Western Regional Research Center, Crop Improvement and Genetics Research Unit, Albany, California, United States of America
| | - Ian Holmes
- Department of Bioengineering, Stanley Hall, University of California, Berkeley, California, United States of America
- * E-mail:
| |
Collapse
|
5
|
Oh M, Park S, Kim S, Chae H. Machine learning-based analysis of multi-omics data on the cloud for investigating gene regulations. Brief Bioinform 2020; 22:66-76. [PMID: 32227074 DOI: 10.1093/bib/bbaa032] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2019] [Revised: 02/05/2020] [Accepted: 02/25/2020] [Indexed: 02/06/2023] Open
Abstract
Gene expressions are subtly regulated by quantifiable measures of genetic molecules such as interaction with other genes, methylation, mutations, transcription factor and histone modifications. Integrative analysis of multi-omics data can help scientists understand the condition or patient-specific gene regulation mechanisms. However, analysis of multi-omics data is challenging since it requires not only the analysis of multiple omics data sets but also mining complex relations among different genetic molecules by using state-of-the-art machine learning methods. In addition, analysis of multi-omics data needs quite large computing infrastructure. Moreover, interpretation of the analysis results requires collaboration among many scientists, often requiring reperforming analysis from different perspectives. Many of the aforementioned technical issues can be nicely handled when machine learning tools are deployed on the cloud. In this survey article, we first survey machine learning methods that can be used for gene regulation study, and we categorize them according to five different goals: gene regulatory subnetwork discovery, disease subtype analysis, survival analysis, clinical prediction and visualization. We also summarize the methods in terms of multi-omics input types. Then, we explain why the cloud is potentially a good solution for the analysis of multi-omics data, followed by a survey of two state-of-the-art cloud systems, Galaxy and BioVLAB. Finally, we discuss important issues when the cloud is used for the analysis of multi-omics data for the gene regulation study.
Collapse
Affiliation(s)
- Minsik Oh
- Department of Computer Science and Engineering, Seoul National University, Seoul, 08826, Korea
| | - Sungjoon Park
- Department of Computer Science and Engineering, Seoul National University, Seoul, 08826, Korea
| | - Sun Kim
- Department of Computer Science and Engineering, Seoul National University, Seoul, 08826, Korea.,Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 08826, Korea.,Bioinformatics Institute, Seoul National University, Seoul, 08826, Korea
| | - Heejoon Chae
- Division of Computer Science, Sookmyung Women's University, Seoul, 04310,Korea
| |
Collapse
|
6
|
Prieto C, Barrios D. RaNA-Seq: Interactive RNA-Seq analysis from FASTQ files to functional analysis. Bioinformatics 2019; 36:btz854. [PMID: 31730197 DOI: 10.1093/bioinformatics/btz854] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Revised: 10/04/2019] [Accepted: 11/11/2019] [Indexed: 11/14/2022] Open
Abstract
SUMMARY RaNA-Seq is a cloud platform for the rapid analysis and visualization of RNA-Seq data. It performs a full analysis in minutes by quantifying FASTQ files, calculating quality control metrics, running differential expression analyses and enabling the explanation of results with functional analyses. Our analysis pipeline applies generally accepted and reproducible protocols that can be applied with two simple steps in its web interface. Analysis results are presented as interactive graphics and reports, ready for their interpretation and publication. AVAILABILITY RaNA-Seq web service is freely available online at https://ranaseq.eu. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Carlos Prieto
- Bioinformatics Service, Nucleus, University of Salamanca, Plaza Doctores de la Reina, Salamanca, Spain
| | - David Barrios
- Bioinformatics Service, Nucleus, University of Salamanca, Plaza Doctores de la Reina, Salamanca, Spain
| |
Collapse
|
7
|
Parveen A, Khurana S, Kumar A. Overview of Genomic Tools for Circular Visualization in the Next-generation Genomic Sequencing Era. Curr Genomics 2019; 20:90-99. [PMID: 31555060 PMCID: PMC6728899 DOI: 10.2174/1389202920666190314092044] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2019] [Revised: 03/07/2019] [Accepted: 03/07/2019] [Indexed: 12/13/2022] Open
Abstract
After human genome sequencing and rapid changes in genome sequencing methods, we have entered into the era of rapidly accumulating genome-sequencing data. This has derived the development of several types of methods for representing results of genome sequencing data. Circular genome visual-ization tools are also critical in this area as they provide rapid interpretation and simple visualization of overall data. In the last 15 years, we have seen rapid changes in circular visualization tools after the de-velopment of the circos tool with 1-2 tools published per year. Herein we have summarized and revisited all these tools until the third quarter of 2018.
Collapse
Affiliation(s)
- Alisha Parveen
- 1Medical Research Center, Medical Faculty of Mannheim, University of Heidelberg, Mannheim, Germany; 2Pharmacology Department, Central Drug Research Institute - Lucknow, Uttar Pradesh, India; 3Department of Genetics & Molecular Biology in Botany, Institute of Botany, Christian-Albrechts-University at Kiel, Kiel, Germany
| | - Sukant Khurana
- 1Medical Research Center, Medical Faculty of Mannheim, University of Heidelberg, Mannheim, Germany; 2Pharmacology Department, Central Drug Research Institute - Lucknow, Uttar Pradesh, India; 3Department of Genetics & Molecular Biology in Botany, Institute of Botany, Christian-Albrechts-University at Kiel, Kiel, Germany
| | - Abhishek Kumar
- 1Medical Research Center, Medical Faculty of Mannheim, University of Heidelberg, Mannheim, Germany; 2Pharmacology Department, Central Drug Research Institute - Lucknow, Uttar Pradesh, India; 3Department of Genetics & Molecular Biology in Botany, Institute of Botany, Christian-Albrechts-University at Kiel, Kiel, Germany
| |
Collapse
|
8
|
Afgan E, Baker D, Batut B, van den Beek M, Bouvier D, Čech M, Chilton J, Clements D, Coraor N, Grüning BA, Guerler A, Hillman-Jackson J, Hiltemann S, Jalili V, Rasche H, Soranzo N, Goecks J, Taylor J, Nekrutenko A, Blankenberg D. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic Acids Res 2018; 46:W537-W544. [PMID: 29790989 PMCID: PMC6030816 DOI: 10.1093/nar/gky379] [Citation(s) in RCA: 2361] [Impact Index Per Article: 337.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2018] [Revised: 04/25/2018] [Accepted: 05/02/2018] [Indexed: 02/06/2023] Open
Abstract
Galaxy (homepage: https://galaxyproject.org, main public server: https://usegalaxy.org) is a web-based scientific analysis platform used by tens of thousands of scientists across the world to analyze large biomedical datasets such as those found in genomics, proteomics, metabolomics and imaging. Started in 2005, Galaxy continues to focus on three key challenges of data-driven biomedical science: making analyses accessible to all researchers, ensuring analyses are completely reproducible, and making it simple to communicate analyses so that they can be reused and extended. During the last two years, the Galaxy team and the open-source community around Galaxy have made substantial improvements to Galaxy's core framework, user interface, tools, and training materials. Framework and user interface improvements now enable Galaxy to be used for analyzing tens of thousands of datasets, and >5500 tools are now available from the Galaxy ToolShed. The Galaxy community has led an effort to create numerous high-quality tutorials focused on common types of genomic analyses. The Galaxy developer and user communities continue to grow and be integral to Galaxy's development. The number of Galaxy public servers, developers contributing to the Galaxy framework and its tools, and users of the main Galaxy server have all increased substantially.
Collapse
Affiliation(s)
- Enis Afgan
- Department of Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Dannon Baker
- Department of Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Bérénice Batut
- Department of Computer Science, Albert-Ludwigs-University, Freiburg, Freiburg, Germany
| | | | - Dave Bouvier
- Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| | - Martin Čech
- Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| | - John Chilton
- Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| | - Dave Clements
- Department of Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Nate Coraor
- Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| | - Björn A Grüning
- Department of Computer Science, Albert-Ludwigs-University, Freiburg, Freiburg, Germany
- Center for Biological Systems Analysis (ZBSA), University of Freiburg, Freiburg, Germany
| | - Aysam Guerler
- Department of Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Jennifer Hillman-Jackson
- Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| | - Saskia Hiltemann
- Department of Pathology, Erasmus Medical Center, Rotterdam, The Netherlands
| | - Vahid Jalili
- Department of Biomedical Engineering, Oregon Health and Science University, OR, USA
| | - Helena Rasche
- Department of Computer Science, Albert-Ludwigs-University, Freiburg, Freiburg, Germany
| | | | - Jeremy Goecks
- Department of Biomedical Engineering, Oregon Health and Science University, OR, USA
| | - James Taylor
- Department of Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Anton Nekrutenko
- Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| | - Daniel Blankenberg
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA
| |
Collapse
|
9
|
Thanki AS, Soranzo N, Haerty W, Davey RP. GeneSeqToFamily: a Galaxy workflow to find gene families based on the Ensembl Compara GeneTrees pipeline. Gigascience 2018; 7:1-10. [PMID: 29425291 PMCID: PMC5863215 DOI: 10.1093/gigascience/giy005] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2017] [Revised: 07/31/2017] [Accepted: 01/18/2018] [Indexed: 11/13/2022] Open
Abstract
Background Gene duplication is a major factor contributing to evolutionary novelty, and the contraction or expansion of gene families has often been associated with morphological, physiological, and environmental adaptations. The study of homologous genes helps us to understand the evolution of gene families. It plays a vital role in finding ancestral gene duplication events as well as identifying genes that have diverged from a common ancestor under positive selection. There are various tools available, such as MSOAR, OrthoMCL, and HomoloGene, to identify gene families and visualize syntenic information between species, providing an overview of syntenic regions evolution at the family level. Unfortunately, none of them provide information about structural changes within genes, such as the conservation of ancestral exon boundaries among multiple genomes. The Ensembl GeneTrees computational pipeline generates gene trees based on coding sequences, provides details about exon conservation, and is used in the Ensembl Compara project to discover gene families. Findings A certain amount of expertise is required to configure and run the Ensembl Compara GeneTrees pipeline via command line. Therefore, we converted this pipeline into a Galaxy workflow, called GeneSeqToFamily, and provided additional functionality. This workflow uses existing tools from the Galaxy ToolShed, as well as providing additional wrappers and tools that are required to run the workflow. Conclusions GeneSeqToFamily represents the Ensembl GeneTrees pipeline as a set of interconnected Galaxy tools, so they can be run interactively within the Galaxy's user-friendly workflow environment while still providing the flexibility to tailor the analysis by changing configurations and tools if necessary. Additional tools allow users to subsequently visualize the gene families produced by the workflow, using the Aequatus.js interactive tool, which has been developed as part of the Aequatus software project.
Collapse
Affiliation(s)
- Anil S Thanki
- Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Nicola Soranzo
- Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Wilfried Haerty
- Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Robert P Davey
- Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| |
Collapse
|
10
|
Ko G, Kim PG, Yoon J, Han G, Park SJ, Song W, Lee B. Closha: bioinformatics workflow system for the analysis of massive sequencing data. BMC Bioinformatics 2018; 19:43. [PMID: 29504905 PMCID: PMC5836837 DOI: 10.1186/s12859-018-2019-3] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND While next-generation sequencing (NGS) costs have fallen in recent years, the cost and complexity of computation remain substantial obstacles to the use of NGS in bio-medical care and genomic research. The rapidly increasing amounts of data available from the new high-throughput methods have made data processing infeasible without automated pipelines. The integration of data and analytic resources into workflow systems provides a solution to the problem by simplifying the task of data analysis. RESULTS To address this challenge, we developed a cloud-based workflow management system, Closha, to provide fast and cost-effective analysis of massive genomic data. We implemented complex workflows making optimal use of high-performance computing clusters. Closha allows users to create multi-step analyses using drag and drop functionality and to modify the parameters of pipeline tools. Users can also import the Galaxy pipelines into Closha. Closha is a hybrid system that enables users to use both analysis programs providing traditional tools and MapReduce-based big data analysis programs simultaneously in a single pipeline. Thus, the execution of analytics algorithms can be parallelized, speeding up the whole process. We also developed a high-speed data transmission solution, KoDS, to transmit a large amount of data at a fast rate. KoDS has a file transfer speed of up to 10 times that of normal FTP and HTTP. The computer hardware for Closha is 660 CPU cores and 800 TB of disk storage, enabling 500 jobs to run at the same time. CONCLUSIONS Closha is a scalable, cost-effective, and publicly available web service for large-scale genomic data analysis. Closha supports the reliable and highly scalable execution of sequencing analysis workflows in a fully automated manner. Closha provides a user-friendly interface to all genomic scientists to try to derive accurate results from NGS platform data. The Closha cloud server is freely available for use from http://closha.kobic.re.kr/ .
Collapse
Affiliation(s)
| | | | - Jongcheol Yoon
- Korean BioInformation Center (KOBIC), KRIBB, 125 Gwahangno, Yuseong-gu, Daejeon, 34141, South Korea
| | - Gukhee Han
- Korean BioInformation Center (KOBIC), KRIBB, 125 Gwahangno, Yuseong-gu, Daejeon, 34141, South Korea
| | - Seong-Jin Park
- Korean BioInformation Center (KOBIC), KRIBB, 125 Gwahangno, Yuseong-gu, Daejeon, 34141, South Korea
| | - Wangho Song
- Korean BioInformation Center (KOBIC), KRIBB, 125 Gwahangno, Yuseong-gu, Daejeon, 34141, South Korea
| | - Byungwook Lee
- Korean BioInformation Center (KOBIC), KRIBB, 125 Gwahangno, Yuseong-gu, Daejeon, 34141, South Korea.
| |
Collapse
|
11
|
Hassan Z, Schneeweis C, Wirth M, Veltkamp C, Dantes Z, Feuerecker B, Ceyhan GO, Knauer SK, Weichert W, Schmid RM, Stauber R, Arlt A, Krämer OH, Rad R, Reichert M, Saur D, Schneider G. MTOR inhibitor-based combination therapies for pancreatic cancer. Br J Cancer 2018; 118:366-377. [PMID: 29384525 PMCID: PMC5808033 DOI: 10.1038/bjc.2017.421] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2017] [Revised: 10/25/2017] [Accepted: 10/26/2017] [Indexed: 12/13/2022] Open
Abstract
Background: Although the mechanistic target of rapamycin (MTOR) kinase, included in the mTORC1 and mTORC2 signalling hubs, has been demonstrated to be active in a significant fraction of patients with pancreatic ductal adenocarcinoma (PDAC), the value of the kinase as a therapeutic target needs further clarification. Methods: We used Mtor floxed mice to analyse the function of the kinase in context of the pancreas at the genetic level. Using a dual-recombinase system, which is based on the flippase-FRT (Flp-FRT) and Cre-loxP recombination technologies, we generated a novel cellular model, allowing the genetic analysis of MTOR functions in tumour maintenance. Cross-species validation and pharmacological intervention studies were used to recapitulate genetic data in human models, including primary human 3D PDAC cultures. Results: Genetic deletion of the Mtor gene in the pancreas results in exocrine and endocrine insufficiency. In established murine PDAC cells, MTOR is linked to metabolic pathways and maintains the glucose uptake and growth. Importantly, blocking MTOR genetically as well as pharmacologically results in adaptive rewiring of oncogenic signalling with activation of canonical extracellular signal-regulated kinase and phosphoinositide 3-kinase-AKT pathways. We provide evidence that interfering with such adaptive signalling in murine and human PDAC models is important in a subgroup. Conclusions: Our data suggest developing dual MTORC1/TORC2 inhibitor-based therapies for subtype-specific intervention.
Collapse
Affiliation(s)
- Zonera Hassan
- Medical Clinic and Polyclinic II, Klinikum rechts der Isar, Technical University Munich, 81675 München, Germany
| | - Christian Schneeweis
- Medical Clinic and Polyclinic II, Klinikum rechts der Isar, Technical University Munich, 81675 München, Germany
| | - Matthias Wirth
- Institute of Pathology, Heinrich-Heine University and University Hospital Düsseldorf, 40225 Düsseldorf, Germany
| | - Christian Veltkamp
- Medical Clinic and Polyclinic II, Klinikum rechts der Isar, Technical University Munich, 81675 München, Germany
| | - Zahra Dantes
- Medical Clinic and Polyclinic II, Klinikum rechts der Isar, Technical University Munich, 81675 München, Germany
| | - Benedikt Feuerecker
- Department of Nuclear Medicine, Klinikum rechts der Isar, Technical University Munich, 81675 München, Germany.,German Cancer Research Center (DKFZ) and German Cancer Consortium (DKTK), 69120 Heidelberg, Germany
| | - Güralp O Ceyhan
- Department of Surgery, Klinikum rechts der Isar, Technical University of Munich, 81675 München, Germany
| | - Shirley K Knauer
- Molecular Biology, Centre for Medical Biotechnology (ZMB), University Duisburg-Essen, 45141 Essen, Germany
| | - Wilko Weichert
- Institute of Pathology, Technische Universität München, 81675 München, Germany
| | - Roland M Schmid
- Medical Clinic and Polyclinic II, Klinikum rechts der Isar, Technical University Munich, 81675 München, Germany
| | - Roland Stauber
- Molecular and Cellular Oncology/ENT, University Medical Center Mainz, Langenbeckstrasse 1, Mainz 55131, Germany
| | - Alexander Arlt
- Laboratory of Molecular Gastroenterology and Hepatology, 1st Department of Internal Medicine, University Hospital Schleswig-Holstein, Kiel, Germany
| | - Oliver H Krämer
- Department of Toxicology, University of Mainz Medical Center, Mainz 55131, Germany
| | - Roland Rad
- Medical Clinic and Polyclinic II, Klinikum rechts der Isar, Technical University Munich, 81675 München, Germany.,German Cancer Research Center (DKFZ) and German Cancer Consortium (DKTK), 69120 Heidelberg, Germany
| | - Maximilian Reichert
- Medical Clinic and Polyclinic II, Klinikum rechts der Isar, Technical University Munich, 81675 München, Germany.,Division of Gastroenterology and Abramson Cancer Center, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Dieter Saur
- Medical Clinic and Polyclinic II, Klinikum rechts der Isar, Technical University Munich, 81675 München, Germany.,German Cancer Research Center (DKFZ) and German Cancer Consortium (DKTK), 69120 Heidelberg, Germany
| | - Günter Schneider
- Medical Clinic and Polyclinic II, Klinikum rechts der Isar, Technical University Munich, 81675 München, Germany.,German Cancer Research Center (DKFZ) and German Cancer Consortium (DKTK), 69120 Heidelberg, Germany
| |
Collapse
|
12
|
Barrios D, Prieto C. D3GB: An Interactive Genome Browser for R, Python, and WordPress. J Comput Biol 2017; 24:447-449. [PMID: 28437135 DOI: 10.1089/cmb.2016.0213] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Genome browsers are useful not only for showing final results but also for improving analysis protocols, testing data quality, and generating result drafts. Its integration in analysis pipelines allows the optimization of parameters, which leads to better results. New developments that facilitate the creation and utilization of genome browsers could contribute to improving analysis results and supporting the quick visualization of genomic data. D3 Genome Browser is an interactive genome browser that can be easily integrated in analysis protocols and shared on the Web. It is distributed as an R package, a Python module, and a WordPress plugin to facilitate its integration in pipelines and the utilization of platform capabilities. It is compatible with popular data formats such as GenBank, GFF, BED, FASTA, and VCF, and enables the exploration of genomic data with a Web browser.
Collapse
Affiliation(s)
- David Barrios
- 1 Bioinformatics Service, Nucleus, University of Salamanca (USAL) , Salamanca, Spain
| | - Carlos Prieto
- 1 Bioinformatics Service, Nucleus, University of Salamanca (USAL) , Salamanca, Spain .,2 Institute of Biomedical Research of Salamanca (IBSAL) , Salamanca, Spain
| |
Collapse
|
13
|
Guerrero CR, Jagtap PD, Johnson JE, Griffin TJ. Using Galaxy for Proteomics. PROTEOME INFORMATICS 2016. [DOI: 10.1039/9781782626732-00289] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
The area of informatics for mass spectrometry (MS)-based proteomics data has steadily grown over the last two decades. Numerous, effective software programs now exist for various aspects of proteomic informatics. However, many researchers still have difficulties in using these software. These difficulties arise from problems with running and integrating disparate software programs, scalability issues when dealing with large data volumes, and lack of ability to share and reproduce workflows comprised of different software. The Galaxy framework for bioinformatics provides an attractive option for solving many of these current issues in proteomic informatics. Originally developed as a workbench to enable genomic data analysis, numerous researchers are now turning to Galaxy to implement software for MS-based proteomics applications. Here, we provide an introduction to Galaxy and its features, and describe how software tools are deployed, published and shared via the scalable framework. We also describe some of the existing tools in Galaxy for basic MS-based proteomics data analysis and informatics. Finally, we describe how proteomics tools in Galaxy can be combined with other existing tools for genomic and transcriptomic data analysis to enable powerful multi-omic data analysis applications.
Collapse
Affiliation(s)
- Candace R. Guerrero
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota 321 Church St SE/6-155 Jackson Hall Minneapolis MN 55455 USA
| | - Pratik D. Jagtap
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota 321 Church St SE/6-155 Jackson Hall Minneapolis MN 55455 USA
- Center for Mass Spectrometry and Proteomics, University of Minnesota 1479 Gortner Avenue, St. Paul MN 55108 USA
| | - James E. Johnson
- Minnesota Supercomputing Institute, University of Minnesota 512 Walter Library, 117 Pleasant Street SE Minneapolis MN 55455 USA
| | - Timothy J. Griffin
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota 321 Church St SE/6-155 Jackson Hall Minneapolis MN 55455 USA
- Center for Mass Spectrometry and Proteomics, University of Minnesota 1479 Gortner Avenue, St. Paul MN 55108 USA
| |
Collapse
|
14
|
Reid CD, Karra K, Chang J, Piskol R, Li Q, Li JB, Cherry JM, Baker JC. XenMine: A genomic interaction tool for the Xenopus community. Dev Biol 2016; 426:155-164. [PMID: 27157655 DOI: 10.1016/j.ydbio.2016.02.034] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2015] [Revised: 02/06/2016] [Accepted: 02/26/2016] [Indexed: 11/17/2022]
Abstract
The Xenopus community has embraced recent advances in sequencing technology, resulting in the accumulation of numerous RNA-Seq and ChIP-Seq datasets. However, easily accessing and comparing datasets generated by multiple laboratories is challenging. Thus, we have created a central space to view, search and analyze data, providing essential information on gene expression changes and regulatory elements present in the genome. XenMine (www.xenmine.org) is a user-friendly website containing published genomic datasets from both Xenopus tropicalis and Xenopus laevis. We have established an analysis pipeline where all published datasets are uniformly processed with the latest genome releases. Information from these datasets can be extracted and compared using an array of pre-built or custom templates. With these search tools, users can easily extract sequences for all putative regulatory domains surrounding a gene of interest, identify the expression values of a gene of interest over developmental time, and analyze lists of genes for gene ontology terms and publications. Additionally, XenMine hosts an in-house genome browser that allows users to visualize all available ChIP-Seq data, extract specifically marked sequences, and aid in identifying important regulatory elements within the genome. Altogether, XenMine is an excellent tool for visualizing, accessing and querying analyzed datasets rapidly and efficiently.
Collapse
Affiliation(s)
- Christine D Reid
- Department of Genetics, Stanford University, Stanford CA 94305, USA
| | - Kalpana Karra
- Department of Genetics, Stanford University, Stanford CA 94305, USA
| | - Jessica Chang
- Department of Genetics, Stanford University, Stanford CA 94305, USA
| | - Robert Piskol
- Department of Genetics, Stanford University, Stanford CA 94305, USA
| | - Qin Li
- Department of Genetics, Stanford University, Stanford CA 94305, USA
| | - Jin Billy Li
- Department of Genetics, Stanford University, Stanford CA 94305, USA
| | - J Michael Cherry
- Department of Genetics, Stanford University, Stanford CA 94305, USA
| | - Julie C Baker
- Department of Genetics, Stanford University, Stanford CA 94305, USA.
| |
Collapse
|
15
|
Goold HD, Nguyen HM, Kong F, Beyly-Adriano A, Légeret B, Billon E, Cuiné S, Beisson F, Peltier G, Li-Beisson Y. Whole Genome Re-Sequencing Identifies a Quantitative Trait Locus Repressing Carbon Reserve Accumulation during Optimal Growth in Chlamydomonas reinhardtii. Sci Rep 2016; 6:25209. [PMID: 27141848 PMCID: PMC4855234 DOI: 10.1038/srep25209] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2016] [Accepted: 04/13/2016] [Indexed: 02/08/2023] Open
Abstract
Microalgae have emerged as a promising source for biofuel production. Massive oil and starch accumulation in microalgae is possible, but occurs mostly when biomass growth is impaired. The molecular networks underlying the negative correlation between growth and reserve formation are not known. Thus isolation of strains capable of accumulating carbon reserves during optimal growth would be highly desirable. To this end, we screened an insertional mutant library of Chlamydomonas reinhardtii for alterations in oil content. A mutant accumulating five times more oil and twice more starch than wild-type during optimal growth was isolated and named constitutive oil accumulator 1 (coa1). Growth in photobioreactors under highly controlled conditions revealed that the increase in oil and starch content in coa1 was dependent on light intensity. Genetic analysis and DNA hybridization pointed to a single insertional event responsible for the phenotype. Whole genome re-sequencing identified in coa1 a >200 kb deletion on chromosome 14 containing 41 genes. This study demonstrates that, 1), the generation of algal strains accumulating higher reserve amount without compromising biomass accumulation is feasible; 2), light is an important parameter in phenotypic analysis; and 3), a chromosomal region (Quantitative Trait Locus) acts as suppressor of carbon reserve accumulation during optimal growth.
Collapse
Affiliation(s)
- Hugh Douglas Goold
- CEA, BIAM, Lab Bioenerget Biotechnol Bacteries &Microalgues, Saint-Paul-lez-Durance, 13108, France.,CNRS, UMR 7265 Biol Veget &Microbiol Environ, Saint-Paul-lez-Durance, 13108, France.,Aix Marseille Université, BVME UMR7265, Marseille, 13284, France.,Faculty of Agriculture and the Environment, University of Sydney, Australia
| | - Hoa Mai Nguyen
- CEA, BIAM, Lab Bioenerget Biotechnol Bacteries &Microalgues, Saint-Paul-lez-Durance, 13108, France.,CNRS, UMR 7265 Biol Veget &Microbiol Environ, Saint-Paul-lez-Durance, 13108, France.,Aix Marseille Université, BVME UMR7265, Marseille, 13284, France
| | - Fantao Kong
- CEA, BIAM, Lab Bioenerget Biotechnol Bacteries &Microalgues, Saint-Paul-lez-Durance, 13108, France.,CNRS, UMR 7265 Biol Veget &Microbiol Environ, Saint-Paul-lez-Durance, 13108, France.,Aix Marseille Université, BVME UMR7265, Marseille, 13284, France
| | - Audrey Beyly-Adriano
- CEA, BIAM, Lab Bioenerget Biotechnol Bacteries &Microalgues, Saint-Paul-lez-Durance, 13108, France.,CNRS, UMR 7265 Biol Veget &Microbiol Environ, Saint-Paul-lez-Durance, 13108, France.,Aix Marseille Université, BVME UMR7265, Marseille, 13284, France
| | - Bertrand Légeret
- CEA, BIAM, Lab Bioenerget Biotechnol Bacteries &Microalgues, Saint-Paul-lez-Durance, 13108, France.,CNRS, UMR 7265 Biol Veget &Microbiol Environ, Saint-Paul-lez-Durance, 13108, France.,Aix Marseille Université, BVME UMR7265, Marseille, 13284, France
| | - Emmanuelle Billon
- CEA, BIAM, Lab Bioenerget Biotechnol Bacteries &Microalgues, Saint-Paul-lez-Durance, 13108, France.,CNRS, UMR 7265 Biol Veget &Microbiol Environ, Saint-Paul-lez-Durance, 13108, France.,Aix Marseille Université, BVME UMR7265, Marseille, 13284, France
| | - Stéphan Cuiné
- CEA, BIAM, Lab Bioenerget Biotechnol Bacteries &Microalgues, Saint-Paul-lez-Durance, 13108, France.,CNRS, UMR 7265 Biol Veget &Microbiol Environ, Saint-Paul-lez-Durance, 13108, France.,Aix Marseille Université, BVME UMR7265, Marseille, 13284, France
| | - Fred Beisson
- CEA, BIAM, Lab Bioenerget Biotechnol Bacteries &Microalgues, Saint-Paul-lez-Durance, 13108, France.,CNRS, UMR 7265 Biol Veget &Microbiol Environ, Saint-Paul-lez-Durance, 13108, France.,Aix Marseille Université, BVME UMR7265, Marseille, 13284, France
| | - Gilles Peltier
- CEA, BIAM, Lab Bioenerget Biotechnol Bacteries &Microalgues, Saint-Paul-lez-Durance, 13108, France.,CNRS, UMR 7265 Biol Veget &Microbiol Environ, Saint-Paul-lez-Durance, 13108, France.,Aix Marseille Université, BVME UMR7265, Marseille, 13284, France
| | - Yonghua Li-Beisson
- CEA, BIAM, Lab Bioenerget Biotechnol Bacteries &Microalgues, Saint-Paul-lez-Durance, 13108, France.,CNRS, UMR 7265 Biol Veget &Microbiol Environ, Saint-Paul-lez-Durance, 13108, France.,Aix Marseille Université, BVME UMR7265, Marseille, 13284, France
| |
Collapse
|
16
|
Afgan E, Baker D, van den Beek M, Blankenberg D, Bouvier D, Čech M, Chilton J, Clements D, Coraor N, Eberhard C, Grüning B, Guerler A, Hillman-Jackson J, Von Kuster G, Rasche E, Soranzo N, Turaga N, Taylor J, Nekrutenko A, Goecks J. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update. Nucleic Acids Res 2016; 44:W3-W10. [PMID: 27137889 PMCID: PMC4987906 DOI: 10.1093/nar/gkw343] [Citation(s) in RCA: 1282] [Impact Index Per Article: 142.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2016] [Accepted: 04/18/2016] [Indexed: 02/07/2023] Open
Abstract
High-throughput data production technologies, particularly ‘next-generation’ DNA sequencing, have ushered in widespread and disruptive changes to biomedical research. Making sense of the large datasets produced by these technologies requires sophisticated statistical and computational methods, as well as substantial computational power. This has led to an acute crisis in life sciences, as researchers without informatics training attempt to perform computation-dependent analyses. Since 2005, the Galaxy project has worked to address this problem by providing a framework that makes advanced computational tools usable by non experts. Galaxy seeks to make data-intensive research more accessible, transparent and reproducible by providing a Web-based environment in which users can perform computational analyses and have all of the details automatically tracked for later inspection, publication, or reuse. In this report we highlight recently added features enabling biomedical analyses on a large scale.
Collapse
Affiliation(s)
- Enis Afgan
- Department of Biology, Johns Hopkins University, Baltimore, MD USA
| | - Dannon Baker
- Department of Biology, Johns Hopkins University, Baltimore, MD USA
| | - Marius van den Beek
- Institut de Biologie Paris-Seine, Université Pierre et Marie Curie, Paris, France
| | - Daniel Blankenberg
- Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| | - Dave Bouvier
- Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| | - Martin Čech
- Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| | - John Chilton
- Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| | - Dave Clements
- Department of Biology, Johns Hopkins University, Baltimore, MD USA
| | - Nate Coraor
- Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| | - Carl Eberhard
- Department of Biology, Johns Hopkins University, Baltimore, MD USA
| | - Björn Grüning
- Department of Computer Science, Albert-Ludwigs-University, Freiburg, Freiburg, Germany Center for Biological Systems Analysis (ZBSA), University of Freiburg, Freiburg, Germany
| | - Aysam Guerler
- Department of Biology, Johns Hopkins University, Baltimore, MD USA
| | - Jennifer Hillman-Jackson
- Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| | - Greg Von Kuster
- Academic Computing Services, Penn State University, University Park, PA, USA
| | - Eric Rasche
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, USA
| | | | - Nitesh Turaga
- Department of Biology, Johns Hopkins University, Baltimore, MD USA
| | - James Taylor
- Department of Biology, Johns Hopkins University, Baltimore, MD USA
| | - Anton Nekrutenko
- Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, USA
| | - Jeremy Goecks
- The Computational Biology Institute, George Washington University, Washington DC, USA
| |
Collapse
|
17
|
Lin G, Chai J, Yuan S, Mai C, Cai L, Murphy RW, Zhou W, Luo J. VennPainter: A Tool for the Comparison and Identification of Candidate Genes Based on Venn Diagrams. PLoS One 2016; 11:e0154315. [PMID: 27120465 PMCID: PMC4847855 DOI: 10.1371/journal.pone.0154315] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2015] [Accepted: 04/12/2016] [Indexed: 12/21/2022] Open
Abstract
VennPainter is a program for depicting unique and shared sets of genes lists and generating Venn diagrams, by using the Qt C++ framework. The software produces Classic Venn, Edwards’ Venn and Nested Venn diagrams and allows for eight sets in a graph mode and 31 sets in data processing mode only. In comparison, previous programs produce Classic Venn and Edwards’ Venn diagrams and allow for a maximum of six sets. The software incorporates user-friendly features and works in Windows, Linux and Mac OS. Its graphical interface does not require a user to have programing skills. Users can modify diagram content for up to eight datasets because of the Scalable Vector Graphics output. VennPainter can provide output results in vertical, horizontal and matrix formats, which facilitates sharing datasets as required for further identification of candidate genes. Users can obtain gene lists from shared sets by clicking the numbers on the diagram. Thus, VennPainter is an easy-to-use, highly efficient, cross-platform and powerful program that provides a more comprehensive tool for identifying candidate genes and visualizing the relationships among genes or gene families in comparative analysis.
Collapse
Affiliation(s)
- Guoliang Lin
- Key Laboratory for Animal Genetic Diversity and Evolution of High Education in Yunnan Province, School of Life Sciences, Yunnan University, Kunming, 650091, China
- State Key Laboratory for Conservation and Utilization of Bio-resource, Yunnan University, Kunming, 650091, Yunnan, China
| | - Jing Chai
- Key Laboratory for Animal Genetic Diversity and Evolution of High Education in Yunnan Province, School of Life Sciences, Yunnan University, Kunming, 650091, China
- State Key Laboratory for Conservation and Utilization of Bio-resource, Yunnan University, Kunming, 650091, Yunnan, China
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, The Chinese Academy of Sciences, Kunming, 650223, Yunnan, China
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, 650000, China
| | - Shuo Yuan
- School of Software, Yunnan University, Kunming, 650091, Yunnan, China
| | - Chao Mai
- School of Software, Yunnan University, Kunming, 650091, Yunnan, China
| | - Li Cai
- School of Software, Yunnan University, Kunming, 650091, Yunnan, China
- School of Computer and Science, Fudan University, Shanghai, 200433, China
| | - Robert W. Murphy
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, The Chinese Academy of Sciences, Kunming, 650223, Yunnan, China
- Centre for Biodiversity and Conservation Biology, Royal Ontario Museum, Toronto, M5S 2C6, Canada
| | - Wei Zhou
- School of Software, Yunnan University, Kunming, 650091, Yunnan, China
- * E-mail: (WZ); (JL)
| | - Jing Luo
- Key Laboratory for Animal Genetic Diversity and Evolution of High Education in Yunnan Province, School of Life Sciences, Yunnan University, Kunming, 650091, China
- State Key Laboratory for Conservation and Utilization of Bio-resource, Yunnan University, Kunming, 650091, Yunnan, China
- * E-mail: (WZ); (JL)
| |
Collapse
|
18
|
Buels R, Yao E, Diesh CM, Hayes RD, Munoz-Torres M, Helt G, Goodstein DM, Elsik CG, Lewis SE, Stein L, Holmes IH. JBrowse: a dynamic web platform for genome visualization and analysis. Genome Biol 2016; 17:66. [PMID: 27072794 PMCID: PMC4830012 DOI: 10.1186/s13059-016-0924-1] [Citation(s) in RCA: 519] [Impact Index Per Article: 57.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2016] [Accepted: 03/15/2016] [Indexed: 02/07/2023] Open
Abstract
Background JBrowse is a fast and full-featured genome browser built with JavaScript and HTML5. It is easily embedded into websites or apps but can also be served as a standalone web page. Results Overall improvements to speed and scalability are accompanied by specific enhancements that support complex interactive queries on large track sets. Analysis functions can readily be added using the plugin framework; most visual aspects of tracks can also be customized, along with clicks, mouseovers, menus, and popup boxes. JBrowse can also be used to browse local annotation files offline and to generate high-resolution figures for publication. Conclusions JBrowse is a mature web application suitable for genome visualization and analysis.
Collapse
Affiliation(s)
- Robert Buels
- Department of Bioengineering, University of California, Berkeley, California, USA
| | - Eric Yao
- Department of Bioengineering, University of California, Berkeley, California, USA
| | - Colin M Diesh
- Division of Animal Sciences, University of Missouri, Columbia, Missouri, USA
| | - Richard D Hayes
- Lawrence Berkeley National Laboratory, Berkeley, California, USA.,US Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | | | - Gregg Helt
- Lawrence Berkeley National Laboratory, Berkeley, California, USA.,Current affiliation: Genomancer Consulting, Healdsburg, California, USA
| | - David M Goodstein
- Lawrence Berkeley National Laboratory, Berkeley, California, USA.,US Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Christine G Elsik
- Division of Animal Sciences, University of Missouri, Columbia, Missouri, USA
| | - Suzanna E Lewis
- Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Lincoln Stein
- Ontario Institute of Cancer Research, Toronto, Ontario, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Ian H Holmes
- Department of Bioengineering, University of California, Berkeley, California, USA. .,Lawrence Berkeley National Laboratory, Berkeley, California, USA.
| |
Collapse
|
19
|
Cock PJA, Chilton JM, Grüning B, Johnson JE, Soranzo N. NCBI BLAST+ integrated into Galaxy. Gigascience 2015; 4:39. [PMID: 26336600 PMCID: PMC4557756 DOI: 10.1186/s13742-015-0080-7] [Citation(s) in RCA: 148] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2014] [Accepted: 08/18/2015] [Indexed: 01/29/2023] Open
Abstract
Background The NCBI BLAST suite has become ubiquitous in modern molecular biology and is used for small tasks such as checking capillary sequencing results of single PCR products, genome annotation or even larger scale pan-genome analyses. For early adopters of the Galaxy web-based biomedical data analysis platform, integrating BLAST into Galaxy was a natural step for sequence comparison workflows. Findings The command line NCBI BLAST+ tool suite was wrapped for use within Galaxy. Appropriate datatypes were defined as needed. The integration of the BLAST+ tool suite into Galaxy has the goal of making common BLAST tasks easy and advanced tasks possible. Conclusions This project is an informal international collaborative effort, and is deployed and used on Galaxy servers worldwide. Several examples of applications are described here.
Collapse
Affiliation(s)
- Peter J A Cock
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA Scotland UK
| | - John M Chilton
- Minnesota Supercomputing Institute, University of Minnesota, 599 Walter Library, 117 Pleasant St. SE, 55455 Minneapolis, MN USA
| | - Björn Grüning
- Department of Computer Science, Albert-Ludwigs-University of Freiburg, Georges-Köhler-Allee 106, Freiburg, 79110 Germany
| | - James E Johnson
- Minnesota Supercomputing Institute, University of Minnesota, 599 Walter Library, 117 Pleasant St. SE, 55455 Minneapolis, MN USA
| | | |
Collapse
|
20
|
Greineisen WE, Maaetoft-Udsen K, Speck M, Balajadia J, Shimoda LMN, Sung C, Turner H. Chronic Insulin Exposure Induces ER Stress and Lipid Body Accumulation in Mast Cells at the Expense of Their Secretory Degranulation Response. PLoS One 2015; 10:e0130198. [PMID: 26263026 PMCID: PMC4532411 DOI: 10.1371/journal.pone.0130198] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2014] [Accepted: 05/17/2015] [Indexed: 12/11/2022] Open
Abstract
Lipid bodies (LB) are reservoirs of precursors to inflammatory lipid mediators in immunocytes, including mast cells. LB numbers are dynamic, increasing dramatically under conditions of immunological challenge. We have previously shown in vitro that insulin-influenced lipogenic pathways induce LB biogenesis in mast cells, with their numbers attaining steatosis-like levels. Here, we demonstrate that in vivo hyperinsulinemia resulting from high fat diet is associated with LB accumulation in murine mast cells and basophils. We characterize the lipidome of purified insulin-induced LB, and the shifts in the whole cell lipid landscape in LB that are associated with their accumulation, in both model (RBL2H3) and primary mast cells. Lipidomic analysis suggests a gain of function associated with LB accumulation, in terms of elevated levels of eicosanoid precursors that translate to enhanced antigen-induced LTC4 release. Loss-of-function in terms of a suppressed degranulation response was also associated with LB accumulation, as were ER reprogramming and ER stress, analogous to observations in the obese hepatocyte and adipocyte. Taken together, these data suggest that chronic insulin elevation drives mast cell LB enrichment in vitro and in vivo, with associated effects on the cellular lipidome, ER status and pro-inflammatory responses.
Collapse
Affiliation(s)
- William E. Greineisen
- Laboratory of Immunology and Signal Transduction, Chaminade University, Honolulu, Hawaii, United States of America
| | - Kristina Maaetoft-Udsen
- Laboratory of Immunology and Signal Transduction, Chaminade University, Honolulu, Hawaii, United States of America
| | - Mark Speck
- Laboratory of Immunology and Signal Transduction, Chaminade University, Honolulu, Hawaii, United States of America
| | - Januaria Balajadia
- Laboratory of Immunology and Signal Transduction, Chaminade University, Honolulu, Hawaii, United States of America
| | - Lori M. N. Shimoda
- Laboratory of Immunology and Signal Transduction, Chaminade University, Honolulu, Hawaii, United States of America
| | - Carl Sung
- Laboratory of Immunology and Signal Transduction, Chaminade University, Honolulu, Hawaii, United States of America
| | - Helen Turner
- Laboratory of Immunology and Signal Transduction, Chaminade University, Honolulu, Hawaii, United States of America
- Department of Cell and Molecular Biology, John A. Burns School of Medicine, University of Hawaii, Honolulu, Hawaii, United States of America
- * E-mail:
| |
Collapse
|
21
|
Comparative, transcriptome analysis of self-organizing optic tissues. Sci Data 2015; 2:150030. [PMID: 26110066 PMCID: PMC4477696 DOI: 10.1038/sdata.2015.30] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2014] [Accepted: 05/14/2015] [Indexed: 01/03/2023] Open
Abstract
Embryonic stem (ES) cells have a remarkable capacity to self-organize complex, multi-layered optic cups in vitro via a culture technique called SFEBq. During both SFEBq and in vivo optic cup development, Rax (Rx) expressing neural retina epithelial (NRE) tissues utilize Fgf and Wnt/β-catenin signalling pathways to differentiate into neural retina (NR) and retinal-pigmented epithelial (RPE) tissues, respectively. How these signaling pathways affect gene expression during optic tissue formation has remained largely unknown, especially at the transcriptome scale. Here, we address this question using RNA-Seq. We generated Rx+ optic tissue using SFEBq, exposed these tissues to either Fgf or Wnt/β-catenin stimulation, and assayed their gene expression across multiple time points using RNA-Seq. This comparative dataset will help elucidate how Fgf and Wnt/β-catenin signaling affect gene expression during optic tissue differentiation and will help inform future efforts to optimize in vitro optic tissue culture technology.
Collapse
|
22
|
Trends in IT Innovation to Build a Next Generation Bioinformatics Solution to Manage and Analyse Biological Big Data Produced by NGS Technologies. BIOMED RESEARCH INTERNATIONAL 2015; 2015:904541. [PMID: 26125026 PMCID: PMC4466500 DOI: 10.1155/2015/904541] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/31/2014] [Revised: 04/01/2015] [Accepted: 04/01/2015] [Indexed: 02/07/2023]
Abstract
Sequencing the human genome began in 1994, and 10 years of work were necessary in order to provide a nearly complete sequence. Nowadays, NGS technologies allow sequencing of a whole human genome in a few days. This deluge of data challenges scientists in many ways, as they are faced with data management issues and analysis and visualization drawbacks due to the limitations of current bioinformatics tools. In this paper, we describe how the NGS Big Data revolution changes the way of managing and analysing data. We present how biologists are confronted with abundance of methods, tools, and data formats. To overcome these problems, focus on Big Data Information Technology innovations from web and business intelligence. We underline the interest of NoSQL databases, which are much more efficient than relational databases. Since Big Data leads to the loss of interactivity with data during analysis due to high processing time, we describe solutions from the Business Intelligence that allow one to regain interactivity whatever the volume of data is. We illustrate this point with a focus on the Amadea platform. Finally, we discuss visualization challenges posed by Big Data and present the latest innovations with JavaScript graphic libraries.
Collapse
|
23
|
Bretaudeau A, Monjeaud C, Le Bras Y, Legeai F, Collin O. BioMAJ2Galaxy: automatic update of reference data in Galaxy using BioMAJ. Gigascience 2015; 4:22. [PMID: 25960870 PMCID: PMC4425870 DOI: 10.1186/s13742-015-0063-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2014] [Accepted: 04/22/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Many bioinformatics tools use reference data, such as genome assemblies or sequence databanks. Galaxy offers multiple ways to give access to this data through its web interface. However, the process of adding new reference data was customarily manual and time consuming, even more so when this data needed to be indexed in a variety of formats (e.g. Blast, Bowtie, BWA, or 2bit). BioMAJ is a widely used and stable software that is designed to automate the download and transformation of data from various sources. This data can be used directly from the command line, in more complex systems, such as Mobyle, or by using a REST API. FINDINGS To ease the process of giving access to reference data in Galaxy, we have developed the BioMAJ2Galaxy module, which enables the gap between BioMAJ and Galaxy to be bridged. With this module, it is now possible to configure BioMAJ to automatically download some reference data, to then convert and/or index it in various formats, and then make this data available in a Galaxy server using data libraries or data managers. CONCLUSIONS The developments presented in this paper allow us to integrate the reference data in Galaxy in an automatic, reliable, and diskspace-saving way. The code is freely available on the GenOuest GitHub account (https://github.com/genouest/biomaj2galaxy).
Collapse
Affiliation(s)
- Anthony Bretaudeau
- INRA, UMR Institut de Génétique, Environnement et Protection des Plantes (IGEPP), BioInformatics Platform for Agroecosystems Arthropods (BIPAA), Campus Beaulieu, Rennes, 35042 France ; INRIA, IRISA, GenOuest Core Facility, Campus de Beaulieu, Rennes, 35042 France
| | - Cyril Monjeaud
- INRIA, IRISA, GenOuest Core Facility, Campus de Beaulieu, Rennes, 35042 France
| | - Yvan Le Bras
- INRIA, IRISA, GenOuest Core Facility, Campus de Beaulieu, Rennes, 35042 France
| | - Fabrice Legeai
- INRA, UMR Institut de Génétique, Environnement et Protection des Plantes (IGEPP), BioInformatics Platform for Agroecosystems Arthropods (BIPAA), Campus Beaulieu, Rennes, 35042 France ; INRIA, IRISA, GenScale, Campus de Beaulieu, Rennes, 35042 France
| | - Olivier Collin
- INRIA, IRISA, GenOuest Core Facility, Campus de Beaulieu, Rennes, 35042 France
| |
Collapse
|
24
|
Goecks J, El-Rayes BF, Maithel SK, Khoury HJ, Taylor J, Rossi MR. Open pipelines for integrated tumor genome profiles reveal differences between pancreatic cancer tumors and cell lines. Cancer Med 2015; 4:392-403. [PMID: 25594743 PMCID: PMC4380965 DOI: 10.1002/cam4.360] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2014] [Revised: 07/22/2014] [Accepted: 08/21/2014] [Indexed: 01/06/2023] Open
Abstract
We describe open, reproducible pipelines that create an integrated genomic profile of a cancer and use the profile to find mutations associated with disease and potentially useful drugs. These pipelines analyze high-throughput cancer exome and transcriptome sequence data together with public databases to find relevant mutations and drugs. The three pipelines that we have developed are: (1) an exome analysis pipeline, which uses whole or targeted tumor exome sequence data to produce a list of putative variants (no matched normal data are needed); (2) a transcriptome analysis pipeline that processes whole tumor transcriptome sequence (RNA-seq) data to compute gene expression and find potential gene fusions; and (3) an integrated variant analysis pipeline that uses the tumor variants from the exome pipeline and tumor gene expression from the transcriptome pipeline to identify deleterious and druggable mutations in all genes and in highly expressed genes. These pipelines are integrated into the popular Web platform Galaxy at http://usegalaxy.org/cancer to make them accessible and reproducible, thereby providing an approach for doing standardized, distributed analyses in clinical studies. We have used our pipeline to identify similarities and differences between pancreatic adenocarcinoma cancer cell lines and primary tumors.
Collapse
Affiliation(s)
- Jeremy Goecks
- Computational Biology Institute, George Washington University, Ashburn, Virginia, 20147
| | | | | | | | | | | |
Collapse
|
25
|
Hiltemann S, Hoogstrate Y, der Spek PV, Jenster G, Stubbs A. iReport: a generalised Galaxy solution for integrated experimental reporting. Gigascience 2014; 3:19. [PMID: 25374662 PMCID: PMC4220639 DOI: 10.1186/2047-217x-3-19] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2014] [Accepted: 09/29/2014] [Indexed: 11/22/2022] Open
Abstract
Background Galaxy offers a number of visualisation options with components, such as Trackster, Circster and Galaxy Charts, but currently lacks the ability to easily combine outputs from different tools into a single view or report. A number of tools produce HTML reports as output in order to combine the various output files from a single tool; however, this requires programming and knowledge of HTML, and the reports must be custom-made for each new tool. Findings We have developed a generic and flexible reporting tool for Galaxy, iReport, that allows users to create interactive HTML reports directly from the Galaxy UI, with the ability to combine an arbitrary number of outputs from any number of different tools. Content can be organised into different tabs, and interactivity can be added to components. To demonstrate the capability of iReport we provide two publically available examples, the first is an iReport explaining about iReports, created for, and using content from the recent Galaxy Community Conference 2014. The second is a genetic report based on a trio analysis to determine candidate pathogenic variants which uses our previously developed Galaxy toolset for whole-genome NGS analysis, CGtag. These reports may be adapted for outputs from any sequencing platform and any results, such as omics data, non-high throughput results and clinical variables. Conclusions iReport provides a secure, collaborative, and flexible web-based reporting system that is compatible with Galaxy (and non-Galaxy) generated content. We demonstrate its value with a real-life example of reporting genetic trio-analysis.
Collapse
Affiliation(s)
- Saskia Hiltemann
- Department of Bioinformatics, Erasmus Medical Center, Wytemaweg 80, 3015 CN Rotterdam, The Netherlands ; Department of Urology, Erasmus Medical Center, Wytemaweg 80, 3015 CN Rotterdam, The Netherlands
| | - Youri Hoogstrate
- Department of Bioinformatics, Erasmus Medical Center, Wytemaweg 80, 3015 CN Rotterdam, The Netherlands ; Department of Urology, Erasmus Medical Center, Wytemaweg 80, 3015 CN Rotterdam, The Netherlands
| | - Peter van der Spek
- Department of Bioinformatics, Erasmus Medical Center, Wytemaweg 80, 3015 CN Rotterdam, The Netherlands
| | - Guido Jenster
- Department of Urology, Erasmus Medical Center, Wytemaweg 80, 3015 CN Rotterdam, The Netherlands
| | - Andrew Stubbs
- Department of Bioinformatics, Erasmus Medical Center, Wytemaweg 80, 3015 CN Rotterdam, The Netherlands
| |
Collapse
|
26
|
Epiviz: interactive visual analytics for functional genomics data. Nat Methods 2014; 11:938-40. [PMID: 25086505 PMCID: PMC4149593 DOI: 10.1038/nmeth.3038] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2014] [Accepted: 06/12/2014] [Indexed: 12/19/2022]
Abstract
Visualization is an integral aspect of genomics data analysis. Algorithmic-statistical analysis and interactive visualization are most effective when used iteratively. Epiviz (http://epiviz.cbcb.umd.edu/), a web-based genome browser, and the Epivizr Bioconductor package allow interactive, extensible and reproducible visualization within a state-of-the-art data-analysis platform.
Collapse
|
27
|
Crabtree J, Agrawal S, Mahurkar A, Myers GS, Rasko DA, White O. Circleator: flexible circular visualization of genome-associated data with BioPerl and SVG. ACTA ACUST UNITED AC 2014; 30:3125-7. [PMID: 25075113 PMCID: PMC4201160 DOI: 10.1093/bioinformatics/btu505] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Summary: Circleator is a Perl application that generates circular figures of genome-associated data. It leverages BioPerl to support standard annotation and sequence file formats and produces publication-quality SVG output. It is designed to be both flexible and easy to use. It includes a library of circular track types and predefined configuration files for common use-cases, including. (i) visualizing gene annotation and DNA sequence data from a GenBank flat file, (ii) displaying patterns of gene conservation in related microbial strains, (iii) showing Single Nucleotide Polymorphisms (SNPs) and indels relative to a reference genome and gene set and (iv) viewing RNA-Seq plots. Availability and implementation: Circleator is freely available under the Artistic License 2.0 from http://jonathancrabtree.github.io/Circleator/ and is integrated with the CloVR cloud-based sequence analysis Virtual Machine (VM), which can be downloaded from http://clovr.org or run on Amazon EC2. Contact:jcrabtree@som.umaryland.edu Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jonathan Crabtree
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, i3 Institute, University of Technology, Sydney, PO Box 123 Broadway NSW 2007, Australia, Department of Microbial Pathogenesis, University of Maryland Dental School, Baltimore, MD 21201, Center for Health-Related Informatics and Bioimaging, University of Maryland, College Park, MD 20740 and Department of Epidemiology and Public Health, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Sonia Agrawal
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, i3 Institute, University of Technology, Sydney, PO Box 123 Broadway NSW 2007, Australia, Department of Microbial Pathogenesis, University of Maryland Dental School, Baltimore, MD 21201, Center for Health-Related Informatics and Bioimaging, University of Maryland, College Park, MD 20740 and Department of Epidemiology and Public Health, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Anup Mahurkar
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, i3 Institute, University of Technology, Sydney, PO Box 123 Broadway NSW 2007, Australia, Department of Microbial Pathogenesis, University of Maryland Dental School, Baltimore, MD 21201, Center for Health-Related Informatics and Bioimaging, University of Maryland, College Park, MD 20740 and Department of Epidemiology and Public Health, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Garry S Myers
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, i3 Institute, University of Technology, Sydney, PO Box 123 Broadway NSW 2007, Australia, Department of Microbial Pathogenesis, University of Maryland Dental School, Baltimore, MD 21201, Center for Health-Related Informatics and Bioimaging, University of Maryland, College Park, MD 20740 and Department of Epidemiology and Public Health, University of Maryland School of Medicine, Baltimore, MD 21201, USA Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, i3 Institute, University of Technology, Sydney, PO Box 123 Broadway NSW 2007, Australia, Department of Microbial Pathogenesis, University of Maryland Dental School, Baltimore, MD 21201, Center for Health-Related Informatics and Bioimaging, University of Maryland, College Park, MD 20740 and Department of Epidemiology and Public Health, University of Maryland School of Medicine, Baltimore, MD 21201, USA Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, i3 Institute, University of Technology, Sydney, PO Box 123 Broadway NSW 2007, Australia, Department of Microbial Pathogenesis, University of Maryland Dental School, Baltimore, MD 21201, Center for Health-Related Informatics and Bioimaging, University of Maryland, College Park, MD 20740 and Department of Epidemiology and Public Health, University of Maryland School of Medicine, Baltimore, MD 21201, USA Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, Department of Microbiology and Immunology, University of Maryland School of Medicine
| | - David A Rasko
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, i3 Institute, University of Technology, Sydney, PO Box 123 Broadway NSW 2007, Australia, Department of Microbial Pathogenesis, University of Maryland Dental School, Baltimore, MD 21201, Center for Health-Related Informatics and Bioimaging, University of Maryland, College Park, MD 20740 and Department of Epidemiology and Public Health, University of Maryland School of Medicine, Baltimore, MD 21201, USA Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, i3 Institute, University of Technology, Sydney, PO Box 123 Broadway NSW 2007, Australia, Department of Microbial Pathogenesis, University of Maryland Dental School, Baltimore, MD 21201, Center for Health-Related Informatics and Bioimaging, University of Maryland, College Park, MD 20740 and Department of Epidemiology and Public Health, University of Maryland School of Medicine, Baltimore, MD 21201, USA Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, i3 Institute, University of Technology, Sydney, PO Box 123 Broadway NSW 2007, Australia, Department of Microbial Pathogenesis, University of Maryland Dental School, Baltimore, MD 21201, Center for Health-Related Informatics and Bioimaging, University of Maryland, College Park, MD 20740 and Department of Epidemiology and Public Health, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Owen White
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, i3 Institute, University of Technology, Sydney, PO Box 123 Broadway NSW 2007, Australia, Department of Microbial Pathogenesis, University of Maryland Dental School, Baltimore, MD 21201, Center for Health-Related Informatics and Bioimaging, University of Maryland, College Park, MD 20740 and Department of Epidemiology and Public Health, University of Maryland School of Medicine, Baltimore, MD 21201, USA Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, i3 Institute, University of Technology, Sydney, PO Box 123 Broadway NSW 2007, Australia, Department of Microbial Pathogenesis, University of Maryland Dental School, Baltimore, MD 21201, Center for Health-Related Informatics and Bioimaging, University of Maryland, College Park, MD 20740 and Department of Epidemiology and Public Health, University of Maryland School of Medicine, Baltimore, MD 21201, USA Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, i3 Institute, University of Technology, Sydney, PO Box 123 Broadway NSW 2007, Australia, Department of Microbial Pathogenesis, University of Maryland Dental School, Baltimore, MD 21201, Center for Health-Related Informatics and Bioimaging, University of Maryland, College Park, MD 20740 and Department of Epidemiology and Public Health, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| |
Collapse
|
28
|
Farrell A, Coleman BI, Benenati B, Brown KM, Blader IJ, Marth GT, Gubbels MJ. Whole genome profiling of spontaneous and chemically induced mutations in Toxoplasma gondii. BMC Genomics 2014; 15:354. [PMID: 24885922 PMCID: PMC4035079 DOI: 10.1186/1471-2164-15-354] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2013] [Accepted: 05/02/2014] [Indexed: 12/18/2022] Open
Abstract
Background Next generation sequencing is helping to overcome limitations in organisms less accessible to classical or reverse genetic methods by facilitating whole genome mutational analysis studies. One traditionally intractable group, the Apicomplexa, contains several important pathogenic protozoan parasites, including the Plasmodium species that cause malaria. Here we apply whole genome analysis methods to the relatively accessible model apicomplexan, Toxoplasma gondii, to optimize forward genetic methods for chemical mutagenesis using N-ethyl-N-nitrosourea (ENU) and ethylmethane sulfonate (EMS) at varying dosages. Results By comparing three different lab-strains we show that spontaneously generated mutations reflect genome composition, without nucleotide bias. However, the single nucleotide variations (SNVs) are not distributed randomly over the genome; most of these mutations reside either in non-coding sequence or are silent with respect to protein coding. This is in contrast to the random genomic distribution of mutations induced by chemical mutagenesis. Additionally, we report a genome wide transition vs transversion ratio (ti/tv) of 0.91 for spontaneous mutations in Toxoplasma, with a slightly higher rate of 1.20 and 1.06 for variants induced by ENU and EMS respectively. We also show that in the Toxoplasma system, surprisingly, both ENU and EMS have a proclivity for inducing mutations at A/T base pairs (78.6% and 69.6%, respectively). Conclusions The number of SNVs between related laboratory strains is relatively low and managed by purifying selection away from changes to amino acid sequence. From an experimental mutagenesis point of view, both ENU (24.7%) and EMS (29.1%) are more likely to generate variation within exons than would naturally accumulate over time in culture (19.1%), demonstrating the utility of these approaches for yielding proportionally greater changes to the amino acid sequence. These results will not only direct the methods of future chemical mutagenesis in Toxoplasma, but also aid in designing forward genetic approaches in less accessible pathogenic protozoa as well. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-354) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Marc-Jan Gubbels
- Department of Biology, Boston College, Higgins Hall 355, 140 Commonwealth Avenue, Chestnut Hill, MA 02467, USA.
| |
Collapse
|
29
|
Nguyen QV, Nelmes G, Huang ML, Simoff S, Catchpoole D. Interactive Visualization for Patient-to-Patient Comparison. Genomics Inform 2014; 12:21-34. [PMID: 24748858 PMCID: PMC3990763 DOI: 10.5808/gi.2014.12.1.21] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2013] [Revised: 02/19/2014] [Accepted: 02/20/2014] [Indexed: 12/20/2022] Open
Abstract
A visual analysis approach and the developed supporting technology provide a comprehensive solution for analyzing large and complex integrated genomic and biomedical data. This paper presents a methodology that is implemented as an interactive visual analysis technology for extracting knowledge from complex genetic and clinical data and then visualizing it in a meaningful and interpretable way. By synergizing the domain knowledge into development and analysis processes, we have developed a comprehensive tool that supports a seamless patient-to-patient analysis, from an overview of the patient population in the similarity space to the detailed views of genes. The system consists of multiple components enabling the complete analysis process, including data mining, interactive visualization, analytical views, and gene comparison. We demonstrate our approach with medical scientists on a case study of childhood cancer patients on how they use the tool to confirm existing hypotheses and to discover new scientific insights.
Collapse
Affiliation(s)
- Quang Vinh Nguyen
- MARCS Institute & School of Computing, Engineering and Mathematics, University of Western Sydney, South Penrith DC, NSW 1979, Australia
| | - Guy Nelmes
- The Kids Research Institute, The Children's Hospital at Westmead, Westmead, NSW 2145, Australia
| | - Mao Lin Huang
- School of Software, Faculty of Engineering & IT, University of Technology, Sydney, NSW 2007, Australia
| | - Simeon Simoff
- MARCS Institute & School of Computing, Engineering and Mathematics, University of Western Sydney, South Penrith DC, NSW 1979, Australia
| | - Daniel Catchpoole
- The Kids Research Institute, The Children's Hospital at Westmead, Westmead, NSW 2145, Australia
| |
Collapse
|
30
|
Mader M, Simon R, Kurtz S. FISH Oracle 2: a web server for integrative visualization of genomic data in cancer research. J Clin Bioinforma 2014; 4:5. [PMID: 24684958 PMCID: PMC4230720 DOI: 10.1186/2043-9113-4-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2013] [Accepted: 03/26/2014] [Indexed: 01/21/2023] Open
Abstract
Background A comprehensive view on all relevant genomic data is instrumental for understanding the complex patterns of molecular alterations typically found in cancer cells. One of the most effective ways to rapidly obtain an overview of genomic alterations in large amounts of genomic data is the integrative visualization of genomic events. Results We developed FISH Oracle 2, a web server for the interactive visualization of different kinds of downstream processed genomics data typically available in cancer research. A powerful search interface and a fast visualization engine provide a highly interactive visualization for such data. High quality image export enables the life scientist to easily communicate their results. A comprehensive data administration allows to keep track of the available data sets. We applied FISH Oracle 2 to published data and found evidence that, in colorectal cancer cells, the gene TTC28 may be inactivated in two different ways, a fact that has not been published before. Conclusions The interactive nature of FISH Oracle 2 and the possibility to store, select and visualize large amounts of downstream processed data support life scientists in generating hypotheses. The export of high quality images supports explanatory data visualization, simplifying the communication of new biological findings. A FISH Oracle 2 demo server and the software is available at
http://www.zbh.uni-hamburg.de/fishoracle.
Collapse
Affiliation(s)
| | | | - Stefan Kurtz
- Center for Bioinformatics, University of Hamburg, Bundesstrasse 43, 20146 Hamburg, Germany.
| |
Collapse
|
31
|
Lin J, Kreisberg R, Kallio A, Dudley AM, Nykter M, Shmulevich I, May P, Autio R. POMO--Plotting Omics analysis results for Multiple Organisms. BMC Genomics 2013; 14:918. [PMID: 24365393 PMCID: PMC3880012 DOI: 10.1186/1471-2164-14-918] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2013] [Accepted: 12/18/2013] [Indexed: 12/15/2022] Open
Abstract
Background Systems biology experiments studying different topics and organisms produce thousands of data values across different types of genomic data. Further, data mining analyses are yielding ranked and heterogeneous results and association networks distributed over the entire genome. The visualization of these results is often difficult and standalone web tools allowing for custom inputs and dynamic filtering are limited. Results We have developed POMO (http://pomo.cs.tut.fi), an interactive web-based application to visually explore omics data analysis results and associations in circular, network and grid views. The circular graph represents the chromosome lengths as perimeter segments, as a reference outer ring, such as cytoband for human. The inner arcs between nodes represent the uploaded network. Further, multiple annotation rings, for example depiction of gene copy number changes, can be uploaded as text files and represented as bar, histogram or heatmap rings. POMO has built-in references for human, mouse, nematode, fly, yeast, zebrafish, rice, tomato, Arabidopsis, and Escherichia coli. In addition, POMO provides custom options that allow integrated plotting of unsupported strains or closely related species associations, such as human and mouse orthologs or two yeast wild types, studied together within a single analysis. The web application also supports interactive label and weight filtering. Every iterative filtered result in POMO can be exported as image file and text file for sharing or direct future input. Conclusions The POMO web application is a unique tool for omics data analysis, which can be used to visualize and filter the genome-wide networks in the context of chromosomal locations as well as multiple network layouts. With the several illustration and filtering options the tool supports the analysis and visualization of any heterogeneous omics data analysis association results for many organisms. POMO is freely available and does not require any installation or registration.
Collapse
Affiliation(s)
- Jake Lin
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Luxembourg, Luxembourg.
| | | | | | | | | | | | | | | |
Collapse
|
32
|
Feehery GR, Yigit E, Oyola SO, Langhorst BW, Schmidt VT, Stewart FJ, Dimalanta ET, Amaral-Zettler LA, Davis T, Quail MA, Pradhan S. A method for selectively enriching microbial DNA from contaminating vertebrate host DNA. PLoS One 2013; 8:e76096. [PMID: 24204593 PMCID: PMC3810253 DOI: 10.1371/journal.pone.0076096] [Citation(s) in RCA: 129] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2013] [Accepted: 08/20/2013] [Indexed: 12/05/2022] Open
Abstract
DNA samples derived from vertebrate skin, bodily cavities and body fluids contain both host and microbial DNA; the latter often present as a minor component. Consequently, DNA sequencing of a microbiome sample frequently yields reads originating from the microbe(s) of interest, but with a vast excess of host genome-derived reads. In this study, we used a methyl-CpG binding domain (MBD) to separate methylated host DNA from microbial DNA based on differences in CpG methylation density. MBD fused to the Fc region of a human antibody (MBD-Fc) binds strongly to protein A paramagnetic beads, forming an effective one-step enrichment complex that was used to remove human or fish host DNA from bacterial and protistan DNA for subsequent sequencing and analysis. We report enrichment of DNA samples from human saliva, human blood, a mock malaria-infected blood sample and a black molly fish. When reads were mapped to reference genomes, sequence reads aligning to host genomes decreased 50-fold, while bacterial and Plasmodium DNA sequences reads increased 8-11.5-fold. The Shannon-Wiener diversity index was calculated for 149 bacterial species in saliva before and after enrichment. Unenriched saliva had an index of 4.72, while the enriched sample had an index of 4.80. The similarity of these indices demonstrates that bacterial species diversity and relative phylotype abundance remain conserved in enriched samples. Enrichment using the MBD-Fc method holds promise for targeted microbiome sequence analysis across a broad range of sample types.
Collapse
Affiliation(s)
- George R. Feehery
- New England Biolabs Inc., Ipswich, Massachusetts, United States of America
| | - Erbay Yigit
- New England Biolabs Inc., Ipswich, Massachusetts, United States of America
| | | | | | - Victor T. Schmidt
- The Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, Massachusetts, United States of America
- Department of Ecology and Evolutionary Biology, Brown University, Providence, Rhode Island, United States of America
| | - Fiona J. Stewart
- New England Biolabs Inc., Ipswich, Massachusetts, United States of America
| | | | - Linda A. Amaral-Zettler
- The Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, Massachusetts, United States of America
- Department of Geological Sciences, Brown University, Providence, Rhode Island, United States of America
| | - Theodore Davis
- New England Biolabs Inc., Ipswich, Massachusetts, United States of America
| | | | - Sriharsa Pradhan
- New England Biolabs Inc., Ipswich, Massachusetts, United States of America
| |
Collapse
|
33
|
Abstract
Over the last ten years, genome sequencing capabilities have expanded exponentially. There have been tremendous advances in sequencing technology, DNA sample preparation, genome assembly, and data analysis. This has led to advances in a number of facets of bacterial genomics, including metagenomics, clinical medicine, bacterial archaeology, and bacterial evolution. This review examines the strengths and weaknesses of techniques in bacterial genome sequencing, upcoming technologies, and assembly techniques, as well as highlighting recent studies that highlight new applications for bacterial genomics.
Collapse
Affiliation(s)
- Michael J Dark
- Department of Infectious Diseases and Pathology and Emerging Pathogens Institute, University of Florida, Gainesville, FL, USA
| |
Collapse
|