101
|
Phelan JE, Coll F, Bergval I, Anthony RM, Warren R, Sampson SL, Gey van Pittius NC, Glynn JR, Crampin AC, Alves A, Bessa TB, Campino S, Dheda K, Grandjean L, Hasan R, Hasan Z, Miranda A, Moore D, Panaiotov S, Perdigao J, Portugal I, Sheen P, de Oliveira Sousa E, Streicher EM, van Helden PD, Viveiros M, Hibberd ML, Pain A, McNerney R, Clark TG. Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages. BMC Genomics 2016; 17:151. [PMID: 26923687 PMCID: PMC4770551 DOI: 10.1186/s12864-016-2467-y] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2015] [Accepted: 02/12/2016] [Indexed: 11/23/2022] Open
Abstract
Background Approximately 10 % of the Mycobacterium tuberculosis genome is made up of two families of genes that are poorly characterized due to their high GC content and highly repetitive nature. The PE and PPE families are typified by their highly conserved N-terminal domains that incorporate proline-glutamate (PE) and proline-proline-glutamate (PPE) signature motifs. They are hypothesised to be important virulence factors involved with host-pathogen interactions, but their high genetic variability and complexity of analysis means they are typically disregarded in genome studies. Results To elucidate the structure of these genes, 518 genomes from a diverse international collection of clinical isolates were de novo assembled. A further 21 reference M. tuberculosis complex genomes and long read sequence data were used to validate the approach. SNP analysis revealed that variation in the majority of the 168 pe/ppe genes studied was consistent with lineage. Several recombination hotspots were identified, notably pe_pgrs3 and pe_pgrs17. Evidence of positive selection was revealed in 65 pe/ppe genes, including epitopes potentially binding to major histocompatibility complex molecules. Conclusions This, the first comprehensive study of the pe and ppe genes, provides important insight into M. tuberculosis diversity and has significant implications for vaccine development. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-2467-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jody E Phelan
- Department of Pathogen Molecular Biology, Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, WC1E 7HT, London, UK.
| | - Francesc Coll
- Department of Pathogen Molecular Biology, Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, WC1E 7HT, London, UK.
| | - Indra Bergval
- KIT Biomedical Research, Royal Tropical Institute, Amsterdam, Netherlands.
| | - Richard M Anthony
- KIT Biomedical Research, Royal Tropical Institute, Amsterdam, Netherlands.
| | - Rob Warren
- Department of Science and Technology and National Research Foundation Centre of Excellence for Biomedical Tuberculosis Research, and Medical Research Council Centre for Molecular and Cellular Biology, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Tygerberg, South Africa.
| | - Samantha L Sampson
- Department of Science and Technology and National Research Foundation Centre of Excellence for Biomedical Tuberculosis Research, and Medical Research Council Centre for Molecular and Cellular Biology, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Tygerberg, South Africa.
| | - Nicolaas C Gey van Pittius
- Department of Science and Technology and National Research Foundation Centre of Excellence for Biomedical Tuberculosis Research, and Medical Research Council Centre for Molecular and Cellular Biology, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Tygerberg, South Africa.
| | - Judith R Glynn
- Faculty of Epidemiology and Population Health, London School of Hygiene & Tropical Medicine, WC1E 7HT, London, UK.
| | - Amelia C Crampin
- Faculty of Epidemiology and Population Health, London School of Hygiene & Tropical Medicine, WC1E 7HT, London, UK. .,Karonga Prevention Study, Lilongwe, Malawi.
| | - Adriana Alves
- National Mycobacterium Reference Laboratory, Porto, Portugal.
| | - Theolis Barbosa Bessa
- Centro de Pesquisas Goncalo Moniz, Fundacao Oswaldo Cruz Bahia R, Salvador, Bahia, Brazil.
| | - Susana Campino
- Department of Pathogen Molecular Biology, Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, WC1E 7HT, London, UK.
| | - Keertan Dheda
- Department of Medicine, Lung Infection and Immunity Unit, Division of Pulmonology & UCT Lung Institute, University of Cape Town, Cape Town, Western Cape, South Africa. .,Institute of Infectious Diseases and Molecular Medicine, University of Cape Town, Cape Town, Western Cape, South Africa.
| | - Louis Grandjean
- Department of Pathogen Molecular Biology, Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, WC1E 7HT, London, UK. .,Laboratorio de Enfermedades Infecciosas, Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, Peru.
| | - Rumina Hasan
- Department of Pathology and Laboratory Medicine, The Aga Khan University, Stadium Road, Karachi, Pakistan.
| | - Zahra Hasan
- Department of Pathology and Laboratory Medicine, The Aga Khan University, Stadium Road, Karachi, Pakistan.
| | - Anabela Miranda
- National Mycobacterium Reference Laboratory, Porto, Portugal.
| | - David Moore
- Department of Pathogen Molecular Biology, Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, WC1E 7HT, London, UK.
| | - Stefan Panaiotov
- National Center of Infectious and Parasitic Diseases, 1504, Sofia, Bulgaria.
| | | | | | - Patricia Sheen
- Laboratorio de Enfermedades Infecciosas, Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, Peru.
| | | | - Elizabeth M Streicher
- Department of Science and Technology and National Research Foundation Centre of Excellence for Biomedical Tuberculosis Research, and Medical Research Council Centre for Molecular and Cellular Biology, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Tygerberg, South Africa.
| | - Paul D van Helden
- Department of Science and Technology and National Research Foundation Centre of Excellence for Biomedical Tuberculosis Research, and Medical Research Council Centre for Molecular and Cellular Biology, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Tygerberg, South Africa.
| | - Miguel Viveiros
- Grupo de Micobactérias, Unidade de Microbiologia Médica, Global Health and Tropical Medicine (GHTM), Instituto de Higiene e Medicina Tropical, Universidade NOVA de Lisboa (IHMT/UNL), Lisbon, Portugal.
| | - Martin L Hibberd
- Department of Pathogen Molecular Biology, Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, WC1E 7HT, London, UK.
| | - Arnab Pain
- Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Kingdom of Saudi Arabia.
| | - Ruth McNerney
- Department of Pathogen Molecular Biology, Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, WC1E 7HT, London, UK.
| | - Taane G Clark
- Department of Pathogen Molecular Biology, Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, WC1E 7HT, London, UK. .,Faculty of Epidemiology and Population Health, London School of Hygiene & Tropical Medicine, WC1E 7HT, London, UK.
| |
Collapse
|
102
|
Kwak MJ, Kim MS, Kwon SK, Cho SH, Kim JF. Genome sequence of Escherichia coli NCCP15653, a group D strain isolated from a diarrhea patient. Gut Pathog 2016; 8:7. [PMID: 26913081 PMCID: PMC4765171 DOI: 10.1186/s13099-016-0084-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Accepted: 01/04/2016] [Indexed: 11/22/2022] Open
Abstract
Background Pathogenic strains in Escherichia coli can be divided into several pathotypes according to their virulence features. Among them, uropathogenic E. coli causes most of the urinary tract infections and has a genotype distinct from other virulent strains of E. coli. In this study, we sequenced and analyzed the genome of E. coli NCCP15653 isolated from the feces of a diarrhea patient in 2007 in South Korea. Results A phylogenetic tree based on MLST showed that NCCP15653 belongs to the D group of E. coli and located in the lineage containing strains ST2747, UMN026 and 042. In the genome of NCCP15653, genes encoding major virulence factors of uropathogenic E. coli were detected. They include type I fimbriae, hemin uptake proteins, iron/manganese transport proteins, yersiniabactin siderophore proteins, type VI secretion proteins, and hemolysin. On the other hand, genes encoding AslA, OmpA, and the K1 capsule, which are virulence factors associated with invasion of neonatal meningitis-causing E. coli, were also present, while a gene encoding CNF-1 protein, which is a cytotoxic necrotizing factor 1, was not detected. Conclusions Through the genome analysis of NCCP15653, we report an example of a genome of chimeric pathogenic properties. The gene content of NCCP15653, a group D strain, demonstrates that it could be both uropathogenic E. coli and neonatal meningitis-causing E. coli. Our results suggest the dynamic nature of plastic genomes in pathogenic strains of E. coli.
Collapse
Affiliation(s)
- Min-Jung Kwak
- Department of Systems Biology and Division of Life Sciences, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul, 120-749 Republic of Korea
| | - Myung-Soo Kim
- Department of Systems Biology and Division of Life Sciences, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul, 120-749 Republic of Korea
| | - Soon-Kyeong Kwon
- Department of Systems Biology and Division of Life Sciences, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul, 120-749 Republic of Korea
| | - Seung-Hak Cho
- Division of Enteric Diseases, Center for Infectious Diseases, Korea National Institute of Health, Heungdeok-Gu, Cheongju, 363-951 Republic of Korea
| | - Jihyun F Kim
- Department of Systems Biology and Division of Life Sciences, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul, 120-749 Republic of Korea
| |
Collapse
|
103
|
Cha JY, Han S, Hong HJ, Cho H, Kim D, Kwon Y, Kwon SK, Crüsemann M, Bok Lee Y, Kim JF, Giaever G, Nislow C, Moore BS, Thomashow LS, Weller DM, Kwak YS. Microbial and biochemical basis of a Fusarium wilt-suppressive soil. THE ISME JOURNAL 2016; 10:119-29. [PMID: 26057845 PMCID: PMC4681868 DOI: 10.1038/ismej.2015.95] [Citation(s) in RCA: 208] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/02/2014] [Revised: 04/26/2015] [Accepted: 05/03/2015] [Indexed: 01/21/2023]
Abstract
Crops lack genetic resistance to most necrotrophic pathogens. To compensate for this disadvantage, plants recruit antagonistic members of the soil microbiome to defend their roots against pathogens and other pests. The best examples of this microbially based defense of roots are observed in disease-suppressive soils in which suppressiveness is induced by continuously growing crops that are susceptible to a pathogen, but the molecular basis of most is poorly understood. Here we report the microbial characterization of a Korean soil with specific suppressiveness to Fusarium wilt of strawberry. In this soil, an attack on strawberry roots by Fusarium oxysporum results in a response by microbial defenders, of which members of the Actinobacteria appear to have a key role. We also identify Streptomyces genes responsible for the ribosomal synthesis of a novel heat-stable antifungal thiopeptide antibiotic inhibitory to F. oxysporum and the antibiotic's mode of action against fungal cell wall biosynthesis. Both classical- and community-oriented approaches were required to dissect this suppressive soil from the field to the molecular level, and the results highlight the role of natural antibiotics as weapons in the microbial warfare in the rhizosphere that is integral to plant health, vigor and development.
Collapse
Affiliation(s)
- Jae-Yul Cha
- IALS and Department of Plant Medicine, Gyeongsang National University, Jinju, Republic of Korea
| | - Sangjo Han
- Bioinformatics Tech Lab, SK Telecom, Sungnam, Republic of Korea
| | - Hee-Jeon Hong
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Hyunji Cho
- RILS and Division of Applied Life Science, Gyeongsang National University, Jinju, Republic of Korea
| | - Daran Kim
- IALS and Department of Plant Medicine, Gyeongsang National University, Jinju, Republic of Korea
| | - Youngho Kwon
- IALS and Department of Plant Medicine, Gyeongsang National University, Jinju, Republic of Korea
| | - Soon-Kyeong Kwon
- Department of Systems Biology and Division of Life Sciences, Yonsei University, Seoul, Republic of Korea
| | - Max Crüsemann
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Yong Bok Lee
- RILS and Division of Applied Life Science, Gyeongsang National University, Jinju, Republic of Korea
| | - Jihyun F Kim
- Department of Systems Biology and Division of Life Sciences, Yonsei University, Seoul, Republic of Korea
| | - Guri Giaever
- Pharmaceutical Sciences, University of British Columbia, Vancouver, British Columbia, Canada
| | - Corey Nislow
- Pharmaceutical Sciences, University of British Columbia, Vancouver, British Columbia, Canada
| | - Bradley S Moore
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Linda S Thomashow
- US Department of Agriculture, Agricultural Research Service, Root Disease and Biological Control Research Unit, Pullman, WA, USA
| | - David M Weller
- US Department of Agriculture, Agricultural Research Service, Root Disease and Biological Control Research Unit, Pullman, WA, USA
| | - Youn-Sig Kwak
- IALS and Department of Plant Medicine, Gyeongsang National University, Jinju, Republic of Korea
- RILS and Division of Applied Life Science, Gyeongsang National University, Jinju, Republic of Korea
| |
Collapse
|
104
|
Kinetoplastid Phylogenomics Reveals the Evolutionary Innovations Associated with the Origins of Parasitism. Curr Biol 2015; 26:161-172. [PMID: 26725202 PMCID: PMC4728078 DOI: 10.1016/j.cub.2015.11.055] [Citation(s) in RCA: 112] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2015] [Revised: 11/17/2015] [Accepted: 11/19/2015] [Indexed: 01/01/2023]
Abstract
The evolution of parasitism is a recurrent event in the history of life and a core problem in evolutionary biology. Trypanosomatids are important parasites and include the human pathogens Trypanosoma brucei, Trypanosoma cruzi, and Leishmania spp., which in humans cause African trypanosomiasis, Chagas disease, and leishmaniasis, respectively. Genome comparison between trypanosomatids reveals that these parasites have evolved specialized cell-surface protein families, overlaid on a well-conserved cell template. Understanding how these features evolved and which ones are specifically associated with parasitism requires comparison with related non-parasites. We have produced genome sequences for Bodo saltans, the closest known non-parasitic relative of trypanosomatids, and a second bodonid, Trypanoplasma borreli. Here we show how genomic reduction and innovation contributed to the character of trypanosomatid genomes. We show that gene loss has “streamlined” trypanosomatid genomes, particularly with respect to macromolecular degradation and ion transport, but consistent with a widespread loss of functional redundancy, while adaptive radiations of gene families involved in membrane function provide the principal innovations in trypanosomatid evolution. Gene gain and loss continued during trypanosomatid diversification, resulting in the asymmetric assortment of ancestral characters such as peptidases between Trypanosoma and Leishmania, genomic differences that were subsequently amplified by lineage-specific innovations after divergence. Finally, we show how species-specific, cell-surface gene families (DGF-1 and PSA) with no apparent structural similarity are independent derivations of a common ancestral form, which we call “bodonin.” This new evidence defines the parasitic innovations of trypanosomatid genomes, revealing how a free-living phagotroph became adapted to exploiting hostile host environments. The Bodo saltans genome reveals evolutionary changes at the origin of parasitism Parasite genomes are streamlined, consistent with a loss of functional redundancy Expanded parasite transporter genes reflect a reorientation of membrane function Non-homologous, parasite cell-surface proteins evolved from a common ancestor
Collapse
|
105
|
Kwak MJ, Kwon SK, Yoon JH, Kim JF. Genome sequence of Lysobacter dokdonensis DS-58(T), a gliding bacterium isolated from soil in Dokdo, Korea. Stand Genomic Sci 2015; 10:123. [PMID: 26664701 PMCID: PMC4673725 DOI: 10.1186/s40793-015-0116-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2015] [Accepted: 11/25/2015] [Indexed: 11/10/2022] Open
Abstract
Lysobacter dokdonensis DS-58, belonging to the family Xanthomonadaceae, was isolated from a soil sample in Dokdo, Korea in 2011. Strain DS-58 is the type strain of L. dokdonensis. In this study, we determined the genome sequence to describe the genomic features including annotation information and COG functional categorization. The draft genome sequence consists of 25 contigs totaling 3,274,406 bp (67.24 % G + C) and contains 3,155 protein coding genes, 2 copies of ribosomal RNA operons, and 48 transfer RNA genes. Among the protein coding genes, 75.91 % of the genes were annotated with a putative function and 87.39 % of the genes were assigned to the COG category. In the genome of L. dokdonensis, a large number of genes associated with protein degradation and antibiotic resistance were detected.
Collapse
Affiliation(s)
- Min-Jung Kwak
- Department of Systems Biology and Division of Life Sciences, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul 120-749 Republic of Korea
| | - Soon-Kyeong Kwon
- Department of Systems Biology and Division of Life Sciences, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul 120-749 Republic of Korea
| | - Jung-Hoon Yoon
- Department of Food Science and Biotechnology, Sungkyunkwan University, Suwon, Republic of Korea
| | - Jihyun F Kim
- Department of Systems Biology and Division of Life Sciences, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul 120-749 Republic of Korea
| |
Collapse
|
106
|
Chan CL, Yew SM, Ngeow YF, Na SL, Lee KW, Hoh CC, Yee WY, Ng KP. Genome analysis of Daldinia eschscholtzii strains UM 1400 and UM 1020, wood-decaying fungi isolated from human hosts. BMC Genomics 2015; 16:966. [PMID: 26581579 PMCID: PMC4650942 DOI: 10.1186/s12864-015-2200-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2014] [Accepted: 11/10/2015] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND Daldinia eschscholtzii is a wood-inhabiting fungus that causes wood decay under certain conditions. It has a broad host range and produces a large repertoire of potentially bioactive compounds. However, there is no extensive genome analysis on this fungal species. RESULTS Two fungal isolates (UM 1400 and UM 1020) from human specimens were identified as Daldinia eschscholtzii by morphological features and ITS-based phylogenetic analysis. Both genomes were similar in size with 10,822 predicted genes in UM 1400 (35.8 Mb) and 11,120 predicted genes in UM 1020 (35.5 Mb). A total of 751 gene families were shared among both UM isolates, including gene families associated with fungus-host interactions. In the CAZyme comparative analysis, both genomes were found to contain arrays of CAZyme related to plant cell wall degradation. Genes encoding secreted peptidases were found in the genomes, which encode for the peptidases involved in the degradation of structural proteins in plant cell wall. In addition, arrays of secondary metabolite backbone genes were identified in both genomes, indicating of their potential to produce bioactive secondary metabolites. Both genomes also contained an abundance of gene encoding signaling components, with three proposed MAPK cascades involved in cell wall integrity, osmoregulation, and mating/filamentation. Besides genomic evidence for degrading capability, both isolates also harbored an array of genes encoding stress response proteins that are potentially significant for adaptation to living in the hostile environments. CONCLUSIONS Our genomic studies provide further information for the biological understanding of the D. eschscholtzii and suggest that these wood-decaying fungi are also equipped for adaptation to adverse environments in the human host.
Collapse
Affiliation(s)
- Chai Ling Chan
- Department of Medical Microbiology, Faculty of Medicine, University of Malaya, 50603, Kuala Lumpur, Malaysia.
| | - Su Mei Yew
- Department of Medical Microbiology, Faculty of Medicine, University of Malaya, 50603, Kuala Lumpur, Malaysia.
| | - Yun Fong Ngeow
- Department of Pre-Clinical Sciences, Faculty of Medicine and Health Sciences, University Tunku Abdul Rahman, Bandar Sungai Long, 43000, Kajang, Selangor Darul Ehsan, Malaysia.
| | - Shiang Ling Na
- Department of Medical Microbiology, Faculty of Medicine, University of Malaya, 50603, Kuala Lumpur, Malaysia.
| | - Kok Wei Lee
- Codon Genomics S/B, No 26, Jalan Dutamas 7, Taman Dutamas, Balakong, Seri Kembangan, 43200, Selangor Darul Ehsan, Malaysia.
| | - Chee-Choong Hoh
- Codon Genomics S/B, No 26, Jalan Dutamas 7, Taman Dutamas, Balakong, Seri Kembangan, 43200, Selangor Darul Ehsan, Malaysia.
| | - Wai-Yan Yee
- Codon Genomics S/B, No 26, Jalan Dutamas 7, Taman Dutamas, Balakong, Seri Kembangan, 43200, Selangor Darul Ehsan, Malaysia.
| | - Kee Peng Ng
- Department of Medical Microbiology, Faculty of Medicine, University of Malaya, 50603, Kuala Lumpur, Malaysia.
| |
Collapse
|
107
|
Isaza JP, Galván AL, Polanco V, Huang B, Matveyev AV, Serrano MG, Manque P, Buck GA, Alzate JF. Revisiting the reference genomes of human pathogenic Cryptosporidium species: reannotation of C. parvum Iowa and a new C. hominis reference. Sci Rep 2015; 5:16324. [PMID: 26549794 PMCID: PMC4637869 DOI: 10.1038/srep16324] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2015] [Accepted: 10/08/2015] [Indexed: 11/09/2022] Open
Abstract
Cryptosporidium parvum and C. hominis are the most relevant species of this genus for human health. Both cause a self-limiting diarrhea in immunocompetent individuals, but cause potentially life-threatening disease in the immunocompromised. Despite the importance of these pathogens, only one reference genome of each has been analyzed and published. These two reference genomes were sequenced using automated capillary sequencing; as of yet, no next generation sequencing technology has been applied to improve their assemblies and annotations. For C. hominis, the main challenge that prevents a larger number of genomes to be sequenced is its resistance to axenic culture. In the present study, we employed next generation technology to analyse the genomic DNA and RNA to generate a new reference genome sequence of a C. hominis strain isolated directly from human stool and a new genome annotation of the C. parvum Iowa reference genome.
Collapse
Affiliation(s)
- Juan P Isaza
- Grupo de Parasitología, Facultad de Medicina, Universidad de Antioquia Carrera 53 No. 61-30, Medellin, Antioquia 05001, Colombia.,Centro Nacional de Secuenciación Genómica-CNSG, Universidad de Antioquia Carrera 53 No. 61-30, Medellin, Antioquia 05001, Colombia
| | - Ana Luz Galván
- Grupo de Parasitología, Facultad de Medicina, Universidad de Antioquia Carrera 53 No. 61-30, Medellin, Antioquia 05001, Colombia
| | - Victor Polanco
- Universidad Mayor de Chile-Centro de Genómica y Bioinformatica Camino La piramide 5750 Huechuraba, Santiago de Chile, 8580000, Chile
| | - Bernice Huang
- Virginia Commonwealth University - Center for the Study of Biological Complexity 1101 E. Marshall St., Virginia 23298-0678, US
| | - Andrey V Matveyev
- Virginia Commonwealth University - Center for the Study of Biological Complexity 1101 E. Marshall St., Virginia 23298-0678, US
| | - Myrna G Serrano
- Virginia Commonwealth University - Center for the Study of Biological Complexity 1101 E. Marshall St., Virginia 23298-0678, US
| | - Patricio Manque
- Universidad Mayor de Chile-Centro de Genómica y Bioinformatica Camino La piramide 5750 Huechuraba, Santiago de Chile, 8580000, Chile
| | - Gregory A Buck
- Virginia Commonwealth University - Center for the Study of Biological Complexity 1101 E. Marshall St., Virginia 23298-0678, US
| | - Juan F Alzate
- Grupo de Parasitología, Facultad de Medicina, Universidad de Antioquia Carrera 53 No. 61-30, Medellin, Antioquia 05001, Colombia.,Centro Nacional de Secuenciación Genómica-CNSG, Universidad de Antioquia Carrera 53 No. 61-30, Medellin, Antioquia 05001, Colombia
| |
Collapse
|
108
|
Pightling AW, Petronella N, Pagotto F. The Listeria monocytogenes Core-Genome Sequence Typer (LmCGST): a bioinformatic pipeline for molecular characterization with next-generation sequence data. BMC Microbiol 2015; 15:224. [PMID: 26490433 PMCID: PMC4618880 DOI: 10.1186/s12866-015-0526-1] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2014] [Accepted: 09/21/2015] [Indexed: 11/28/2022] Open
Abstract
BACKGROUND Next-generation sequencing provides a powerful means of molecular characterization. However, methods such as single-nucleotide polymorphism detection or whole-chromosome sequence analysis are computationally expensive, prone to errors, and are still less accessible than traditional typing methods. Here, we present the Listeria monocytogenes core-genome sequence typing method for molecular characterization. This method uses a high-confidence core (HCC) genome, calculated to ensure accurate identification of orthologs. We also developed an evolutionarily relevant nomenclature based upon phylogenetic analysis of HCC genomes. Finally, we created a pipeline (LmCGST; https://sourceforge.net/projects/lmcgst/files/) that takes in raw next-generation sequencing reads, calculates a subject HCC profile, compares it to an expandable database, assigns a sequence type, and performs a phylogenetic analysis. RESULTS We analyzed 29 high-quality, closed Listeria monocytogenes chromosome sequences and identified loci that are reliable targets for automated molecular characterization methods. We identified 1013 open-reading frames that comprise our high-confidence core (HCC) genome. We then populated a database with HCC profiles from 114 taxa. We sequenced 84 randomly selected isolates from the Listeriosis Reference Service for Canada's collection and analysed them with the LmCGST pipeline. In addition, we generated pulsed-field gel electrophoresis, ribotyping, and in silico multi-locus sequence typing (MLST) data for the 84 isolates and compared the results to those obtained using the CGST method. We found that all of the methods yielded results that are generally congruent. However, due to the increased numbers of categories, the CGST method provides much greater discriminatory power than the other methods tested here. CONCLUSIONS We show that the CGST method provides increased discriminatory power relative to typing methods such as pulsed-field gel electrophoresis, ribotyping, and multi-locus sequence typing while it addresses several shortcomings of other methods of molecular characterization with next-generation sequence data. It uses discrete, well-defined groupings (types) of organisms that are phylogenetically relevant and easily interpreted. In addition, the CGST scheme can be expanded to include additional loci and HCC profiles in the future. In total, the CGST method provides an approach to the molecular characterization of Listeria monocytogenes with next-generation sequence data that is highly reproducible, easily standardized, portable, and accessible.
Collapse
Affiliation(s)
- Arthur W Pightling
- Office of Analytics and Outreach, Center for Food Safety and Applied Nutrition, U.S. Food and Drug Administration, 5100 Paint Branch Parkway, College Park, MD, 20740, USA.
| | - Nicholas Petronella
- Biostatistics and Modelling Division, Bureau of Food Surveillance and Science Integration, Food Directorate, Health Products and Food Branch, Health Canada, 251 Sir Frederick Banting Driveway, Ottawa, K1A 0K9, ON, Canada.
| | - Franco Pagotto
- Office of Analytics and Outreach, Center for Food Safety and Applied Nutrition, U.S. Food and Drug Administration, 5100 Paint Branch Parkway, College Park, MD, 20740, USA.
| |
Collapse
|
109
|
Castro DB, Pereira LB, Silva MVME, Silva BPD, Palermo BRZ, Carlos C, Belgini DR, Limache EEG, Lacerda GVJ, Nery MB, Gomes MB, Souza SSD, Silva TMD, Rodrigues VD, Paulino LC, Vicentini R, Ferraz LF, Ottoboni LM. High-quality draft genome sequence of Kocuria marina SO9-6, an actinobacterium isolated from a copper mine. GENOMICS DATA 2015; 5:34-5. [PMID: 26484219 PMCID: PMC4583624 DOI: 10.1016/j.gdata.2015.05.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 03/31/2015] [Revised: 05/06/2015] [Accepted: 05/10/2015] [Indexed: 11/17/2022]
Abstract
An actinobacterial strain, designated SO9-6, was isolated from a copper iron sulfide mineral. The organism is Gram-positive, facultatively anaerobic, and coccoid. Chemotaxonomic and phylogenetic properties were consistent with its classification in the genus Kocuria. Here, we report the first draft genome sequence of Kocuria marina SO9-6 under accession JROM00000000 (http://www.ncbi.nlm.nih.gov/nuccore/725823918), which provides insights for heavy metal bioremediation and production of compounds of biotechnological interest.
Collapse
Affiliation(s)
- Daniel B.A. Castro
- Center for Molecular Biology and Genetic Engineering (CBMEG), State University of Campinas — UNICAMP, Campinas, Brazil
| | - Letícia Bianca Pereira
- Center for Molecular Biology and Genetic Engineering (CBMEG), State University of Campinas — UNICAMP, Campinas, Brazil
| | | | - Bárbara P. da Silva
- Chemical, Biological and Agricultural Pluridisciplinary Research Center, State University of Campinas — UNICAMP, Campinas, Brazil
| | - Bruna Rafaella Z. Palermo
- Center for Molecular Biology and Genetic Engineering (CBMEG), State University of Campinas — UNICAMP, Campinas, Brazil
| | - Camila Carlos
- Center for Molecular Biology and Genetic Engineering (CBMEG), State University of Campinas — UNICAMP, Campinas, Brazil
| | - Daiane R.B. Belgini
- Chemical, Biological and Agricultural Pluridisciplinary Research Center, State University of Campinas — UNICAMP, Campinas, Brazil
| | - Elmer Erasmo G. Limache
- Chemical, Biological and Agricultural Pluridisciplinary Research Center, State University of Campinas — UNICAMP, Campinas, Brazil
| | - Gileno V. Jr Lacerda
- Chemical, Biological and Agricultural Pluridisciplinary Research Center, State University of Campinas — UNICAMP, Campinas, Brazil
| | - Mariana B.P. Nery
- Chemical, Biological and Agricultural Pluridisciplinary Research Center, State University of Campinas — UNICAMP, Campinas, Brazil
| | - Milene B. Gomes
- Chemical, Biological and Agricultural Pluridisciplinary Research Center, State University of Campinas — UNICAMP, Campinas, Brazil
| | - Salatiel S. de Souza
- Department of Animal Biology, Institute of Biology, State University of Campinas — UNICAMP, Campinas, Brazil
| | - Thiago M. da Silva
- Center for Molecular Biology and Genetic Engineering (CBMEG), State University of Campinas — UNICAMP, Campinas, Brazil
| | - Viviane D. Rodrigues
- Center for Molecular Biology and Genetic Engineering (CBMEG), State University of Campinas — UNICAMP, Campinas, Brazil
| | - Luciana C. Paulino
- Center of Natural and Human Sciences, Federal University of ABC, Santo André, Brazil
| | - Renato Vicentini
- Center for Molecular Biology and Genetic Engineering (CBMEG), State University of Campinas — UNICAMP, Campinas, Brazil
| | - Lúcio F.C. Ferraz
- Department of Molecular Biology and Pharmacology, University of São Francisco, Bragança Paulista, Brazil
| | - Laura M.M. Ottoboni
- Center for Molecular Biology and Genetic Engineering (CBMEG), State University of Campinas — UNICAMP, Campinas, Brazil
- Corresponding author. Tel.: + 551935211099.
| |
Collapse
|
110
|
Pereira UP, Gouran H, Nascimento R, Adaskaveg JE, Goulart LR, Dandekar AM. Complete Genome Sequence of Xanthomonas arboricola pv. juglandis 417, a Copper-Resistant Strain Isolated from Juglans regia L. GENOME ANNOUNCEMENTS 2015; 3:e01126-15. [PMID: 26430043 PMCID: PMC4591315 DOI: 10.1128/genomea.01126-15] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 08/17/2015] [Accepted: 08/19/2015] [Indexed: 11/20/2022]
Abstract
Here, we report the complete genome sequence of Xanthomonas arboricola pv. juglandis 417, a copper-resistant strain isolated from a blighted walnut fruit (Juglans regia L. cv. Chandler). The genome consists of a single chromosome (5,218 kb).
Collapse
Affiliation(s)
- Ulisses P Pereira
- Preventive Veterinary Medicine Department, State University of Londrina, Campus UEL, Londrina, PR, Brazil
| | - Hossein Gouran
- Plant Sciences Department, University of California Davis, Davis, California, USA
| | - Rafael Nascimento
- Institute of Genetics and Biochemistry, Federal University of Uberlândia, Campus Umuarama, Uberlândia, MG, Brazil
| | - James E Adaskaveg
- Department of Plant Pathology and Microbiology, University of California Riverside, California, USA
| | - Luiz Ricardo Goulart
- Institute of Genetics and Biochemistry, Federal University of Uberlândia, Campus Umuarama, Uberlândia, MG, Brazil
| | - Abhaya M Dandekar
- Plant Sciences Department, University of California Davis, Davis, California, USA
| |
Collapse
|
111
|
Dvořák J, Fajtová P, Ulrychová L, Leontovyč A, Rojo-Arreola L, Suzuki BM, Horn M, Mareš M, Craik CS, Caffrey CR, O'Donoghue AJ. Excretion/secretion products from Schistosoma mansoni adults, eggs and schistosomula have unique peptidase specificity profiles. Biochimie 2015; 122:99-109. [PMID: 26409899 DOI: 10.1016/j.biochi.2015.09.025] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2015] [Accepted: 09/22/2015] [Indexed: 02/03/2023]
Abstract
Schistosomiasis is one of a number of chronic helminth diseases of poverty that severely impact personal and societal well-being and productivity. Peptidases (proteases) are vital to successful parasitism, and can modulate host physiology and immunology. Interference of peptidase action by specific drugs or vaccines can be therapeutically beneficial. To date, research on peptidases in the schistosome parasite has focused on either the functional characterization of individual peptidases or their annotation as part of global genome or transcriptome studies. We were interested in functionally characterizing the complexity of peptidase activity operating at the host-parasite interface, therefore the excretory-secretory products of key developmental stages of Schistosoma mansoni that parasitize the human were examined. Using class specific peptidase inhibitors in combination with a multiplex substrate profiling assay, a number of unique activities derived from endo- and exo-peptidases were revealed in the excretory-secretory products of schistosomula (larval migratory worms), adults and eggs. The data highlight the complexity of the functional degradome for each developmental stage of this parasite and facilitate further enquiry to establish peptidase identity, physiological and immunological function, and utility as drug or vaccine candidates.
Collapse
Affiliation(s)
- Jan Dvořák
- Institute of Molecular Genetics, The Czech Academy of Sciences, Prague CZ - 142 20, Czech Republic; Institute of Parasitology, Biology Center, The Czech Academy of Sciences, České Budějovice CZ - 370 05, Czech Republic; Institute of Organic Chemistry and Biochemistry, The Czech Academy of Sciences, Prague CZ - 166 10, Czech Republic
| | - Pavla Fajtová
- Institute of Organic Chemistry and Biochemistry, The Czech Academy of Sciences, Prague CZ - 166 10, Czech Republic; First Faculty of Medicine, Charles University in Prague, Prague CZ - 121 08, Czech Republic
| | - Lenka Ulrychová
- Institute of Molecular Genetics, The Czech Academy of Sciences, Prague CZ - 142 20, Czech Republic; Dept. of Parasitology, Faculty of Science, Charles University in Prague, Prague CZ - 128 44, Czech Republic
| | - Adrian Leontovyč
- Institute of Organic Chemistry and Biochemistry, The Czech Academy of Sciences, Prague CZ - 166 10, Czech Republic; First Faculty of Medicine, Charles University in Prague, Prague CZ - 121 08, Czech Republic
| | - Liliana Rojo-Arreola
- Center for Discovery and Innovation in Parasitic Diseases and the Department of Pathology, University of California San Francisco, San Francisco, CA 94158, USA
| | - Brian M Suzuki
- Center for Discovery and Innovation in Parasitic Diseases and the Department of Pathology, University of California San Francisco, San Francisco, CA 94158, USA
| | - Martin Horn
- Institute of Organic Chemistry and Biochemistry, The Czech Academy of Sciences, Prague CZ - 166 10, Czech Republic
| | - Michael Mareš
- Institute of Organic Chemistry and Biochemistry, The Czech Academy of Sciences, Prague CZ - 166 10, Czech Republic
| | - Charles S Craik
- Dept. of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA 94158, USA
| | - Conor R Caffrey
- Center for Discovery and Innovation in Parasitic Diseases and the Department of Pathology, University of California San Francisco, San Francisco, CA 94158, USA
| | - Anthony J O'Donoghue
- Dept. of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA 94158, USA.
| |
Collapse
|
112
|
Valdivia HO, Reis-Cunha JL, Rodrigues-Luiz GF, Baptista RP, Baldeviano GC, Gerbasi RV, Dobson DE, Pratlong F, Bastien P, Lescano AG, Beverley SM, Bartholomeu DC. Comparative genomic analysis of Leishmania (Viannia) peruviana and Leishmania (Viannia) braziliensis. BMC Genomics 2015; 16:715. [PMID: 26384787 PMCID: PMC4575464 DOI: 10.1186/s12864-015-1928-z] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2015] [Accepted: 09/09/2015] [Indexed: 12/25/2022] Open
Abstract
BACKGROUND The Leishmania (Viannia) braziliensis complex is responsible for most cases of New World tegumentary leishmaniasis. This complex includes two closely related species but with different geographic distribution and disease phenotypes, L. (V.) peruviana and L. (V.) braziliensis. However, the genetic basis of these differences is not well understood and the status of L. (V.) peruviana as distinct species has been questioned by some. Here we sequenced the genomes of two L. (V.) peruviana isolates (LEM1537 and PAB-4377) using Illumina high throughput sequencing and performed comparative analyses against the L. (V.) braziliensis M2904 reference genome. Comparisons were focused on the detection of Single Nucleotide Polymorphisms (SNPs), insertions and deletions (INDELs), aneuploidy and gene copy number variations. RESULTS We found 94,070 variants shared by both L. (V.) peruviana isolates (144,079 in PAB-4377 and 136,946 in LEM1537) against the L. (V.) braziliensis M2904 reference genome while only 26,853 variants separated both L. (V.) peruviana genomes. Analysis in coding sequences detected 26,750 SNPs and 1,513 indels shared by both L. (V.) peruviana isolates against L. (V.) braziliensis M2904 and revealed two L. (V.) braziliensis pseudogenes that are likely to have coding potential in L. (V.) peruviana. Chromosomal read density and allele frequency profiling showed a heterogeneous pattern of aneuploidy with an overall disomic tendency in both L. (V.) peruviana isolates, in contrast with a trisomic pattern in the L. (V.) braziliensis M2904 reference. Read depth analysis allowed us to detect more than 368 gene expansions and 14 expanded gene arrays in L. (V.) peruviana, and the likely absence of expanded amastin gene arrays. CONCLUSIONS The greater numbers of interspecific SNP/indel differences between L. (V.) peruviana and L. (V.) braziliensis and the presence of different gene and chromosome copy number variations support the classification of both organisms as closely related but distinct species. The extensive nucleotide polymorphisms and differences in gene and chromosome copy numbers in L. (V.) peruviana suggests the possibility that these may contribute to some of the unique features of its biology, including a lower pathology and lack of mucosal development.
Collapse
Affiliation(s)
- Hugo O Valdivia
- Laboratório de Imunologia e Genômica de Parasitos, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil. .,Department of Parasitology, U.S. Naval Medical Research Unit No. 6, Lima, Peru.
| | - João L Reis-Cunha
- Laboratório de Imunologia e Genômica de Parasitos, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil.
| | - Gabriela F Rodrigues-Luiz
- Laboratório de Imunologia e Genômica de Parasitos, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil.
| | - Rodrigo P Baptista
- Laboratório de Imunologia e Genômica de Parasitos, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil.
| | | | - Robert V Gerbasi
- Department of Parasitology, U.S. Naval Medical Research Unit No. 6, Lima, Peru.
| | - Deborah E Dobson
- Department of Molecular Microbiology, Washington University School of Medicine, St. Louis, Missouri, USA.
| | - Francine Pratlong
- Centre Hospitalier Universitaire de Montpellier, Departement de Parasitologie-Mycologie, Centre National de Reference des Leishmanioses, Montpellier, France.
| | - Patrick Bastien
- Centre Hospitalier Universitaire de Montpellier, Departement de Parasitologie-Mycologie, Centre National de Reference des Leishmanioses, Montpellier, France.
| | - Andrés G Lescano
- Department of Parasitology, U.S. Naval Medical Research Unit No. 6, Lima, Peru. .,Universidad Peruana Cayetano Heredia, School of Public Health and Management, Lima, Peru.
| | - Stephen M Beverley
- Department of Molecular Microbiology, Washington University School of Medicine, St. Louis, Missouri, USA.
| | - Daniella C Bartholomeu
- Laboratório de Imunologia e Genômica de Parasitos, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil.
| |
Collapse
|
113
|
Ormerod KL, George NM, Fraser JA, Wainwright C, Hugenholtz P. Comparative genomics of non-pseudomonal bacterial species colonising paediatric cystic fibrosis patients. PeerJ 2015; 3:e1223. [PMID: 26401445 PMCID: PMC4579023 DOI: 10.7717/peerj.1223] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2015] [Accepted: 08/10/2015] [Indexed: 12/16/2022] Open
Abstract
The genetic disorder cystic fibrosis is a life-limiting condition affecting ∼70,000 people worldwide. Targeted, early, treatment of the dominant infecting species, Pseudomonas aeruginosa, has improved patient outcomes; however, there is concern that other species are now stepping in to take its place. In addition, the necessarily long-term antibiotic therapy received by these patients may be providing a suitable environment for the emergence of antibiotic resistance. To investigate these issues, we employed whole-genome sequencing of 28 non-Pseudomonas bacterial strains isolated from three paediatric patients. We did not find any trend of increasing antibiotic resistance (either by mutation or lateral gene transfer) in these isolates in comparison with other examples of the same species. In addition, each isolate contained a virulence gene repertoire that was similar to other examples of the relevant species. These results support the impaired clearance of the CF lung not demanding extensive virulence for survival in this habitat. By analysing serial isolates of the same species we uncovered several examples of strain persistence. The same strain of Staphylococcus aureus persisted for nearly a year, despite administration of antibiotics to which it was shown to be sensitive. This is consistent with previous studies showing antibiotic therapy to be inadequate in cystic fibrosis patients, which may also explain the lack of increasing antibiotic resistance over time. Serial isolates of two naturally multi-drug resistant organisms, Achromobacter xylosoxidans and Stenotrophomonas maltophilia, revealed that while all S. maltophilia strains were unique, A. xylosoxidans persisted for nearly five years, making this a species of particular concern. The data generated by this study will assist in developing an understanding of the non-Pseudomonas species associated with cystic fibrosis.
Collapse
Affiliation(s)
- Kate L. Ormerod
- Australian Infectious Diseases Research Centre, School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, Queensland, Australia
| | - Narelle M. George
- Health Support Queensland, Department of Health, Queensland Government, Herston, Queensland, Australia
| | - James A. Fraser
- Australian Infectious Diseases Research Centre, School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, Queensland, Australia
| | - Claire Wainwright
- School of Medicine, University of Queensland, Brisbane, Queensland, Australia
- Department of Respiratory and Sleep Medicine, Lady Cilento Children’s Hospital, South Brisbane, Queensland, Australia
- Queensland Children’s Medical Research Insitute, Herston, Queensland, Australia
| | - Philip Hugenholtz
- Australian Infectious Diseases Research Centre, School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, Queensland, Australia
- Australian Centre for Ecogenomics, School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, Queensland, Australia
| |
Collapse
|
114
|
Farrant GK, Hoebeke M, Partensky F, Andres G, Corre E, Garczarek L. WiseScaffolder: an algorithm for the semi-automatic scaffolding of Next Generation Sequencing data. BMC Bioinformatics 2015; 16:281. [PMID: 26335184 PMCID: PMC4559175 DOI: 10.1186/s12859-015-0705-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2015] [Accepted: 08/17/2015] [Indexed: 01/12/2023] Open
Abstract
Background The sequencing depth provided by high-throughput sequencing technologies has allowed a rise in the number of de novo sequenced genomes that could potentially be closed without further sequencing. However, genome scaffolding and closure require costly human supervision that often results in genomes being published as drafts. A number of automatic scaffolders were recently released, which improved the global quality of genomes published in the last few years. Yet, none of them reach the efficiency of manual scaffolding. Results Here, we present an innovative semi-automatic scaffolder that additionally helps with chimerae resolution and generates valuable contig maps and outputs for manual improvement of the automatic scaffolding. This software was tested on the newly sequenced marine cyanobacterium Synechococcus sp. WH8103 as well as two reference datasets used in previous studies, Rhodobacter sphaeroides and Homo sapiens chromosome 14 (http://gage.cbcb.umd.edu/). The quality of resulting scaffolds was compared to that of three other stand-alone scaffolders: SSPACE, SOPRA and SCARPA. For all three model organisms, WiseScaffolder produced better results than other scaffolders in terms of contiguity statistics (number of genome fragments, N50, LG50, etc.) and, in the case of WH8103, the reliability of the scaffolds was confirmed by whole genome alignment against a closely related reference genome. We also propose an efficient computer-assisted strategy for manual improvement of the scaffolding, using outputs generated by WiseScaffolder, as well as for genome finishing that in our hands led to the circularization of the WH8103 genome. Conclusion Altogether, WiseScaffolder proved more efficient than three other scaffolders for both prokaryotic and eukaryotic genomes and is thus likely applicable to most genome projects. The scaffolding pipeline described here should be of particular interest to biologists wishing to take advantage of the high added value of complete genomes. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0705-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Gregory K Farrant
- Sorbonne Universités, UPMC Univ. Paris 06, UMR 7144, Station Biologique, CS 90074, 29688, Roscoff cedex, France.,CNRS, UMR 7144 Adaptation and Diversity in the Marine Environment, Oceanic Plankton Group, Marine Phototrophic Prokaryotes team, Place Georges Teissier, CS 90074, 29688, Roscoff cedex, France
| | - Mark Hoebeke
- CNRS, FR 2424, ABiMS Platform, Station Biologique, Place Georges Teissier, CS 90074, 29688, Roscoff cedex, France
| | - Frédéric Partensky
- Sorbonne Universités, UPMC Univ. Paris 06, UMR 7144, Station Biologique, CS 90074, 29688, Roscoff cedex, France.,CNRS, UMR 7144 Adaptation and Diversity in the Marine Environment, Oceanic Plankton Group, Marine Phototrophic Prokaryotes team, Place Georges Teissier, CS 90074, 29688, Roscoff cedex, France
| | - Gwendoline Andres
- CNRS, FR 2424, ABiMS Platform, Station Biologique, Place Georges Teissier, CS 90074, 29688, Roscoff cedex, France
| | - Erwan Corre
- CNRS, FR 2424, ABiMS Platform, Station Biologique, Place Georges Teissier, CS 90074, 29688, Roscoff cedex, France
| | - Laurence Garczarek
- Sorbonne Universités, UPMC Univ. Paris 06, UMR 7144, Station Biologique, CS 90074, 29688, Roscoff cedex, France. .,CNRS, UMR 7144 Adaptation and Diversity in the Marine Environment, Oceanic Plankton Group, Marine Phototrophic Prokaryotes team, Place Georges Teissier, CS 90074, 29688, Roscoff cedex, France.
| |
Collapse
|
115
|
The Genome of a Tortoise Herpesvirus (Testudinid Herpesvirus 3) Has a Novel Structure and Contains a Large Region That Is Not Required for Replication In Vitro or Virulence In Vivo. J Virol 2015; 89:11438-56. [PMID: 26339050 DOI: 10.1128/jvi.01794-15] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2015] [Accepted: 08/27/2015] [Indexed: 11/20/2022] Open
Abstract
UNLABELLED Testudinid herpesvirus 3 (TeHV-3) is the causative agent of a lethal disease affecting several tortoise species. The threat that this virus poses to endangered animals is focusing efforts on characterizing its properties, in order to enable the development of prophylactic methods. We have sequenced the genomes of the two most studied TeHV-3 strains (1976 and 4295). TeHV-3 strain 1976 has a novel genome structure and is most closely related to a turtle herpesvirus, thus supporting its classification into genus Scutavirus, subfamily Alphaherpesvirinae, family Herpesviridae. The sequence of strain 1976 also revealed viral counterparts of cellular interleukin-10 and semaphorin, which have not been described previously in members of subfamily Alphaherpesvirinae. TeHV-3 strain 4295 is a mixture of three forms (m1, m2, and M), in which, in comparison to strain 1976, the genomes exhibit large, partially overlapping deletions of 12.5 to 22.4 kb. Viral subclones representing these forms were isolated by limiting dilution assays, and each replicated in cell culture comparably to strain 1976. With the goal of testing the potential of the three forms as attenuated vaccine candidates, strain 4295 was inoculated intranasally into Hermann's tortoises (Testudo hermanni). All inoculated subjects died, and PCR analyses demonstrated the ability of the m2 and M forms to spread and invade the brain. In contrast, the m1 form was detected in none of the organs tested, suggesting its potential as the basis of an attenuated vaccine candidate. Our findings represent a major step toward characterizing TeHV-3 and developing prophylactic methods against it. IMPORTANCE Testudinid herpesvirus 3 (TeHV-3) causes a lethal disease in tortoises, several species of which are endangered. We have characterized the viral genome and used this information to take steps toward developing an attenuated vaccine. We have sequenced the genomes of two strains (1976 and 4295), compared their growth in vitro, and investigated the pathogenesis of strain 4295, which consists of three deletion mutants. The major findings are that (i) TeHV-3 has a novel genome structure, (ii) its closest relative is a turtle herpesvirus, (iii) it contains interleukin-10 and semaphorin genes (the first time these have been reported in an alphaherpesvirus), (iv) a sizeable region of the genome is not required for viral replication in vitro or virulence in vivo, and (v) one of the components of strain 4295, which has a deletion of 22.4 kb, exhibits properties indicating that it may serve as the starting point for an attenuated vaccine.
Collapse
|
116
|
Schirrmeister BE, Gugger M, Donoghue PCJ. Cyanobacteria and the Great Oxidation Event: evidence from genes and fossils. PALAEONTOLOGY 2015. [PMID: 26924853 DOI: 10.5061/dryad.h75ht] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
Cyanobacteria are among the most ancient of evolutionary lineages, oxygenic photosynthesizers that may have originated before 3.0 Ga, as evidenced by free oxygen levels. Throughout the Precambrian, cyanobacteria were one of the most important drivers of biological innovations, strongly impacting early Earth's environments. At the end of the Archean Eon, they were responsible for the rapid oxygenation of Earth's atmosphere during an episode referred to as the Great Oxidation Event (GOE). However, little is known about the origin and diversity of early cyanobacterial taxa, due to: (1) the scarceness of Precambrian fossil deposits; (2) limited characteristics for the identification of taxa; and (3) the poor preservation of ancient microfossils. Previous studies based on 16S rRNA have suggested that the origin of multicellularity within cyanobacteria might have been associated with the GOE. However, single-gene analyses have limitations, particularly for deep branches. We reconstructed the evolutionary history of cyanobacteria using genome scale data and re-evaluated the Precambrian fossil record to get more precise calibrations for a relaxed clock analysis. For the phylogenomic reconstructions, we identified 756 conserved gene sequences in 65 cyanobacterial taxa, of which eight genomes have been sequenced in this study. Character state reconstructions based on maximum likelihood and Bayesian phylogenetic inference confirm previous findings, of an ancient multicellular cyanobacterial lineage ancestral to the majority of modern cyanobacteria. Relaxed clock analyses provide firm support for an origin of cyanobacteria in the Archean and a transition to multicellularity before the GOE. It is likely that multicellularity had a greater impact on cyanobacterial fitness and thus abundance, than previously assumed. Multicellularity, as a major evolutionary innovation, forming a novel unit for selection to act upon, may have served to overcome evolutionary constraints and enabled diversification of the variety of morphotypes seen in cyanobacteria today.
Collapse
Affiliation(s)
- Bettina E Schirrmeister
- School of Earth Sciences University of Bristol Life Science Building 24 Tyndall Avenue Bristol BS8 1TQ UK
| | - Muriel Gugger
- Institut Pasteur Collection des Cyanobactéries 75724 Paris Cedex 15 France
| | - Philip C J Donoghue
- School of Earth Sciences University of Bristol Life Science Building 24 Tyndall Avenue Bristol BS8 1TQ UK
| |
Collapse
|
117
|
Schirrmeister BE, Gugger M, Donoghue PCJ. Cyanobacteria and the Great Oxidation Event: evidence from genes and fossils. PALAEONTOLOGY 2015; 58:769-785. [PMID: 26924853 PMCID: PMC4755140 DOI: 10.1111/pala.12178] [Citation(s) in RCA: 123] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2015] [Accepted: 05/26/2015] [Indexed: 05/22/2023]
Abstract
Cyanobacteria are among the most ancient of evolutionary lineages, oxygenic photosynthesizers that may have originated before 3.0 Ga, as evidenced by free oxygen levels. Throughout the Precambrian, cyanobacteria were one of the most important drivers of biological innovations, strongly impacting early Earth's environments. At the end of the Archean Eon, they were responsible for the rapid oxygenation of Earth's atmosphere during an episode referred to as the Great Oxidation Event (GOE). However, little is known about the origin and diversity of early cyanobacterial taxa, due to: (1) the scarceness of Precambrian fossil deposits; (2) limited characteristics for the identification of taxa; and (3) the poor preservation of ancient microfossils. Previous studies based on 16S rRNA have suggested that the origin of multicellularity within cyanobacteria might have been associated with the GOE. However, single-gene analyses have limitations, particularly for deep branches. We reconstructed the evolutionary history of cyanobacteria using genome scale data and re-evaluated the Precambrian fossil record to get more precise calibrations for a relaxed clock analysis. For the phylogenomic reconstructions, we identified 756 conserved gene sequences in 65 cyanobacterial taxa, of which eight genomes have been sequenced in this study. Character state reconstructions based on maximum likelihood and Bayesian phylogenetic inference confirm previous findings, of an ancient multicellular cyanobacterial lineage ancestral to the majority of modern cyanobacteria. Relaxed clock analyses provide firm support for an origin of cyanobacteria in the Archean and a transition to multicellularity before the GOE. It is likely that multicellularity had a greater impact on cyanobacterial fitness and thus abundance, than previously assumed. Multicellularity, as a major evolutionary innovation, forming a novel unit for selection to act upon, may have served to overcome evolutionary constraints and enabled diversification of the variety of morphotypes seen in cyanobacteria today.
Collapse
Affiliation(s)
- Bettina E. Schirrmeister
- School of Earth SciencesUniversity of BristolLife Science Building24 Tyndall AvenueBristolBS8 1TQUK
| | - Muriel Gugger
- Institut PasteurCollection des Cyanobactéries75724Paris Cedex 15France
| | - Philip C. J. Donoghue
- School of Earth SciencesUniversity of BristolLife Science Building24 Tyndall AvenueBristolBS8 1TQUK
| |
Collapse
|
118
|
Hadfield SJ, Pachebat JA, Swain MT, Robinson G, Cameron SJ, Alexander J, Hegarty MJ, Elwin K, Chalmers RM. Generation of whole genome sequences of new Cryptosporidium hominis and Cryptosporidium parvum isolates directly from stool samples. BMC Genomics 2015; 16:650. [PMID: 26318339 PMCID: PMC4552982 DOI: 10.1186/s12864-015-1805-9] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2015] [Accepted: 07/31/2015] [Indexed: 12/20/2022] Open
Abstract
Background Whole genome sequencing (WGS) of Cryptosporidium spp. has previously relied on propagation of the parasite in animals to generate enough oocysts from which to extract DNA of sufficient quantity and purity for analysis. We have developed and validated a method for preparation of genomic Cryptosporidium DNA suitable for WGS directly from human stool samples and used it to generate 10 high-quality whole Cryptosporidium genome assemblies. Our method uses a combination of salt flotation, immunomagnetic separation (IMS), and surface sterilisation of oocysts prior to DNA extraction, with subsequent use of the transposome-based Nextera XT kit to generate libraries for sequencing on Illumina platforms. IMS was found to be superior to caesium chloride density centrifugation for purification of oocysts from small volume stool samples and for reducing levels of contaminant DNA. Results The IMS-based method was used initially to sequence whole genomes of Cryptosporidium hominis gp60 subtype IbA10G2 and Cryptosporidium parvum gp60 subtype IIaA19G1R2 from small amounts of stool left over from diagnostic testing of clinical cases of cryptosporidiosis. The C. parvum isolate was sequenced to a mean depth of 51.8X with reads covering 100 % of the bases of the C. parvum Iowa II reference genome (Bioproject PRJNA 15586), while the C. hominis isolate was sequenced to a mean depth of 34.7X with reads covering 98 % of the bases of the C. hominis TU502 v1 reference genome (Bioproject PRJNA 15585). The method was then applied to a further 17 stools, successfully generating another eight new whole genome sequences, of which two were C. hominis (gp60 subtypes IbA10G2 and IaA14R3) and six C. parvum (gp60 subtypes IIaA15G2R1 from three samples, and one each of IIaA17G1R1, IIaA18G2R1, and IIdA22G1), demonstrating the utility of this method to sequence Cryptosporidium genomes directly from clinical samples. This development is especially important as it reduces the requirement to propagate Cryptosporidium oocysts in animal models prior to genome sequencing. Conclusion This represents the first report of high-quality whole genome sequencing of Cryptosporidium isolates prepared directly from human stool samples.
Collapse
Affiliation(s)
- Stephen J Hadfield
- Cryptosporidium Reference Unit, Public Health Wales Microbiology ABM, Singleton Hospital, Sgeti, Swansea, SA2 8QA, United Kingdom.
| | - Justin A Pachebat
- Institute of Biological, Environmental & Rural Sciences, Aberystwyth University, Penglais, Aberystwyth, Ceredigion, SY23 3DA, United Kingdom.
| | - Martin T Swain
- Institute of Biological, Environmental & Rural Sciences, Aberystwyth University, Penglais, Aberystwyth, Ceredigion, SY23 3DA, United Kingdom.
| | - Guy Robinson
- Cryptosporidium Reference Unit, Public Health Wales Microbiology ABM, Singleton Hospital, Sgeti, Swansea, SA2 8QA, United Kingdom.
| | - Simon Js Cameron
- Institute of Biological, Environmental & Rural Sciences, Aberystwyth University, Penglais, Aberystwyth, Ceredigion, SY23 3DA, United Kingdom.
| | - Jenna Alexander
- Institute of Biological, Environmental & Rural Sciences, Aberystwyth University, Penglais, Aberystwyth, Ceredigion, SY23 3DA, United Kingdom.
| | - Matthew J Hegarty
- Institute of Biological, Environmental & Rural Sciences, Aberystwyth University, Penglais, Aberystwyth, Ceredigion, SY23 3DA, United Kingdom.
| | - Kristin Elwin
- Cryptosporidium Reference Unit, Public Health Wales Microbiology ABM, Singleton Hospital, Sgeti, Swansea, SA2 8QA, United Kingdom.
| | - Rachel M Chalmers
- Cryptosporidium Reference Unit, Public Health Wales Microbiology ABM, Singleton Hospital, Sgeti, Swansea, SA2 8QA, United Kingdom.
| |
Collapse
|
119
|
Kosugi S, Hirakawa H, Tabata S. GMcloser: closing gaps in assemblies accurately with a likelihood-based selection of contig or long-read alignments. Bioinformatics 2015; 31:3733-41. [PMID: 26261222 DOI: 10.1093/bioinformatics/btv465] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2015] [Accepted: 08/04/2015] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Genome assemblies generated with next-generation sequencing (NGS) reads usually contain a number of gaps. Several tools have recently been developed to close the gaps in these assemblies with NGS reads. Although these gap-closing tools efficiently close the gaps, they entail a high rate of misassembly at gap-closing sites. RESULTS We have found that the assembly error rates caused by these tools are 20-500-fold higher than the rate of errors introduced into contigs by de novo assemblers. We here describe GMcloser, a tool that accurately closes these gaps with a preassembled contig set or a long read set (i.e., error-corrected PacBio reads). GMcloser uses likelihood-based classifiers calculated from the alignment statistics between scaffolds, contigs and paired-end reads to correctly assign contigs or long reads to gap regions of scaffolds, thereby achieving accurate and efficient gap closure. We demonstrate with sequencing data from various organisms that the gap-closing accuracy of GMcloser is 3-100-fold higher than those of other available tools, with similar efficiency. AVAILABILITY AND IMPLEMENTATION GMcloser and an accompanying tool (GMvalue) for evaluating the assembly and correcting misassemblies except SNPs and short indels in the assembly are available at https://sourceforge.net/projects/gmcloser/. CONTACT shunichi.kosugi@riken.jp. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Shunichi Kosugi
- Department of Technology Development, Kazusa DNA Research Institute, Kisarazu, Chiba 292-0818, Japan
| | - Hideki Hirakawa
- Department of Technology Development, Kazusa DNA Research Institute, Kisarazu, Chiba 292-0818, Japan
| | - Satoshi Tabata
- Department of Technology Development, Kazusa DNA Research Institute, Kisarazu, Chiba 292-0818, Japan
| |
Collapse
|
120
|
The barber's pole worm CAP protein superfamily--A basis for fundamental discovery and biotechnology advances. Biotechnol Adv 2015; 33:1744-54. [PMID: 26239368 DOI: 10.1016/j.biotechadv.2015.07.003] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2015] [Revised: 07/02/2015] [Accepted: 07/11/2015] [Indexed: 01/22/2023]
Abstract
Parasitic worm proteins that belong to the cysteine-rich secretory proteins, antigen 5 and pathogenesis-related 1 (CAP) superfamily are proposed to play key roles in the infection process and the modulation of immune responses in host animals. However, there is limited information on these proteins for most socio-economically important worms. Here, we review the CAP protein superfamily of Haemonchus contortus (barber's pole worm), a highly significant parasitic roundworm (order Strongylida) of small ruminants. To do this, we mined genome and transcriptomic datasets, predicted and curated full-length amino acid sequences (n=45), undertook systematic phylogenetic analyses of these data and investigated transcription throughout the life cycle of H. contortus. We inferred functions for selected Caenorhabditis elegans orthologs (including vap-1, vap-2, scl-5 and lon-1) based on genetic networking and by integrating data and published information, and were able to infer that a subset of orthologs and their interaction partners play pivotal roles in growth and development via the insulin-like and/or the TGF-beta signalling pathways. The identification of the important and conserved growth regulator LON-1 led us to appraise the three-dimensional structure of this CAP protein by comparative modelling. This model revealed the presence of different topological moieties on the canonical fold of the CAP domain, which coincide with an overall charge separation as indicated by the electrostatic surface potential map. These observations suggest the existence of separate sites for effector binding and receptor interactions, and thus support the proposal that these worm molecules act in similar ways as venoms act as ligands for chemokine receptors or G protein-coupled receptor effectors. In conclusion, this review should guide future molecular studies of these molecules, and could support the development of novel interventions against haemonchosis.
Collapse
|
121
|
Paulino D, Warren RL, Vandervalk BP, Raymond A, Jackman SD, Birol I. Sealer: a scalable gap-closing application for finishing draft genomes. BMC Bioinformatics 2015. [PMID: 26209068 PMCID: PMC4515008 DOI: 10.1186/s12859-015-0663-4] [Citation(s) in RCA: 96] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND While next-generation sequencing technologies have made sequencing genomes faster and more affordable, deciphering the complete genome sequence of an organism remains a significant bioinformatics challenge, especially for large genomes. Low sequence coverage, repetitive elements and short read length make de novo genome assembly difficult, often resulting in sequence and/or fragment "gaps" - uncharacterized nucleotide (N) stretches of unknown or estimated lengths. Some of these gaps can be closed by re-processing latent information in the raw reads. Even though there are several tools for closing gaps, they do not easily scale up to processing billion base pair genomes. RESULTS Here we describe Sealer, a tool designed to close gaps within assembly scaffolds by navigating de Bruijn graphs represented by space-efficient Bloom filter data structures. We demonstrate how it scales to successfully close 50.8% and 13.8% of gaps in human (3 Gbp) and white spruce (20 Gbp) draft assemblies in under 30 and 27 h, respectively - a feat that is not possible with other leading tools with the breadth of data used in our study. CONCLUSION Sealer is an automated finishing application that uses the succinct Bloom filter representation of a de Bruijn graph to close gaps in draft assemblies, including that of very large genomes. We expect Sealer to have broad utility for finishing genomes across the tree of life, from bacterial genomes to large plant genomes and beyond. Sealer is available for download at https://github.com/bcgsc/abyss/tree/sealer-release.
Collapse
Affiliation(s)
- Daniel Paulino
- Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC, V5Z 4S6, Canada.
| | - René L Warren
- Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC, V5Z 4S6, Canada.
| | - Benjamin P Vandervalk
- Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC, V5Z 4S6, Canada.
| | - Anthony Raymond
- Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC, V5Z 4S6, Canada.
| | - Shaun D Jackman
- Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC, V5Z 4S6, Canada.
| | - Inanç Birol
- Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC, V5Z 4S6, Canada. .,Department of Medical Genetics, University of British Columbia, Vancouver, BC, V6H 3N1, Canada.
| |
Collapse
|
122
|
Woo YH, Ansari H, Otto TD, Klinger CM, Kolisko M, Michálek J, Saxena A, Shanmugam D, Tayyrov A, Veluchamy A, Ali S, Bernal A, del Campo J, Cihlář J, Flegontov P, Gornik SG, Hajdušková E, Horák A, Janouškovec J, Katris NJ, Mast FD, Miranda-Saavedra D, Mourier T, Naeem R, Nair M, Panigrahi AK, Rawlings ND, Padron-Regalado E, Ramaprasad A, Samad N, Tomčala A, Wilkes J, Neafsey DE, Doerig C, Bowler C, Keeling PJ, Roos DS, Dacks JB, Templeton TJ, Waller RF, Lukeš J, Oborník M, Pain A. Chromerid genomes reveal the evolutionary path from photosynthetic algae to obligate intracellular parasites. eLife 2015; 4:e06974. [PMID: 26175406 PMCID: PMC4501334 DOI: 10.7554/elife.06974] [Citation(s) in RCA: 143] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2015] [Accepted: 06/16/2015] [Indexed: 12/18/2022] Open
Abstract
The eukaryotic phylum Apicomplexa encompasses thousands of obligate intracellular parasites of humans and animals with immense socio-economic and health impacts. We sequenced nuclear genomes of Chromera velia and Vitrella brassicaformis, free-living non-parasitic photosynthetic algae closely related to apicomplexans. Proteins from key metabolic pathways and from the endomembrane trafficking systems associated with a free-living lifestyle have been progressively and non-randomly lost during adaptation to parasitism. The free-living ancestor contained a broad repertoire of genes many of which were repurposed for parasitic processes, such as extracellular proteins, components of a motility apparatus, and DNA- and RNA-binding protein families. Based on transcriptome analyses across 36 environmental conditions, Chromera orthologs of apicomplexan invasion-related motility genes were co-regulated with genes encoding the flagellar apparatus, supporting the functional contribution of flagella to the evolution of invasion machinery. This study provides insights into how obligate parasites with diverse life strategies arose from a once free-living phototrophic marine alga. DOI:http://dx.doi.org/10.7554/eLife.06974.001 Single-celled parasites cause many severe diseases in humans and animals. The apicomplexans form probably the most successful group of these parasites and include the parasites that cause malaria. Apicomplexans infect a broad range of hosts, including humans, reptiles, birds, and insects, and often have complicated life cycles. For example, the malaria-causing parasites spread by moving from humans to female mosquitoes and then back to humans. Despite significant differences amongst apicomplexans, these single-celled parasites also share a number of features that are not seen in other living species. How and when these features arose remains unclear. It is known from previous work that apicomplexans are closely related to single-celled algae. But unlike apicomplexans, which depend on a host animal to survive, these algae live freely in their environment, often in close association with corals. Woo et al. have now sequenced the genomes of two photosynthetic algae that are thought to be close living relatives of the apicomplexans. These genomes were then compared to each other and to the genomes of other algae and apicomplexans. These comparisons reconfirmed that the two algae that were studied were close relatives of the apicomplexans. Further analyses suggested that thousands of genes were lost as an ancient free-living algae evolved into the apicomplexan ancestor, and further losses occurred as these early parasites evolved into modern species. The lost genes were typically those that are important for free-living organisms, but are either a hindrance to, or not needed in, a parasitic lifestyle. Some of the ancestor's genes, especially those that coded for the building blocks of flagella (structures which free-living algae use to move around), were repurposed in ways that helped the apicomplexans to invade their hosts. Understanding this repurposing process in greater detail will help to identify key molecules in these deadly parasites that could be targeted by drug treatments. It will also offer answers to one of the most fascinating questions in evolutionary biology: how parasites have evolved from free-living organisms. DOI:http://dx.doi.org/10.7554/eLife.06974.002
Collapse
Affiliation(s)
- Yong H Woo
- Pathogen Genomics Laboratory, Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Hifzur Ansari
- Pathogen Genomics Laboratory, Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Thomas D Otto
- Parasite Genomics, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge, United Kingdom
| | | | - Martin Kolisko
- Canadian Institute for Advanced Research, Department of Botany, University of British Columbia, Vancouver, Canada
| | - Jan Michálek
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic
| | - Alka Saxena
- Pathogen Genomics Laboratory, Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | | | - Annageldi Tayyrov
- Pathogen Genomics Laboratory, Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Alaguraj Veluchamy
- Ecology and Evolutionary Biology Section, Institut de Biologie de l'Ecole Normale Supérieure, CNRS UMR8197 INSERM U1024, Paris, France
| | - Shahjahan Ali
- Bioscience Core Laboratory, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Axel Bernal
- Department of Biology, University of Pennsylvania, Philadelphia, United States
| | - Javier del Campo
- Canadian Institute for Advanced Research, Department of Botany, University of British Columbia, Vancouver, Canada
| | - Jaromír Cihlář
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic
| | - Pavel Flegontov
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic
| | | | - Eva Hajdušková
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic
| | - Aleš Horák
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic
| | - Jan Janouškovec
- Canadian Institute for Advanced Research, Department of Botany, University of British Columbia, Vancouver, Canada
| | | | - Fred D Mast
- Seattle Biomedical Research Institute, Seattle, United States
| | - Diego Miranda-Saavedra
- Centro de Biología Molecular Severo Ochoa, CSIC/Universidad Autónoma de Madrid, Madrid, Spain
| | - Tobias Mourier
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Raeece Naeem
- Pathogen Genomics Laboratory, Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Mridul Nair
- Pathogen Genomics Laboratory, Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Aswini K Panigrahi
- Bioscience Core Laboratory, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Neil D Rawlings
- European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Eriko Padron-Regalado
- Pathogen Genomics Laboratory, Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Abhinay Ramaprasad
- Pathogen Genomics Laboratory, Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Nadira Samad
- School of Botany, University of Melbourne, Parkville, Australia
| | - Aleš Tomčala
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic
| | - Jon Wilkes
- Wellcome Trust Centre For Molecular Parasitology, Institute of Infection, Immunity and Inflammation, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
| | - Daniel E Neafsey
- Broad Genome Sequencing and Analysis Program, Broad Institute of MIT and Harvard, Cambridge, United States
| | - Christian Doerig
- Department of Microbiology, Monash University, Clayton, Australia
| | - Chris Bowler
- Ecology and Evolutionary Biology Section, Institut de Biologie de l'Ecole Normale Supérieure, CNRS UMR8197 INSERM U1024, Paris, France
| | - Patrick J Keeling
- Canadian Institute for Advanced Research, Department of Botany, University of British Columbia, Vancouver, Canada
| | - David S Roos
- Department of Biology, University of Pennsylvania, Philadelphia, United States
| | - Joel B Dacks
- Department of Cell Biology, University of Alberta, Edmonton, Canada
| | - Thomas J Templeton
- Department of Microbiology and Immunology, Weill Cornell Medical College, New York, United States
| | - Ross F Waller
- School of Botany, University of Melbourne, Parkville, Australia
| | - Julius Lukeš
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic
| | - Miroslav Oborník
- Institute of Parasitology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic
| | - Arnab Pain
- Pathogen Genomics Laboratory, Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| |
Collapse
|
123
|
Lennon FE, Cianci GC, Cipriani NA, Hensing TA, Zhang HJ, Chen CT, Murgu SD, Vokes EE, Vannier MW, Salgia R. Lung cancer-a fractal viewpoint. Nat Rev Clin Oncol 2015; 12:664-75. [PMID: 26169924 DOI: 10.1038/nrclinonc.2015.108] [Citation(s) in RCA: 97] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Fractals are mathematical constructs that show self-similarity over a range of scales and non-integer (fractal) dimensions. Owing to these properties, fractal geometry can be used to efficiently estimate the geometrical complexity, and the irregularity of shapes and patterns observed in lung tumour growth (over space or time), whereas the use of traditional Euclidean geometry in such calculations is more challenging. The application of fractal analysis in biomedical imaging and time series has shown considerable promise for measuring processes as varied as heart and respiratory rates, neuronal cell characterization, and vascular development. Despite the advantages of fractal mathematics and numerous studies demonstrating its applicability to lung cancer research, many researchers and clinicians remain unaware of its potential. Therefore, this Review aims to introduce the fundamental basis of fractals and to illustrate how analysis of fractal dimension (FD) and associated measurements, such as lacunarity (texture) can be performed. We describe the fractal nature of the lung and explain why this organ is particularly suited to fractal analysis. Studies that have used fractal analyses to quantify changes in nuclear and chromatin FD in primary and metastatic tumour cells, and clinical imaging studies that correlated changes in the FD of tumours on CT and/or PET images with tumour growth and treatment responses are reviewed. Moreover, the potential use of these techniques in the diagnosis and therapeutic management of lung cancer are discussed.
Collapse
Affiliation(s)
- Frances E Lennon
- Section of Hematology/Oncology, University of Chicago, 5841 South Maryland Avenue, MC 2115 Chicago, IL 60637, USA
| | - Gianguido C Cianci
- Department of Cell and Molecular Biology, Feinberg School of Medicine, Northwestern University, 303 East Chicago Avenue, Chicago, IL 60611, USA
| | - Nicole A Cipriani
- Department of Pathology, University of Chicago, 5841 South Maryland Avenue, MC 2115 Chicago, IL 60637, USA
| | - Thomas A Hensing
- NorthShore University Health System, 2650 Ridge Avenue, Evanston, IL 60201, USA
| | - Hannah J Zhang
- Department of Radiology, University of Chicago, 5841 South Maryland Avenue, MC 2115 Chicago, IL 60637, USA
| | - Chin-Tu Chen
- Department of Radiology, University of Chicago, 5841 South Maryland Avenue, MC 2115 Chicago, IL 60637, USA
| | - Septimiu D Murgu
- Department of Medicine, University of Chicago, 5841 South Maryland Avenue, MC 2115 Chicago, IL 60637, USA
| | - Everett E Vokes
- Section of Hematology/Oncology, University of Chicago, 5841 South Maryland Avenue, MC 2115 Chicago, IL 60637, USA
| | - Michael W Vannier
- Department of Radiology, University of Chicago, 5841 South Maryland Avenue, MC 2115 Chicago, IL 60637, USA
| | - Ravi Salgia
- Section of Hematology/Oncology, University of Chicago, 5841 South Maryland Avenue, MC 2115 Chicago, IL 60637, USA
| |
Collapse
|
124
|
Kinjo Y, Saitoh S, Tokuda G. An Efficient Strategy Developed for Next-Generation Sequencing of Endosymbiont Genomes Performed Using Crude DNA Isolated from Host Tissues: A Case Study of Blattabacterium cuenoti Inhabiting the Fat Bodies of Cockroaches. Microbes Environ 2015; 30:208-20. [PMID: 26156552 PMCID: PMC4567559 DOI: 10.1264/jsme2.me14153] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Whole-genome sequencing has emerged as one of the most effective means to elucidate the biological roles and molecular features of obligate intracellular symbionts (endosymbionts). However, the de novo assembly of an endosymbiont genome remains a challenge when host and/or mitochondrial DNA sequences are present in a dataset and hinder the assembly of the genome. By focusing on the traits of genome evolution in endosymbionts, we herein developed and investigated a genome-assembly strategy that consisted of two consecutive procedures: the selection of endosymbiont contigs from an output obtained from a de novo assembly performed using a TBLASTX search against a reference genome, named TBLASTX Contig Selection and Filtering (TCSF), and the iterative reassembling of the genome from reads mapped on the selected contigs, named Iterative Mapping and ReAssembling (IMRA), to merge the contigs. In order to validate this approach, we sequenced two strains of the cockroach endosymbiont Blattabacterium cuenoti and applied this strategy to the datasets. TCSF was determined to be highly accurate and sensitive in contig selection even when the genome of a distantly related free-living bacterium was used as a reference genome. Furthermore, the use of IMRA markedly improved sequence assemblies: the genomic sequence of an endosymbiont was almost completed from a dataset containing only 3% of the sequences of the endosymbiont’s genome. The efficiency of our strategy may facilitate further studies on endosymbionts.
Collapse
Affiliation(s)
- Yukihiro Kinjo
- Tropical Biosphere Research Center, University of the Ryukyus
| | | | | |
Collapse
|
125
|
The structure of the Cyberlindnera jadinii genome and its relation to Candida utilis analyzed by the occurrence of single nucleotide polymorphisms. J Biotechnol 2015; 211:20-30. [PMID: 26150016 DOI: 10.1016/j.jbiotec.2015.06.423] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2015] [Revised: 06/18/2015] [Accepted: 06/29/2015] [Indexed: 11/22/2022]
Abstract
The yeast Cyberlindnera jadinii is a close relative of Candida utilis that is being used in the food and feed industries. Here, we present the 12.7Mb genome sequence of C. jadinii strain CBS 1600 generated by next generation sequencing. The deduced draft genome sequence consists of seven large scaffolds analogous to the seven largest chromosomes of C. utilis. An automated annotation of the C. jadinii genome identified 6147 protein-coding sequences. The level of ploidy for both genomes was analyzed by calling single nucleotide polymorphisms (SNPs) and was verified measuring nuclear DNA contents by florescence activated cell sorting (FACS). Both analyses determined the level of ploidy to diploid for C. jadinii and to triploid for C. utilis. However, SNP calling for C. jadinii also identified scaffold regions that seem to be haploid, triploid or tetraploid.
Collapse
|
126
|
Gowda M, Shirke MD, Mahesh H, Chandarana P, Rajamani A, Chattoo BB. Genome analysis of rice-blast fungus Magnaporthe oryzae field isolates from southern India. GENOMICS DATA 2015; 5:284-91. [PMID: 26484270 PMCID: PMC4583678 DOI: 10.1016/j.gdata.2015.06.018] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/07/2015] [Accepted: 06/03/2015] [Indexed: 12/18/2022]
Abstract
The Indian subcontinent is the center of origin and diversity for rice (Oryza sativa L.). The O. sativa ssp. indica is a major food crop grown in India, which occupies the first and second position in area and production, respectively. Blast disease caused by Magnaporthe oryzae is a major constraint to rice production. Here, we report the analysis of genome architecture and sequence variation of two field isolates, B157 and MG01, of the blast fungus from southern India. The 40 Mb genome of B157 and 43 Mb genome of MG01 contained 11,344 and 11,733 predicted genes, respectively. Genomic comparisons unveiled a large set of SNPs and several isolate specific genes in the Indian blast isolates. Avr genes were analyzed in several sequenced Magnaporthe strains; this analysis revealed the presence of Avr-Pizt and Avr-Ace1 genes in all the sequenced isolates. Availability of whole genomes of field isolates from India will contribute to global efforts to understand genetic diversity of M. oryzae population and to track the emergence of virulent pathotypes. The first genomic study of Magnaporthe from Indian subcontinent Provided information about genomic variations in terms of SNPs, InDels and ICVs due to transposable elements Identified novel genes specific to Indian isolates Genome wide antisense transcripts identified from this study Identified Magnaporthe specific pathogenicity genes that are absent in non-pathogenic Ascomycetes fungi
Collapse
Affiliation(s)
- Malali Gowda
- Genomics Laboratory, Centre for Cellular and Molecular Platforms, Bangalore 560065, India
- Corresponding authors. Tel.: + 91 80 67185113.
| | - Meghana D. Shirke
- Genomics Laboratory, Centre for Cellular and Molecular Platforms, Bangalore 560065, India
| | - H.B. Mahesh
- Genomics Laboratory, Centre for Cellular and Molecular Platforms, Bangalore 560065, India
- Marker Assisted Selection Laboratory, Department of Genetics and Plant Breeding, University of Agricultural Sciences, Bangalore, India
| | - Pinal Chandarana
- Centre for Genome Research, Department of Microbiology and Biotechnology Centre, Faculty of Science, Maharaja Sayajirao University of Baroda, Vadodara 390002, India
| | | | - Bharat B. Chattoo
- Centre for Genome Research, Department of Microbiology and Biotechnology Centre, Faculty of Science, Maharaja Sayajirao University of Baroda, Vadodara 390002, India
- Corresponding authors. Tel.: + 91 80 67185113.
| |
Collapse
|
127
|
Hjerde E, Karlsen C, Sørum H, Parkhill J, Willassen NP, Thomson NR. Co-cultivation and transcriptome sequencing of two co-existing fish pathogens Moritella viscosa and Aliivibrio wodanis. BMC Genomics 2015; 16:447. [PMID: 26059548 PMCID: PMC4462113 DOI: 10.1186/s12864-015-1669-z] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2014] [Accepted: 05/29/2015] [Indexed: 11/10/2022] Open
Abstract
Background Aliivibrio wodanis and Moritella viscosa have often been isolated concurrently from fish with winter-ulcer disease. Little is known about the interaction between the two bacterial species and how the presence of one bacterial species affects the behaviour of the other. Results The impact on bacterial growth in co-culture was investigated in vitro, and the presence of A. wodanis has an inhibitorial effect on M. viscosa. Further, we have sequenced the complete genomes of these two marine Gram-negative species, and have performed transcriptome analysis of the bacterial gene expression levels from in vivo samples. Using bacterial implants in the fish abdomen, we demonstrate that the presence of A. wodanis is altering the gene expression levels of M. viscosa compared to when the bacteria are implanted separately. Conclusions From expression profiling of the transcriptomes, it is evident that the presence of A. wodanis is altering the global gene expression of M. viscosa. Co-cultivation studies showed that A. wodanis is impeding the growth of M. viscosa, and that the inhibitorial effect is not contact-dependent. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1669-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Erik Hjerde
- Department of Chemistry, Faculty of Science and Technology, University of Tromsø, N-9037, Tromsø, Norway.
| | - Christian Karlsen
- Section of Microbiology, Immunology and Parasitology, Department of Food Safety and Infection Biology, Faculty of Veterinary Medicine and Biosciences, Norwegian University of Life Sciences, Ullevålsveien 72, Oslo, Norway.
| | - Henning Sørum
- Section of Microbiology, Immunology and Parasitology, Department of Food Safety and Infection Biology, Faculty of Veterinary Medicine and Biosciences, Norwegian University of Life Sciences, Ullevålsveien 72, Oslo, Norway.
| | - Julian Parkhill
- The Pathogen Sequencing Unit, The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| | - Nils Peder Willassen
- Department of Chemistry, Faculty of Science and Technology, University of Tromsø, N-9037, Tromsø, Norway. .,The Norwegian Structural Biology Centre, University of Tromsø, N-9037, Tromsø, Norway.
| | - Nicholas R Thomson
- The Pathogen Sequencing Unit, The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| |
Collapse
|
128
|
Phelan J, Maitra A, McNerney R, Nair M, Gupta A, Coll F, Pain A, Bhakta S, Clark TG. The draft genome of Mycobacterium aurum, a potential model organism for investigating drugs against Mycobacterium tuberculosis and Mycobacterium leprae. Int J Mycobacteriol 2015; 4:207-16. [PMID: 27649868 DOI: 10.1016/j.ijmyco.2015.05.001] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2015] [Accepted: 05/03/2015] [Indexed: 10/23/2022] Open
Abstract
Mycobacterium aurum (M. aurum) is an environmental mycobacteria that has previously been used in studies of anti-mycobacterial drugs due to its fast growth rate and low pathogenicity. The M. aurum genome has been sequenced and assembled into 46 contigs, with a total length of 6.02Mb containing 5684 annotated protein-coding genes. A phylogenetic analysis using whole genome alignments positioned M. aurum close to Mycobacterium vaccae and Mycobacterium vanbaalenii, within a clade related to fast-growing mycobacteria. Large-scale genomic rearrangements were identified by comparing the M. aurum genome to those of Mycobacterium tuberculosis and Mycobacterium leprae. M. aurum orthologous genes implicated in resistance to anti-tuberculosis drugs in M. tuberculosis were observed. The sequence identity at the DNA level varied from 68.6% for pncA (pyrazinamide drug-related) to 96.2% for rrs (streptomycin, capreomycin). We observed two homologous genes encoding the catalase-peroxidase enzyme (katG) that is associated with resistance to isoniazid. Similarly, two embB homologues were identified in the M. aurum genome. In addition to describing for the first time the genome of M. aurum, this work provides a resource to aid the use of M. aurum in studies to develop improved drugs for the pathogenic mycobacteria M. tuberculosis and M. leprae.
Collapse
Affiliation(s)
- Jody Phelan
- Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, London WC1E 7HT, United Kingdom.
| | - Arundhati Maitra
- Mycobacteria Research Laboratory, Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck College, University of London, Malet Street, London WC1E 7HX, United Kingdom
| | - Ruth McNerney
- Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, London WC1E 7HT, United Kingdom
| | - Mridul Nair
- Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal 23955-6900, Saudi Arabia
| | - Antima Gupta
- Mycobacteria Research Laboratory, Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck College, University of London, Malet Street, London WC1E 7HX, United Kingdom
| | - Francesc Coll
- Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, London WC1E 7HT, United Kingdom
| | - Arnab Pain
- Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal 23955-6900, Saudi Arabia
| | - Sanjib Bhakta
- Mycobacteria Research Laboratory, Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck College, University of London, Malet Street, London WC1E 7HX, United Kingdom
| | - Taane G Clark
- Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, London WC1E 7HT, United Kingdom; Faculty of Epidemiology and Population Health, London School of Hygiene & Tropical Medicine, Keppel Street, London WC1E 7HT, United Kingdom
| |
Collapse
|
129
|
Draft genome sequences of clostridium strains native to Colombia with the potential to produce solvents. GENOME ANNOUNCEMENTS 2015; 3:3/3/e00486-15. [PMID: 25999575 PMCID: PMC4440955 DOI: 10.1128/genomea.00486-15] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Genomes from four Clostridium sp. strains considered to be mesophilic anaerobic bacteria, isolated from crop soil in Colombia, with a strong potential to produce alcohols like 1,3-propanediol, were analyzed. We present the draft genome of these strains, which will be useful for developing genetic engineering strategies.
Collapse
|
130
|
Paijmans JLA, Fickel J, Courtiol A, Hofreiter M, Förster DW. Impact of enrichment conditions on cross-species capture of fresh and degraded DNA. Mol Ecol Resour 2015; 16:42-55. [DOI: 10.1111/1755-0998.12420] [Citation(s) in RCA: 58] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2014] [Revised: 04/21/2015] [Accepted: 04/23/2015] [Indexed: 12/12/2022]
Affiliation(s)
- Johanna L. A. Paijmans
- Department of Biology; University of York; York YO10 5DD UK
- Institute for Biochemistry and Biology; University of Potsdam; Karl-Liebknecht-Str 24-25 14476 Potsdam Germany
| | - Joerns Fickel
- Institute for Biochemistry and Biology; University of Potsdam; Karl-Liebknecht-Str 24-25 14476 Potsdam Germany
- Evolutionary Genetics Department, Leibniz-Institute for Zoo and Wildlife Research, Alfred-Kowalke-Str. 17; 10315 Berlin Germany
| | - Alexandre Courtiol
- Evolutionary Genetics Department, Leibniz-Institute for Zoo and Wildlife Research, Alfred-Kowalke-Str. 17; 10315 Berlin Germany
| | - Michael Hofreiter
- Department of Biology; University of York; York YO10 5DD UK
- Institute for Biochemistry and Biology; University of Potsdam; Karl-Liebknecht-Str 24-25 14476 Potsdam Germany
| | - Daniel W. Förster
- Evolutionary Genetics Department, Leibniz-Institute for Zoo and Wildlife Research, Alfred-Kowalke-Str. 17; 10315 Berlin Germany
| |
Collapse
|
131
|
Kwak MJ, Kwon SK, Cho SH, Kim JF. Genome sequences of the Shiga-like toxin-producing Escherichia coli NCCP15655 and NCCP15656. Gut Pathog 2015; 7:13. [PMID: 25991925 PMCID: PMC4436796 DOI: 10.1186/s13099-015-0060-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/02/2015] [Accepted: 04/23/2015] [Indexed: 12/16/2022] Open
Abstract
Background Virulence genes can spread among commensal bacteria through horizontal gene transfer. The bacterium with novel virulence factors may pose a severe threat to public health because of the absence of a management system unlike known pathogens. Especially, when a pathogenic bacterium acquires a new kind of virulence genes, it tends to exhibit stronger virulence. In this study, we analyzed the genomes of the two strains of Escherichia coli that were isolated from the feces of patients with diarrhea and produce Shiga-like toxin. Results Phylogenetic analysis of conserved genes and average nucleotide identity values of the draft genome sequences indicate that strains NCCP15655 and NCCP15656, isolated from diarrhea patients, belong to the B1 group of E. coli and form a sister clade with strain E24377A. However, the proportion the genes belonging to the subsystem category “phages, prophages, transposable elements, plasmids” and “virulence, disease and defense” are higher than E24377A. Indeed, in their genomes, genes encoding Shiga toxin type 1, Shiga toxin type 2, and type 1 fimbriae were detected. Moreover, a plasmid encoding hemolysin and entropathogenic E. coli secreted protein C was identified in both genomes. Conclusions Through the genome analysis of NCCP15655 and NCCP15656, we identified two types of Shiga-like toxin genes that could be responsible for the manifestation of the diarrhea symptom. However, the LEE island, which is one of the major virulence factors of enterohemorrhagic E. coli, was not detected and they are most similar with non-Shiga-like toxin-producing E. coli at the genomic level. NCCP15655 and NCCP15656 will be good examples of Shiga-like toxin-producing E. coli whose genomes are not as similar with typical enterohemorrhagic E. coli as non-Shiga-like toxin-producing E. coli.
Collapse
Affiliation(s)
- Min-Jung Kwak
- Department of Systems Biology and Division of Life Sciences, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul, 120-749, Republic of Korea ; Biosystems and Bioengineering Program, University of Science and Technology, 217 Gajung-ro, Yuseong-gu, Daejeon, 305-350, Republic of Korea
| | - Soon-Kyeong Kwon
- Department of Systems Biology and Division of Life Sciences, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul, 120-749, Republic of Korea
| | - Seung-Hak Cho
- Division of Enteric Diseases, Center for Infectious Diseases, Korea National Institute of Health, Heungdeok-Gu, Cheongju, 363-951 Republic of Korea
| | - Jihyun F Kim
- Department of Systems Biology and Division of Life Sciences, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul, 120-749, Republic of Korea
| |
Collapse
|
132
|
Palser AL, Grayson NE, White RE, Corton C, Correia S, Ba Abdullah MM, Watson SJ, Cotten M, Arrand JR, Murray PG, Allday MJ, Rickinson AB, Young LS, Farrell PJ, Kellam P. Genome diversity of Epstein-Barr virus from multiple tumor types and normal infection. J Virol 2015; 89:5222-37. [PMID: 25787276 PMCID: PMC4442510 DOI: 10.1128/jvi.03614-14] [Citation(s) in RCA: 169] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2014] [Accepted: 03/08/2015] [Indexed: 02/06/2023] Open
Abstract
UNLABELLED Epstein-Barr virus (EBV) infects most of the world's population and is causally associated with several human cancers, but little is known about how EBV genetic variation might influence infection or EBV-associated disease. There are currently no published wild-type EBV genome sequences from a healthy individual and very few genomes from EBV-associated diseases. We have sequenced 71 geographically distinct EBV strains from cell lines, multiple types of primary tumor, and blood samples and the first EBV genome from the saliva of a healthy carrier. We show that the established genome map of EBV accurately represents all strains sequenced, but novel deletions are present in a few isolates. We have increased the number of type 2 EBV genomes sequenced from one to 12 and establish that the type 1/type 2 classification is a major feature of EBV genome variation, defined almost exclusively by variation of EBNA2 and EBNA3 genes, but geographic variation is also present. Single nucleotide polymorphism (SNP) density varies substantially across all known open reading frames and is highest in latency-associated genes. Some T-cell epitope sequences in EBNA3 genes show extensive variation across strains, and we identify codons under positive selection, both important considerations for the development of vaccines and T-cell therapy. We also provide new evidence for recombination between strains, which provides a further mechanism for the generation of diversity. Our results provide the first global view of EBV sequence variation and demonstrate an effective method for sequencing large numbers of genomes to further understand the genetics of EBV infection. IMPORTANCE Most people in the world are infected by Epstein-Barr virus (EBV), and it causes several human diseases, which occur at very different rates in different parts of the world and are linked to host immune system variation. Natural variation in EBV DNA sequence may be important for normal infection and for causing disease. Here we used rapid, cost-effective sequencing to determine 71 new EBV sequences from different sample types and locations worldwide. We showed geographic variation in EBV genomes and identified the most variable parts of the genome. We identified protein sequences that seem to have been selected by the host immune system and detected variability in known immune epitopes. This gives the first overview of EBV genome variation, important for designing vaccines and immune therapy for EBV, and provides techniques to investigate relationships between viral sequence variation and EBV-associated diseases.
Collapse
MESH Headings
- Amino Acid Sequence
- Antigens, Viral/genetics
- Carrier State/virology
- Cell Line, Tumor
- DNA, Viral/genetics
- Epitopes, T-Lymphocyte/genetics
- Epstein-Barr Virus Infections/virology
- Epstein-Barr Virus Nuclear Antigens/genetics
- Genetic Variation
- Genome, Viral
- Herpesvirus 4, Human/classification
- Herpesvirus 4, Human/genetics
- Herpesvirus 4, Human/isolation & purification
- Humans
- Phylogeny
- Polymorphism, Single Nucleotide
- Recombination, Genetic
- Viral Matrix Proteins/genetics
Collapse
Affiliation(s)
- Anne L Palser
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom
| | | | - Robert E White
- Section of Virology, Imperial College Faculty of Medicine, London, United Kingdom
| | - Craig Corton
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom
| | - Samantha Correia
- Section of Virology, Imperial College Faculty of Medicine, London, United Kingdom
| | | | - Simon J Watson
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom
| | - Matthew Cotten
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom
| | - John R Arrand
- School of Cancer Sciences, University of Birmingham, Birmingham, United Kingdom
| | - Paul G Murray
- School of Cancer Sciences, University of Birmingham, Birmingham, United Kingdom
| | - Martin J Allday
- Section of Virology, Imperial College Faculty of Medicine, London, United Kingdom
| | - Alan B Rickinson
- School of Cancer Sciences, University of Birmingham, Birmingham, United Kingdom
| | - Lawrence S Young
- University of Warwick, University House, Coventry, United Kingdom
| | - Paul J Farrell
- Section of Virology, Imperial College Faculty of Medicine, London, United Kingdom
| | - Paul Kellam
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom Division of Infection and Immunity, UCL, London, United Kingdom
| |
Collapse
|
133
|
Sahl JW, Allender CJ, Colman RE, Califf KJ, Schupp JM, Currie BJ, Van Zandt KE, Gelhaus HC, Keim P, Tuanyok A. Genomic characterization of Burkholderia pseudomallei isolates selected for medical countermeasures testing: comparative genomics associated with differential virulence. PLoS One 2015; 10:e0121052. [PMID: 25803742 PMCID: PMC4372212 DOI: 10.1371/journal.pone.0121052] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Accepted: 01/27/2015] [Indexed: 01/06/2023] Open
Abstract
Burkholderia pseudomallei is the causative agent of melioidosis and a potential bioterrorism agent. In the development of medical countermeasures against B. pseudomallei infection, the US Food and Drug Administration (FDA) animal Rule recommends using well-characterized strains in animal challenge studies. In this study, whole genome sequence data were generated for 6 B. pseudomallei isolates previously identified as candidates for animal challenge studies; an additional 5 isolates were sequenced that were associated with human inhalational melioidosis. A core genome single nucleotide polymorphism (SNP) phylogeny inferred from a concatenated SNP alignment from the 11 isolates sequenced in this study and a diverse global collection of isolates demonstrated the diversity of the proposed Animal Rule isolates. To understand the genomic composition of each isolate, a large-scale blast score ratio (LS-BSR) analysis was performed on the entire pan-genome; this demonstrated the variable composition of genes across the panel and also helped to identify genes unique to individual isolates. In addition, a set of ~550 genes associated with pathogenesis in B. pseudomallei were screened against the 11 sequenced genomes with LS-BSR. Differential gene distribution for 54 virulence-associated genes was observed between genomes and three of these genes were correlated with differential virulence observed in animal challenge studies using BALB/c mice. Differentially conserved genes and SNPs associated with disease severity were identified and could be the basis for future studies investigating the pathogenesis of B. pseudomallei. Overall, the genetic characterization of the 11 proposed Animal Rule isolates provides context for future studies involving B. pseudomallei pathogenesis, differential virulence, and efficacy to therapeutics.
Collapse
Affiliation(s)
- Jason W. Sahl
- Department of Pathogen Genomics, Translational Genomics Research Institute, Flagstaff, Arizona, United States of America
- Center for Microbial Genetics and Genomics, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Christopher J. Allender
- Center for Microbial Genetics and Genomics, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Rebecca E. Colman
- Department of Pathogen Genomics, Translational Genomics Research Institute, Flagstaff, Arizona, United States of America
| | - Katy J. Califf
- Center for Microbial Genetics and Genomics, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - James M. Schupp
- Department of Pathogen Genomics, Translational Genomics Research Institute, Flagstaff, Arizona, United States of America
| | - Bart J. Currie
- Department of Tropical and Emerging Infectious Diseases, Menzies School of Health Research, Casuarina NT, Australia
| | | | - H. Carl Gelhaus
- Battelle Biomedical Research Center (BBRC), Columbus, Ohio, United States of America
| | - Paul Keim
- Department of Pathogen Genomics, Translational Genomics Research Institute, Flagstaff, Arizona, United States of America
- Center for Microbial Genetics and Genomics, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Apichai Tuanyok
- Department of Tropical Medicine, Medical Microbiology and Pharmacology, and Pacific Center for Emerging Infections Diseases Research, University of Hawaii at Manoa, Honolulu, Hawaii, United States of America
| |
Collapse
|
134
|
Tsai IJ, Tanaka E, Masuya H, Tanaka R, Hirooka Y, Endoh R, Sahashi N, Kikuchi T. Comparative genomics of Taphrina fungi causing varying degrees of tumorous deformity in plants. Genome Biol Evol 2015; 6:861-72. [PMID: 24682155 PMCID: PMC4007546 DOI: 10.1093/gbe/evu067] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Taphrina fungi are biotrophic plant pathogens that cause plant deformity diseases. We sequenced the genomes of four Taphrina species—Taphrina wiesneri, T. deformans, T. flavorubra, and T. populina—which parasitize Prunus, Cerasus, and Populus hosts with varying severity of disease symptoms. High levels of gene synteny within Taphrina species were observed, and our comparative analysis further revealed that these fungi may utilize multiple strategies in coping with the host environment that are also found in some specialized dimorphic species. These include species-specific aneuploidy and clusters of highly diverged secreted proteins located at subtelomeres. We also identified species differences in plant hormone biosynthesis pathways, which may contribute to varying degree of disease symptoms. The genomes provide a rich resource for investigation into Taphrina biology and evolutionary studies across the basal ascomycetes clade.
Collapse
Affiliation(s)
- Isheng J Tsai
- Division of Parasitology, Faculty of Medicine, University of Miyazaki, Japan
| | | | | | | | | | | | | | | |
Collapse
|
135
|
Eastman AW, Yuan ZC. Development and validation of an rDNA operon based primer walking strategy applicable to de novo bacterial genome finishing. Front Microbiol 2015; 5:769. [PMID: 25653642 PMCID: PMC4301005 DOI: 10.3389/fmicb.2014.00769] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2014] [Accepted: 12/16/2014] [Indexed: 01/10/2023] Open
Abstract
Advances in sequencing technology have drastically increased the depth and feasibility of bacterial genome sequencing. However, little information is available that details the specific techniques and procedures employed during genome sequencing despite the large numbers of published genomes. Shotgun approaches employed by second-generation sequencing platforms has necessitated the development of robust bioinformatics tools for in silico assembly, and complete assembly is limited by the presence of repetitive DNA sequences and multi-copy operons. Typically, re-sequencing with multiple platforms and laborious, targeted Sanger sequencing are employed to finish a draft bacterial genome. Here we describe a novel strategy based on the identification and targeted sequencing of repetitive rDNA operons to expedite bacterial genome assembly and finishing. Our strategy was validated by finishing the genome of Paenibacillus polymyxa strain CR1, a bacterium with potential in sustainable agriculture and bio-based processes. An analysis of the 38 contigs contained in the P. polymyxa strain CR1 draft genome revealed 12 repetitive rDNA operons with varied intragenic and flanking regions of variable length, unanimously located at contig boundaries and within contig gaps. These highly similar but not identical rDNA operons were experimentally verified and sequenced simultaneously with multiple, specially designed primer sets. This approach also identified and corrected significant sequence rearrangement generated during the initial in silico assembly of sequencing reads. Our approach reduces the required effort associated with blind primer walking for contig assembly, increasing both the speed and feasibility of genome finishing. Our study further reinforces the notion that repetitive DNA elements are major limiting factors for genome finishing. Moreover, we provided a step-by-step workflow for genome finishing, which may guide future bacterial genome finishing projects.
Collapse
Affiliation(s)
- Alexander W Eastman
- Southern Crop Protection and Food Research Centre, Agriculture and Agri-Food Canada, Government of Canada London, ON, Canada ; Department of Microbiology and Immunology, Schulich School of Medicine and Dentistry, University of Western Ontario London, ON, Canada
| | - Ze-Chun Yuan
- Southern Crop Protection and Food Research Centre, Agriculture and Agri-Food Canada, Government of Canada London, ON, Canada ; Department of Microbiology and Immunology, Schulich School of Medicine and Dentistry, University of Western Ontario London, ON, Canada
| |
Collapse
|
136
|
Abstract
Shigellae cause significant diarrheal disease and mortality in humans, as there are approximately 163 million episodes of shigellosis and 1.1 million deaths annually. While significant strides have been made in the understanding of the pathogenesis, few studies on the genomic content of the Shigella species have been completed. The goal of this study was to characterize the genomic diversity of Shigella species through sequencing of 55 isolates representing members of each of the four Shigella species: S. flexneri, S. sonnei, S. boydii, and S. dysenteriae. Phylogeny inferred from 336 available Shigella and Escherichia coli genomes defined exclusive clades of Shigella; conserved genomic markers that can identify each clade were then identified. PCR assays were developed for each clade-specific marker, which was combined with an amplicon for the conserved Shigella invasion antigen, IpaH3, into a multiplex PCR assay. This assay demonstrated high specificity, correctly identifying 218 of 221 presumptive Shigella isolates, and sensitivity, by not identifying any of 151 diverse E. coli isolates incorrectly as Shigella. This new phylogenomics-based PCR assay represents a valuable tool for rapid typing of uncharacterized Shigella isolates and provides a framework that can be utilized for the identification of novel genomic markers from genomic data.
Collapse
|
137
|
Cantacessi C, Hofmann A, Campbell BE, Gasser RB. Impact of next-generation technologies on exploring socioeconomically important parasites and developing new interventions. Methods Mol Biol 2015; 1247:437-474. [PMID: 25399114 DOI: 10.1007/978-1-4939-2004-4_31] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
High-throughput molecular and computer technologies have become instrumental for systems biological explorations of pathogens, including parasites. For instance, investigations of the transcriptomes of different developmental stages of parasitic nematodes give insights into gene expression, regulation and function in a parasite, which is a significant step to understanding their biology, as well as interactions with their host(s) and disease. This chapter (1) gives a background on some key parasitic nematodes of socioeconomic importance, (2) describes sequencing and bioinformatic technologies for large-scale studies of the transcriptomes and genomes of these parasites, (3) provides some recent examples of applications and (4) emphasizes the prospects of fundamental biological explorations of parasites using these technologies for the development of new interventions to combat parasitic diseases.
Collapse
Affiliation(s)
- Cinzia Cantacessi
- Department of Veterinary and Agricultural Sciences, The University of Melbourne, Parkville, VIC, 3010, Australia
| | | | | | | |
Collapse
|
138
|
Abstract
The development of "next-generation" high-throughput sequencing technologies has made it possible for many labs to undertake sequencing-based research projects that were unthinkable just a few years ago. Although the scientific applications are diverse, e.g., new genome projects, gene expression analysis, genome-wide functional screens, or epigenetics-the sequence data are usually processed in one of two ways: sequence reads are either mapped to an existing reference sequence, or they are built into a new sequence ("de novo assembly"). In this chapter, we first discuss some limitations of the mapping process and how these may be overcome through local sequence assembly. We then introduce the concept of de novo assembly and describe essential assembly improvement procedures such as scaffolding, contig ordering, gap closure, error evaluation, gene annotation transfer and ab initio gene annotation. The results are high-quality draft assemblies that will facilitate informative downstream analyses.
Collapse
Affiliation(s)
- Thomas D Otto
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, CB10 1SA, UK,
| |
Collapse
|
139
|
Blanc-Mathieu R, Verhelst B, Derelle E, Rombauts S, Bouget FY, Carré I, Château A, Eyre-Walker A, Grimsley N, Moreau H, Piégu B, Rivals E, Schackwitz W, Van de Peer Y, Piganeau G. An improved genome of the model marine alga Ostreococcus tauri unfolds by assessing Illumina de novo assemblies. BMC Genomics 2014; 15:1103. [PMID: 25494611 PMCID: PMC4378021 DOI: 10.1186/1471-2164-15-1103] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2014] [Accepted: 11/19/2014] [Indexed: 12/17/2022] Open
Abstract
Background Cost effective next generation sequencing technologies now enable the production of genomic datasets for many novel planktonic eukaryotes, representing an understudied reservoir of genetic diversity. O. tauri is the smallest free-living photosynthetic eukaryote known to date, a coccoid green alga that was first isolated in 1995 in a lagoon by the Mediterranean sea. Its simple features, ease of culture and the sequencing of its 13 Mb haploid nuclear genome have promoted this microalga as a new model organism for cell biology. Here, we investigated the quality of genome assemblies of Illumina GAIIx 75 bp paired-end reads from Ostreococcus tauri, thereby also improving the existing assembly and showing the genome to be stably maintained in culture. Results The 3 assemblers used, ABySS, CLCBio and Velvet, produced 95% complete genomes in 1402 to 2080 scaffolds with a very low rate of misassembly. Reciprocally, these assemblies improved the original genome assembly by filling in 930 gaps. Combined with additional analysis of raw reads and PCR sequencing effort, 1194 gaps have been solved in total adding up to 460 kb of sequence. Mapping of RNAseq Illumina data on this updated genome led to a twofold reduction in the proportion of multi-exon protein coding genes, representing 19% of the total 7699 protein coding genes. The comparison of the DNA extracted in 2001 and 2009 revealed the fixation of 8 single nucleotide substitutions and 2 deletions during the approximately 6000 generations in the lab. The deletions either knocked out or truncated two predicted transmembrane proteins, including a glutamate-receptor like gene. Conclusion High coverage (>80 fold) paired-end Illumina sequencing enables a high quality 95% complete genome assembly of a compact ~13 Mb haploid eukaryote. This genome sequence has remained stable for 6000 generations of lab culture. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-1103) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | | | - Gwenaël Piganeau
- CNRS, UMR 7232, Observatoire Océanologique, Avenue du Fontaulé, BP44, 66650 Banyuls-sur-Mer, France.
| |
Collapse
|
140
|
Lin K, Smit S, Bonnema G, Sanchez-Perez G, de Ridder D. Making the difference: integrating structural variation detection tools. Brief Bioinform 2014; 16:852-64. [PMID: 25504367 DOI: 10.1093/bib/bbu047] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2014] [Indexed: 01/01/2023] Open
Abstract
From prokaryotes to eukaryotes, phenotypic variation, adaptation and speciation has been associated with structural variation between genomes of individuals within the same species. Many computer algorithms detecting such variations (callers) have recently been developed, spurred by the advent of the next-generation sequencing technology. Such callers mainly exploit split-read mapping or paired-end read mapping. However, as different callers are geared towards different types of structural variation, there is still no single caller that can be considered a community standard; instead, increasingly the various callers are combined in integrated pipelines. In this article, we review a wide range of callers, discuss challenges in the integration step and present a survey of pipelines used in population genomics studies. Based on our findings, we provide general recommendations on how to set-up such pipelines. Finally, we present an outlook on future challenges in structural variation detection.
Collapse
|
141
|
Abbas MM, Malluhi QM, Balakrishnan P. Assessment of de novo assemblers for draft genomes: a case study with fungal genomes. BMC Genomics 2014; 15 Suppl 9:S10. [PMID: 25521762 PMCID: PMC4290589 DOI: 10.1186/1471-2164-15-s9-s10] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Recently, large bio-projects dealing with the release of different genomes have transpired. Most of these projects use next-generation sequencing platforms. As a consequence, many de novo assembly tools have evolved to assemble the reads generated by these platforms. Each tool has its own inherent advantages and disadvantages, which make the selection of an appropriate tool a challenging task. RESULTS We have evaluated the performance of frequently used de novo assemblers namely ABySS, IDBA-UD, Minia, SOAP, SPAdes, Sparse, and Velvet. These assemblers are assessed based on their output quality during the assembly process conducted over fungal data. We compared the performance of these assemblers by considering both computational as well as quality metrics. By analyzing these performance metrics, the assemblers are ranked and a procedure for choosing the candidate assembler is illustrated. CONCLUSIONS In this study, we propose an assessment method for the selection of de novo assemblers by considering their computational as well as quality metrics at the draft genome level. We divide the quality metrics into three groups: g1 measures the goodness of the assemblies, g2 measures the problems of the assemblies, and g3 measures the conservation elements in the assemblies. Our results demonstrate that the assemblers ABySS and IDBA-UD exhibit a good performance for the studied data from fungal genomes in terms of running time, memory, and quality. The results suggest that whole genome shotgun sequencing projects should make use of different assemblers by considering their merits.
Collapse
|
142
|
Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 2014; 9:e112963. [PMID: 25409509 PMCID: PMC4237348 DOI: 10.1371/journal.pone.0112963] [Citation(s) in RCA: 5570] [Impact Index Per Article: 557.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2014] [Accepted: 10/16/2014] [Indexed: 02/06/2023] Open
Abstract
Advances in modern sequencing technologies allow us to generate sufficient data to analyze hundreds of bacterial genomes from a single machine in a single day. This potential for sequencing massive numbers of genomes calls for fully automated methods to produce high-quality assemblies and variant calls. We introduce Pilon, a fully automated, all-in-one tool for correcting draft assemblies and calling sequence variants of multiple sizes, including very large insertions and deletions. Pilon works with many types of sequence data, but is particularly strong when supplied with paired end data from two Illumina libraries with small e.g., 180 bp and large e.g., 3–5 Kb inserts. Pilon significantly improves draft genome assemblies by correcting bases, fixing mis-assemblies and filling gaps. For both haploid and diploid genomes, Pilon produces more contiguous genomes with fewer errors, enabling identification of more biologically relevant genes. Furthermore, Pilon identifies small variants with high accuracy as compared to state-of-the-art tools and is unique in its ability to accurately identify large sequence variants including duplications and resolve large insertions. Pilon is being used to improve the assemblies of thousands of new genomes and to identify variants from thousands of clinically relevant bacterial strains. Pilon is freely available as open source software.
Collapse
Affiliation(s)
- Bruce J. Walker
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
- * E-mail: (BJW); (AME)
| | - Thomas Abeel
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
- VIB Department of Plant Systems Biology, Ghent University, Ghent, Belgium
| | - Terrance Shea
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Margaret Priest
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Amr Abouelliel
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Sharadha Sakthikumar
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Christina A. Cuomo
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Qiandong Zeng
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Jennifer Wortman
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Sarah K. Young
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Ashlee M. Earl
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
- * E-mail: (BJW); (AME)
| |
Collapse
|
143
|
Otto TD, Böhme U, Jackson AP, Hunt M, Franke-Fayard B, Hoeijmakers WAM, Religa AA, Robertson L, Sanders M, Ogun SA, Cunningham D, Erhart A, Billker O, Khan SM, Stunnenberg HG, Langhorne J, Holder AA, Waters AP, Newbold CI, Pain A, Berriman M, Janse CJ. A comprehensive evaluation of rodent malaria parasite genomes and gene expression. BMC Biol 2014; 12:86. [PMID: 25359557 PMCID: PMC4242472 DOI: 10.1186/s12915-014-0086-0] [Citation(s) in RCA: 199] [Impact Index Per Article: 19.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2014] [Accepted: 10/10/2014] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND Rodent malaria parasites (RMP) are used extensively as models of human malaria. Draft RMP genomes have been published for Plasmodium yoelii, P. berghei ANKA (PbA) and P. chabaudi AS (PcAS). Although availability of these genomes made a significant impact on recent malaria research, these genomes were highly fragmented and were annotated with little manual curation. The fragmented nature of the genomes has hampered genome wide analysis of Plasmodium gene regulation and function. RESULTS We have greatly improved the genome assemblies of PbA and PcAS, newly sequenced the virulent parasite P. yoelii YM genome, sequenced additional RMP isolates/lines and have characterized genotypic diversity within RMP species. We have produced RNA-seq data and utilised it to improve gene-model prediction and to provide quantitative, genome-wide, data on gene expression. Comparison of the RMP genomes with the genome of the human malaria parasite P. falciparum and RNA-seq mapping permitted gene annotation at base-pair resolution. Full-length chromosomal annotation permitted a comprehensive classification of all subtelomeric multigene families including the 'Plasmodium interspersed repeat genes' (pir). Phylogenetic classification of the pir family, combined with pir expression patterns, indicates functional diversification within this family. CONCLUSIONS Complete RMP genomes, RNA-seq and genotypic diversity data are excellent and important resources for gene-function and post-genomic analyses and to better interrogate Plasmodium biology. Genotypic diversity between P. chabaudi isolates makes this species an excellent parasite to study genotype-phenotype relationships. The improved classification of multigene families will enhance studies on the role of (variant) exported proteins in virulence and immune evasion/modulation.
Collapse
Affiliation(s)
- Thomas D Otto
- />Wellcome Trust Sanger Institute, Hinxton, Cambridge UK
| | - Ulrike Böhme
- />Wellcome Trust Sanger Institute, Hinxton, Cambridge UK
| | - Andrew P Jackson
- />Department of Infection Biology, Institute of Infection and Global Health, University of Liverpool, Liverpool, UK
| | - Martin Hunt
- />Wellcome Trust Sanger Institute, Hinxton, Cambridge UK
| | - Blandine Franke-Fayard
- />Leiden Malaria Research Group, Department of Parasitology, Leiden University Medical Center, Leiden, The Netherlands
| | - Wieteke A M Hoeijmakers
- />Department of Molecular Biology, Science faculty, Radboud Institute for Molecular Life Sciences, Radboud University, Nijmegen, The Netherlands
| | - Agnieszka A Religa
- />Institute of Infection, Immunity & Inflammation, School of Medical, Veterinary & Life Sciences, & Wellcome Centre for Molecular Parasitology, Glasgow Biomedical Research Centre, University of Glasgow, Glasgow, Scotland UK
| | | | - Mandy Sanders
- />Wellcome Trust Sanger Institute, Hinxton, Cambridge UK
| | - Solabomi A Ogun
- />Division of Parasitology, MRC National Institute for Medical Research, Mill Hill, London UK
| | - Deirdre Cunningham
- />Division of Parasitology, MRC National Institute for Medical Research, Mill Hill, London UK
| | - Annette Erhart
- />Unit of Malariology, Institute of Tropical Medicine, Antwerp, Belgium
| | - Oliver Billker
- />Wellcome Trust Sanger Institute, Hinxton, Cambridge UK
| | - Shahid M Khan
- />Leiden Malaria Research Group, Department of Parasitology, Leiden University Medical Center, Leiden, The Netherlands
| | - Hendrik G Stunnenberg
- />Department of Molecular Biology, Science faculty, Radboud Institute for Molecular Life Sciences, Radboud University, Nijmegen, The Netherlands
| | - Jean Langhorne
- />Division of Parasitology, MRC National Institute for Medical Research, Mill Hill, London UK
| | - Anthony A Holder
- />Division of Parasitology, MRC National Institute for Medical Research, Mill Hill, London UK
| | - Andrew P Waters
- />Institute of Infection, Immunity & Inflammation, School of Medical, Veterinary & Life Sciences, & Wellcome Centre for Molecular Parasitology, Glasgow Biomedical Research Centre, University of Glasgow, Glasgow, Scotland UK
| | - Chris I Newbold
- />Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, UK
- />Weatherall Institute of Molecular Medicine, John Radcliffe Hospital, Headington, Oxford UK
| | - Arnab Pain
- />Biological and Environmental Sciences and Engineering (BESE) Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
| | | | - Chris J Janse
- />Leiden Malaria Research Group, Department of Parasitology, Leiden University Medical Center, Leiden, The Netherlands
| |
Collapse
|
144
|
McGaugh SE, Gross JB, Aken B, Blin M, Borowsky R, Chalopin D, Hinaux H, Jeffery WR, Keene A, Ma L, Minx P, Murphy D, O’Quin KE, Rétaux S, Rohner N, Searle SMJ, Stahl BA, Tabin C, Volff JN, Yoshizawa M, Warren WC. The cavefish genome reveals candidate genes for eye loss. Nat Commun 2014; 5:5307. [PMID: 25329095 PMCID: PMC4218959 DOI: 10.1038/ncomms6307] [Citation(s) in RCA: 177] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2014] [Accepted: 09/17/2014] [Indexed: 11/10/2022] Open
Abstract
Natural populations subjected to strong environmental selection pressures offer a window into the genetic underpinnings of evolutionary change. Cavefish populations, Astyanax mexicanus (Teleostei: Characiphysi), exhibit repeated, independent evolution for a variety of traits including eye degeneration, pigment loss, increased size and number of taste buds and mechanosensory organs, and shifts in many behavioural traits. Surface and cave forms are interfertile making this system amenable to genetic interrogation; however, lack of a reference genome has hampered efforts to identify genes responsible for changes in cave forms of A. mexicanus. Here we present the first de novo genome assembly for Astyanax mexicanus cavefish, contrast repeat elements to other teleost genomes, identify candidate genes underlying quantitative trait loci (QTL), and assay these candidate genes for potential functional and expression differences. We expect the cavefish genome to advance understanding of the evolutionary process, as well as, analogous human disease including retinal dysfunction.
Collapse
Affiliation(s)
- Suzanne E. McGaugh
- The Genome Institute, Washington University, Campus Box 8501, St Louis, Missouri 63108, USA
| | - Joshua B. Gross
- Department of Biological Sciences, University of Cincinnati, 711B Rieveschl Hall, 312 College Drive, Cincinnati, Ohio 45221, USA
| | - Bronwen Aken
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Maryline Blin
- DECA group, Neurobiology and Development Laboratory, CNRS-Institut de Neurobiologie Alfred Fessard, 91198 Gif-sur-Yvette, France
| | - Richard Borowsky
- Department of Biology, New York University, New York, New York 10003-6688, USA
| | - Domitille Chalopin
- Institut de Génomique Fonctionnelle de Lyon, Ecole Normale Supérieure de Lyon, CNRS, UMR 5242, UCBL, 46 allée d’Italie, Lyon F-69364, France
| | - Hélène Hinaux
- DECA group, Neurobiology and Development Laboratory, CNRS-Institut de Neurobiologie Alfred Fessard, 91198 Gif-sur-Yvette, France
| | - William R. Jeffery
- Department of Biology, University of Maryland, College Park, Maryland 20742, USA
| | - Alex Keene
- Department of Biology, University of Nevada, Reno, Nevada 89557, USA
| | - Li Ma
- Department of Biology, University of Maryland, College Park, Maryland 20742, USA
| | - Patrick Minx
- The Genome Institute, Washington University, Campus Box 8501, St Louis, Missouri 63108, USA
| | - Daniel Murphy
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Kelly E. O’Quin
- Department of Biology, Centre College, 600 West Walnut St, Danville, Kentucky 40422, USA
| | - Sylvie Rétaux
- DECA group, Neurobiology and Development Laboratory, CNRS-Institut de Neurobiologie Alfred Fessard, 91198 Gif-sur-Yvette, France
| | - Nicolas Rohner
- Harvard Medical School Department of Genetics, 77 Avenue Louis Pasteur; NRB 360, Boston, Massachusetts 02115, USA
| | - Steve M. J. Searle
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Bethany A. Stahl
- Department of Biological Sciences, University of Cincinnati, 711B Rieveschl Hall, 312 College Drive, Cincinnati, Ohio 45221, USA
| | - Cliff Tabin
- Harvard Medical School Department of Genetics, 77 Avenue Louis Pasteur; NRB 360, Boston, Massachusetts 02115, USA
| | - Jean-Nicolas Volff
- Institut de Génomique Fonctionnelle de Lyon, Ecole Normale Supérieure de Lyon, CNRS, UMR 5242, UCBL, 46 allée d’Italie, Lyon F-69364, France
| | - Masato Yoshizawa
- Department of Biology, University of Nevada, Reno, Nevada 89557, USA
| | - Wesley C. Warren
- The Genome Institute, Washington University, Campus Box 8501, St Louis, Missouri 63108, USA
| |
Collapse
|
145
|
Genome Sequence of Coxiella burnetii Strain AuQ01 (Arandale) from an Australian Patient with Acute Q Fever. GENOME ANNOUNCEMENTS 2014; 2:2/5/e00964-14. [PMID: 25278528 PMCID: PMC4183872 DOI: 10.1128/genomea.00964-14] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Coxiella burnetii strain AuQ01 was isolated from the serum of an Australian acute Q fever patient and represents the first whole genome from this historical Q fever country. This new genome shows distinct differences from existing genomic data and will enhance the understanding of this query pathogen.
Collapse
|
146
|
Bao E, Jiang T, Girke T. AlignGraph: algorithm for secondary de novo genome assembly guided by closely related references. Bioinformatics 2014; 30:i319-i328. [PMID: 24932000 PMCID: PMC4058956 DOI: 10.1093/bioinformatics/btu291] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Motivation:De novo assemblies of genomes remain one of the most challenging applications in next-generation sequencing. Usually, their results are incomplete and fragmented into hundreds of contigs. Repeats in genomes and sequencing errors are the main reasons for these complications. With the rapidly growing number of sequenced genomes, it is now feasible to improve assemblies by guiding them with genomes from related species. Results: Here we introduce AlignGraph, an algorithm for extending and joining de novo-assembled contigs or scaffolds guided by closely related reference genomes. It aligns paired-end (PE) reads and preassembled contigs or scaffolds to a close reference. From the obtained alignments, it builds a novel data structure, called the PE multipositional de Bruijn graph. The incorporated positional information from the alignments and PE reads allows us to extend the initial assemblies, while avoiding incorrect extensions and early terminations. In our performance tests, AlignGraph was able to substantially improve the contigs and scaffolds from several assemblers. For instance, 28.7–62.3% of the contigs of Arabidopsis thaliana and human could be extended, resulting in improvements of common assembly metrics, such as an increase of the N50 of the extendable contigs by 89.9–94.5% and 80.3–165.8%, respectively. In another test, AlignGraph was able to improve the assembly of a published genome (Arabidopsis strain Landsberg) by increasing the N50 of its extendable scaffolds by 86.6%. These results demonstrate AlignGraph’s efficiency in improving genome assemblies by taking advantage of closely related references. Availability and implementation: The AlignGraph software can be downloaded for free from this site: https://github.com/baoe/AlignGraph. Contact:thomas.girke@ucr.edu
Collapse
Affiliation(s)
- Ergude Bao
- Department of Computer Science and Engineering and Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA
| | - Tao Jiang
- Department of Computer Science and Engineering and Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA
| | - Thomas Girke
- Department of Computer Science and Engineering and Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA
| |
Collapse
|
147
|
Otto TD, Rayner JC, Böhme U, Pain A, Spottiswoode N, Sanders M, Quail M, Ollomo B, Renaud F, Thomas AW, Prugnolle F, Conway DJ, Newbold C, Berriman M. Genome sequencing of chimpanzee malaria parasites reveals possible pathways of adaptation to human hosts. Nat Commun 2014; 5:4754. [PMID: 25203297 PMCID: PMC4166903 DOI: 10.1038/ncomms5754] [Citation(s) in RCA: 114] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2014] [Accepted: 07/18/2014] [Indexed: 01/19/2023] Open
Abstract
Plasmodium falciparum causes most human malaria deaths, having prehistorically evolved from parasites of African Great Apes. Here we explore the genomic basis of P. falciparum adaptation to human hosts by fully sequencing the genome of the closely related chimpanzee parasite species P. reichenowi, and obtaining partial sequence data from a more distantly related chimpanzee parasite (P. gaboni). The close relationship between P. reichenowi and P. falciparum is emphasized by almost complete conservation of genomic synteny, but against this strikingly conserved background we observe major differences at loci involved in erythrocyte invasion. The organization of most virulence-associated multigene families, including the hypervariable var genes, is broadly conserved, but P. falciparum has a smaller subset of rif and stevor genes whose products are expressed on the infected erythrocyte surface. Genome-wide analysis identifies other loci under recent positive selection, but a limited number of changes at the host-parasite interface may have mediated host switching.
Collapse
Affiliation(s)
- Thomas D. Otto
- Parasite Genomics, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SA, UK
| | - Julian C. Rayner
- Parasite Genomics, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SA, UK
| | - Ulrike Böhme
- Parasite Genomics, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SA, UK
| | - Arnab Pain
- Parasite Genomics, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SA, UK
- Biological and Environmental Sciences and Engineering (BESE) Division, Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal 23955-6900, Saudi Arabia
| | - Natasha Spottiswoode
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Rockville, Maryland 20852, USA
- Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DS, UK
| | - Mandy Sanders
- Parasite Genomics, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SA, UK
| | - Michael Quail
- Parasite Genomics, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SA, UK
| | - Benjamin Ollomo
- Centre International de Recherches Médicales de Franceville, CIRMF, BP 769 Franceville, Gabon
| | - François Renaud
- Laboratoire MIVEGEC, UMR 5290 CNRS-IRD-UMI-UMII, IRD, BP 64501, 34394 Montpellier, France
| | - Alan W. Thomas
- Biomedical Primate Research Centre, Department of Parasitology, 2280 GH Rijswijk, The Netherlands
| | - Franck Prugnolle
- Centre International de Recherches Médicales de Franceville, CIRMF, BP 769 Franceville, Gabon
- Laboratoire MIVEGEC, UMR 5290 CNRS-IRD-UMI-UMII, IRD, BP 64501, 34394 Montpellier, France
| | - David J. Conway
- Department of Pathogen Molecular Biology, London School of Hygiene & Tropical Medicine, London WC1E 7HT, UK
| | - Chris Newbold
- Parasite Genomics, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SA, UK
- Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DS, UK
- These authors contributed equally to this work
| | - Matthew Berriman
- Parasite Genomics, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SA, UK
- These authors contributed equally to this work
| |
Collapse
|
148
|
Reid AJ, Blake DP, Ansari HR, Billington K, Browne HP, Bryant J, Dunn M, Hung SS, Kawahara F, Miranda-Saavedra D, Malas TB, Mourier T, Naghra H, Nair M, Otto TD, Rawlings ND, Rivailler P, Sanchez-Flores A, Sanders M, Subramaniam C, Tay YL, Woo Y, Wu X, Barrell B, Dear PH, Doerig C, Gruber A, Ivens AC, Parkinson J, Rajandream MA, Shirley MW, Wan KL, Berriman M, Tomley FM, Pain A. Genomic analysis of the causative agents of coccidiosis in domestic chickens. Genome Res 2014; 24:1676-85. [PMID: 25015382 PMCID: PMC4199364 DOI: 10.1101/gr.168955.113] [Citation(s) in RCA: 150] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Global production of chickens has trebled in the past two decades and they are now the most important source of dietary animal protein worldwide. Chickens are subject to many infectious diseases that reduce their performance and productivity. Coccidiosis, caused by apicomplexan protozoa of the genus Eimeria, is one of the most important poultry diseases. Understanding the biology of Eimeria parasites underpins development of new drugs and vaccines needed to improve global food security. We have produced annotated genome sequences of all seven species of Eimeria that infect domestic chickens, which reveal the full extent of previously described repeat-rich and repeat-poor regions and show that these parasites possess the most repeat-rich proteomes ever described. Furthermore, while no other apicomplexan has been found to possess retrotransposons, Eimeria is home to a family of chromoviruses. Analysis of Eimeria genes involved in basic biology and host-parasite interaction highlights adaptations to a relatively simple developmental life cycle and a complex array of co-expressed surface proteins involved in host cell binding.
Collapse
Affiliation(s)
- Adam J Reid
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridgeshire CB10 1SA, United Kingdom
| | - Damer P Blake
- Royal Veterinary College, North Mymms, Hertfordshire AL9 7TA, United Kingdom; The Pirbright Institute, Compton Laboratory, Newbury, Berkshire RG20 7NN, United Kingdom
| | - Hifzur R Ansari
- Computational Bioscience Research Center, Biological Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Jeddah 23955-6900, Kingdom of Saudi Arabia
| | - Karen Billington
- The Pirbright Institute, Compton Laboratory, Newbury, Berkshire RG20 7NN, United Kingdom
| | - Hilary P Browne
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridgeshire CB10 1SA, United Kingdom
| | - Josephine Bryant
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridgeshire CB10 1SA, United Kingdom
| | - Matt Dunn
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridgeshire CB10 1SA, United Kingdom
| | - Stacy S Hung
- Program in Molecular Structure and Function, Hospital for Sick Children and Departments of Biochemistry and Molecular Genetics, University of Toronto, Toronto, Ontario M5G 1X8, Canada
| | - Fumiya Kawahara
- Nippon Institute for Biological Science, Ome, Tokyo 198-0024, Japan
| | - Diego Miranda-Saavedra
- Fibrosis Laboratories, Institute of Cellular Medicine, Newcastle University Medical School, Framlington Place, Newcastle upon Tyne NE2 4HH, United Kingdom
| | - Tareq B Malas
- Computational Bioscience Research Center, Biological Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Jeddah 23955-6900, Kingdom of Saudi Arabia
| | - Tobias Mourier
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, 1350 Copenhagen, Denmark
| | - Hardeep Naghra
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridgeshire CB10 1SA, United Kingdom; School of Life Sciences, Centre for Biomolecular Sciences, University of Nottingham, Nottingham NG7 2RD, United Kingdom
| | - Mridul Nair
- Computational Bioscience Research Center, Biological Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Jeddah 23955-6900, Kingdom of Saudi Arabia
| | - Thomas D Otto
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridgeshire CB10 1SA, United Kingdom
| | - Neil D Rawlings
- European Bioinformatics Institute, Genome Campus, Hinxton, Cambridgeshire CB10 1SA, United Kingdom
| | - Pierre Rivailler
- The Pirbright Institute, Compton Laboratory, Newbury, Berkshire RG20 7NN, United Kingdom; Division of Viral Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia 30333, USA
| | - Alejandro Sanchez-Flores
- Unidad Universitaria de Apoyo Bioinformático, Institute of Biotechnology, Universidad Nacional Autónoma de México, Cuernavaca, Morelos 62210, Mexico
| | - Mandy Sanders
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridgeshire CB10 1SA, United Kingdom
| | - Chandra Subramaniam
- The Pirbright Institute, Compton Laboratory, Newbury, Berkshire RG20 7NN, United Kingdom
| | - Yea-Ling Tay
- School of Biosciences and Biotechnology, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor DE, Malaysia; Malaysia Genome Institute, Jalan Bangi, 43000 Kajang, Selangor DE, Malaysia
| | - Yong Woo
- Computational Bioscience Research Center, Biological Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Jeddah 23955-6900, Kingdom of Saudi Arabia
| | - Xikun Wu
- The Pirbright Institute, Compton Laboratory, Newbury, Berkshire RG20 7NN, United Kingdom; Amgen Limited, Uxbridge UB8 1DH, United Kingdom
| | - Bart Barrell
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridgeshire CB10 1SA, United Kingdom
| | - Paul H Dear
- MRC Laboratory of Molecular Biology, Cambridge CB2 0QH, United Kingdom
| | - Christian Doerig
- Department of Microbiology, Monash University, Clayton VIC 3800, Australia
| | - Arthur Gruber
- Departament of Parasitology, Institute of Biomedical Sciences, University of São Paulo, São Paulo, SP 05508-000, Brazil
| | - Alasdair C Ivens
- Centre for Immunity, Infection and Evolution, Ashworth Laboratories, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom
| | - John Parkinson
- Program in Molecular Structure and Function, Hospital for Sick Children and Departments of Biochemistry and Molecular Genetics, University of Toronto, Toronto, Ontario M5G 1X8, Canada
| | - Marie-Adèle Rajandream
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridgeshire CB10 1SA, United Kingdom
| | - Martin W Shirley
- The Pirbright Institute, Pirbright Laboratory, Pirbright, Surrey GU24 0NF, United Kingdom
| | - Kiew-Lian Wan
- School of Biosciences and Biotechnology, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor DE, Malaysia; Malaysia Genome Institute, Jalan Bangi, 43000 Kajang, Selangor DE, Malaysia
| | - Matthew Berriman
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridgeshire CB10 1SA, United Kingdom
| | - Fiona M Tomley
- Royal Veterinary College, North Mymms, Hertfordshire AL9 7TA, United Kingdom; The Pirbright Institute, Compton Laboratory, Newbury, Berkshire RG20 7NN, United Kingdom;
| | - Arnab Pain
- Computational Bioscience Research Center, Biological Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Jeddah 23955-6900, Kingdom of Saudi Arabia;
| |
Collapse
|
149
|
Draft Genome Sequence of the Oleaginous Yeast Yarrowia lipolytica PO1f, a Commonly Used Metabolic Engineering Host. GENOME ANNOUNCEMENTS 2014; 2:2/4/e00652-14. [PMID: 24994799 PMCID: PMC4081999 DOI: 10.1128/genomea.00652-14] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
The draft genome sequence of the oleaginous yeast Yarrowia lipolytica stain PO1f, a commonly used metabolic engineering host, is presented here. The approximately 20.3-Mb genome sequence of PO1f will greatly facilitate research efforts in metabolic engineering of Yarrowia lipolytica for value-added chemical production.
Collapse
|
150
|
Ekblom R, Wolf JBW. A field guide to whole-genome sequencing, assembly and annotation. Evol Appl 2014; 7:1026-42. [PMID: 25553065 PMCID: PMC4231593 DOI: 10.1111/eva.12178] [Citation(s) in RCA: 188] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2014] [Accepted: 05/20/2014] [Indexed: 12/12/2022] Open
Abstract
Genome sequencing projects were long confined to biomedical model organisms and required the concerted effort of large consortia. Rapid progress in high-throughput sequencing technology and the simultaneous development of bioinformatic tools have democratized the field. It is now within reach for individual research groups in the eco-evolutionary and conservation community to generate de novo draft genome sequences for any organism of choice. Because of the cost and considerable effort involved in such an endeavour, the important first step is to thoroughly consider whether a genome sequence is necessary for addressing the biological question at hand. Once this decision is taken, a genome project requires careful planning with respect to the organism involved and the intended quality of the genome draft. Here, we briefly review the state of the art within this field and provide a step-by-step introduction to the workflow involved in genome sequencing, assembly and annotation with particular reference to large and complex genomes. This tutorial is targeted at scientists with a background in conservation genetics, but more generally, provides useful practical guidance for researchers engaging in whole-genome sequencing projects.
Collapse
Affiliation(s)
- Robert Ekblom
- Department of Evolutionary Biology, Uppsala University Uppsala, Sweden
| | - Jochen B W Wolf
- Department of Evolutionary Biology, Uppsala University Uppsala, Sweden
| |
Collapse
|