1
|
Bush ZD, Naftaly AFS, Dinwiddie D, Albers C, Hillers KJ, Libuda DE. Comprehensive detection of structural variation and transposable element differences between wild type laboratory lineages of C. elegans. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.13.523974. [PMID: 37961628 PMCID: PMC10634987 DOI: 10.1101/2023.01.13.523974] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Genomic structural variations (SVs) and transposable elements (TEs) can be significant contributors to genome evolution, altered gene expression, and risk of genetic diseases. Recent advancements in long-read sequencing have greatly improved the quality of de novo genome assemblies and enhanced the detection of sequence variants at the scale of hundreds or thousands of bases. Comparisons between two diverged wild isolates of Caenorhabditis elegans, the Bristol and Hawaiian strains, have been widely utilized in the analysis of small genetic variations. Genetic drift, including SVs and rearrangements of repeated sequences such as TEs, can occur over time from long-term maintenance of wild type isolates within the laboratory. To comprehensively detect both large and small structural variations as well as TEs due to genetic drift, we generated de novo genome assemblies and annotations for each strain from our lab collection using both long- and short-read sequencing and compared our assemblies and annotations with that of other lab wild type strains. Within our lab assemblies, we annotate over 3.1Mb of sequence divergence between the Bristol and Hawaiian isolates: 337,584 SNPs, 94,503 small insertion-deletions (<50bp), and 4,334 structural variations (>50bp). Further, we define the location and movement of specific DNA TEs between N2 Bristol and CB4856 Hawaiian wild type isolates. Specifically, we find the N2 Bristol genome has 20.6% more TEs from the Tc1/mariner family than the CB4856 Hawaiian genome. Moreover, we identified Zator elements as the most abundant and mobile TE family in the genome. Using specific TE sequences with unique SNPs, we also identify 38 TEs that moved intrachromosomally and 9 TEs that moved interchromosomally between the N2 Bristol and CB4856 Hawaiian genomes. By comparing the de novo genome assembly of our lab collection Bristol isolate to the VC2010 Bristol assembly, we also reveal that lab lineages display over 2 Mb of total variation: 1,162 SNPs, 1,528 indels, and 897 SVs with 95% of the variation due to SVs. Overall, our work demonstrates the unique contribution of SVs and TEs to variation and genetic drift between wild type laboratory strains assumed to be isogenic despite growing evidence of genetic drift and phenotypic variation.
Collapse
Affiliation(s)
- Zachary D. Bush
- Institute of Molecular Biology, Department of Biology, University of Oregon, 1229 Franklin Blvd Eugene, OR 97403, USA
| | - Alice F. S. Naftaly
- Institute of Molecular Biology, Department of Biology, University of Oregon, 1229 Franklin Blvd Eugene, OR 97403, USA
| | - Devin Dinwiddie
- Institute of Molecular Biology, Department of Biology, University of Oregon, 1229 Franklin Blvd Eugene, OR 97403, USA
| | - Cora Albers
- Institute of Molecular Biology, Department of Biology, University of Oregon, 1229 Franklin Blvd Eugene, OR 97403, USA
| | - Kenneth J. Hillers
- Biological Sciences Department, California Polytechnic State University, San Luis Obispo, California, USA
| | - Diana E. Libuda
- Institute of Molecular Biology, Department of Biology, University of Oregon, 1229 Franklin Blvd Eugene, OR 97403, USA
| |
Collapse
|
2
|
Moya ND, Stevens L, Miller IR, Sokol CE, Galindo JL, Bardas AD, Koh ESH, Rozenich J, Yeo C, Xu M, Andersen EC. Novel and improved Caenorhabditis briggsae gene models generated by community curation. BMC Genomics 2023; 24:486. [PMID: 37626289 PMCID: PMC10463891 DOI: 10.1186/s12864-023-09582-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2023] [Accepted: 08/12/2023] [Indexed: 08/27/2023] Open
Abstract
BACKGROUND The nematode Caenorhabditis briggsae has been used as a model in comparative genomics studies with Caenorhabditis elegans because of their striking morphological and behavioral similarities. However, the potential of C. briggsae for comparative studies is limited by the quality of its genome resources. The genome resources for the C. briggsae laboratory strain AF16 have not been developed to the same extent as C. elegans. The recent publication of a new chromosome-level reference genome for QX1410, a C. briggsae wild strain closely related to AF16, has provided the first step to bridge the gap between C. elegans and C. briggsae genome resources. Currently, the QX1410 gene models consist of software-derived gene predictions that contain numerous errors in their structure and coding sequences. In this study, a team of researchers manually inspected over 21,000 gene models and underlying transcriptomic data to repair software-derived errors. RESULTS We designed a detailed workflow to train a team of nine students to manually curate gene models using RNA read alignments. We manually inspected the gene models, proposed corrections to the coding sequences of over 8,000 genes, and modeled thousands of putative isoforms and untranslated regions. We exploited the conservation of protein sequence length between C. briggsae and C. elegans to quantify the improvement in protein-coding gene model quality and showed that manual curation led to substantial improvements in the protein sequence length accuracy of QX1410 genes. Additionally, collinear alignment analysis between the QX1410 and AF16 genomes revealed over 1,800 genes affected by spurious duplications and inversions in the AF16 genome that are now resolved in the QX1410 genome. CONCLUSIONS Community-based, manual curation using transcriptome data is an effective approach to improve the quality of software-derived protein-coding genes. The detailed protocols provided in this work can be useful for future large-scale manual curation projects in other species. Our manual curation efforts have brought the QX1410 gene models to a comparable level of quality as the extensively curated AF16 gene models. The improved genome resources for C. briggsae provide reliable tools for the study of Caenorhabditis biology and other related nematodes.
Collapse
Affiliation(s)
- Nicolas D Moya
- Department of Molecular Biosciences, Northwestern University, 4619 Silverman Hall 2205 Tech Drive, Evanston, IL, 60208, USA
- Interdisciplinary Biological Sciences Program, Northwestern University, Evanston, IL, 60208, USA
| | - Lewis Stevens
- Department of Molecular Biosciences, Northwestern University, 4619 Silverman Hall 2205 Tech Drive, Evanston, IL, 60208, USA
- Tree of Life, Wellcome Sanger Institute, Cambridge, UK
| | - Isabella R Miller
- Department of Molecular Biosciences, Northwestern University, 4619 Silverman Hall 2205 Tech Drive, Evanston, IL, 60208, USA
| | - Chloe E Sokol
- Department of Molecular Biosciences, Northwestern University, 4619 Silverman Hall 2205 Tech Drive, Evanston, IL, 60208, USA
| | - Joseph L Galindo
- Department of Molecular Biosciences, Northwestern University, 4619 Silverman Hall 2205 Tech Drive, Evanston, IL, 60208, USA
| | - Alexandra D Bardas
- Department of Molecular Biosciences, Northwestern University, 4619 Silverman Hall 2205 Tech Drive, Evanston, IL, 60208, USA
| | - Edward S H Koh
- Department of Molecular Biosciences, Northwestern University, 4619 Silverman Hall 2205 Tech Drive, Evanston, IL, 60208, USA
| | - Justine Rozenich
- Department of Molecular Biosciences, Northwestern University, 4619 Silverman Hall 2205 Tech Drive, Evanston, IL, 60208, USA
| | - Cassia Yeo
- Department of Molecular Biosciences, Northwestern University, 4619 Silverman Hall 2205 Tech Drive, Evanston, IL, 60208, USA
| | - Maryanne Xu
- Department of Molecular Biosciences, Northwestern University, 4619 Silverman Hall 2205 Tech Drive, Evanston, IL, 60208, USA
| | - Erik C Andersen
- Department of Molecular Biosciences, Northwestern University, 4619 Silverman Hall 2205 Tech Drive, Evanston, IL, 60208, USA.
| |
Collapse
|
3
|
Ragipani B, Albritton SE, Morao AK, Mesquita D, Kramer M, Ercan S. Increased gene dosage and mRNA expression from chromosomal duplications in Caenorhabditis elegans. G3 (BETHESDA, MD.) 2022; 12:jkac151. [PMID: 35731207 PMCID: PMC9339279 DOI: 10.1093/g3journal/jkac151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Accepted: 06/09/2022] [Indexed: 11/14/2022]
Abstract
Isolation of copy number variations and chromosomal duplications at high frequency in the laboratory suggested that Caenorhabditis elegans tolerates increased gene dosage. Here, we addressed if a general dosage compensation mechanism acts at the level of mRNA expression in C. elegans. We characterized gene dosage and mRNA expression in 3 chromosomal duplications and a fosmid integration strain using DNA-seq and mRNA-seq. Our results show that on average, increased gene dosage leads to increased mRNA expression, pointing to a lack of genome-wide dosage compensation. Different genes within the same chromosomal duplication show variable levels of mRNA increase, suggesting feedback regulation of individual genes. Somatic dosage compensation and germline repression reduce the level of mRNA increase from X chromosomal duplications. Together, our results show a lack of genome-wide dosage compensation mechanism acting at the mRNA level in C. elegans and highlight the role of epigenetic and individual gene regulation contributing to the varied consequences of increased gene dosage.
Collapse
Affiliation(s)
- Bhavana Ragipani
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
| | - Sarah Elizabeth Albritton
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
| | - Ana Karina Morao
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
| | - Diogo Mesquita
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
| | - Maxwell Kramer
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
| | - Sevinç Ercan
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
| |
Collapse
|
4
|
Asif MZ, Van der Gaag VL, Guo J, Nocilla KA, Muzio CJ, Edison AS. A Plate Based Assay for Determination of the Median Lethal Dose of 1-Hydroxyphenazine in Caenorhabditis elegans. MICROPUBLICATION BIOLOGY 2021; 2021. [PMID: 33458604 PMCID: PMC7807258 DOI: 10.17912/micropub.biology.000352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Caenorhabditiselegans is an ideal model organism for studying the xenobiotic detoxification pathways of various natural and synthetic toxins. We developed a new workflow to study the effects of 1-hydroxyphenazine (1-HP), a toxin produced by the bacterium Pseudomonas aeruginosa, on the survival of C. elegans. Prior research has demonstrated that C. elegans can detoxify 1-HP through the general mechanism of O-glycosylation. As part of the Vertically Integrated Projects (VIP) undergraduate research team, we have developed a workflow for a plate-based toxicity assay to determine the median lethal dose (LD50) of 1-HP. This was achieved through a toxin exposure method in which the worms were exposed to varying concentrations of 1-HP. The death rates were measured using a fluorescent bead assay. This workflow can be used to test C. elegans responses to different toxins and also the response of different mutant strains to a toxin of interest.
Collapse
Affiliation(s)
- Muhammad Zaka Asif
- Department of Biochemistry & Molecular Biology and Complex Carbohydrate Research Center, University of Georgia
| | - Victoria L Van der Gaag
- Department of Biochemistry & Molecular Biology and Complex Carbohydrate Research Center, University of Georgia
| | - Jane Guo
- Department of Biochemistry & Molecular Biology and Complex Carbohydrate Research Center, University of Georgia
| | - Kelsey A Nocilla
- Department of Biochemistry & Molecular Biology and Complex Carbohydrate Research Center, University of Georgia
| | - Cole J Muzio
- Department of Biochemistry & Molecular Biology and Complex Carbohydrate Research Center, University of Georgia
| | - Arthur S Edison
- Department of Biochemistry & Molecular Biology and Complex Carbohydrate Research Center, University of Georgia.,Department of Genetics.,Institute of Bioinformatics
| |
Collapse
|
5
|
Zhao Y, Long L, Wan J, Biliya S, Brady SC, Lee D, Ojemakinde A, Andersen EC, Vannberg FO, Lu H, McGrath PT. A spontaneous complex structural variant in rcan-1 increases exploratory behavior and laboratory fitness of Caenorhabditis elegans. PLoS Genet 2020; 16:e1008606. [PMID: 32092052 PMCID: PMC7058356 DOI: 10.1371/journal.pgen.1008606] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2019] [Revised: 03/05/2020] [Accepted: 01/11/2020] [Indexed: 01/02/2023] Open
Abstract
Over long evolutionary timescales, major changes to the copy number, function, and genomic organization of genes occur, however, our understanding of the individual mutational events responsible for these changes is lacking. In this report, we study the genetic basis of adaptation of two strains of C. elegans to laboratory food sources using competition experiments on a panel of 89 recombinant inbred lines (RIL). Unexpectedly, we identified a single RIL with higher relative fitness than either of the parental strains. This strain also displayed a novel behavioral phenotype, resulting in higher propensity to explore bacterial lawns. Using bulk-segregant analysis and short-read resequencing of this RIL, we mapped the change in exploration behavior to a spontaneous, complex rearrangement of the rcan-1 gene that occurred during construction of the RIL panel. We resolved this rearrangement into five unique tandem inversion/duplications using Oxford Nanopore long-read sequencing. rcan-1 encodes an ortholog to human RCAN1/DSCR1 calcipressin gene, which has been implicated as a causal gene for Down syndrome. The genomic rearrangement in rcan-1 creates two complete and two truncated versions of the rcan-1 coding region, with a variety of modified 5’ and 3’ non-coding regions. While most copy-number variations (CNVs) are thought to act by increasing expression of duplicated genes, these changes to rcan-1 ultimately result in the reduction of its whole-body expression due to changes in the upstream regions. By backcrossing this rearrangement into a common genetic background to create a near isogenic line (NIL), we demonstrate that both the competitive advantage and exploration behavioral changes are linked to this complex genetic variant. This NIL strain does not phenocopy a strain containing an rcan-1 loss-of-function allele, which suggests that the residual expression of rcan-1 is necessary for its fitness effects. Our results demonstrate how colonization of new environments, such as those encountered in the laboratory, can create evolutionary pressure to modify gene function. This evolutionary mismatch can be resolved by an unexpectedly complex genetic change that simultaneously duplicates and diversifies a gene into two uniquely regulated genes. Our work shows how complex rearrangements can act to modify gene expression in ways besides increased gene dosage. Evolution acts on genetic variants that modify phenotypes that increase the likelihood of staying alive and passing on these genetic changes to subsequent generations (i.e. fitness). There is general interest in understanding the types of genetic variants that can increase fitness in specific environments. One route that fitness can be increased is through changes in behavior, such as finding new food sources. Here, we identify a spontaneous genetic change that increases exploration behavior and fitness of animals in laboratory environments. Interestingly, this genetic change is not a simple genetic change that deletes or changes the sequence of a protein product, but rather a complex structural variant that simultaneously duplicates the rcan-1 gene and also modifies its expression in a number of tissues. Our work demonstrates how a complex structural change can duplicate a gene, modify the DNA control regions that determine its cellular sites of action, and confer a fitness advantage that could lead to its spread in a population.
Collapse
Affiliation(s)
- Yuehui Zhao
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Lijiang Long
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia, United States of America
- Interdisciplinary Graduate Program in Quantitative Biosciences, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Jason Wan
- The Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, Georgia, United States of America
| | - Shweta Biliya
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Shannon C. Brady
- Department of Molecular Biosciences, Northwestern University, Evanston, Illinois, United States of America
| | - Daehan Lee
- Department of Molecular Biosciences, Northwestern University, Evanston, Illinois, United States of America
| | - Akinade Ojemakinde
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Erik C. Andersen
- Department of Molecular Biosciences, Northwestern University, Evanston, Illinois, United States of America
| | - Fredrik O. Vannberg
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia, United States of America
- Parker H. Petit Institute of Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Hang Lu
- Parker H. Petit Institute of Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, Georgia, United States of America
- School of Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Patrick T. McGrath
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia, United States of America
- Interdisciplinary Graduate Program in Quantitative Biosciences, Georgia Institute of Technology, Atlanta, Georgia, United States of America
- School of Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, Georgia, United States of America
- School of Physics, Georgia Institute of Technology, Atlanta, Georgia, United States of America
- * E-mail:
| |
Collapse
|
6
|
Yoshimura J, Ichikawa K, Shoura MJ, Artiles KL, Gabdank I, Wahba L, Smith CL, Edgley ML, Rougvie AE, Fire AZ, Morishita S, Schwarz EM. Recompleting the Caenorhabditis elegans genome. Genome Res 2019; 29:1009-1022. [PMID: 31123080 PMCID: PMC6581061 DOI: 10.1101/gr.244830.118] [Citation(s) in RCA: 83] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2018] [Accepted: 03/11/2019] [Indexed: 01/14/2023]
Abstract
Caenorhabditis elegans was the first multicellular eukaryotic genome sequenced to apparent completion. Although this assembly employed a standard C. elegans strain (N2), it used sequence data from several laboratories, with DNA propagated in bacteria and yeast. Thus, the N2 assembly has many differences from any C. elegans available today. To provide a more accurate C. elegans genome, we performed long-read assembly of VC2010, a modern strain derived from N2. Our VC2010 assembly has 99.98% identity to N2 but with an additional 1.8 Mb including tandem repeat expansions and genome duplications. For 116 structural discrepancies between N2 and VC2010, 97 structures matching VC2010 (84%) were also found in two outgroup strains, implying deficiencies in N2. Over 98% of N2 genes encoded unchanged products in VC2010; moreover, we predicted ≥53 new genes in VC2010. The recompleted genome of C. elegans should be a valuable resource for genetics, genomics, and systems biology.
Collapse
Affiliation(s)
- Jun Yoshimura
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba 277-8583, Japan
| | - Kazuki Ichikawa
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba 277-8583, Japan
| | - Massa J Shoura
- Department of Pathology, Stanford University, Stanford, California 94305, USA
| | - Karen L Artiles
- Department of Pathology, Stanford University, Stanford, California 94305, USA
| | - Idan Gabdank
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Lamia Wahba
- Department of Pathology, Stanford University, Stanford, California 94305, USA
| | - Cheryl L Smith
- Department of Pathology, Stanford University, Stanford, California 94305, USA.,Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Mark L Edgley
- Department of Zoology and Michael Smith Laboratories, University of British Columbia, Vancouver V6T 1Z3, British Columbia, Canada
| | - Ann E Rougvie
- Department of Genetics, Cell Biology, and Development, University of Minnesota, Minneapolis, Minnesota 55454, USA
| | - Andrew Z Fire
- Department of Pathology, Stanford University, Stanford, California 94305, USA.,Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Shinichi Morishita
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba 277-8583, Japan
| | - Erich M Schwarz
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA
| |
Collapse
|
7
|
The laboratory domestication of Caenorhabditis elegans. Trends Genet 2015; 31:224-31. [PMID: 25804345 DOI: 10.1016/j.tig.2015.02.009] [Citation(s) in RCA: 127] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2015] [Revised: 02/19/2015] [Accepted: 02/20/2015] [Indexed: 12/17/2022]
Abstract
Model organisms are of great importance to our understanding of basic biology and to making advances in biomedical research. However, the influence of laboratory cultivation on these organisms is underappreciated, and especially how that environment can affect research outcomes. Recent experiments led to insights into how the widely used laboratory reference strain of the nematode Caenorhabditis elegans compares with natural strains. Here we describe potential selective pressures that led to the fixation of laboratory-derived alleles for the genes npr-1, glb-5, and nath-10. These alleles influence a large number of traits, resulting in behaviors that affect experimental interpretations. Furthermore, strong phenotypic effects caused by these laboratory-derived alleles hinder the discovery of natural alleles. We highlight strategies to reduce the influence of laboratory-derived alleles and to harness the full power of C. elegans.
Collapse
|
8
|
Vergara IA, Tarailo-Graovac M, Frech C, Wang J, Qin Z, Zhang T, She R, Chu JSC, Wang K, Chen N. Genome-wide variations in a natural isolate of the nematode Caenorhabditis elegans. BMC Genomics 2014; 15:255. [PMID: 24694239 PMCID: PMC4023591 DOI: 10.1186/1471-2164-15-255] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2013] [Accepted: 03/03/2014] [Indexed: 12/02/2022] Open
Abstract
Background Increasing genetic and phenotypic differences found among natural isolates of C. elegans have encouraged researchers to explore the natural variation of this nematode species. Results Here we report on the identification of genomic differences between the reference strain N2 and the Hawaiian strain CB4856, one of the most genetically distant strains from N2. To identify both small- and large-scale genomic variations (GVs), we have sequenced the CB4856 genome using both Roche 454 (~400 bps single reads) and Illumina GA DNA sequencing methods (101 bps paired-end reads). Compared to previously described variants (available in WormBase), our effort uncovered twice as many single nucleotide variants (SNVs) and increased the number of small InDels almost 20-fold. Moreover, we identified and validated large insertions, most of which range from 150 bps to 1.2 kb in length in the CB4856 strain. Identified GVs had a widespread impact on protein-coding sequences, including 585 single-copy genes that have associated severe phenotypes of reduced viability in RNAi and genetics studies. Sixty of these genes are homologs of human genes associated with diseases. Furthermore, our work confirms previously identified GVs associated with differences in behavioural and biological traits between the N2 and CB4856 strains. Conclusions The identified GVs provide a rich resource for future studies that aim to explain the genetic basis for other trait differences between the N2 and CB4856 strains.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | - Nansheng Chen
- Department of Molecular Biology and Biochemistry, Simon Fraser University, 8888 University Drive, Burnaby, British Columbia V5A 1S6, Canada.
| |
Collapse
|
9
|
Abstract
Background Segmental duplications (SDs) or low-copy repeats play important roles in both gene and genome evolution. SDs have been extensively investigated in many organisms, however, there is no information about SDs in the silkworm, Bombyx mori. Result In this study, we identified and annotated the SDs in the silkworm genome. Our results suggested that SDs constitute ~1.4% of the silkworm genome sequence (≥1 kb in length and ≥90% in the identity of sequence); the number is similar to that in Drosophila melanogaster but smaller than mammalian organisms. Almost half (42%) of the SD sequences are not assigned to chromosomes, indicating that the SDs are challenges for the assembling of genome sequences. We also provided experimental validation of large duplications using qPCR. The analysis of SD content indicated that the genes related to immunity, detoxification, reproduction, and environmental signal recognition are significantly enriched in the silkworm SDs. Conclusion Our results suggested that segmental duplications have been problematic for sequencing and assembling of the silkworm genome. SDs may have important biological significances in immunity, detoxification, reproduction, and environmental signal recognition in the silkworm. This study provides insight into the evolution of the silkworm genome and an invaluable resource for insect genomics research.
Collapse
|
10
|
Gaglia MM, Jeong DE, Ryu EA, Lee D, Kenyon C, Lee SJ. Genes that act downstream of sensory neurons to influence longevity, dauer formation, and pathogen responses in Caenorhabditis elegans. PLoS Genet 2012; 8:e1003133. [PMID: 23284299 PMCID: PMC3527274 DOI: 10.1371/journal.pgen.1003133] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2012] [Accepted: 10/15/2012] [Indexed: 12/11/2022] Open
Abstract
The sensory systems of multicellular organisms are designed to provide information about the environment and thus elicit appropriate changes in physiology and behavior. In the nematode Caenorhabditis elegans, sensory neurons affect the decision to arrest during development in a diapause state, the dauer larva, and modulate the lifespan of the animals in adulthood. However, the mechanisms underlying these effects are incompletely understood. Using whole-genome microarray analysis, we identified transcripts whose levels are altered by mutations in the intraflagellar transport protein daf-10, which result in impaired development and function of many sensory neurons in C. elegans. In agreement with existing genetic data, the expression of genes regulated by the transcription factor DAF-16/FOXO was affected by daf-10 mutations. In addition, we found altered expression of transcriptional targets of the DAF-12/nuclear hormone receptor in the daf-10 mutants and showed that this pathway influences specifically the dauer formation phenotype of these animals. Unexpectedly, pathogen-responsive genes were repressed in daf-10 mutant animals, and these sensory mutants exhibited altered susceptibility to and behavioral avoidance of bacterial pathogens. Moreover, we found that a solute transporter gene mct-1/2, which was induced by daf-10 mutations, was necessary and sufficient for longevity. Thus, sensory input seems to influence an extensive transcriptional network that modulates basic biological processes in C. elegans. This situation is reminiscent of the complex regulation of physiology by the mammalian hypothalamus, which also receives innervations from sensory systems, most notably the visual and olfactory systems. The senses provide animals with information about their environment, which affects not only their behavior but also their internal state and physiological outputs. How this information is processed is still unclear. In this study, we used mutant C. elegans roundworms that had defective sensory neurons to investigate how changes in sensation alter the expression of genes and regulate physiology, specifically the worms' choice to hibernate during growth and their longevity as fully-grown adults. We showed that defects in sensory neurons change the pattern of gene expression and regulate these outputs through known hormonal pathways, including insulin/IGF-1 and steroid pathways. We also identified a new regulator of longevity, MCT-1, that is predicted to transport small metabolites and hormones in the body. Unexpectedly, we found that sensory impairment altered yet another physiological output, the response to infectious agents. It prevented the worms from avoiding infectious bacteria and reduced the expression of potentially protective factors, but also increased the worms' resistance to infection, suggesting a complex network of responses to environmental stimuli. Understanding how sensory information is relayed in this relatively simple organism may inform our understanding of sensory processing in higher organisms like mammals.
Collapse
Affiliation(s)
- Marta M Gaglia
- Neuroscience Graduate Program and Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, USA
| | | | | | | | | | | |
Collapse
|
11
|
Vergara IA, Chen N. Large synteny blocks revealed between Caenorhabditis elegans and Caenorhabditis briggsae genomes using OrthoCluster. BMC Genomics 2010; 11:516. [PMID: 20868500 PMCID: PMC2997010 DOI: 10.1186/1471-2164-11-516] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2009] [Accepted: 09/24/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Accurate identification of synteny blocks is an important step in comparative genomics towards the understanding of genome architecture and expression. Most computer programs developed in the last decade for identifying synteny blocks have limitations. To address these limitations, we recently developed a robust program called OrthoCluster, and an online database OrthoClusterDB. In this work, we have demonstrated the application of OrthoCluster in identifying synteny blocks between the genomes of Caenorhabditis elegans and Caenorhabditis briggsae, two closely related hermaphrodite nematodes. RESULTS Initial identification and analysis of synteny blocks using OrthoCluster enabled us to systematically improve the genome annotation of C. elegans and C. briggsae, identifying 52 potential novel genes in C. elegans, 582 in C. briggsae, and 949 novel orthologous relationships between these two species. Using the improved annotation, we have detected 3,058 perfect synteny blocks that contain no mismatches between C. elegans and C. briggsae. Among these synteny blocks, the majority are mapped to homologous chromosomes, as previously reported. The largest perfect synteny block contains 42 genes, which spans 201.2 kb in Chromosome V of C. elegans. On average, perfect synteny blocks span 18.8 kb in length. When some mismatches (interruptions) are allowed, synteny blocks ("imperfect synteny blocks") that are much larger in size are identified. We have shown that the majority (80%) of the C. elegans and C. briggsae genomes are covered by imperfect synteny blocks. The largest imperfect synteny block spans 6.14 Mb in Chromosome X of C. elegans and there are 11 synteny blocks that are larger than 1 Mb in size. On average, imperfect synteny blocks span 63.6 kb in length, larger than previously reported. CONCLUSIONS We have demonstrated that OrthoCluster can be used to accurately identify synteny blocks and have found that synteny blocks between C. elegans and C. briggsae are almost three-folds larger than previously identified.
Collapse
Affiliation(s)
- Ismael A Vergara
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, Canada
| | | |
Collapse
|
12
|
Martens C, Van de Peer Y. The hidden duplication past of the plant pathogen Phytophthora and its consequences for infection. BMC Genomics 2010; 11:353. [PMID: 20525264 PMCID: PMC2996974 DOI: 10.1186/1471-2164-11-353] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2010] [Accepted: 06/03/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Oomycetes of the genus Phytophthora are pathogens that infect a wide range of plant species. For dicot hosts such as tomato, potato and soybean, Phytophthora is even the most important pathogen. Previous analyses of Phytophthora genomes uncovered many genes, large gene families and large genome sizes that can partially be explained by significant repeat expansion patterns. RESULTS Analysis of the complete genomes of three different Phytophthora species, using a newly developed approach, unveiled a large number of small duplicated blocks, mainly consisting of two or three consecutive genes. Further analysis of these duplicated genes and comparison with the known gene and genome duplication history of ten other eukaryotes including parasites, algae, plants, fungi, vertebrates and invertebrates, suggests that the ancestor of P. infestans, P. sojae and P. ramorum most likely underwent a whole genome duplication (WGD). Genes that have survived in duplicate are mainly genes that are known to be preferentially retained following WGDs, but also genes important for pathogenicity and infection of the different hosts seem to have been retained in excess. As a result, the WGD might have contributed to the evolutionary and pathogenic success of Phytophthora. CONCLUSIONS The fact that we find many small blocks of duplicated genes indicates that the genomes of Phytophthora species have been heavily rearranged following the WGD. Most likely, the high repeat content in these genomes have played an important role in this rearrangement process. As a consequence, the paucity of retained larger duplicated blocks has greatly complicated previous attempts to detect remnants of a large-scale duplication event in Phytophthora. However, as we show here, our newly developed strategy to identify very small duplicated blocks might be a useful approach to uncover ancient polyploidy events, in particular for heavily rearranged genomes.
Collapse
Affiliation(s)
- Cindy Martens
- Department of Plant Systems Biology, VIB, Technologiepark 927, B-9052 Ghent, Belgium
- Bioinformatics and Evolutionary Genomics, Department of Molecular Genetics, Technologiepark 927, Ghent University, B-9052 Ghent, Belgium
| | - Yves Van de Peer
- Department of Plant Systems Biology, VIB, Technologiepark 927, B-9052 Ghent, Belgium
- Bioinformatics and Evolutionary Genomics, Department of Molecular Genetics, Technologiepark 927, Ghent University, B-9052 Ghent, Belgium
| |
Collapse
|
13
|
Abstract
Deep sequencing offers an unprecedented view of an organism's genome. We describe the spectrum of mutations induced by three commonly used mutagens: ethyl methanesulfonate (EMS), N-ethyl-N-nitrosourea (ENU), and ultraviolet trimethylpsoralen (UV/TMP) in the nematode Caenorhabditis elegans. Our analysis confirms the strong GC to AT transition bias of EMS. We found that ENU mainly produces A to T and T to A transversions, but also all possible transitions. We found no bias for any specific transition or transversion in the spectrum of UV/TMP-induced mutations. In 10 mutagenized strains we identified 2723 variants, of which 508 are expected to alter or disrupt gene function, including 21 nonsense mutations and 10 mutations predicted to affect mRNA splicing. This translates to an average of 50 informative mutations per strain. We also present evidence of genetic drift among laboratory wild-type strains derived from the Bristol N2 strain. We make several suggestions for best practice using massively parallel short read sequencing to ensure mutation detection.
Collapse
|
14
|
Rose AM, O'Neil NJ, Bilenky M, Butterfield YS, Malhis N, Flibotte S, Jones MR, Marra M, Baillie DL, Jones SJM. Genomic sequence of a mutant strain of Caenorhabditis elegans with an altered recombination pattern. BMC Genomics 2010; 11:131. [PMID: 20178641 PMCID: PMC2837035 DOI: 10.1186/1471-2164-11-131] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2009] [Accepted: 02/23/2010] [Indexed: 11/10/2022] Open
Abstract
Background The original sequencing and annotation of the Caenorhabditis elegans genome along with recent advances in sequencing technology provide an exceptional opportunity for the genomic analysis of wild-type and mutant strains. Using the Illumina Genome Analyzer, we sequenced the entire genome of Rec-1, a strain that alters the distribution of meiotic crossovers without changing the overall frequency. Rec-1 was derived from ethylmethane sulfonate (EMS)-treated strains, one of which had a high level of transposable element mobility. Sequencing of this strain provides an opportunity to examine the consequences on the genome of altering the distribution of meiotic recombination events. Results Using Illumina sequencing and MAQ software, 83% of the base pair sequence reads were aligned to the reference genome available at Wormbase, providing a 21-fold coverage of the genome. Using the software programs MAQ and Slider, we observed 1124 base pair differences between Rec-1 and the reference genome in Wormbase (WS190), and 441 between the mutagenized Rec-1 (BC313) and the wild-type N2 strain (VC2010). The most frequent base-substitution was G:C to A:T, 141 for the entire genome most of which were on chromosomes I or X, 55 and 31 respectively. With this data removed, no obvious pattern in the distribution of the base differences along the chromosomes was apparent. No major chromosomal rearrangements were observed, but additional insertions of transposable elements were detected. There are 11 extra copies of Tc1, and 8 of Tc2 in the Rec-1 genome, most likely the remains of past high-hopper activity in a progenitor strain. Conclusion Our analysis of high-throughput sequencing was able to detect regions of direct repeat sequences, deletions, insertions of transposable elements, and base pair differences. A subset of sequence alterations affecting coding regions were confirmed by an independent approach using oligo array comparative genome hybridization. The major phenotype of the Rec-1 strain is an alteration in the preferred position of the meiotic recombination event with no other significant phenotypic consequences. In this study, we observed no evidence of a mutator effect at the nucleotide level attributable to the Rec-1 mutation.
Collapse
Affiliation(s)
- Ann M Rose
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Vergara IA, Chen N. Using OrthoCluster for the detection of synteny blocks among multiple genomes. ACTA ACUST UNITED AC 2009; Chapter 6:Unit 6.10 6.10.1-18. [PMID: 19728289 DOI: 10.1002/0471250953.bi0610s27] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Synteny blocks are composed of two or more orthologous genes conserved among species, resulting from speciation from their last common ancestor. OrthoCluster (Zeng et al., 2008) is a fast and easy-to-use program for the identification of synteny blocks among multiple genomes. It allows users to identify synteny blocks that contain different types of mismatches, and to decide whether they require conservation of gene orientation and conservation of gene order within the blocks. OrthoCluster can also be used to find duplicated blocks within genomes. Although genes and their correspondence are usually used as input for OrthoCluster, in fact, OrthoCluster can be applied using any type of markers as input as long as their relationships can be established. OrthoClusterDB provides a Web interface for running OrthoCluster with user-defined datasets and parameters, as well as for browsing and downloading precomputed synteny blocks for different groups of genomes.
Collapse
Affiliation(s)
- Ismael A Vergara
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia, Canada
| | | |
Collapse
|